pgbench can now run multiple benchmarking processes/threads at once. Earlier versions could easily become limited by the speed of the pgbench client coordination part of the program itself instead. The contrib/auto_explain output displays the query being executed, in addition to its plan.
Query log data collected by contrib/pg_stat_statements includes counts for how much buffer cache activity was associated with each statement.