diff options
| author | Tom Lane <tgl@sss.pgh.pa.us> | 2011-05-30 17:05:40 -0400 |
|---|---|---|
| committer | Tom Lane <tgl@sss.pgh.pa.us> | 2011-05-30 17:07:19 -0400 |
| commit | b503da135ab0bdd97ac3d3f720c35854e084e525 (patch) | |
| tree | e2862a05fd2982b2189bb4da303a72558bcc249b /src/include | |
| parent | d6a9e7a1e45a3569b3c3558b5817e78756224be2 (diff) | |
Fix VACUUM so that it always updates pg_class.reltuples/relpages.
When we added the ability for vacuum to skip heap pages by consulting the
visibility map, we made it just not update the reltuples/relpages
statistics if it skipped any pages. But this could leave us with extremely
out-of-date stats for a table that contains any unchanging areas,
especially for TOAST tables which never get processed by ANALYZE. In
particular this could result in autovacuum making poor decisions about when
to process the table, as in recent report from Florian Helmberger. And in
general it's a bad idea to not update the stats at all. Instead, use the
previous values of reltuples/relpages as an estimate of the tuple density
in unvisited pages. This approach results in a "moving average" estimate
of reltuples, which should converge to the correct value over multiple
VACUUM and ANALYZE cycles even when individual measurements aren't very
good.
This new method for updating reltuples is used by both VACUUM and ANALYZE,
with the result that we no longer need the grotty interconnections that
caused ANALYZE to not update the stats depending on what had happened
in the parent VACUUM command.
Also, fix the logic for skipping all-visible pages during VACUUM so that it
looks ahead rather than behind to decide what to do, as per a suggestion
from Greg Stark. This eliminates useless scanning of all-visible pages at
the start of the relation or just after a not-all-visible page. In
particular, the first few pages of the relation will not be invariably
included in the scanned pages, which seems to help in not overweighting
them in the reltuples estimate.
Back-patch to 8.4, where the visibility map was introduced.
Diffstat (limited to 'src/include')
| -rw-r--r-- | src/include/commands/vacuum.h | 8 | ||||
| -rw-r--r-- | src/include/pgstat.h | 3 |
2 files changed, 7 insertions, 4 deletions
diff --git a/src/include/commands/vacuum.h b/src/include/commands/vacuum.h index 22008e0b47c..9012b4e2e38 100644 --- a/src/include/commands/vacuum.h +++ b/src/include/commands/vacuum.h @@ -131,6 +131,10 @@ extern void vacuum(VacuumStmt *vacstmt, Oid relid, bool do_toast, extern void vac_open_indexes(Relation relation, LOCKMODE lockmode, int *nindexes, Relation **Irel); extern void vac_close_indexes(int nindexes, Relation *Irel, LOCKMODE lockmode); +extern double vac_estimate_reltuples(Relation relation, bool is_analyze, + BlockNumber total_pages, + BlockNumber scanned_pages, + double scanned_tuples); extern void vac_update_relstats(Relation relation, BlockNumber num_pages, double num_tuples, @@ -147,10 +151,10 @@ extern void vacuum_delay_point(void); /* in commands/vacuumlazy.c */ extern bool lazy_vacuum_rel(Relation onerel, VacuumStmt *vacstmt, - BufferAccessStrategy bstrategy, bool *scanned_all); + BufferAccessStrategy bstrategy); /* in commands/analyze.c */ extern void analyze_rel(Oid relid, VacuumStmt *vacstmt, - BufferAccessStrategy bstrategy, bool update_reltuples); + BufferAccessStrategy bstrategy); #endif /* VACUUM_H */ diff --git a/src/include/pgstat.h b/src/include/pgstat.h index 7fa25175f44..bac5262ed79 100644 --- a/src/include/pgstat.h +++ b/src/include/pgstat.h @@ -284,7 +284,6 @@ typedef struct PgStat_MsgVacuum Oid m_tableoid; bool m_analyze; bool m_autovacuum; - bool m_scanned_all; TimestampTz m_vacuumtime; PgStat_Counter m_tuples; } PgStat_MsgVacuum; @@ -632,7 +631,7 @@ extern void pgstat_clear_snapshot(void); extern void pgstat_reset_counters(void); extern void pgstat_report_autovac(Oid dboid); -extern void pgstat_report_vacuum(Oid tableoid, bool shared, bool scanned_all, +extern void pgstat_report_vacuum(Oid tableoid, bool shared, bool analyze, PgStat_Counter tuples); extern void pgstat_report_analyze(Relation rel, PgStat_Counter livetuples, |
