user/sven/postgresql.git

Age	Commit message (Collapse)	Author
2011-06-10	Work around gcc 4.6.0 bug that breaks WAL replay.	Tom Lane
	ReadRecord's habit of using both direct references to tmpRecPtr and references to *RecPtr (which is pointing at tmpRecPtr) triggers an optimization bug in gcc 4.6.0, which apparently has forgotten about aliasing rules. Avoid the compiler bug, and make the code more readable to boot, by getting rid of the direct references. Improve the comments while at it. Back-patch to all supported versions, in case they get built with 4.6.0. Tom Lane, with some cosmetic suggestions from Alex Hunsaker
2011-05-11	Shut down WAL receiver if it's still running at end of recovery. We used to	Heikki Linnakangas
	just check that it's not running and PANIC if it was, but that can rightfully happen if recovery stops at recovery target.
2011-04-13	Revert the patch to check if we've reached end-of-backup also when doing	Heikki Linnakangas
	crash recovery, and throw an error if not. hubert depesz lubaczewski pointed out that that situation also happens in the crash recovery following a system crash that happens during an online backup. We might want to do something smarter in 9.1, like put the check back for backups taken with pg_basebackup, but that's for another patch.
2011-03-30	Check that we've reached end-of-backup also when we're not performing	Heikki Linnakangas
	archive recovery. It's possible to restore an online backup without recovery.conf, by simply copying all the necessary WAL files to pg_xlog. "pg_basebackup -x" does that too. That's the use case where this cross-check is useful. Backpatch to 9.0. We used to do this in earlier versins, but in 9.0 the code was inadvertently changed so that the check is only performed after archive recovery. Fujii Masao.
2011-03-23	Prevent intermittent hang in recovery from bgwriter interaction.	Simon Riggs
	Startup process waited for cleanup lock but when hot_standby = off the pid was not registered, so that the bgwriter would not wake the waiting process as intended.
2010-12-07	Fix bugs in the hot standby known-assigned-xids tracking logic. If there's	Heikki Linnakangas
	an old transaction running in the master, and a lot of transactions have started and finished since, and a WAL-record is written in the gap between the creating the running-xacts snapshot and WAL-logging it, recovery will fail with "too many KnownAssignedXids" error. This bug was reported by Joachim Wieland on Nov 19th. In the same scenario, when fewer transactions have started so that all the xids fit in KnownAssignedXids despite the first bug, a more serious bug arises. We incorrectly initialize the clog code with the oldest still running transaction, and when we see the WAL record belonging to a transaction with an XID larger than one that committed already before the checkpoint we're recovering from, we zero the clog page containing the already committed transaction, leading to data loss. In hindsight, trying to track xids in the known-assigned-xids array before seeing the running-xacts record was too complicated. To fix that, hold XidGenLock while the running-xacts snapshot is taken and WAL-logged. That ensures that no transaction can begin or end in that gap, so that in recvoery we know that the snapshot contains all transactions running at that point in WAL.
2010-12-06	Fix two typos, by Fujii Masao.	Heikki Linnakangas

2010-11-11	Fix bug introduced by the recent patch to check that the checkpoint redo	Heikki Linnakangas
	location read from backup label file can be found: wasShutdown was set incorrectly when a backup label file was found. Jeff Davis, with a little tweaking by me.
2010-11-02	Bootstrap WAL to begin at segment logid=0 logseg=1 (000000010000000000000001)	Heikki Linnakangas
	rather than 0/0, so that we can safely use 0/0 as an invalid value. This is a more future-proof fix for the corner-case bug in streaming replication that was fixed yesterday. We had a similar corner-case bug with log/seg 0/0 back in February as well. Avoiding 0/0 as a valid value should prevent bugs like that in the future. Per Tom Lane's idea. Back-patch to 9.0. Since this only affects bootstrapping, it makes no difference to existing installations. We don't need to worry about the bug in existing installations, because if you've managed to get past the initial base backup already, you won't hit the bug in the future either.
2010-11-01	Fix corner-case bug in tracking of latest removed WAL segment during	Heikki Linnakangas
	streaming replication. We used log/seg 0/0 to indicate that no WAL segments have been removed since startup, but 0/0 is a valid value for the very first WAL segment after initdb. To make that disambiguous, store (latest removed WAL segment + 1) in the global variable. Per report from Matt Chesler, also reproduced by Greg Smith.
2010-10-26	Before removing backup_label and irrevocably changing pg_control file, check	Heikki Linnakangas
	that WAL file containing the checkpoint redo-location can be found. This avoids making the cluster irrecoverable if the redo location is in an earlie WAL file than the checkpoint record. Report, analysis and patch by Jeff Davis, with small changes by me.
2010-10-20	Don't try to fetch database name when SetTransactionIdLimit() is executed	Tom Lane
	outside a transaction. This repairs brain fade in my patch of 2009-08-30: the reason we had been storing oldest-database name, not OID, in ShmemVariableCache was of course to avoid having to do a catalog lookup at times when it might be unsafe. This error explains why Aleksandr Dushein is having trouble getting out of an XID wraparound state in bug #5718, though not how he got into that state in the first place. I suspect pg_upgrade is at fault there.
2010-10-14	Fix bug in comment of timeline history file.	Simon Riggs
	Fujii Masao
2010-08-30	Fix misleading DEBUG2 issued during RemoveOldXlogFiles()	Simon Riggs

2010-08-30	Truncate subtrans after each restartpoint.	Simon Riggs
	Issue reported by Harald Kolb, patch by Fujii Masao, review by me.
2010-08-26	Remove duplicate translatable phrase	Alvaro Herrera

2010-08-13	Make RecordTransactionCommit() respect wal_level.	Robert Haas
	Since the only purpose of WAL-loggin SharedInvalidationMessages is to support Hot Standby operation, they needn't be included when wal_level < hot_standby. Back-patch to 9.0. Review by Heikki Linnakanagas and Fujii Masao.
2010-08-12	Correct sundry errors in Hot Standby-related comments.	Robert Haas
	Fujii Masao
2010-08-01	Back-patch fix for renaming asyncCommitLSN to asyncXactLSN.	Tom Lane
	AIUI this was supposed to go into 9.0 as well as HEAD.
2010-07-23	Avoid deep recursion when assigning XIDs to multiple levels of subxacts.	Robert Haas
	Backpatch to 8.0. Andres Freund, with cleanup and adjustment for older branches by me.
2010-07-08	Update obsolete comment. Noted by Josh Tolley.	Tom Lane

2010-07-06	pgindent run for 9.0, second run	Bruce Momjian

2010-07-03	Don't set recoveryLastXTime when replaying a checkpoint --- that was a bogus	Tom Lane
	idea from the start since the variable is only meant to track commit/abort events. This patch reverts the logic around the variable to what it was in 8.4, except that the value is now kept in shared memory rather than a static variable, so that it can be reported correctly by CreateRestartPoint (which is executed in the bgwriter).
2010-07-03	Replace max_standby_delay with two parameters, max_standby_archive_delay and	Tom Lane
	max_standby_streaming_delay, and revise the implementation to avoid assuming that timestamps found in WAL records can meaningfully be compared to clock time on the standby server. Instead, the delay limits are compared to the elapsed time since we last obtained a new WAL segment from archive or since we were last "caught up" to WAL data arriving via streaming replication. This avoids problems with clock skew between primary and standby, as well as other corner cases that the original coding would misbehave in, such as the primary server having significant idle time between transactions. Per my complaint some time ago and considerable ensuing discussion. Do some desultory editing on the hot standby documentation, too.
2010-06-29	Add C comment about why synchronous_commit=off behavior can lose	Bruce Momjian
	committed transactions in a postmaster crash.
2010-06-28	emode_for_corrupt_record shouldn't reduce LOG messages to WARNING.	Robert Haas
	In non-interactive sessions, WARNING sorts below LOG.
2010-06-17	Make RemoveOldXlogFiles's debug printout match style used elsewhere:	Tom Lane
	log and seg aren't an XLogRecPtr and shouldn't be printed like one. Fujii Masao
2010-06-17	Don't allow walsender to send WAL data until it's been safely fsync'd on the	Tom Lane
	master. Otherwise a subsequent crash could cause the master to lose WAL that has already been applied on the slave, resulting in the slave being out of sync and soon corrupt. Per recent discussion and an example from Robert Haas. Fujii Masao
2010-06-14	If a corrupt WAL record is received by streaming replication, disconnect	Heikki Linnakangas
	and retry. If the record is genuinely corrupt in the master database, there's little hope of recovering, but it's better than simply retrying to apply the corrupt WAL record in a tight loop without even trying to retransmit it, which is what we used to do.
2010-06-12	Fix typo/bug, found by Clang compiler	Peter Eisentraut

2010-06-10	Rename restartpoint_command to archive_cleanup_command.	Itagaki Takahiro

2010-06-10	Make TriggerFile variable static. It's not used outside xlog.c.	Heikki Linnakangas
	Fujii Masao
2010-06-10	Return NULL instead of 0/0 in pg_last_xlog_receive_location() and	Heikki Linnakangas
	pg_last_xlog_replay_location(). Per Robert Haas's suggestion, after Itagaki Takahiro pointed out an issue in the docs. Also, some wording changes in the docs by me.
2010-06-09	In standby mode, respect checkpoint_segments in addition to	Heikki Linnakangas
	checkpoint_timeout to trigger restartpoints. We used to deliberately only do time-based restartpoints, because if checkpoint_segments is small we would spend time doing restartpoints more often than really necessary. But now that restartpoints are done in bgwriter, they're not as disruptive as they used to be. Secondly, because streaming replication stores the streamed WAL files in pg_xlog, we want to clean it up more often to avoid running out of disk space when checkpoint_timeout is large and checkpoint_segments small. Patch by Fujii Masao, with some minor changes by me.
2010-06-09	Make the walwriter close it's handle to an old xlog segment if it's no longer	Magnus Hagander
	the current one. Not doing this would leave the walwriter with a handle to a deleted file if there was nothing for it to do for a long period of time, preventing the file from being completely removed. Reported by Tollef Fog Heen, and thanks to Heikki for some hand-holding with the patch.
2010-06-03	Fix some inconsistent quoting of wal_level values in messages	Peter Eisentraut
	When referring to postgresql.conf syntax, then it's without quotes (wal_level=archive); in narrative it's with double quotes. But never single quotes.
2010-06-03	On clean shutdown during recovery, don't warn about possible corruption.	Robert Haas
	Fujii Masao. Review by Heikki Linnakangas and myself.
2010-06-02	Fix obsolete comments that I neglected to update in a previous patch.	Heikki Linnakangas
	Fujii Masao
2010-05-27	Adjust comment to reflect that we now have Hot Standby. Pointed out by	Heikki Linnakangas
	Robert Haas.
2010-05-15	Rename PM_RECOVERY_CONSISTENT and PMSIGNAL_RECOVERY_CONSISTENT.	Robert Haas
	The new names PM_HOT_STANDBY and PMSIGNAL_BEGIN_HOT_STANDBY more accurately reflect their actual function.
2010-05-15	Fix bug in processing of checkpoint time for max_standby_delay. Latest	Simon Riggs
	log time was incorrectly set, typically leading to dates in the past, which would cause more cancellations in Hot Standby on a quiet server.
2010-05-14	Add many new Asserts in code and fix simple bug that slipped through	Simon Riggs
	without them, related to previous commit. Report by Bruce Momjian.
2010-05-13	Ensure that top level aborts call XLogSetAsyncCommit(). Not doing	Simon Riggs
	so simply leads to data waiting in wal_buffers which then causes later commits to potentially do emergency writes and for all forms of replication to be potentially delayed without need or benefit. Issue pointed out exactly by Fujii Masao, following bug report by Robert Haas on a separate though related topic.
2010-05-13	Cleanup initialization of Hot Standby. Clarify working with reanalysis	Simon Riggs
	of requirements and documentation on LogStandbySnapshot(). Fixes two minor bugs reported by Tom Lane that would lead to an incorrect snapshot after transaction wraparound. Also fix two other problems discovered that would give incorrect snapshots in certain cases. ProcArrayApplyRecoveryInfo() substantially rewritten. Some minor refactoring of xact_redo_apply() and ExpireTreeKnownAssignedTransactionIds().
2010-05-03	Need to hold ControlFileLock while updating control file. Update	Heikki Linnakangas
	minRecoveryPoint in control file when replaying a parameter change record, to ensure that we don't allow hot standby on WAL generated without wal_level='hot_standby' after a standby restart.
2010-05-02	Clean up some awkward, inaccurate, and inefficient processing around	Tom Lane
	MaxStandbyDelay. Use the GUC units mechanism for the value, and choose more appropriate timestamp functions for performing tests with it. Make the ps_activity manipulation in ResolveRecoveryConflictWithVirtualXIDs have behavior similar to ps_activity code elsewhere, notably not updating the display when update_process_title is off and not truncating the display contents at an arbitrarily-chosen length. Improve the docs to be explicit about what MaxStandbyDelay actually measures, viz the difference between primary and standby servers' clocks, and the possible hazards if their clocks aren't in sync.
2010-04-29	Adjust error checks in pg_start_backup and pg_stop_backup to make it possible	Tom Lane
	to perform a backup without archive_mode being enabled. This gives up some user-error protection in order to improve usefulness for streaming-replication scenarios. Per discussion.
2010-04-29	Rename the parameter recovery_connections to hot_standby, to reduce possible	Tom Lane
	confusion with streaming-replication settings. Also, change its default value to "off", because of concern about executing new and poorly-tested code during ordinary non-replicating operation. Per discussion. In passing do some minor editing of related documentation.
2010-04-28	Modify ShmemInitStruct and ShmemInitHash to throw errors internally,	Tom Lane
	rather than returning NULL for some-but-not-all failures as they used to. Remove now-redundant tests for NULL from call sites. We had to do something about this because many call sites were failing to check for NULL; and changing it like this seems a lot more useful and mistake-proof than adding checks to the call sites without them.
2010-04-28	Introduce wal_level GUC to explicitly control if information needed for	Heikki Linnakangas
	archival or hot standby should be WAL-logged, instead of deducing that from other options like archive_mode. This replaces recovery_connections GUC in the primary, where it now has no effect, but it's still used in the standby to enable/disable hot standby. Remove the WAL-logging of "unlogged operations", like creating an index without WAL-logging and fsyncing it at the end. Instead, we keep a copy of the wal_mode setting and the settings that affect how much shared memory a hot standby server needs to track master transactions (max_connections, max_prepared_xacts, max_locks_per_xact) in pg_control. Whenever the settings change, at server restart, write a WAL record noting the new settings and update pg_control. This allows us to notice the change in those settings in the standby at the right moment, they used to be included in checkpoint records, but that meant that a changed value was not reflected in the standby until the first checkpoint after the change. Bump PG_CONTROL_VERSION and XLOG_PAGE_MAGIC. Whack XLOG_PAGE_MAGIC back to the sequence it used to follow, before hot standby and subsequent patches changed it to 0x9003.