user/sven/postgresql.git

Age	Commit message (Collapse)	Author
2007-02-06	Add support for cross-type hashing in hashed subplans (hashed IN/NOT IN cases	Tom Lane
	that aren't turned into true joins). Since this is the last missing bit of infrastructure, go ahead and fill out the hash integer_ops and float_ops opfamilies with cross-type operators. The operator family project is now DONE ... er, except for documentation ...
2007-02-05	Rename MaxTupleSize to MaxHeapTupleSize to clarify that it's not meant to	Tom Lane
	describe the maximum size of index tuples (which is typically AM-dependent anyway); and consequently remove the bogus deduction for "special space" that was built into it. Adjust TOAST_TUPLE_THRESHOLD and TOAST_MAX_CHUNK_SIZE to avoid wasting two bytes per toast chunk, and to ensure that the calculation correctly tracks any future changes in page header size. The computation had been inaccurate in a way that didn't cause any harm except space wastage, but future changes could have broken it more drastically. Fix the calculation of BTMaxItemSize, which was formerly computed as 1 byte more than it could safely be. This didn't cause any harm in practice because it's only compared against maxalign'd lengths, but future changes in the size of page headers or btree special space could have exposed the problem. initdb forced because of change in TOAST_MAX_CHUNK_SIZE, which alters the storage of toast tables.
2007-02-04	Don't MAXALIGN in the checks to decide whether a tuple is over TOAST's	Tom Lane
	threshold for tuple length. On 4-byte-MAXALIGN machines, the toast code creates tuples that have t_len exactly TOAST_TUPLE_THRESHOLD ... but this number is not itself maxaligned, so if heap_insert maxaligns t_len before comparing to TOAST_TUPLE_THRESHOLD, it'll uselessly recurse back to tuptoaster.c, wasting cycles. (It turns out that this does not happen on 8-byte-MAXALIGN machines, because for them the outer MAXALIGN in the TOAST_MAX_CHUNK_SIZE macro reduces TOAST_MAX_CHUNK_SIZE so that toast tuples will be less than TOAST_TUPLE_THRESHOLD in size. That MAXALIGN is really incorrect, but we can't remove it now, see below.) There isn't any particular value in maxaligning before comparing to the thresholds, so just don't do that, which saves a small number of cycles in itself. These numbers should be rejiggered to minimize wasted space on toast-relation pages, but we can't do that in the back branches because changing TOAST_MAX_CHUNK_SIZE would force an initdb (by changing the contents of toast tables). We can move the toast decision thresholds a bit, though, which is what this patch effectively does. Thanks to Pavan Deolasee for discovering the unintended recursion. Back-patch into 8.2, but not further, pending more testing. (HEAD is about to get a further patch modifying the thresholds, so it won't help much for testing this form of the patch.)
2007-02-04	Change vacuum lazy "compacting" warning message to:	Bruce Momjian
	errhint("Consider using VACUUM FULL on this relation or increasing the configuration parameter \"max_fsm_pages\".")));
2007-02-03	Update SQL conformance information about XML features.	Peter Eisentraut

2007-02-03	Implement XMLSERIALIZE for real. Analogously, make the xml to text cast	Peter Eisentraut
	observe the xmloption. Reorganize the representation of the XML option in the parse tree and the API to make it easier to manage and understand. Add regression tests for parsing back XML expressions.
2007-02-02	Repair failure to check that a table is still compatible with a previously	Tom Lane
	made query plan. Use of ALTER COLUMN TYPE creates a hazard for cached query plans: they could contain Vars that claim a column has a different type than it now has. Fix this by checking during plan startup that Vars at relation scan level match the current relation tuple descriptor. Since at that point we already have at least AccessShareLock, we can be sure the column type will not change underneath us later in the query. However, since a backend's locks do not conflict against itself, there is still a hole for an attacker to exploit: he could try to execute ALTER COLUMN TYPE while a query is in progress in the current backend. Seal that hole by rejecting ALTER TABLE whenever the target relation is already open in the current backend. This is a significant security hole: not only can one trivially crash the backend, but with appropriate misuse of pass-by-reference datatypes it is possible to read out arbitrary locations in the server process's memory, which could allow retrieving database content the user should not be able to see. Our thanks to Jeff Trout for the initial report. Security: CVE-2007-0556
2007-02-02	Repair insufficiently careful type checking for SQL-language functions:	Tom Lane
	we should check that the function code returns the claimed result datatype every time we parse the function for execution. Formerly, for simple scalar result types we assumed the creation-time check was sufficient, but this fails if the function selects from a table that's been redefined since then, and even more obviously fails if check_function_bodies had been OFF. This is a significant security hole: not only can one trivially crash the backend, but with appropriate misuse of pass-by-reference datatypes it is possible to read out arbitrary locations in the server process's memory, which could allow retrieving database content the user should not be able to see. Our thanks to Jeff Trout for the initial report. Security: CVE-2007-0555
2007-02-01	Wording cleanup for error messages. Also change can't -> cannot.	Bruce Momjian
	Standard English uses "may", "can", and "might" in different ways: may - permission, "You may borrow my rake." can - ability, "I can lift that log." might - possibility, "It might rain today." Unfortunately, in conversational English, their use is often mixed, as in, "You may use this variable to do X", when in fact, "can" is a better choice. Similarly, "It may crash" is better stated, "It might crash".
2007-02-01	Fix a few typos in comments in GiN.	Neil Conway

2007-01-31	Revert error message change for may/can/might --- needs discussion.	Bruce Momjian

2007-01-31	Update documentation on may/can/might:	Bruce Momjian
	Standard English uses "may", "can", and "might" in different ways: may - permission, "You may borrow my rake." can - ability, "I can lift that log." might - possibility, "It might rain today." Unfortunately, in conversational English, their use is often mixed, as in, "You may use this variable to do X", when in fact, "can" is a better choice. Similarly, "It may crash" is better stated, "It might crash". Also update two error messages mentioned in the documenation to match.
2007-01-31	Rewrite uuid input and output routines to avoid dependency on the	Neil Conway
	nonportable "hh" sprintf(3) length modifier. Instead, do the parsing and output by hand. The code to do this isn't ideal, but this is an interim measure anyway: the uuid type should probably use the in-memory struct layout specified by RFC 4122. For now, this patch should hopefully rectify the buildfarm failures for the uuid test. Along the way, re-add pg_cast entries for uuid <-> varchar, which I mistakenly removed earlier, and bump the catversion.
2007-01-31	Revert gincostestimate changes.	Teodor Sigaev

2007-01-31	Allow GIN's extractQuery method to signal that nothing can satisfy the query.	Teodor Sigaev
	In this case extractQuery should returns -1 as nentries. This changes prototype of extractQuery method to use int32* instead of uint32* for nentries argument. Based on that gincostestimate may see two corner cases: nothing will be found or seqscan should be used. Per proposal at http://archives.postgresql.org/pgsql-hackers/2007-01/msg01581.php PS tsearch_core patch should be sightly modified to support changes, but I'm waiting a verdict about reviewing of tsearch_core patch.
2007-01-30	Update documentation for pg_get_serial_sequence() function.	Bruce Momjian

2007-01-30	Add support for cross-type hashing in hash index searches and hash joins.	Tom Lane
	Hashing for aggregation purposes still needs work, so it's not time to mark any cross-type operators as hashable for general use, but these cases work if the operators are so marked by hand in the system catalogs.
2007-01-29	Add comment noting that hashm_procid in a hash index's metapage isn't	Tom Lane
	actually used for anything.
2007-01-29	Update process termination message to display signal number and name	Bruce Momjian
	from exec.c and postmaster.c.
2007-01-28	Improve hash join to discard input tuples immediately if they can't	Tom Lane
	match because they contain a null join key (and the join operator is known strict). Improves performance significantly when the inner relation contains a lot of nulls, as per bug #2930.
2007-01-28	Rename the uuid_t type to pg_uuid_t, to avoid a conflict with any	Neil Conway
	definitions of uuid_t that may be provided by the system headers. This should hopefully fix the Win32 build problems reported by Magnus.
2007-01-28	Remove some unnecessary conversion work in build_regtype_array().	Tom Lane

2007-01-28	Repair oversight in creation of "append relations": we should set up	Tom Lane
	rel->tuples as well as rel->rows, since some estimation functions expect both to be valid in every baserel. Per report from Dave Dutcher.
2007-01-28	Add a new builtin type, "uuid". This implements a UUID type, similar to	Neil Conway
	that defined in RFC 4122. This patch includes the basic implementation, plus regression tests. Documentation and perhaps some additional functionality will come later. Catversion bumped. Patch from Gevik Babakhani; review from Peter, Tom, and myself.
2007-01-28	Clean up broken usage of HAVE_DECL_SYS_SIGLIST and inconsistent/poorly	Tom Lane
	formatted error messages.
2007-01-28	Use autoconf build-in sys_siglist macro AC_DECL_SYS_SIGLIST, rather than	Bruce Momjian
	create our own.
2007-01-28	Dept of second thoughts: the IQ of estimate_array_length() needs to be	Tom Lane
	kept on par with that of scalararraysel(), else estimates that should track might not. Hence teach it about binary-compatible cases, too.
2007-01-28	Fix scalararraysel() to cope with binary-compatible cases, such as text[]	Tom Lane
	versus varchar[]. This oversight probably explains Ryan Holmes' recent complaint --- he was getting a generic selectivity estimate instead of anything intelligent.
2007-01-28	Use sys_siglist[] to print out signal names for signal exits, rather	Bruce Momjian
	than just numbers.
2007-01-27	Correct an old logic error in btree page splitting: when considering a split	Tom Lane
	exactly at the point where we need to insert a new item, the calculation used the wrong size for the "high key" of the new left page. This could lead to choosing an unworkable split, resulting in "PANIC: failed to add item to the left sibling" (or "right sibling") failure. Although this bug has been there a long time, it's very difficult to trigger a failure before 8.2, since there was generally a lot of free space on both sides of a chosen split. In 8.2, where the user-selected fill factor determines how much free space the code tries to leave, an unworkable split is much more likely. Report by Joe Conway, diagnosis and fix by Heikki Linnakangas.
2007-01-27	Reactivate libxml memory management via palloc, now that I think I've	Peter Eisentraut
	classified the conditions under which this is safe to do (see source code comment).
2007-01-27	Add trailing zero byte in Unicode codepoint conversion.	Peter Eisentraut

2007-01-26	On Windows, use pgwin32_waitforsinglesocket() instead of select() to wait for	Tom Lane
	input in the stats collector. Our select() emulation is apparently buggy for UDP sockets :-(. This should resolve problems with stats collection (and hence autovacuum) failing under more than minimal load. Diagnosis and patch by Magnus Hagander. Patch probably needs to be back-ported to 8.1 and 8.0, but first let's see if it makes the buildfarm happy...
2007-01-25	Correction: temp_tablespaces was implemented by Albert Cervera Areny,	Bruce Momjian
	with cleanup by Jaime Casanova.
2007-01-25	Various fixes in the logic of XML functions:	Peter Eisentraut
	- Add new SQL command SET XML OPTION (also available via regular GUC) to control the DOCUMENT vs. CONTENT option in implicit parsing and serialization operations. - Subtle corrections in the handling of the standalone property in xmlroot(). - Allow xmlroot() to work on content fragments. - Subtle corrections in the handling of the version property in xmlconcat(). - Code refactoring for producing XML declarations.
2007-01-25	Add GUC temp_tablespaces to provide a default location for temporary	Bruce Momjian
	objects. Jaime Casanova
2007-01-25	Properly detoast access to bytea field pg_trigger.tgargs. Old code	Bruce Momjian
	might cause server crash. Backpatch to 8.2.X.
2007-01-25	Prevent WAL logging when COPY is done in the same transation that	Bruce Momjian
	created it. Simon Riggs
2007-01-24	Get pg_utf_mblen(), pg_utf2wchar_with_len(), and utf2ucs() all on the same	Tom Lane
	page about the maximum UTF8 sequence length we support (4 bytes since 8.1, 3 before that). pg_utf2wchar_with_len never got updated to support 4-byte characters at all, and in any case had a buffer-overrun risk in that it could produce multiple pg_wchars from what mblen claims to be just one UTF8 character. The only reason we don't have a major security hole is that most callers allocate worst-case output buffers; the sole exception in released versions appears to be pre-8.2 iwchareq() (ie, ILIKE), which can be crashed due to zeroing out its return address --- but AFAICS that can't be exploited for anything more than a crash, due to inability to control what gets written there. Per report from James Russell and Michael Fuhr. Pre-8.1 the risk is much less, but I still think pg_utf2wchar_with_len's behavior given an incomplete final character risks buffer overrun, so back-patch that logic change anyway. This patch also makes sure that UTF8 sequences exceeding the supported length (whichever it is) are consistently treated as error cases, rather than being treated like a valid shorter sequence in some places.
2007-01-24	Relax an Assert() that has been found to be too strict in some situations	Tom Lane
	involving unions of types having typmods. Variants of the failure are known to occur in 8.1 and up; not sure if it's possible in 8.0 and 7.4, but since the code exists that far back, I'll just patch 'em all. Per report from Brian Hurt.
2007-01-23	Simplify handling of XML error messages: Just use the string provided by	Peter Eisentraut
	libxml as the detail message. As per <http://archives.postgresql.org/pgsql-hackers/2006-12/msg01087.php>. For converting error codes to messages, we only need to cover those codes that we raise ourselves now.
2007-01-23	Add CREATE/ALTER/DROP OPERATOR FAMILY commands, also COMMENT ON OPERATOR	Tom Lane
	FAMILY; and add FAMILY option to CREATE OPERATOR CLASS to allow adding a class to a pre-existing family. Per previous discussion. Man, what a tedious lot of cutting and pasting ...
2007-01-23	Back out use of FormatMessage(), does error values, not exception	Bruce Momjian
	values. Point to /include/ntstatus.h for an exception list, rather than a URL.
2007-01-23	Print meaningfull error text for abonormal process exit on Win32, rather	Bruce Momjian
	than hex codes, using FormatMessage().
2007-01-22	Put back planner's ability to cache the results of mergejoinscansel(),	Tom Lane
	which I had removed in the first cut of the EquivalenceClass rewrite to simplify that patch a little. But it's still important --- in a four-way join problem mergejoinscansel() was eating about 40% of the planning time according to gprof. Also, improve the EquivalenceClass code to re-use join RestrictInfos rather than generating fresh ones for each join considered. This saves some memory space but more importantly improves the effectiveness of caching planning info in RestrictInfos.
2007-01-22	Use errhint() for WIN32 SIGTERM message, where possible.	Bruce Momjian

2007-01-22	When system() fails in Win32, report it as an exception, print the	Bruce Momjian
	exception value in hex, and give a URL where the value can be looked-up.
2007-01-22	Add COST and ROWS options to CREATE/ALTER FUNCTION, plus underlying pg_proc	Tom Lane
	columns procost and prorows, to allow simple user adjustment of the estimated cost of a function call, as well as control of the estimated number of rows returned by a set-returning function. We might eventually wish to extend this to allow function-specific estimation routines, but there seems to be consensus that we should try a simple constant estimate first. In particular this provides a relatively simple way to control the order in which different WHERE clauses are applied in a plan node, which is a Good Thing in view of the fact that the recent EquivalenceClass planner rewrite made that much less predictable than before.
2007-01-21	Refactor some lsyscache routines to eliminate duplicate code and save	Tom Lane
	a couple of syscache lookups in make_pathkey_from_sortinfo().
2007-01-20	Simplify pg_am representation of ordering-capable access methods:	Tom Lane
	provide just a boolean 'amcanorder', instead of fields that specify the sort operator strategy numbers. We have decided to require ordering-capable AMs to use btree-compatible strategy numbers, so the old fields are overkill (and indeed misleading about what's allowed).