user/sven/postgresql.git

Age	Commit message (Collapse)	Author
2006-05-30	Fix ancient misdescription of namegt/namege in comment. Greg Stark	Tom Lane

2006-05-28	Fix up pg_dump to do string escaping fully correctly for client encoding	Tom Lane
	and standard_conforming_strings; likewise for the other client programs that need it. As per previous discussion, a pg_dump dump now conforms to the standard_conforming_strings setting of the source database. We don't use E'' syntax in the dump, thereby improving portability of the SQL. I added a SET escape_strings_warning = off command to keep the dumps from getting a lot of back-chatter from that.
2006-05-26	Use E'' strings internally only when standard_conforming_strings =	Bruce Momjian
	'off'. This allows pg_dump output with standard_conforming_strings = 'on' to generate proper strings that can be loaded into other databases without the backslash doubling we typically do. I have added the dumping of the standard_conforming_strings value to pg_dump. I also added standard backslash handling for plpgsql.
2006-05-23	Tweak writetup_heap/readtup_heap to avoid storing the tuple identity	Tom Lane
	and transaction visibility fields of tuples being sorted. These are always uninteresting in a tuple being sorted (if the fields were actually selected, they'd have been pulled out into user columns beforehand). This saves about 24 bytes per row being sorted, which is a useful savings for any but the widest of sort rows. Per recent discussion.
2006-05-21	Add a new GUC parameter backslash_quote, which determines whether the SQL	Tom Lane
	parser will allow "\'" to be used to represent a literal quote mark. The "\'" representation has been deprecated for some time in favor of the SQL-standard representation "''" (two single quote marks), but it has been used often enough that just disallowing it immediately won't do. Hence backslash_quote allows the settings "on", "off", and "safe_encoding", the last meaning to allow "\'" only if client_encoding is a valid server encoding. That is now the default, and the reason is that in encodings such as SJIS that allow 0x5c (ASCII backslash) to be the last byte of a multibyte character, accepting "\'" allows SQL-injection attacks as per CVE-2006-2314 (further details will be published after release). The "on" setting is available for backward compatibility, but it must not be used with clients that are exposed to untrusted input. Thanks to Akio Ishida and Yasuo Ohgaki for identifying this security issue.
2006-05-21	Change the backend to reject strings containing invalidly-encoded multibyte	Tom Lane
	characters in all cases. Formerly we mostly just threw warnings for invalid input, and failed to detect it at all if no encoding conversion was required. The tighter check is needed to defend against SQL-injection attacks as per CVE-2006-2313 (further details will be published after release). Embedded zero (null) bytes will be rejected as well. The checks are applied during input to the backend (receipt from client or COPY IN), so it no longer seems necessary to check in textin() and related routines; any string arriving at those functions will already have been validated. Conversion failure reporting (for characters with no equivalent in the destination encoding) has been cleaned up and made consistent while at it. Also, fix a few longstanding errors in little-used encoding conversion routines: win1251_to_iso, win866_to_iso, euc_tw_to_big5, euc_tw_to_mic, mic_to_euc_tw were all broken to varying extents. Patches by Tatsuo Ishii and Tom Lane. Thanks to Akio Ishida and Yasuo Ohgaki for identifying the security issues.
2006-05-19	Add last-vacuum/analyze-time columns to the stats collector, both manual and	Alvaro Herrera
	issued by autovacuum. Add accessor functions to them, and use those in the pg_stat_*_tables system views. Catalog version bumped due to changes in the pgstat views and the pgstat file. Patch from Larry Rosenman, minor improvements by me.
2006-05-19	Have autovacuum report its activities to the stat collector.	Alvaro Herrera

2006-05-11	Code review for standard_conforming_strings patch. Fix it so it does not	Tom Lane
	throw warnings for 100%-SQL-standard constructs, clean up some minor infelicities, try to un-break ecpg to the best of my ability. (It's not clear how ecpg is going to find out the setting of standard_conforming_strings, though.) I think pg_dump still needs work, too.
2006-05-06	Further minor simplification of relcache startup: don't need a static	Tom Lane
	needNewCacheFile flag anymore, it can just be local in RelationCacheInitializePhase2.
2006-05-04	Simplify relcache startup sequence. With the new design of InitPostgres	Tom Lane
	it's not necessary to have three separate calls anymore. This patch also fixes things so we don't try to read pg_internal.init until after we've obtained lock on the target database; which was fairly harmless, but it's certainly cleaner this way.
2006-05-04	Rethink the locking mechanisms used for CREATE/DROP/RENAME DATABASE.	Tom Lane
	The former approach used ExclusiveLock on pg_database, which being a cluster-wide lock meant only one of these operations could proceed at a time; worse, it also blocked all incoming connections in ReverifyMyDatabase. Now that we have LockSharedObject(), we can use locks of different types applied to databases considered as objects. This allows much more flexible management of the interlocking: two CREATE DATABASEs need not block each other, and need not block connections except to the template database being used. Similarly DROP DATABASE doesn't block unrelated operations. The locking used in flatfiles.c is also much narrower in scope than before. Per recent proposal.
2006-05-03	Create a syscache for pg_database-indexed-by-oid, and make use of it	Tom Lane
	in various places that were previously doing ad hoc pg_database searches. This may speed up database-related privilege checks a little bit, but the main motivation is to eliminate the performance reason for having ReverifyMyDatabase do such a lot of stuff (viz, avoiding repeat scans of pg_database during backend startup). The locking reason for having that routine is about to go away, and it'd be good to have the option to break it up.
2006-05-02	GIN: Generalized Inverted iNdex.	Teodor Sigaev
	text[], int4[], Tsearch2 support for GIN.
2006-05-02	Avoid assuming that statistics for a parent relation reflect the properties of	Tom Lane
	the union of its child relations as well. This might have been a good idea when it was originally coded, but it's a fatally bad idea when inheritance is being used for partitioning. It's better to have no stats at all than completely misleading stats. Per report from Mark Liberman. The bug arguably exists all the way back, but I've only patched HEAD and 8.1 because we weren't particularly trying to support partitioning before 8.1. Eventually we ought to look at deriving union statistics instead of just punting, but for now the drop kick looks good.
2006-05-01	Provide a namespace.c function for lookup of an operator with exact	Tom Lane
	input datatypes given, and use this before trying OpernameGetCandidates. This is faster than the old method when there's an exact match, and it does not seem materially slower when there's not. And it definitely makes some of the callers cleaner, because they didn't really want to know about a list of candidates anyway. Per discussion with Atsushi Ogawa.
2006-04-30	Code review for GRANT CONNECT patch. Spell the privilege as CONNECT not	Tom Lane
	CONNECTION, fix a number of places that were missed (eg pg_dump support), avoid executing an extra search of pg_database during startup.
2006-04-30	Improve the representation of FOR UPDATE/FOR SHARE so that we can	Tom Lane
	support both FOR UPDATE and FOR SHARE in one command, as well as both NOWAIT and normal WAIT behavior. The more general code is actually simpler and cleaner.
2006-04-30	Add GRANT CONNECTION ON DATABASE, to be used in addition to pg_hba.conf.	Bruce Momjian
	Gevik Babakhani
2006-04-27	Generalize mcv_selectivity() to support both VAR OP CONST and CONST OP VAR	Tom Lane
	cases. This was not needed in the existing uses within selfuncs.c, but if we're gonna export it for general use, the extra generality seems helpful. Motivated by looking at ltree example.
2006-04-27	If we're going to expose VariableStatData for contrib modules to use,	Tom Lane
	then we should export a reasonable set of the supporting routines too.
2006-04-26	Move ltree parentsel() selectivity function into /contrib/ltree.	Bruce Momjian

2006-04-26	Enhanced containment selectivity function for /contrib/ltree	Bruce Momjian
	Matteo Beccati
2006-04-25	Arrange to cache btree metapage data in the relcache entry for the index,	Tom Lane
	thereby saving a visit to the metapage in most index searches/updates. This wouldn't actually save any I/O (since in the old regime the metapage generally stayed in cache anyway), but it does provide a useful decrease in bufmgr traffic in high-contention scenarios. Per my recent proposal.
2006-04-25	Back out RESET CONNECTION until there is more discussion.	Bruce Momjian

2006-04-25	Add RESET CONNECTION, to reset all aspects of a session.	Bruce Momjian
	Hans-J?rgen Sch?nig
2006-04-25	Add statement_timestamp(), clock_timestamp(), and	Bruce Momjian
	transaction_timestamp() (just like now()). Also update statement_timeout() to mention it is statement arrival time that is measured. Catalog version updated.
2006-04-24	Improve our private implementation of cbrt() to give results of the	Tom Lane
	accuracy expected by the regression tests. Per suggestion from Martijn van Oosterhout.
2006-04-24	Remove compiler warning by casting SNPRINTF() call to void.	Bruce Momjian
	Report from Gevik Babakhani.
2006-04-22	Simplify ParamListInfo data structure to support only numbered parameters,	Tom Lane
	not named ones, and replace linear searches of the list with array indexing. The named-parameter support has been dead code for many years anyway, and recent profiling suggests that the searching was costing a noticeable amount of performance for complex queries.
2006-04-20	Eliminate some no-longer-needed workarounds for palloc's old behavior	Tom Lane
	of rejecting palloc(0). Also, tweak like_selectivity() to avoid assuming the presented pattern is nonempty; although that assumption is valid, it doesn't really help much, and the new coding is more correct anyway since it properly handles redundant wildcards. In combination these changes should eliminate a Coverity warning noted by Martijn.
2006-04-19	Fix problem that sscanf(buf, "%d", &val) eats leading white space, but	Bruce Momjian
	our to_* functions were not handling that.
2006-04-19	C code whitespace inprovement for formatting.c.	Bruce Momjian

2006-04-13	Fix similar_escape() so that SIMILAR TO works properly for patterns involving	Tom Lane
	alternatives ("\|" symbol). The original coding allowed the added ^ and $ constraints to be absorbed into the first and last alternatives, producing a pattern that would match more than it should. Per report from Eric Noriega. I also changed the pattern to add an ARE director ("***:"), ensuring that SIMILAR TO patterns do not change behavior if regex_flavor is changed. This is necessary to make the non-capturing parentheses work, and seems like a good idea on general principles. Back-patched as far as 7.4. 7.3 also has the bug, but a fix seems impractical because that version's regex engine doesn't have non-capturing parens.
2006-04-10	Suppress unused-variable warning on platforms without HAVE_SYSLOG.	Tom Lane
	Magnus
2006-04-08	Fix EXPLAIN so that it can drill down through multiple levels of subplan	Tom Lane
	when trying to locate the referent of a RECORD variable. This fixes the 'record type has not been registered' failure reported by Stefan Kaltenbrunner about a month ago. A side effect of the way I chose to fix it is that most variable references in join conditions will now be properly labeled with the variable's source table name, instead of the not-too-helpful 'outer' or 'inner' we used to use.
2006-04-05	Fix a bunch of problems with domains by making them use special input functions	Tom Lane
	that apply the necessary domain constraint checks immediately. This fixes cases where domain constraints went unchecked for statement parameters, PL function local variables and results, etc. We can also eliminate existing special cases for domains in places that had gotten it right, eg COPY. Also, allow domains over domains (base of a domain is another domain type). This almost worked before, but was disallowed because the original patch hadn't gotten it quite right.
2006-04-04	Modify all callers of datatype input and receive functions so that if these	Tom Lane
	functions are not strict, they will be called (passing a NULL first parameter) during any attempt to input a NULL value of their datatype. Currently, all our input functions are strict and so this commit does not change any behavior. However, this will make it possible to build domain input functions that centralize checking of domain constraints, thereby closing numerous holes in our domain support, as per previous discussion. While at it, I took the opportunity to introduce convenience functions InputFunctionCall, OutputFunctionCall, etc to use in code that calls I/O functions. This eliminates a lot of grotty-looking casts, but the main motivation is to make it easier to grep for these places if we ever need to touch them again.
2006-04-03	Eliminate ajust scan code. Since concurrent GiST it doesn't	Teodor Sigaev
	do real work. That was missed during concurrence development.
2006-03-29	Clean up and document the API for XLogOpenRelation and XLogReadBuffer.	Tom Lane
	This commit doesn't make much functional change, but it does eliminate some duplicated code --- for instance, PageIsNew tests are now done inside XLogReadBuffer rather than by each caller. The GIST xlog code still needs a lot of love, but I'll worry about that separately.
2006-03-23	Add error location info to ResTarget parse nodes. Allows error cursor to be ↵	Tom Lane
	supplied for various mistakes involving INSERT and UPDATE target columns.
2006-03-19	Fix a few places that were checking for the return value of palloc() to be	Neil Conway
	non-NULL: palloc() ereports on OOM, so we can safely assume it returns a valid pointer.
2006-03-16	Clean up representation of function RTEs for functions returning RECORD.	Tom Lane
	The original coding stored the raw parser output (ColumnDef and TypeName nodes) which was ugly, bulky, and wrong because it failed to create any dependency on the referenced datatype --- and in fact would not track type renamings and suchlike. Instead store a list of column type OIDs in the RTE. Also fix up general failure of recordDependencyOnExpr to do anything sane about recording dependencies on datatypes. While there are many cases where there will be an indirect dependency (eg if an operator returns a datatype, the dependency on the operator is enough), we do have to record the datatype as a separate dependency in examples like CoerceToDomain. initdb forced because of change of stored rules.
2006-03-14	Improve parser so that we can show an error cursor position for errors	Tom Lane
	during parse analysis, not only errors detected in the flex/bison stages. This is per my earlier proposal. This commit includes all the basic infrastructure, but locations are only tracked and reported for errors involving column references, function calls, and operators. More could be done later but this seems like a good set to start with. I've also moved the ReportSyntaxErrorPosition logic out of psql and into libpq, which should make it available to more people --- even within psql this is an improvement because warnings weren't handled by ReportSyntaxErrorPosition.
2006-03-11	Remove copyright notices from Jan (per author approval), and those files	Bruce Momjian
	derived from Jan's.
2006-03-11	Add CVS tag lines to files that were lacking them.	Bruce Momjian

2006-03-11	Remove a few places that attempted to define INT_MAX, SCHAR_MAX, and	Neil Conway
	similar constants if they were not previously defined. All these constants must be defined by limits.h according to C89, so we can safely assume they are present.
2006-03-10	Recent changes in memory management in tuplesort.c had a problem: the	Tom Lane
	case where we run low on array slots before we run low on memory is much more probable than I had thought, and so it's important to treat each tape fairly in that case. To fix this, track per-tape slot allocations just like we track per-tape space allocation. Also, in the FINALMERGE code path avoid scanning all the input tapes when we really only need to read from one. This should fix poor behavior with very large work_mem as exhibited by Stefan Kaltenbrunner. I didn't do anything about putting an upper bound on the number of tapes, but maybe we should still consider that.
2006-03-10	Implement 4 new aggregate functions from SQL2003. Specifically: var_pop(),	Neil Conway
	var_samp(), stddev_pop(), and stddev_samp(). var_samp() and stddev_samp() are just renamings of the historical Postgres aggregates variance() and stddev() -- the latter names have been kept for backward compatibility. This patch includes updates for the documentation and regression tests. The catversion has been bumped. NB: SQL2003 requires that DISTINCT not be specified for any of these aggregates. Per discussion on -patches, I have NOT implemented this restriction: if the user asks for stddev(DISTINCT x), presumably they know what they are doing.
2006-03-08	Tweak trace_sort code to show the merge order (number of active input	Tom Lane
	tapes) for each merge step. This will give us some idea of how effective the merge distribution algorithm is.