diff options
Diffstat (limited to 'doc/FAQ')
-rw-r--r-- | doc/FAQ | 1057 |
1 files changed, 0 insertions, 1057 deletions
diff --git a/doc/FAQ b/doc/FAQ deleted file mode 100644 index 4052044954d..00000000000 --- a/doc/FAQ +++ /dev/null @@ -1,1057 +0,0 @@ - - Frequently Asked Questions (FAQ) for PostgreSQL - - Last updated: Tue Jun 11 06:36:10 EDT 2002 - - Current maintainer: Bruce Momjian (pgman@candle.pha.pa.us) - - The most recent version of this document can be viewed at - http://www.PostgreSQL.org/docs/faq-english.html. - - Platform-specific questions are answered at - http://www.PostgreSQL.org/users-lounge/docs/faq.html. - _________________________________________________________________ - - General Questions - - 1.1) What is PostgreSQL? How is it pronounced? - 1.2) What is the copyright on PostgreSQL? - 1.3) What Unix platforms does PostgreSQL run on? - 1.4) What non-Unix ports are available? - 1.5) Where can I get PostgreSQL? - 1.6) Where can I get support? - 1.7) What is the latest release? - 1.8) What documentation is available? - 1.9) How do I find out about known bugs or missing features? - 1.10) How can I learn SQL? - 1.11) Is PostgreSQL Y2K compliant? - 1.12) How do I join the development team? - 1.13) How do I submit a bug report? - 1.14) How does PostgreSQL compare to other DBMSs? - 1.15) How can I financially assist PostgreSQL? - - User Client Questions - - 2.1) Are there ODBC drivers for PostgreSQL? - 2.2) What tools are available for using PostgreSQL with Web pages? - 2.3) Does PostgreSQL have a graphical user interface? A report - generator? An embedded query language interface? - 2.4) What languages are available to communicate with PostgreSQL? - - Administrative Questions - - 3.1) How do I install PostgreSQL somewhere other than - /usr/local/pgsql? - 3.2) When I start postmaster, I get a Bad System Call or core dumped - message. Why? - 3.3) When I try to start postmaster, I get IpcMemoryCreate errors. - Why? - 3.4) When I try to start postmaster, I get IpcSemaphoreCreate errors. - Why? - 3.5) How do I control connections from other hosts? - 3.6) How do I tune the database engine for better performance? - 3.7) What debugging features are available? - 3.8) Why do I get "Sorry, too many clients" when trying to connect? - 3.9) What are the pg_sorttempNNN.NN files in my database directory? - - Operational Questions - - 4.1) What is the difference between binary cursors and normal cursors? - 4.2) How do I SELECT only the first few rows of a query? - 4.3) How do I get a list of tables or other things I can see in psql? - 4.4) How do you remove a column from a table? - 4.5) What is the maximum size for a row, a table, and a database? - 4.6) How much database disk space is required to store data from a - typical text file? - 4.7) How do I find out what tables, indexes, databases, and users are - defined? - 4.8) My queries are slow or don't make use of the indexes. Why? - 4.9) How do I see how the query optimizer is evaluating my query? - 4.10) What is an R-tree index? - 4.11) What is the Genetic Query Optimizer? - 4.12) How do I perform regular expression searches and - case-insensitive regular expression searches? How do I use an index - for case-insensitive searches? - 4.13) In a query, how do I detect if a field is NULL? - 4.14) What is the difference between the various character types? - 4.15.1) How do I create a serial/auto-incrementing field? - 4.15.2) How do I get the value of a SERIAL insert? - 4.15.3) Don't currval() and nextval() lead to a race condition with - other users? - 4.15.4) Why aren't my sequence numbers reused on transaction abort? - Why are there gaps in the numbering of my sequence/SERIAL column? - 4.16) What is an OID? What is a TID? - 4.17) What is the meaning of some of the terms used in PostgreSQL? - 4.18) Why do I get the error "ERROR: Memory exhausted in - AllocSetAlloc()"? - 4.19) How do I tell what PostgreSQL version I am running? - 4.20) Why does my large-object operations get "invalid large obj - descriptor"? - 4.21) How do I create a column that will default to the current time? - 4.22) Why are my subqueries using IN so slow? - 4.23) How do I perform an outer join? - 4.24) How do I perform queries using multiple databases? - 4.25) How do I return multiple rows or columns from a function? - 4.26) Why can't I reliably create/drop temporary tables in PL/PgSQL - functions? - - Extending PostgreSQL - - 5.1) I wrote a user-defined function. When I run it in psql, why does - it dump core? - 5.2) How can I contribute some nifty new types and functions to - PostgreSQL? - 5.3) How do I write a C function to return a tuple? - 5.4) I have changed a source file. Why does the recompile not see the - change? - _________________________________________________________________ - - General Questions - - 1.1) What is PostgreSQL? - - PostgreSQL is pronounced Post-Gres-Q-L. - - PostgreSQL is an enhancement of the POSTGRES database management - system, a next-generation DBMS research prototype. While PostgreSQL - retains the powerful data model and rich data types of POSTGRES, it - replaces the PostQuel query language with an extended subset of SQL. - PostgreSQL is free and the complete source is available. - - PostgreSQL development is performed by a team of Internet developers - who all subscribe to the PostgreSQL development mailing list. The - current coordinator is Marc G. Fournier (scrappy@PostgreSQL.org). (See - below on how to join). This team is now responsible for all - development of PostgreSQL. - - The authors of PostgreSQL 1.01 were Andrew Yu and Jolly Chen. Many - others have contributed to the porting, testing, debugging, and - enhancement of the code. The original Postgres code, from which - PostgreSQL is derived, was the effort of many graduate students, - undergraduate students, and staff programmers working under the - direction of Professor Michael Stonebraker at the University of - California, Berkeley. - - The original name of the software at Berkeley was Postgres. When SQL - functionality was added in 1995, its name was changed to Postgres95. - The name was changed at the end of 1996 to PostgreSQL. - - 1.2) What is the copyright on PostgreSQL? - - PostgreSQL is subject to the following COPYRIGHT: - - PostgreSQL Data Base Management System - - Portions copyright (c) 1996-2002, PostgreSQL Global Development Group - Portions Copyright (c) 1994-6 Regents of the University of California - - Permission to use, copy, modify, and distribute this software and its - documentation for any purpose, without fee, and without a written - agreement is hereby granted, provided that the above copyright notice - and this paragraph and the following two paragraphs appear in all - copies. - - IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY - FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, - INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS SOFTWARE AND - ITS DOCUMENTATION, EVEN IF THE UNIVERSITY OF CALIFORNIA HAS BEEN - ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. - - THE UNIVERSITY OF CALIFORNIA SPECIFICALLY DISCLAIMS ANY WARRANTIES, - INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF - MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE - PROVIDED HEREUNDER IS ON AN "AS IS" BASIS, AND THE UNIVERSITY OF - CALIFORNIA HAS NO OBLIGATIONS TO PROVIDE MAINTENANCE, SUPPORT, - UPDATES, ENHANCEMENTS, OR MODIFICATIONS. - - The above is the BSD license, the classic open-source license. It has - no restrictions on how the source code may be used. We like it and - have no intention of changing it. - - 1.3) What Unix platforms does PostgreSQL run on? - - In general, a modern Unix-compatible platform should be able to run - PostgreSQL. The platforms that had received explicit testing at the - time of release are listed in the installation instructions. - - 1.4) What non-Unix ports are available? - - Client - - It is possible to compile the libpq C library, psql, and other - interfaces and binaries to run on MS Windows platforms. In this case, - the client is running on MS Windows, and communicates via TCP/IP to a - server running on one of our supported Unix platforms. A file - win31.mak is included in the distribution for making a Win32 libpq - library and psql. PostgreSQL also communicates with ODBC clients. - - Server - - The database server can run on Windows NT and Win2k using Cygwin, the - Cygnus Unix/NT porting library. See pgsql/doc/FAQ_MSWIN in the - distribution or the MS Windows FAQ on our web site. We have no plan to - do a native port to any Microsoft platform. - - 1.5) Where can I get PostgreSQL? - - The primary anonymous ftp site for PostgreSQL is - ftp://ftp.PostgreSQL.org/pub. For mirror sites, see our main web site. - - 1.6) Where can I get support? - - The main mailing list is: pgsql-general@PostgreSQL.org. It is - available for discussion of matters pertaining to PostgreSQL. To - subscribe, send mail with the following lines in the body (not the - subject line): - subscribe - end - - to pgsql-general-request@PostgreSQL.org. - - There is also a digest list available. To subscribe to this list, send - email to: pgsql-general-digest-request@PostgreSQL.org with a body of: - subscribe - end - - Digests are sent out to members of this list whenever the main list - has received around 30k of messages. - - The bugs mailing list is available. To subscribe to this list, send - email to pgsql-bugs-request@PostgreSQL.org with a body of: - subscribe - end - - There is also a developers discussion mailing list available. To - subscribe to this list, send email to - pgsql-hackers-request@PostgreSQL.org with a body of: - subscribe - end - - Additional mailing lists and information about PostgreSQL can be found - via the PostgreSQL WWW home page at: - - http://www.PostgreSQL.org - - There is also an IRC channel on EFNet, channel #PostgreSQL. I use the - Unix command irc -c '#PostgreSQL' "$USER" irc.phoenix.net. - - A list of commercial support companies is available at - http://www.postgresql.org/users-lounge/commercial-support.html. - - 1.7) What is the latest release? - - The latest release of PostgreSQL is version 7.2.1. - - We plan to have major releases every four months. - - 1.8) What documentation is available? - - Several manuals, manual pages, and some small test examples are - included in the distribution. See the /doc directory. You can also - browse the manual online at - http://www.PostgreSQL.org/users-lounge/docs/. - - There are two PostgreSQL books available online at - http://www.PostgreSQL.org/docs/awbook.html and - http://www.commandprompt.com/ppbook/. There is a list of PostgreSQL - books available for purchase at http://www.postgresql.org/books/. - There is also a collection of PostgreSQL technical articles at - http://techdocs.postgresql.org/. - - psql has some nice \d commands to show information about types, - operators, functions, aggregates, etc. - - Our web site contains even more documentation. - - 1.9) How do I find out about known bugs or missing features? - - PostgreSQL supports an extended subset of SQL-92. See our TODO list - for known bugs, missing features, and future plans. - - 1.10) How can I learn SQL? - - The PostgreSQL book at http://www.PostgreSQL.org/docs/awbook.html - teaches SQL. There is another PostgreSQL book at - http://www.commandprompt.com/ppbook. There is a nice tutorial at - http://www.intermedia.net/support/sql/sqltut.shtm, at - http://ourworld.compuserve.com/homepages/graeme_birchall/HTM_COOK.HTM, - and at http://sqlcourse.com. - - Another one is "Teach Yourself SQL in 21 Days, Second Edition" at - http://members.tripod.com/er4ebus/sql/index.htm - - Many of our users like The Practical SQL Handbook, Bowman, Judith S., - et al., Addison-Wesley. Others like The Complete Reference SQL, Groff - et al., McGraw-Hill. - - 1.11) Is PostgreSQL Y2K compliant? - - Yes, we easily handle dates past the year 2000 AD, and before 2000 BC. - - 1.12) How do I join the development team? - - First, download the latest source and read the PostgreSQL Developers - documentation on our web site, or in the distribution. Second, - subscribe to the pgsql-hackers and pgsql-patches mailing lists. Third, - submit high quality patches to pgsql-patches. - - There are about a dozen people who have commit privileges to the - PostgreSQL CVS archive. They each have submitted so many high-quality - patches that it was impossible for the existing committers to keep up, - and we had confidence that patches they committed were of high - quality. - - 1.13) How do I submit a bug report? - - Please visit the PostgreSQL BugTool page, which gives guidelines and - directions on how to submit a bug. - - Also check out our ftp site ftp://ftp.PostgreSQL.org/pub to see if - there is a more recent PostgreSQL version or patches. - - 1.14) How does PostgreSQL compare to other DBMSs? - - There are several ways of measuring software: features, performance, - reliability, support, and price. - - Features - PostgreSQL has most features present in large commercial DBMSs, - like transactions, subselects, triggers, views, foreign key - referential integrity, and sophisticated locking. We have some - features they do not have, like user-defined types, - inheritance, rules, and multi-version concurrency control to - reduce lock contention. - - Performance - PostgreSQL has performance similar to other commercial and open - source databases. it is faster for some things, slower for - others. In comparison to MySQL or leaner database systems, we - are slower on inserts/updates because of transaction overhead. - Of course, MySQL does not have any of the features mentioned in - the Features section above. We are built for reliability and - features, though we continue to improve performance in every - release. There is an interesting Web page comparing PostgreSQL - to MySQL at http://openacs.org/why-not-mysql.html - - Reliability - We realize that a DBMS must be reliable, or it is worthless. We - strive to release well-tested, stable code that has a minimum - of bugs. Each release has at least one month of beta testing, - and our release history shows that we can provide stable, solid - releases that are ready for production use. We believe we - compare favorably to other database software in this area. - - Support - Our mailing list provides a large group of developers and users - to help resolve any problems encountered. While we can not - guarantee a fix, commercial DBMSs do not always supply a fix - either. Direct access to developers, the user community, - manuals, and the source code often make PostgreSQL support - superior to other DBMSs. There is commercial per-incident - support available for those who need it. (See support FAQ - item.) - - Price - We are free for all use, both commercial and non-commercial. - You can add our code to your product with no limitations, - except those outlined in our BSD-style license stated above. - - 1.15) How can I financially assist PostgreSQL? - - PostgreSQL has had a first-class infrastructure since we started six - years ago. This is all thanks to Marc Fournier, who has created and - managed this infrastructure over the years. - - Quality infrastructure is very important to an open-source project. It - prevents disruptions that can greatly delay forward movement of the - project. - - Of course, this infrastructure is not cheap. There are a variety of - monthly and one-time expenses that are required to keep it going. If - you or your company has money it can donate to help fund this effort, - please go to http://www.pgsql.com/pg_goodies and make a donation. - - Although the web page mentions PostgreSQL, Inc, the "contributions" - item is solely to support the PostgreSQL project and does not fund any - specific company. If you prefer, you can also send a check to the - contact address. - _________________________________________________________________ - - User Client Questions - - 2.1) Are there ODBC drivers for PostgreSQL? - - There are two ODBC drivers available, PsqlODBC and OpenLink ODBC. - - PsqlODBC is included in the distribution. More information about it - can be gotten from ftp://ftp.PostgreSQL.org/pub/odbc/. - - OpenLink ODBC can be gotten from http://www.openlinksw.com. It works - with their standard ODBC client software so you'll have PostgreSQL - ODBC available on every client platform they support (Win, Mac, Unix, - VMS). - - They will probably be selling this product to people who need - commercial-quality support, but a freeware version will always be - available. Please send questions to postgres95@openlink.co.uk. - - See also the ODBC chapter of the Programmer's Guide. - - 2.2) What tools are available for using PostgreSQL with Web pages? - - A nice introduction to Database-backed Web pages can be seen at: - http://www.webreview.com - - There is also one at http://www.phone.net/home/mwm/hotlist/. - - For Web integration, PHP is an excellent interface. It is at - http://www.php.net. - - For complex cases, many use the Perl interface and CGI.pm. - - 2.3) Does PostgreSQL have a graphical user interface? A report generator? - An embedded query language interface? - - We have a nice graphical user interface called pgaccess, which is - shipped as part of the distribution. pgaccess also has a report - generator. The Web page is http://www.flex.ro/pgaccess - - We also include ecpg, which is an embedded SQL query language - interface for C. - - 2.4) What languages are available to communicate with PostgreSQL? - - We have: - * C (libpq) - * C++ (libpq++) - * Embedded C (ecpg) - * Java (jdbc) - * Perl (perl5) - * ODBC (odbc) - * Python (PyGreSQL) - * TCL (libpgtcl) - * C Easy API (libpgeasy) - * Embedded HTML (PHP from http://www.php.net) - _________________________________________________________________ - - Administrative Questions - - 3.1) How do I install PostgreSQL somewhere other than /usr/local/pgsql? - - Specify the --prefix option when running configure. - - 3.2) When I start postmaster, I get a Bad System Call or core dumped - message. Why? - - It could be a variety of problems, but first check to see that you - have System V extensions installed in your kernel. PostgreSQL requires - kernel support for shared memory and semaphores. - - 3.3) When I try to start postmaster, I get IpcMemoryCreate errors. Why? - - You either do not have shared memory configured properly in your - kernel or you need to enlarge the shared memory available in the - kernel. The exact amount you need depends on your architecture and how - many buffers and backend processes you configure for postmaster. For - most systems, with default numbers of buffers and processes, you need - a minimum of ~1 MB. See the PostgreSQL Administrator's Guide for more - detailed information about shared memory and semaphores. - - 3.4) When I try to start postmaster, I get IpcSemaphoreCreate errors. Why? - - If the error message is IpcSemaphoreCreate: semget failed (No space - left on device) then your kernel is not configured with enough - semaphores. Postgres needs one semaphore per potential backend - process. A temporary solution is to start postmaster with a smaller - limit on the number of backend processes. Use -N with a parameter less - than the default of 32. A more permanent solution is to increase your - kernel's SEMMNS and SEMMNI parameters. - - Inoperative semaphores can also cause crashes during heavy database - access. - - If the error message is something else, you might not have semaphore - support configured in your kernel at all. See the PostgreSQL - Administrator's Guide for more detailed information about shared - memory and semaphores. - - 3.5) How do I control connections from other hosts? - - By default, PostgreSQL only allows connections from the local machine - using Unix domain sockets. Other machines will not be able to connect - unless you add the -i flag to postmaster, and enable host-based - authentication by modifying the file $PGDATA/pg_hba.conf accordingly. - This will allow TCP/IP connections. - - 3.6) How do I tune the database engine for better performance? - - Certainly, indexes can speed up queries. The EXPLAIN command allows - you to see how PostgreSQL is interpreting your query, and which - indexes are being used. - - If you are doing many INSERTs, consider doing them in a large batch - using the COPY command. This is much faster than individual INSERTS. - Second, statements not in a BEGIN WORK/COMMIT transaction block are - considered to be in their own transaction. Consider performing several - statements in a single transaction block. This reduces the transaction - overhead. Also, consider dropping and recreating indexes when making - large data changes. - - There are several tuning options. You can disable fsync() by starting - postmaster with a -o -F option. This will prevent fsync()s from - flushing to disk after every transaction. - - You can also use the postmaster -B option to increase the number of - shared memory buffers used by the backend processes. If you make this - parameter too high, the postmaster may not start because you have - exceeded your kernel's limit on shared memory space. Each buffer is 8K - and the default is 64 buffers. - - You can also use the backend -S option to increase the maximum amount - of memory used by the backend process for temporary sorts. The -S - value is measured in kilobytes, and the default is 512 (i.e. 512K). - - You can also use the CLUSTER command to group data in tables to match - an index. See the CLUSTER manual page for more details. - - 3.7) What debugging features are available? - - PostgreSQL has several features that report status information that - can be valuable for debugging purposes. - - First, by running configure with the --enable-cassert option, many - assert()s monitor the progress of the backend and halt the program - when something unexpected occurs. - - Both postmaster and postgres have several debug options available. - First, whenever you start postmaster, make sure you send the standard - output and error to a log file, like: - cd /usr/local/pgsql - ./bin/postmaster >server.log 2>&1 & - - This will put a server.log file in the top-level PostgreSQL directory. - This file contains useful information about problems or errors - encountered by the server. Postmaster has a -d option that allows even - more detailed information to be reported. The -d option takes a number - that specifies the debug level. Be warned that high debug level values - generate large log files. - - If postmaster is not running, you can actually run the postgres - backend from the command line, and type your SQL statement directly. - This is recommended only for debugging purposes. Note that a newline - terminates the query, not a semicolon. If you have compiled with - debugging symbols, you can use a debugger to see what is happening. - Because the backend was not started from postmaster, it is not running - in an identical environment and locking/backend interaction problems - may not be duplicated. - - If postmaster is running, start psql in one window, then find the PID - of the postgres process used by psql. Use a debugger to attach to the - postgres PID. You can set breakpoints in the debugger and issue - queries from psql. If you are debugging postgres startup, you can set - PGOPTIONS="-W n", then start psql. This will cause startup to delay - for n seconds so you can attach to the process with the debugger, set - any breakpoints, and continue through the startup sequence. - - The postgres program has -s, -A, and -t options that can be very - useful for debugging and performance measurements. - - You can also compile with profiling to see what functions are taking - execution time. The backend profile files will be deposited in the - pgsql/data/base/dbname directory. The client profile file will be put - in the client's current directory. Linux requires a compile with - -DLINUX_PROFILE for proper profiling. - - 3.8) Why do I get "Sorry, too many clients" when trying to connect? - - You need to increase postmaster's limit on how many concurrent backend - processes it can start. - - The default limit is 32 processes. You can increase it by restarting - postmaster with a suitable -N value or modifying postgresql.conf. - - Note that if you make -N larger than 32, you must also increase -B - beyond its default of 64; -B must be at least twice -N, and probably - should be more than that for best performance. For large numbers of - backend processes, you are also likely to find that you need to - increase various Unix kernel configuration parameters. Things to check - include the maximum size of shared memory blocks, SHMMAX; the maximum - number of semaphores, SEMMNS and SEMMNI; the maximum number of - processes, NPROC; the maximum number of processes per user, MAXUPRC; - and the maximum number of open files, NFILE and NINODE. The reason - that PostgreSQL has a limit on the number of allowed backend processes - is so your system won't run out of resources. - - In PostgreSQL versions prior to 6.5, the maximum number of backends - was 64, and changing it required a rebuild after altering the - MaxBackendId constant in include/storage/sinvaladt.h. - - 3.9) What are the pg_sorttempNNN.NN files in my database directory? - - They are temporary files generated by the query executor. For example, - if a sort needs to be done to satisfy an ORDER BY, and the sort - requires more space than the backend's -S parameter allows, then - temporary files are created to hold the extra data. - - The temporary files should be deleted automatically, but might not if - a backend crashes during a sort. If you have no backends running at - the time, it is safe to delete the pg_tempNNN.NN files. - _________________________________________________________________ - - Operational Questions - - 4.1) What is the difference between binary cursors and normal cursors? - - See the DECLARE manual page for a description. - - 4.2) How do I SELECT only the first few rows of a query? - - See the FETCH manual page, or use SELECT ... LIMIT.... - - The entire query may have to be evaluated, even if you only want the - first few rows. Consider a query that has an ORDER BY. If there is an - index that matches the ORDER BY, PostgreSQL may be able to evaluate - only the first few records requested, or the entire query may have to - be evaluated until the desired rows have been generated. - - 4.3) How do I get a list of tables or other things I can see in psql? - - You can read the source code for psql in file - pgsql/src/bin/psql/describe.c. It contains SQL commands that generate - the output for psql's backslash commands. You can also start psql with - the -E option so it will print out the queries it uses to execute the - commands you give. - - 4.4) How do you remove a column from a table? - - We do not support ALTER TABLE DROP COLUMN, but do this: - BEGIN; - LOCK TABLE old_table; - SELECT ... -- select all columns but the one you want to remove - INTO TABLE new_table - FROM old_table; - DROP TABLE old_table; - ALTER TABLE new_table RENAME TO old_table; - COMMIT; - - 4.5) What is the maximum size for a row, a table, and a database? - - These are the limits: - Maximum size for a database? unlimited (500 GB databases exist) - Maximum size for a table? 16 TB - Maximum size for a row? unlimited in 7.1 and later - Maximum size for a field? 1 GB in 7.1 and later - Maximum number of rows in a table? unlimited - Maximum number of columns in a table? 250-1600 depending on column types - Maximum number of indexes on a table? unlimited - - Of course, these are not actually unlimited, but limited to available - disk space and memory/swap space. Performance may suffer when these - values get unusually large. - - The maximum table size of 16 TB does not require large file support - from the operating system. Large tables are stored as multiple 1 GB - files so file system size limits are not important. - - The maximum table size and maximum number of columns can be increased - if the default block size is increased to 32k. - - 4.6) How much database disk space is required to store data from a typical - text file? - - A PostgreSQL database may require up to five times the disk space to - store data from a text file. - - As an example, consider a file of 100,000 lines with an integer and - text description on each line. Suppose the text string avergages - twenty bytes in length. The flat file would be 2.8 MB. The size of the - PostgreSQL database file containing this data can be estimated as 6.4 - MB: - 36 bytes: each row header (approximate) - 24 bytes: one int field and one text filed - + 4 bytes: pointer on page to tuple - ---------------------------------------- - 64 bytes per row - - The data page size in PostgreSQL is 8192 bytes (8 KB), so: - - 8192 bytes per page - ------------------- = 128 rows per database page (rounded down) - 64 bytes per row - - 100000 data rows - -------------------- = 782 database pages (rounded up) - 128 rows per page - -782 database pages * 8192 bytes per page = 6,406,144 bytes (6.4 MB) - - Indexes do not require as much overhead, but do contain the data that - is being indexed, so they can be large also. - - 4.7) How do I find out what tables, indexes, databases, and users are - defined? - - psql has a variety of backslash commands to show such information. Use - \? to see them. There are also system tables beginning with pg_ that - describe these too. Also, psql -l will list all databases. - - Also try the file pgsql/src/tutorial/syscat.source. It illustrates - many of the SELECTs needed to get information from the database system - tables. - - 4.8) My queries are slow or don't make use of the indexes. Why? - - Indexes are not automatically used by every query. Indexes are only - used if the table is larger than a minimum size, and the query selects - only a small percentage of the rows in the table. This is because the - random disk access caused by an index scan is sometimes slower than a - straight read through the table, or sequential scan. - - To determine if an index should be used, PostgreSQL must have - statistics about the table. These statistics are collected using - VACUUM ANALYZE, or simply ANALYZE. Using statistics, the optimizer - knows how many rows are in the table, and can better determine if - indexes should be used. Statistics are also valuable in determining - optimal join order and join methods. Statistics collection should be - performed periodically as the contents of the table change. - - Indexes are normally not used for ORDER BY or to perform joins. A - sequential scan followed by an explicit sort is usually faster than an - index scan of a large table. - However, LIMIT combined with ORDER BY often will use an index because - only a small portion of the table is returned. - - When using wild-card operators such as LIKE or ~, indexes can only be - used if the beginning of the search is anchored to the start of the - string. Therefore, to use indexes, LIKE patterns must not start with - %, and ~(regular expression) patterns must start with ^. - - 4.9) How do I see how the query optimizer is evaluating my query? - - See the EXPLAIN manual page. - - 4.10) What is an R-tree index? - - An R-tree index is used for indexing spatial data. A hash index can't - handle range searches. A B-tree index only handles range searches in a - single dimension. R-trees can handle multi-dimensional data. For - example, if an R-tree index can be built on an attribute of type - point, the system can more efficiently answer queries such as "select - all points within a bounding rectangle." - - The canonical paper that describes the original R-tree design is: - - Guttman, A. "R-trees: A Dynamic Index Structure for Spatial - Searching." Proceedings of the 1984 ACM SIGMOD Int'l Conf on Mgmt of - Data, 45-57. - - You can also find this paper in Stonebraker's "Readings in Database - Systems". - - Built-in R-trees can handle polygons and boxes. In theory, R-trees can - be extended to handle higher number of dimensions. In practice, - extending R-trees requires a bit of work and we don't currently have - any documentation on how to do it. - - 4.11) What is the Genetic Query Optimizer? - - The GEQO module speeds query optimization when joining many tables by - means of a Genetic Algorithm (GA). It allows the handling of large - join queries through nonexhaustive search. - - 4.12) How do I perform regular expression searches and case-insensitive - regular expression searches? How do I use an index for case-insensitive - searches? - - The ~ operator does regular expression matching, and ~* does - case-insensitive regular expression matching. The case-insensitive - variant of LIKE is called ILIKE in PostgreSQL 7.1 and later. - - Case-insensitive equality comparisons are normally expressed as: - SELECT * - FROM tab - WHERE lower(col) = 'abc' - - This will not use an standard index. However, if you create a - functional index, it will be used: - CREATE INDEX tabindex on tab (lower(col)); - - 4.13) In a query, how do I detect if a field is NULL? - - You test the column with IS NULL and IS NOT NULL. - - 4.14) What is the difference between the various character types? - -Type Internal Name Notes --------------------------------------------------- -"char" char 1 character -CHAR(#) bpchar blank padded to the specified fixed length -VARCHAR(#) varchar size specifies maximum length, no padding -TEXT text no specific upper limit on length -BYTEA bytea variable-length byte array (null-byte safe) - - You will see the internal name when examining system catalogs and in - some error messages. - - The last four types above are "varlena" types (i.e., the first four - bytes on disk are the length, followed by the data). Thus the actual - space used is slightly greater than the declared size. However, these - data types are also subject to compression or being stored out-of-line - by TOAST, so the space on disk might also be less than expected. - - CHAR() is best when storing strings that are usually the same length. - VARCHAR() is best when storing variable-length strings but it limits - how long a string can be. TEXT is for strings of unlimited length, - maximum 1 gigabyte. BYTEA is for storing binary data, particularly - values that include NULL bytes. - - 4.15.1) How do I create a serial/auto-incrementing field? - - PostgreSQL supports a SERIAL data type. It auto-creates a sequence and - index on the column. For example, this: - CREATE TABLE person ( - id SERIAL, - name TEXT - ); - - is automatically translated into this: - CREATE SEQUENCE person_id_seq; - CREATE TABLE person ( - id INT4 NOT NULL DEFAULT nextval('person_id_seq'), - name TEXT - ); - CREATE UNIQUE INDEX person_id_key ON person ( id ); - - See the create_sequence manual page for more information about - sequences. You can also use each row's OID field as a unique value. - However, if you need to dump and reload the database, you need to use - pg_dump's -o option or COPY WITH OIDS option to preserve the OIDs. - - 4.15.2) How do I get the value of a SERIAL insert? - - One approach is to retrieve the next SERIAL value from the sequence - object with the nextval() function before inserting and then insert it - explicitly. Using the example table in 4.15.1, that might look like - this in Perl: - new_id = output of "SELECT nextval('person_id_seq')" - INSERT INTO person (id, name) VALUES (new_id, 'Blaise Pascal'); - - You would then also have the new value stored in new_id for use in - other queries (e.g., as a foreign key to the person table). Note that - the name of the automatically created SEQUENCE object will be named - <table>_<serialcolumn>_seq, where table and serialcolumn are the names - of your table and your SERIAL column, respectively. - - Alternatively, you could retrieve the assigned SERIAL value with the - currval() function after it was inserted by default, e.g., - INSERT INTO person (name) VALUES ('Blaise Pascal'); - new_id = output of "SELECT currval('person_id_seq')"; - - Finally, you could use the OID returned from the INSERT statement to - look up the default value, though this is probably the least portable - approach. In Perl, using DBI with Edmund Mergl's DBD::Pg module, the - oid value is made available via $sth->{pg_oid_status} after - $sth->execute(). - - 4.15.3) Don't currval() and nextval() lead to a race condition with other - users? - - No. Currval() returns the current value assigned by your backend, not - by all users. - - 4.15.4) Why aren't my sequence numbers reused on transaction abort? Why are - there gaps in the numbering of my sequence/SERIAL column? - - To improve concurrency, sequence values are given out to running - transactions as needed and are not locked until the transaction - completes. This causes gaps in numbering from aborted transactions. - - 4.16) What is an OID? What is a TID? - - OIDs are PostgreSQL's answer to unique row ids. Every row that is - created in PostgreSQL gets a unique OID. All OIDs generated during - initdb are less than 16384 (from backend/access/transam.h). All - user-created OIDs are equal to or greater than this. By default, all - these OIDs are unique not only within a table or database, but unique - within the entire PostgreSQL installation. - - PostgreSQL uses OIDs in its internal system tables to link rows - between tables. These OIDs can be used to identify specific user rows - and used in joins. It is recommended you use column type OID to store - OID values. You can create an index on the OID field for faster - access. - - OIDs are assigned to all new rows from a central area that is used by - all databases. If you want to change the OID to something else, or if - you want to make a copy of the table, with the original OIDs, there is - no reason you can't do it: - CREATE TABLE new_table(old_oid oid, mycol int); - SELECT old_oid, mycol INTO new FROM old; - COPY new TO '/tmp/pgtable'; - DELETE FROM new; - COPY new WITH OIDS FROM '/tmp/pgtable'; - - OIDs are stored as 4-byte integers, and will overflow at 4 billion. No - one has reported this ever happening, and we plan to have the limit - removed before anyone does. - - TIDs are used to identify specific physical rows with block and offset - values. TIDs change after rows are modified or reloaded. They are used - by index entries to point to physical rows. - - 4.17) What is the meaning of some of the terms used in PostgreSQL? - - Some of the source code and older documentation use terms that have - more common usage. Here are some: - * table, relation, class - * row, record, tuple - * column, field, attribute - * retrieve, select - * replace, update - * append, insert - * OID, serial value - * portal, cursor - * range variable, table name, table alias - - A list of general database terms can be found at: - http://www.comptechnews.com/~reaster/dbdesign.html - - 4.18) Why do I get the error "ERROR: Memory exhausted in AllocSetAlloc()"? - - If you are running a version older than 7.1, an upgrade may fix the - problem. Also it is possible you have run out of virtual memory on - your system, or your kernel has a low limit for certain resources. Try - this before starting postmaster: - ulimit -d 262144 - limit datasize 256m - - Depending on your shell, only one of these may succeed, but it will - set your process data segment limit much higher and perhaps allow the - query to complete. This command applies to the current process, and - all subprocesses created after the command is run. If you are having a - problem with the SQL client because the backend is returning too much - data, try it before starting the client. - - 4.19) How do I tell what PostgreSQL version I am running? - - From psql, type select version(); - - 4.20) Why does my large-object operations get "invalid large obj - descriptor"? - - You need to put BEGIN WORK and COMMIT around any use of a large object - handle, that is, surrounding lo_open ... lo_close. - - Currently PostgreSQL enforces the rule by closing large object handles - at transaction commit. So the first attempt to do anything with the - handle will draw invalid large obj descriptor. So code that used to - work (at least most of the time) will now generate that error message - if you fail to use a transaction. - - If you are using a client interface like ODBC you may need to set - auto-commit off. - - 4.21) How do I create a column that will default to the current time? - - Use CURRENT_TIMESTAMP: -CREATE TABLE test (x int, modtime timestamp DEFAULT CURRENT_TIMESTAMP ); - - 4.22) Why are my subqueries using IN so slow? - - Currently, we join subqueries to outer queries by sequentially - scanning the result of the subquery for each row of the outer query. A - workaround is to replace IN with EXISTS: -SELECT * - FROM tab - WHERE col1 IN (SELECT col2 FROM TAB2) - - to: -SELECT * - FROM tab - WHERE EXISTS (SELECT col2 FROM TAB2 WHERE col1 = col2) - - We hope to fix this limitation in a future release. - - 4.23) How do I perform an outer join? - - PostgreSQL 7.1 and later supports outer joins using the SQL standard - syntax. Here are two examples: - SELECT * - FROM t1 LEFT OUTER JOIN t2 ON (t1.col = t2.col); - - or - SELECT * - FROM t1 LEFT OUTER JOIN t2 USING (col); - - These identical queries join t1.col to t2.col, and also return any - unjoined rows in t1 (those with no match in t2). A RIGHT join would - add unjoined rows of t2. A FULL join would return the matched rows - plus all unjoined rows from t1 and t2. The word OUTER is optional and - is assumed in LEFT, RIGHT, and FULL joins. Ordinary joins are called - INNER joins. - - In previous releases, outer joins can be simulated using UNION and NOT - IN. For example, when joining tab1 and tab2, the following query does - an outer join of the two tables: - SELECT tab1.col1, tab2.col2 - FROM tab1, tab2 - WHERE tab1.col1 = tab2.col1 - UNION ALL - SELECT tab1.col1, NULL - FROM tab1 - WHERE tab1.col1 NOT IN (SELECT tab2.col1 FROM tab2) - ORDER BY col1 - - 4.24) How do I perform queries using multiple databases? - - There is no way to query any database except the current one. Because - PostgreSQL loads database-specific system catalogs, it is uncertain - how a cross-database query should even behave. - - Of course, a client can make simultaneous connections to different - databases and merge the information that way. - - 4.25) How do I return multiple rows or columns from a function? - - You can return result sets from PL/pgSQL functions using refcursors. - See - http://developer.postgresql.org/docs/postgres/plpgsql-cursors.html, - section 23.7.3.3. - - 4.26) Why can't I reliably create/drop temporary tables in PL/PgSQL - functions? - - PL/PgSQL caches function contents, and an unfortunate side effect is - that if a PL/PgSQL function accesses a temporary table, and that table - is later dropped and recreated, and the function called again, the - function will fail because the cached function contents still point to - the old temporary table. The solution is to use EXECUTE for temporary - table access in PL/PgSQL. This will cause the query to be reparsed - every time. - _________________________________________________________________ - - Extending PostgreSQL - - 5.1) I wrote a user-defined function. When I run it in psql, why does it - dump core? - - The problem could be a number of things. Try testing your user-defined - function in a stand-alone test program first. - - 5.2) How can I contribute some nifty new types and functions to PostgreSQL? - - Send your extensions to the pgsql-hackers mailing list, and they will - eventually end up in the contrib/ subdirectory. - - 5.3) How do I write a C function to return a tuple? - - This requires wizardry so extreme that the authors have never tried - it, though in principle it can be done. - - 5.4) I have changed a source file. Why does the recompile not see the - change? - - The Makefiles do not have the proper dependencies for include files. - You have to do a make clean and then another make. If you are using - GCC you can use the --enable-depend option of configure to have the - compiler compute the dependencies automatically. |