postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
*	Revise TupleTableSlot code to avoid unnecessary construction and disassembly	Tom Lane	2005-03-16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	of tuples when passing data up through multiple plan nodes. A slot can now hold either a normal "physical" HeapTuple, or a "virtual" tuple consisting of Datum/isnull arrays. Upper plan levels can usually just copy the Datum arrays, avoiding heap_formtuple() and possible subsequent nocachegetattr() calls to extract the data again. This work extends Atsushi Ogawa's earlier patch, which provided the key idea of adding Datum arrays to TupleTableSlots. (I believe however that something like this was foreseen way back in Berkeley days --- see the old comment on ExecProject.) A test case involving many levels of join of fairly wide tables (about 80 columns altogether) showed about 3x overall speedup, though simple queries will probably not be helped very much. I have also duplicated some code in heaptuple.c in order to provide versions of heap_formtuple and friends that use "bool" arrays to indicate null attributes, instead of the old convention of "char" arrays containing either 'n' or ' '. This provides a better match to the convention used by ExecEvalExpr. While I have not made a concerted effort to get rid of uses of the old routines, I think they should be deprecated and eventually removed.
*	Avoid O(N^2) overhead in repeated nocachegetattr calls when columns of	Tom Lane	2005-03-14
\| \| \| \| \| \| \| \|	a tuple are being accessed via ExecEvalVar and the attcacheoff shortcut isn't usable (due to nulls and/or varlena columns). To do this, cache Datums extracted from a tuple in the associated TupleTableSlot. Also some code cleanup in and around the TupleTable handling. Atsushi Ogawa with some kibitzing by Tom Lane.
*	Adjust creation/destruction of TupleDesc data structure to reduce the	Tom Lane	2005-03-07
\| \| \| \| \| \|	number of palloc calls. This has a salutory impact on plpgsql operations with record variables (which create and destroy tupdescs constantly) and probably helps a bit in some other cases too.
*	Remove some no-longer-needed kluges for bootstrapping, in particular	Tom Lane	2005-02-20
\| \| \| \| \| \| \| \|	the AMI_OVERRIDE flag. The fact that TransactionLogFetch treats BootstrapTransactionId as always committed is sufficient to make bootstrap work, and getting rid of extra tests in heavily used code paths seems like a win. The files produced by initdb are demonstrably the same after this change.
*	Add code to prevent transaction ID wraparound by enforcing a safe limit	Tom Lane	2005-02-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	in GetNewTransactionId(). Since the limit value has to be computed before we run any real transactions, this requires adding code to database startup to scan pg_database and determine the oldest datfrozenxid. This can conveniently be combined with the first stage of an attack on the problem that the 'flat file' copies of pg_shadow and pg_group are not properly updated during WAL recovery. The code I've added to startup resides in a new file src/backend/utils/init/flatfiles.c, and it is responsible for rewriting the flat files as well as initializing the XID wraparound limit value. This will eventually allow us to get rid of GetRawDatabaseInfo too, but we'll need an initdb so we can add a trigger to pg_database.
*	Move plpgsql DEBUG from DEBUG2 to DEBUG1 because it is a user-requested	Bruce Momjian	2005-02-12
\| \| \| \| \| \|	DEBUG. Fix a few places where DEBUG1 crept in that should have been DEBUG2.
*	Marginal hack to merge adjacent ReleaseBuffer/ReadBuffer calls into	Tom Lane	2005-02-05
\| \| \| \| \|	ReleaseAndReadBuffer during GIST index searches. We already did this in btree and rtree, might as well do it here too.
*	Change heap_modifytuple() to require a TupleDesc rather than a	Neil Conway	2005-01-27
\| \| \| \| \|	Relation. Patch from Alvaro Herrera, minor editorializing by Neil Conway.
*	Fix memory leak in rtdosplit, per report from Clive Page.	Tom Lane	2005-01-24
\|
*	This patch makes some improvements to the rtree index implementation:	Neil Conway	2005-01-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(1) Keep a pin on the scan's current buffer and mark buffer. This avoids the need to do a ReadBuffer() for each tuple produced by the scan. Since ReadBuffer() is expensive, this is a significant win. (2) Convert a ReleaseBuffer(); ReadBuffer() pair into ReleaseAndReadBuffer(). Surely not a huge win, but it saves a lock acquire/release... (3) Remove a bunch of duplicated code in rtget.c; make rtnext() handle both the "initial result" and "subsequent result" cases. (4) Add support for index tuple killing (5) Remove rtscancache(): it is dead code, for the same reason that gistscancache() is dead code (an index scan ought not be invoked with NoMovementScanDirection). The end result is about a 10% improvement in rtree index scan perf, according to contrib/rtree_gist/bench.
*	Phase 1 of fix for 'SMgrRelation hashtable corrupted' problem. This	Tom Lane	2005-01-10
\| \| \| \| \| \|	is the minimum required fix. I want to look next at taking advantage of it by simplifying the message semantics in the shared inval message queue, but that part can be held over for 8.1 if it turns out too ugly.
*	Update copyrights that were missed.	Bruce Momjian	2005-01-01
\|
*	Tag appropriate files for rc3	PostgreSQL Daemon	2004-12-31
\| \| \| \| \| \| \| \|	Also performed an initial run through of upgrading our Copyright date to extend to 2005 ... first run here was very simple ... change everything where: grep 1996-2004 && the word 'Copyright' ... scanned through the generated list with 'less' first, and after, to make sure that I only picked up the right entries ...
*	Awhile back I added some code to StartupCLOG() to forcibly zero out	Tom Lane	2004-12-22
\| \| \| \| \| \| \| \| \| \|	the remainder of the current clog page during system startup. While this was a good idea, it turns out the code fails if nextXid is exactly at a page boundary, because we won't have created the "current" clog page yet in that case. Since the page will be correctly zeroed when we execute the first transaction on it, the solution is just to do nothing when exactly at a page boundary. Per trouble report from Dave Hartwig.
*	Fix is-it-time-for-a-checkpoint logic so that checkpoint_segments can	Tom Lane	2004-12-17
\| \| \| \|	usefully be larger than 255. Per gripe from Simon Riggs.
*	Calculation of keys_are_unique flag was wrong for cases involving	Tom Lane	2004-12-15
\| \| \| \|	redundant cross-datatype comparisons. Per example from Merlin Moncure.
*	Change planner to use the current true disk file size as its estimate of	Tom Lane	2004-12-01
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a relation's number of blocks, rather than the possibly-obsolete value in pg_class.relpages. Scale the value in pg_class.reltuples correspondingly to arrive at a hopefully more accurate number of rows. When pg_class contains 0/0, estimate a tuple width from the column datatypes and divide that into current file size to estimate number of rows. This improved methodology allows us to jettison the ancient hacks that put bogus default values into pg_class when a table is first created. Also, per a suggestion from Simon, make VACUUM (but not VACUUM FULL or ANALYZE) adjust the value it puts into pg_class.reltuples to try to represent the mean tuple density instead of the minimal density that actually prevails just after VACUUM. These changes alter the plans selected for certain regression tests, so update the expected files accordingly. (I removed join_1.out because it's not clear if it still applies; we can add back any variant versions as they are shown to be needed.)
*	Minor adjustment of message style.	Tom Lane	2004-11-17
\|
*	Micro-optimization of markpos() and restrpos() in btree and hash indexes.	Neil Conway	2004-11-17
\| \| \| \| \| \|	Rather than using ReadBuffer() to increment the reference count on an already-pinned buffer, we should use IncrBufferRefCount() as it is faster and does not require acquiring the BufMgrLock.
*	Don't allow pg_start_backup() to be invoked if archive_command has not	Neil Conway	2004-11-17
\| \| \| \|	been defined. Patch from Gavin Sherry, editorializing by Neil Conway.
*	There is no need for ReadBuffer() call sites to check that the returned	Neil Conway	2004-11-14
\| \| \| \| \| \|	buffer is valid, as ReadBuffer() will elog on error. Most of the call sites of ReadBuffer() got this right, but this patch fixes those call sites that did not.
*	Remove obsolete comment from btbuild() and hashbuild(): we no longer use	Neil Conway	2004-11-11
\| \| \| \|	a global variable to control building indexes.
*	Small message clarifications	Peter Eisentraut	2004-11-05
\|
*	Change COMMIT back to the old behavior of emitting command tag COMMIT,	Tom Lane	2004-10-30
\| \| \| \| \|	not ROLLBACK, for the case of COMMIT outside a transaction block. Alvaro Herrera
*	Rearrange order of pre-commit operations: must close cursors before doing	Tom Lane	2004-10-29
\| \| \| \|	ON COMMIT actions. Per bug report from Michael Guerin.
*	Add DEBUG1-level logging of checkpoint start and end. Also, reduce the	Tom Lane	2004-10-29
\| \| \| \| \| \|	'recycled log files' and 'removed log files' messages from DEBUG1 to DEBUG2, replacing them with a count of files added/removed/recycled in the checkpoint end message, as per suggestion from Simon Riggs.
*	Make heap_fetch API more consistent by having the buffer remain pinned	Tom Lane	2004-10-26
\| \| \| \| \| \|	in all cases when keep_buf = true. This allows ANALYZE's inner loop to use heap_release_fetch, which saves multiple buffer lookups for the same page and avoids overestimation of cost by the vacuum cost mechanism.
*	Allow functions returning void or cstring to appear in FROM clause,	Tom Lane	2004-10-20
\| \| \| \| \| \|	to make life cushy for the JDBC driver. Centralize the decision-making that affects this by inventing a get_type_func_class() function, rather than adding special cases in half a dozen places.
*	Give the ResourceOwner mechanism full responsibility for releasing buffer	Tom Lane	2004-10-16
\| \| \| \| \| \| \| \|	pins at end of transaction, and reduce AtEOXact_Buffers to an Assert cross-check that this was done correctly. When not USE_ASSERT_CHECKING, AtEOXact_Buffers is a complete no-op. This gets rid of an O(NBuffers) bottleneck during transaction commit/abort, which recent testing has shown becomes significant above a few tens of thousands of shared buffers.
*	Repair possible failure to update hint bits back to disk, per	Tom Lane	2004-10-15
\| \| \| \| \| \| \| \| \| \|	http://archives.postgresql.org/pgsql-hackers/2004-10/msg00464.php. This fix is intended to be permanent: it moves the responsibility for calling SetBufferCommitInfoNeedsSave() into the tqual.c routines, eliminating the requirement for callers to test whether t_infomask changed. Also, tighten validity checking on buffer IDs in bufmgr.c --- several routines were paranoid about out-of-range shared buffer numbers but not about out-of-range local ones, which seems a tad pointless.
*	Add 'int' cast for getpid() because some Solaris releases return long	Bruce Momjian	2004-10-14
\| \| \| \|	for getpid().
*	Message style revisions	Peter Eisentraut	2004-10-12
\|
*	Make getpid() use %d consistently for printing.	Bruce Momjian	2004-10-09
\|
*	Adjust comments previously moved to column 1 by pgident.	Bruce Momjian	2004-10-07
\|
*	PortalRun must guard against the possibility that the portal it's	Tom Lane	2004-10-04
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	running contains VACUUM or a similar command that will internally start and commit transactions. In such a case, the original caller values of CurrentMemoryContext and CurrentResourceOwner will point to objects that will be destroyed by the internal commit. We must restore these pointers to point to the newly-manufactured transaction context and resource owner, rather than possibly pointing to deleted memory. Also tweak xact.c so that AbortTransaction and AbortSubTransaction forcibly restore a sane value for CurrentResourceOwner, much as they have always done for CurrentMemoryContext. I'm not certain this is necessary but I'm feeling paranoid today. Responds to Sean Chittenden's bug report of 4-Oct.
*	Code review for NOWAIT patch: downgrade NOWAIT from fully reserved keyword	Tom Lane	2004-10-01
\| \| \| \| \| \|	to unreserved keyword, use ereport not elog, assign a separate error code for 'could not obtain lock' so that applications will be able to detect that case cleanly.
*	Adjust index locking rules as per my proposal of earlier today. You	Tom Lane	2004-09-30
\| \| \| \| \| \|	now are supposed to take some kind of lock on an index whenever you are going to access the index contents, rather than relying only on a lock on the parent table.
*	Code cleanup: don't bother casting the argument to pfree() to void *	Neil Conway	2004-09-27
\| \| \| \| \|	from another pointer type. Per C89, this is unnecessary, and it is common practice throughout the rest of the tree anyway.
*	Now that xmax and cmin are distinct fields again, we should zero xmax when	Tom Lane	2004-09-17
\| \| \| \| \| \|	creating a new tuple. This is just for debugging sanity, though, since nothing should be paying any attention to xmax when the HEAP_XMAX_INVALID bit is set.
*	Add some marginal tweaks to eliminate memory leakages associated with	Tom Lane	2004-09-16
\| \| \| \| \|	subtransactions. Trivial subxacts (such as a plpgsql exception block containing no database access) now demonstrably leak zero bytes.
*	RecentXmin is too recent to use as the cutoff point for accessing	Tom Lane	2004-09-16
\| \| \| \| \| \| \|	pg_subtrans --- what we need is the oldest xmin of any snapshot in use in the current top transaction. Introduce a new variable TransactionXmin to play this role. Fixes intermittent regression failure reported by Neil Conway.
*	Restructure subtransaction handling to reduce resource consumption,	Tom Lane	2004-09-16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	as per recent discussions. Invent SubTransactionIds that are managed like CommandIds (ie, counter is reset at start of each top transaction), and use these instead of TransactionIds to keep track of subtransaction status in those modules that need it. This means that a subtransaction does not need an XID unless it actually inserts/modifies rows in the database. Accordingly, don't assign it an XID nor take a lock on the XID until it tries to do that. This saves a lot of overhead for subtransactions that are only used for error recovery (eg plpgsql exceptions). Also, arrange to release a subtransaction's XID lock as soon as the subtransaction exits, in both the commit and abort cases. This avoids holding many unique locks after a long series of subtransactions. The price is some additional overhead in XactLockTableWait, but that seems acceptable. Finally, restructure the state machine in xact.c to have a more orthogonal set of states for subtransactions.
*	Redesign query-snapshot timing so that volatile functions in READ COMMITTED	Tom Lane	2004-09-13
\| \| \| \| \| \| \| \| \| \| \| \| \|	mode see a fresh snapshot for each command in the function, rather than using the latest interactive command's snapshot. Also, suppress fresh snapshots as well as CommandCounterIncrement inside STABLE and IMMUTABLE functions, instead using the snapshot taken for the most closely nested regular query. (This behavior is only sane for read-only functions, so the patch also enforces that such functions contain only SELECT commands.) As per my proposal of 6-Sep-2004; I note that I floated essentially the same proposal on 19-Jun-2002, but that discussion tailed off without any action. Since 8.0 seems like the right place to be taking possibly nontrivial backwards compatibility hits, let's get it done now.
*	Renumber SnapshotNow and the other special snapshot codes so that	Tom Lane	2004-09-11
\| \| \| \| \| \| \| \|	((Snapshot) NULL) can no longer be confused with a valid snapshot, as per my recent suggestion. Define a macro InvalidSnapshot for 0. Use InvalidSnapshot instead of SnapshotAny as the do-nothing special case for heap_update and heap_delete crosschecks; this seems a little cleaner even though the behavior is really the same.
*	Fire non-deferred AFTER triggers immediately upon query completion,	Tom Lane	2004-09-10
\| \| \| \| \| \| \| \| \| \| \| \| \|	rather than when returning to the idle loop. This makes no particular difference for interactively-issued queries, but it makes a big difference for queries issued within functions: trigger execution now occurs before the calling function is allowed to proceed. This responds to numerous complaints about nonintuitive behavior of foreign key checking, such as http://archives.postgresql.org/pgsql-bugs/2004-09/msg00020.php, and appears to be required by the SQL99 spec. Also take the opportunity to simplify the data structures used for the pending-trigger list, rename them for more clarity, and squeeze out a bit of space.
*	Fix incorrect ordering of smgr cleanup relative to buffer pin cleanup	Tom Lane	2004-09-06
\| \| \| \| \|	during transaction abort. Add a regression test case to catch related mistakes in future. Alvaro Herrera and Tom Lane.
*	Downgrade LOG messages to DEBUG1 for normal recycling of xlog, clog,	Tom Lane	2004-09-06
\| \| \| \|	subtrans segments. Per Greg Mullane and Chris K-L.
*	Ensure that the remainder of the current pg_clog page is zeroed during	Tom Lane	2004-08-30
\| \| \| \|	startup, just to be sure that there's no leftover junk there.
*	Fix failure to advance nextXID beyond subtransactions whose XIDs appear	Tom Lane	2004-08-30
\| \| \| \|	only within COMMIT or ABORT records.
*	Another pgindent run with lib typedefs added.	Bruce Momjian	2004-08-30
\|