postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
*	pgindent run for 9.0	Bruce Momjian	2010-02-26
\|
*	Wrap calls to SearchSysCache and related functions using macros.	Robert Haas	2010-02-14
\| \| \| \| \| \| \| \| \| \| \| \|	The purpose of this change is to eliminate the need for every caller of SearchSysCache, SearchSysCacheCopy, SearchSysCacheExists, GetSysCacheOid, and SearchSysCacheList to know the maximum number of allowable keys for a syscache entry (currently 4). This will make it far easier to increase the maximum number of keys in a future release should we choose to do so, and it makes the code shorter, too. Design and review by Tom Lane.
*	Remove old-style VACUUM FULL (which was known for a little while as	Tom Lane	2010-02-08
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	VACUUM FULL INPLACE), along with a boatload of subsidiary code and complexity. Per discussion, the use case for this method of vacuuming is no longer large enough to justify maintaining it; not to mention that we don't wish to invest the work that would be needed to make it play nicely with Hot Standby. Aside from the code directly related to old-style VACUUM FULL, this commit removes support for certain WAL record types that could only be generated within VACUUM FULL, redirect-pointer removal in heap_page_prune, and nontransactional generation of cache invalidation sinval messages (the last being the sticking point for Hot Standby). We still have to retain all code that copes with finding HEAP_MOVED_OFF and HEAP_MOVED_IN flag bits on existing tuples. This can't be removed as long as we want to support in-place update from pre-9.0 databases.
*	Tighten integrity checks on ALTER TABLE ... ALTER COLUMN ... RENAME.	Robert Haas	2010-02-01
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a column is renamed, we recursively rename the same column in all descendent tables. But if one of those tables also inherits that column from a table outside the inheritance hierarchy rooted at the named table, we must throw an error. The previous coding correctly prohibited the rename when the parent had inherited the column from elsewhere, but overlooked the case where the parent was OK but a child table also inherited the same column from a second, unrelated parent. For now, not backpatched due to lack of complaints from the field. KaiGai Kohei, with further changes by me. Reviewed by Bernd Helme and Tom Lane.
*	Replace ALTER TABLE ... SET STATISTICS DISTINCT with a more general mechanism.	Robert Haas	2010-01-22
\| \| \| \| \| \| \| \| \|	Attributes can now have options, just as relations and tablespaces do, and the reloptions code is used to parse, validate, and store them. For simplicity and because these options are not performance critical, we store them in a separate cache rather than the main relcache. Thanks to Alex Hunsaker for the review.
*	Update copyright for the year 2010.	Bruce Momjian	2010-01-02
\|
*	Dept of second thoughts: recursive case in ANALYZE shouldn't emit a	Tom Lane	2009-12-30
\| \| \| \| \|	pgstats message. This might need to be done differently later, but with the current logic that's what should happen.
*	Revise pgstat's tracking of tuple changes to improve the reliability of	Tom Lane	2009-12-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	decisions about when to auto-analyze. The previous code depended on n_live_tuples + n_dead_tuples - last_anl_tuples, where all three of these numbers could be bad estimates from ANALYZE itself. Even worse, in the presence of a steady flow of HOT updates and matching HOT-tuple reclamations, auto-analyze might never trigger at all, even if all three numbers are exactly right, because n_dead_tuples could hold steady. To fix, replace last_anl_tuples with an accurately tracked count of the total number of committed tuple inserts + updates + deletes since the last ANALYZE on the table. This can still be compared to the same threshold as before, but it's much more trustworthy than the old computation. Tracking this requires one more intra-transaction counter per modified table within backends, but no additional memory space in the stats collector. There probably isn't any measurable speed difference; if anything it might be a bit faster than before, since I was able to eliminate some per-tuple arithmetic operations in favor of adding sums once per (sub)transaction. Also, simplify the logic around pgstat vacuum and analyze reporting messages by not trying to fold VACUUM ANALYZE into a single pgstat message. The original thought behind this patch was to allow scheduling of analyzes on parent tables by artificially inflating their changes_since_analyze count. I've left that for a separate patch since this change seems to stand on its own merit.
*	Add the ability to store inheritance-tree statistics in pg_statistic,	Tom Lane	2009-12-29
\| \| \| \| \| \| \| \|	and teach ANALYZE to compute such stats for tables that have subclasses. Per my proposal of yesterday. autovacuum still needs to be taught about running ANALYZE on parent tables when their subclasses change, but the feature is useful even without that.
*	Prevent indirect security attacks via changing session-local state within	Tom Lane	2009-12-09
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	an allegedly immutable index function. It was previously recognized that we had to prevent such a function from executing SET/RESET ROLE/SESSION AUTHORIZATION, or it could trivially obtain the privileges of the session user. However, since there is in general no privilege checking for changes of session-local state, it is also possible for such a function to change settings in a way that might subvert later operations in the same session. Examples include changing search_path to cause an unexpected function to be called, or replacing an existing prepared statement with another one that will execute a function of the attacker's choosing. The present patch secures VACUUM, ANALYZE, and CREATE INDEX/REINDEX against these threats, which are the same places previously deemed to need protection against the SET ROLE issue. GUC changes are still allowed, since there are many useful cases for that, but we prevent security problems by forcing a rollback of any GUC change after completing the operation. Other cases are handled by throwing an error if any change is attempted; these include temp table creation, closing a cursor, and creating or deleting a prepared statement. (In 7.4, the infrastructure to roll back GUC changes doesn't exist, so we settle for rejecting changes of "search_path" in these contexts.) Original report and patch by Gurjeet Singh, additional analysis by Tom Lane. Security: CVE-2009-4136
*	Provide a parenthesized-options syntax for VACUUM, analogous to that recently	Tom Lane	2009-11-16
\| \| \| \| \| \| \| \|	adopted for EXPLAIN. This will allow additional options to be implemented in future without having to make them fully-reserved keywords. The old syntax remains available for existing options, however. Itagaki Takahiro
*	Fix old bug in log_autovacuum_min_duration code: it was relying on being able	Tom Lane	2009-08-12
\| \| \| \| \| \| \| \|	to access a Relation entry it had just closed. I happened to be testing with CLOBBER_CACHE_ALWAYS, which made this a guaranteed core dump (at least on machines where sprintf %s isn't forgiving of a NULL pointer). It's probably quite unlikely that it would fail in the field, but a bug is a bug. Fix by moving the relation_close call down past the logging action.
*	Add ALTER TABLE ... ALTER COLUMN ... SET STATISTICS DISTINCT	Tom Lane	2009-08-02
\| \| \| \|	Robert Haas
*	8.4 pgindent run, with new combined Linux/FreeBSD/MinGW typedef list	Bruce Momjian	2009-06-11
\| \| \| \|	provided by Andrew.
*	Improve the IndexVacuumInfo/IndexBulkDeleteResult API to allow somewhat sane	Tom Lane	2009-06-06
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	behavior in cases where we don't know the heap tuple count accurately; in particular partial vacuum, but this also makes the API a bit more useful for ANALYZE. This patch adds "estimated_count" flags to both structs so that an approximate count can be flagged as such, and adjusts the logic so that approximate counts are not used for updating pg_class.reltuples. This fixes my previous complaint that VACUUM was putting ridiculous values into pg_class.reltuples for indexes. The actual impact of that bug is limited, because the planner only pays attention to reltuples for an index if the index is partial; which probably explains why beta testers hadn't noticed a degradation in plan quality from it. But it needs to be fixed. The whole thing is a bit messy and should be redesigned in future, because reltuples now has the potential to drift quite far away from reality when a long period elapses with no non-partial vacuums. But this is as good as it's going to get for 8.4.
*	Update relpages and reltuples estimates in stand-alone ANALYZE, even if	Heikki Linnakangas	2009-05-19
\| \| \| \| \| \| \| \|	there's no analyzable attributes or indexes. We also used to report 0 live and dead tuples for such tables, which messed with autovacuum threshold calculations. This fixes bug #4812 reported by George Su. Backpatch back to 8.1.
*	Avoid integer overflow in the loop that extracts histogram entries from	Tom Lane	2009-05-05
\| \| \| \| \| \|	ANALYZE's total sample. The original coding is at risk of overflow for statistics targets exceeding about 2675; this was not a problem before 8.4 but it is now. Per bug #4793 from Dennis Noordsij.
*	Modify the relcache to record the temp status of both local and nonlocal	Tom Lane	2009-03-31
\| \| \| \| \| \| \| \| \| \|	temp relations; this is no more expensive than before, now that we have pg_class.relistemp. Insert tests into bufmgr.c to prevent attempting to fetch pages from nonlocal temp relations. This provides a low-level defense against bugs-of-omission allowing temp pages to be loaded into shared buffers, as in the contrib/pgstattuple problem reported by Stuart Bishop. While at it, tweak a bunch of places to use new relcache tests (instead of expensive probes into pg_namespace) to detect local or nonlocal temp tables.
*	Implement "fastupdate" support for GIN indexes, in which we try to accumulate	Tom Lane	2009-03-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	multiple index entries in a holding area before adding them to the main index structure. This helps because bulk insert is (usually) significantly faster than retail insert for GIN. This patch also removes GIN support for amgettuple-style index scans. The API defined for amgettuple is difficult to support with fastupdate, and the previously committed partial-match feature didn't really work with it either. We might eventually figure a way to put back amgettuple support, but it won't happen for 8.4. catversion bumped because of change in GIN's pg_am entry, and because the format of GIN indexes changed on-disk (there's a metapage now, and possibly a pending list). Teodor Sigaev
*	Support column-level privileges, as required by SQL standard.	Tom Lane	2009-01-22
\| \| \| \|	Stephen Frost, with help from KaiGai Kohei and others
*	Clarify a confusing comment about MCVs vs histogram entries.	Tom Lane	2009-01-06
\| \| \| \|	Per Nathan Boley.
*	Update copyright for 2009.	Bruce Momjian	2009-01-01
\|
*	Don't reset pg_class.reltuples and relpages in VACUUM, if any pages were	Heikki Linnakangas	2008-12-17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	skipped. We could update relpages anyway, but it seems better to only update it together with reltuples, because we use the reltuples/relpages ratio in the planner. Also don't update n_live_tuples in pgstat. ANALYZE in VACUUM ANALYZE now needs to update pg_class, if the VACUUM-phase didn't do so. Added some boolean-passing to let analyze_rel know if it should update pg_class or not. I also moved the relcache invalidation (to update rd_targblock) from vac_update_relstats to where RelationTruncate is called, because vac_update_relstats is not called for partial vacuums anymore. It's more obvious to send the invalidation close to the truncation that requires it. Per report by Ned T. Crigler.
*	Increase the default value of default_statistics_target from 10 to 100,	Tom Lane	2008-12-13
\| \| \| \| \|	and its maximum value from 1000 to 10000. ALTER TABLE SET STATISTICS similarly now allows a value up to 10000. Per discussion.
*	Make relhasrules and relhastriggers work like relhasindex, namely we let	Tom Lane	2008-11-10
\| \| \| \|	VACUUM reset them to false rather than trying to clean 'em up during DROP.
*	Remove all uses of the deprecated functions heap_formtuple, heap_modifytuple,	Tom Lane	2008-11-02
\| \| \| \| \| \| \| \| \| \| \|	and heap_deformtuple in favor of the newer functions heap_form_tuple et al (which do the same things but use bool control flags instead of arbitrary char values). Eliminate the former duplicate coding of these functions, reducing the deprecated functions to mere wrappers around the newer ones. We can't get rid of them entirely because add-on modules probably still contain many instances of the old coding style. Kris Jurka
*	Unite ReadBufferWithFork, ReadBufferWithStrategy, and ZeroOrReadBuffer	Heikki Linnakangas	2008-10-31
\| \| \| \| \| \| \| \| \| \| \| \|	functions into one ReadBufferExtended function, that takes the strategy and mode as argument. There's three modes, RBM_NORMAL which is the default used by plain ReadBuffer(), RBM_ZERO, which replaces ZeroOrReadBuffer, and a new mode RBM_ZERO_ON_ERROR, which allows callers to read corrupt pages without throwing an error. The FSM needs the new mode to recover from corrupt pages, which could happend if we crash after extending an FSM file, and the new page is "torn". Add fork number to some error messages in bufmgr.c, that still lacked it.
*	Move exprType(), exprTypmod(), expression_tree_walker(), and related routines	Tom Lane	2008-08-25
\| \| \| \| \| \|	into nodes/nodeFuncs, so as to reduce wanton cross-subsystem #includes inside the backend. There's probably more that should be done along this line, but this is a start anyway.
*	Rearrange the querytree representation of ORDER BY/GROUP BY/DISTINCT items	Tom Lane	2008-08-02
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	as per my recent proposal: 1. Fold SortClause and GroupClause into a single node type SortGroupClause. We were already relying on them to be struct-equivalent, so using two node tags wasn't accomplishing much except to get in the way of comparing items with equal(). 2. Add an "eqop" field to SortGroupClause to carry the associated equality operator. This is cheap for the parser to get at the same time it's looking up the sort operator, and storing it eliminates the need for repeated not-so-cheap lookups during planning. In future this will also let us represent GROUP/DISTINCT operations on datatypes that have hash opclasses but no btree opclasses (ie, they have equality but no natural sort order). The previous representation simply didn't work for that, since its only indicator of comparison semantics was a sort operator. 3. Add a hasDistinctOn boolean to struct Query to explicitly record whether the distinctClause came from DISTINCT or DISTINCT ON. This allows removing some complicated and not 100% bulletproof code that attempted to figure that out from the distinctClause alone. This patch doesn't in itself create any new capability, but it's necessary infrastructure for future attempts to use hash-based grouping for DISTINCT and UNION/INTERSECT/EXCEPT.
*	Extend VacAttrStats to allow typanalyze functions to store statistic values	Heikki Linnakangas	2008-07-01
\| \| \| \| \| \| \| \|	of different types than the underlying column. The capability isn't yet used for anything, but will be required by upcoming patch to analyze tsvector columns. Jan Urbanski
*	Move BufferGetPageSize and BufferGetPage from bufpage.h to bufmgr.h. It is	Alvaro Herrera	2008-06-08
\| \| \| \| \| \| \| \| \| \|	more logical that way, and also it reduces the amount of unnecessary includes in bufpage.h, which is widely used. Zdenek Kotala. My previous patch to bufpage.h should also have credited him as author, but I forgot (sorry about that).
*	Put back bufmgr.h in bufpage.h -- it is needed by some macros.	Alvaro Herrera	2008-05-12
\| \| \| \| \|	Remove #include bufmgr.h from (most?) source files which already include bufpage.h.
*	Restructure some header files a bit, in particular heapam.h, by removing some	Alvaro Herrera	2008-05-12
\| \| \| \| \| \| \| \| \| \| \| \|	unnecessary #include lines in it. Also, move some tuple routine prototypes and macros to htup.h, which allows removal of heapam.h inclusion from some .c files. For this to work, a new header file access/sysattr.h needed to be created, initially containing attribute numbers of system columns, for pg_dump usage. While at it, make contrib ltree, intarray and hstore header files more consistent with our header style.
*	Allow float8, int8, and related datatypes to be passed by value on machines	Tom Lane	2008-04-21
\| \| \| \| \| \| \| \| \| \|	where Datum is 8 bytes wide. Since this will break old-style C functions (those still using version 0 calling convention) that have arguments or results of these types, provide a configure option to disable it and retain the old pass-by-reference behavior. Likewise, provide a configure option to disable the recently-committed float4 pass-by-value change. Zoltan Boszormenyi, plus configurability stuff by me.
*	Modify the float4 datatype to be pass-by-val. Along the way, remove the last	Alvaro Herrera	2008-04-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	uses of the long-deprecated float32 in contrib/seg; the definitions themselves are still there, but no longer used. fmgr/README updated to match. I added a CREATE FUNCTION to account for existing seg_center() code in seg.c too, and some tests for it and the neighbor functions. At the same time, remove checks for NULL which are not needed (because the functions are declared STRICT). I had to do some adjustments to contrib's btree_gist too. The choices for representation there are not ideal for changing the underlying types :-( Original patch by Zoltan Boszormenyi, with some adjustments by me.
*	Teach ANALYZE to distinguish dead and in-doubt tuples, which it formerly	Tom Lane	2008-04-03
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	classed all as "dead"; also get it to count DEAD item pointers as dead rows, instead of ignoring them as before. Also improve matters so that tuples previously inserted or deleted by our own transaction are handled nicely: the stats collector's live-tuple and dead-tuple counts will end up correct after our transaction ends, regardless of whether we end in commit or abort. While there's more work that could be done to improve the counting of in-doubt tuples in both VACUUM and ANALYZE, this commit is enough to alleviate some known bad behaviors in 8.3; and the other stuff that's been discussed seems like research projects anyway. Pavan Deolasee and Tom Lane
*	Move the HTSU_Result enum definition into snapshot.h, to avoid including	Alvaro Herrera	2008-03-26
\| \| \| \| \| \|	tqual.h into heapam.h. This makes all inclusion of tqual.h explicit. I also sorted alphabetically the includes on some source files.
*	Improve error messages emitted when VACUUM and ANALYZE skip a table.	Alvaro Herrera	2008-02-20
\| \| \| \| \|	Per gripe from Clodoaldo Pinto Neto on Message-ID: <a595de7a0801060326qbfc790ax2a60573043c2e2be@mail.gmail.com>
*	Make standard maintenance operations (including VACUUM, ANALYZE, REINDEX,	Tom Lane	2008-01-03
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and CLUSTER) execute as the table owner rather than the calling user, using the same privilege-switching mechanism already used for SECURITY DEFINER functions. The purpose of this change is to ensure that user-defined functions used in index definitions cannot acquire the privileges of a superuser account that is performing routine maintenance. While a function used in an index is supposed to be IMMUTABLE and thus not able to do anything very interesting, there are several easy ways around that restriction; and even if we could plug them all, there would remain a risk of reading sensitive information and broadcasting it through a covert channel such as CPU usage. To prevent bypassing this security measure, execution of SET SESSION AUTHORIZATION and SET ROLE is now forbidden within a SECURITY DEFINER context. Thanks to Itagaki Takahiro for reporting this vulnerability. Security: CVE-2007-6600
*	Update copyrights in source tree to 2008.	Bruce Momjian	2008-01-01
\|
*	Re-run pgindent with updated list of typedefs. (Updated README should	Bruce Momjian	2007-11-15
\| \| \| \|	avoid this problem in the future.)
*	pgindent run for 8.3.	Bruce Momjian	2007-11-15
\|
*	Rearrange vacuum-related bits in PGPROC as a bitmask, to better support	Alvaro Herrera	2007-10-24
\| \| \| \| \| \| \| \| \|	having several of them. Add two more flags: whether the process is executing an ANALYZE, and whether a vacuum is for Xid wraparound (which is obviously only set by autovacuum). Sneakily move the worker's recently-acquired PostAuthDelay to a more useful place.
*	Simplify and rename some GUC variables, per various recent discussions:	Tom Lane	2007-09-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* stats_start_collector goes away; we always start the collector process, unless prevented by a problem with setting up the stats UDP socket. * stats_reset_on_server_start goes away; it seems useless in view of the availability of pg_stat_reset(). * stats_block_level and stats_row_level are merged into a single variable "track_counts", which controls all reports sent to the collector process. * stats_command_string is renamed to track_activities. * log_autovacuum is renamed to log_autovacuum_min_duration to better reflect its meaning. The log_autovacuum change is not a compatibility issue since it didn't exist before 8.3 anyway. The other changes need to be release-noted.
*	Make large sequential scans and VACUUMs work in a limited-size "ring" of	Tom Lane	2007-05-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	buffers, rather than blowing out the whole shared-buffer arena. Aside from avoiding cache spoliation, this fixes the problem that VACUUM formerly tended to cause a WAL flush for every page it modified, because we had it hacked to use only a single buffer. Those flushes will now occur only once per ring-ful. The exact ring size, and the threshold for seqscans to switch into the ring usage pattern, remain under debate; but the infrastructure seems done. The key bit of infrastructure is a new optional BufferAccessStrategy object that can be passed to ReadBuffer operations; this replaces the former StrategyHintVacuum API. This patch also changes the buffer usage-count methodology a bit: we now advance usage_count when first pinning a buffer, rather than when last unpinning it. To preserve the behavior that a buffer's lifetime starts to decrease when it's released, the clock sweep code is modified to not decrement usage_count of pinned buffers. Work not done in this commit: teach GiST and GIN indexes to use the vacuum BufferAccessStrategy for vacuum-driven fetches. Original patch by Simon, reworked by Heikki and again by Tom.
*	Implement rate-limiting logic on how often backends will attempt to send	Tom Lane	2007-04-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	messages to the stats collector. This avoids the problem that enabling stats_row_level for autovacuum has a significant overhead for short read-only transactions, as noted by Arjen van der Meijden. We can avoid an extra gettimeofday call by piggybacking on the one done for WAL-logging xact commit or abort (although that doesn't help read-only transactions, since they don't WAL-log anything). In my proposal for this, I noted that we could change the WAL log entries for commit/abort to record full TimestampTz precision, instead of only time_t as at present. That's not done in this patch, but will be committed separately.
*	Silence compiler warnings, per Bruce.	Alvaro Herrera	2007-04-19
\|
*	Enable configurable log of autovacuum actions. Initial patch from Simon	Alvaro Herrera	2007-04-18
\| \| \| \|	Riggs, additional code and docs by me. Per discussion.
*	Support varlena fields with single-byte headers and unaligned storage.	Tom Lane	2007-04-06
\| \| \| \| \| \| \| \| \|	This commit breaks any code that assumes that the mere act of forming a tuple (without writing it to disk) does not "toast" any fields. While all available regression tests pass, I'm not totally sure that we've fixed every nook and cranny, especially in contrib. Greg Stark with some help from Tom Lane
*	Support ORDER BY ... NULLS FIRST/LAST, and add ASC/DESC/NULLS FIRST/NULLS LAST	Tom Lane	2007-01-09
\| \| \| \| \| \| \| \| \| \| \| \|	per-column options for btree indexes. The planner's support for this is still pretty rudimentary; it does not yet know how to plan mergejoins with nondefault ordering options. The documentation is pretty rudimentary, too. I'll work on improving that stuff later. Note incompatible change from prior behavior: ORDER BY ... USING will now be rejected if the operator is not a less-than or greater-than member of some btree opclass. This prevents less-than-sane behavior if an operator that doesn't actually define a proper sort ordering is selected.