postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
...
*	Add monitoring function pg_last_xact_replay_timestamp.	Robert Haas	2010-11-09
\| \| \| \|	Fujii Masao, with a little wordsmithing by me.
*	Repair memory leakage while ANALYZE-ing complex index expressions.	Tom Lane	2010-11-09
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The general design of memory management in Postgres is that intermediate results computed by an expression are not freed until the end of the tuple cycle. For expression indexes, ANALYZE has to re-evaluate each expression for each of its sample rows, and it wasn't bothering to free intermediate results until the end of processing of that index. This could lead to very substantial leakage if the intermediate results were large, as in a recent example from Jakub Ouhrabka. Fix by doing ResetExprContext for each sample row. This necessitates adding a datumCopy step to ensure that the final expression value isn't recycled too. Some quick testing suggests that this change adds at worst about 10% to the time needed to analyze a table with an expression index; which is annoying, but seems a tolerable price to pay to avoid unexpected out-of-memory problems. Back-patch to all supported branches.
*	In rewriteheap.c (used by VACUUM FULL and CLUSTER), calculate the tuple	Heikki Linnakangas	2010-11-09
\| \| \| \| \| \| \| \| \| \|	length stored in the line pointer the same way it's calculated in the normal heap_insert() codepath. As noted by Jeff Davis, the length stored by raw_heap_insert() included padding but the one stored by the normal codepath did not. While the mismatch seems to be harmless, inconsistency isn't good, and the normal codepath has received a lot more testing over the years. Backpatch to 8.3 where the heap rewrite code was introduced.
*	Fix error handling in temp-file deletion with log_temp_files active.	Tom Lane	2010-11-08
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The original coding in FileClose() reset the file-is-temp flag before unlinking the file, so that if control came back through due to an error, it wouldn't try to unlink the file twice. This was correct when written, but when the log_temp_files feature was added, the logging action was put in between those two steps. An error occurring during the logging action --- such as a query cancel --- would result in the unlink not getting done at all, as in recent report from Michael Glaesemann. To fix this, make sure that we do both the stat and the unlink before doing anything that could conceivably CHECK_FOR_INTERRUPTS. There is a judgment call here, which is which log message to emit first: if you can see only one, which should it be? I chose to log unlink failure at the risk of losing the log_temp_files log message --- after all, if the unlink does fail, the temp file is still there for you to see. Back-patch to all versions that have log_temp_files. The code was OK before that.
*	Fix permanent memory leak in autovacuum launcher	Alvaro Herrera	2010-11-08
\| \| \| \| \| \| \| \| \| \| \| \|	get_database_list was uselessly allocating its output data, along some created along the way, in a permanent memory context. This didn't matter when autovacuum was a single, short-lived process, but now that the launcher is permanent, it shows up as a permanent leak. To fix, make get_database list allocate its output data in the caller's context, which is in charge of freeing it when appropriate; and the memory leaked by heap_beginscan et al is allocated in a throwaway transaction context.
*	Use appendrel planning logic for top-level UNION ALL structures.	Tom Lane	2010-11-08
\| \| \| \| \| \| \| \| \| \| \| \| \|	Formerly, we could convert a UNION ALL structure inside a subquery-in-FROM into an appendrel, as a side effect of pulling up the subquery into its parent; but top-level UNION ALL always caused use of plan_set_operations(). That didn't matter too much because you got an Append-based plan either way. However, now that the appendrel code can do things with MergeAppend, it's worthwhile to hack up the top-level case so it also uses appendrels. This is a bit of a stopgap; but going much further than this will require a major rewrite of the planner's set-operations support, which I'm not prepared to undertake now. For the moment let's grab the low-hanging fruit.
*	Prevent invoking I/O conversion casts via functional/attribute notation.	Tom Lane	2010-11-07
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	PG 8.4 added a built-in feature for casting pretty much any data type to string types (text, varchar, etc). We allowed this to work in any of the historically-allowed syntaxes: CAST(x AS text), x::text, text(x), or x.text. However, multiple complaints have shown that it's too easy to invoke such casts unintentionally in the latter two styles, particularly field selection. To cure the problem with the narrowest possible change of behavior, disallow use of I/O conversion casts from composite types to string types via functional/attribute syntax. The new functionality is still available via cast syntax. In passing, document the equivalence of functional and attribute syntax in a more visible place.
*	Add support for detecting register-stack overrun on IA64.	Tom Lane	2010-11-06
\| \| \| \| \| \| \| \| \|	Per recent investigation, the register stack can grow faster than the regular stack depending on compiler and choice of options. To avoid crashes we must check both stacks in check_stack_depth(). Since this is poorly-tested code, committing only to HEAD for the moment ... but we might want to consider back-patching later.
*	Make get_stack_depth_rlimit() handle RLIM_INFINITY more sanely.	Tom Lane	2010-11-06
\| \| \| \| \| \| \| \| \| \| \| \| \|	Rather than considering this result as meaning "unknown", report LONG_MAX. This won't change what superusers can set max_stack_depth to, but it will cause InitializeGUCOptions() to set the built-in default to 2MB not 100kB. The latter seems like a fairly unreasonable interpretation of "infinity". Per my investigation of odd buildfarm results as well as an old complaint from Heikki. Since this should persuade all the buildfarm animals to use a reasonable stack depth setting during "make check", revert previous patch that dumbed down a recursive regression test to only 5 levels.
*	Include the current value of max_stack_depth in stack depth complaints.	Tom Lane	2010-11-04
\| \| \| \| \| \| \|	I'm mainly interested in finding out what it is on buildfarm machines, but including the active value in the message seems like good practice in any case. Add the info to the HINT, not the ERROR string, so as not to change the regression tests' expected output.
*	Use appendStringInfoString() where appropriate in elog.c.	Tom Lane	2010-11-04
\| \| \| \| \| \| \| \| \|	The nominally equivalent call appendStringInfo(buf, "%s", str) can be significantly slower when str is large. In particular, the former usage in EVALUATE_MESSAGE led to O(N^2) behavior when collecting a large number of context lines, as I found out while testing recursive functions. The other changes are just neatnik-ism and seem unlikely to save anything meaningful, but a cycle shaved is a cycle earned.
*	Reimplement planner's handling of MIN/MAX aggregate optimization.	Tom Lane	2010-11-04
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Per my recent proposal, get rid of all the direct inspection of indexes and manual generation of paths in planagg.c. Instead, set up EquivalenceClasses for the aggregate argument expressions, and let the regular path generation logic deal with creating paths that can satisfy those sort orders. This makes planagg.c a bit more visible to the rest of the planner than it was originally, but the approach is basically a lot cleaner than before. A major advantage of doing it this way is that we get MIN/MAX optimization on inheritance trees (using MergeAppend of indexscans) practically for free, whereas in the old way we'd have had to add a whole lot more duplicative logic. One small disadvantage of this approach is that MIN/MAX aggregates can no longer exploit partial indexes having an "x IS NOT NULL" predicate, unless that restriction or something that implies it is specified in the query. The previous implementation was able to use the added "x IS NOT NULL" condition as an extra predicate proof condition, but in this version we rely entirely on indexes that are considered usable by the main planning process. That seems a fair tradeoff for the simplicity and functionality gained.
*	Fix adjust_semi_join to be more cautious about clauseless joins.	Tom Lane	2010-11-02
\| \| \| \| \| \| \|	It was reporting that these were fully indexed (hence cheap), when of course they're the exact opposite of that. I'm not certain if the case would arise in practice, since a clauseless semijoin is hard to produce in SQL, but if it did happen we'd make some dumb decisions.
*	Ensure an index that uses a whole-row Var still depends on its table.	Tom Lane	2010-11-02
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We failed to record any dependency on the underlying table for an index declared like "create index i on t (foo(t.*))". This would create trouble if the table were dropped without previously dropping the index. To fix, simplify some overly-cute code in index_create(), accepting the possibility that sometimes the whole-table dependency will be redundant. Also document this hazard in dependency.c. Per report from Kevin Grittner. In passing, prevent a core dump in pg_get_indexdef() if the index's table can't be found. I came across this while experimenting with Kevin's example. Not sure it's a real issue when the catalogs aren't corrupt, but might as well be cautious. Back-patch to all supported versions.
*	Bootstrap WAL to begin at segment logid=0 logseg=1 (000000010000000000000001)	Heikki Linnakangas	2010-11-02
\| \| \| \| \| \| \| \| \| \| \| \| \|	rather than 0/0, so that we can safely use 0/0 as an invalid value. This is a more future-proof fix for the corner-case bug in streaming replication that was fixed yesterday. We had a similar corner-case bug with log/seg 0/0 back in February as well. Avoiding 0/0 as a valid value should prevent bugs like that in the future. Per Tom Lane's idea. Back-patch to 9.0. Since this only affects bootstrapping, it makes no difference to existing installations. We don't need to worry about the bug in existing installations, because if you've managed to get past the initial base backup already, you won't hit the bug in the future either.
*	Avoid using a local FunctionCallInfoData struct in ExecMakeFunctionResult	Tom Lane	2010-11-01
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	and related routines. We already had a redundant FunctionCallInfoData struct in FuncExprState, but were using that copy only in set-returning-function cases, to avoid keeping function evaluation state in the expression tree for the benefit of plpgsql's "simple expression" logic. But of course that didn't work anyway. Given the recent fixes in plpgsql there is no need to have two separate behaviors here. Getting rid of the local FunctionCallInfoData structs should make things a little faster (because we don't need to do InitFunctionCallInfoData each time), and it also makes for a noticeable reduction in stack space consumption during recursive calls.
*	Fix corner-case bug in tracking of latest removed WAL segment during	Heikki Linnakangas	2010-11-01
\| \| \| \| \| \| \| \| \|	streaming replication. We used log/seg 0/0 to indicate that no WAL segments have been removed since startup, but 0/0 is a valid value for the very first WAL segment after initdb. To make that disambiguous, store (latest removed WAL segment + 1) in the global variable. Per report from Matt Chesler, also reproduced by Greg Smith.
*	Provide hashing support for arrays.	Tom Lane	2010-10-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The core of this patch is hash_array() and associated typcache infrastructure, which works just about exactly like the existing support for array comparison. In addition I did some work to ensure that the planner won't think that an array type is hashable unless its element type is hashable, and similarly for sorting. This includes adding a datatype parameter to op_hashjoinable and op_mergejoinable, and adding an explicit "hashable" flag to SortGroupClause. The lack of a cross-check on the element type was a pre-existing bug in mergejoin support --- but it didn't matter so much before, because if you couldn't sort the element type there wasn't any good alternative to failing anyhow. Now that we have the alternative of hashing the array type, there are cases where we can avoid a failure by being picky at the planner stage, so it's time to be picky. The issue of exactly how to combine the per-element hash values to produce an array hash is still open for discussion, but the rest of this is pretty solid, so I'll commit it as-is.
*	Fix comparisons of pointers with zero to compare with NULL instead.	Tom Lane	2010-10-29
\| \| \| \| \| \| \|	Per C standard, these are semantically the same thing; but saying NULL when you mean NULL is good for readability. Marti Raudsepp, per results of INRIA's Coccinelle.
*	Oops, missed one fix for EquivalenceClass rearrangement.	Tom Lane	2010-10-29
\| \| \| \| \| \| \| \| \| \|	Now that we're expecting a mergeclause's left_ec/right_ec to persist from the initial assignments, we can't just blithely zero these out when transforming such a clause in adjust_appendrel_attrs. But really it should be okay to keep the parent's values, since a child table's derived Var ought to be equivalent to the parent Var for all EquivalenceClass purposes. (Indeed, I'm wondering whether we couldn't find a way to dispense with add_child_rel_equivalences altogether. But this is wrong in any case.)
*	Avoid creation of useless EquivalenceClasses during planning.	Tom Lane	2010-10-29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Zoltan Boszormenyi exhibited a test case in which planning time was dominated by construction of EquivalenceClasses and PathKeys that had no actual relevance to the query (and in fact got discarded immediately). This happened because we generated PathKeys describing the sort ordering of every index on every table in the query, and only after that checked to see if the sort ordering was relevant. The EC/PK construction code is O(N^2) in the number of ECs, which is all right for the intended number of such objects, but it gets out of hand if there are ECs for lots of irrelevant indexes. To fix, twiddle the handling of mergeclauses a little bit to ensure that every interesting EC is created before we begin path generation. (This doesn't cost anything --- in fact I think it's a bit cheaper than before --- since we always eventually created those ECs anyway.) Then, if an index column can't be found in any pre-existing EC, we know that that sort ordering is irrelevant for the query. Instead of creating a useless EC, we can just not build a pathkey for the index column in the first place. The index will still be considered if it's useful for non-order-related reasons, but we will think of its output as unsorted.
*	Previous patch had no detectable virtue other than being a one-liner.	Tom Lane	2010-10-27
\| \| \| \| \|	Try to make the code look self-consistent again, so it doesn't confuse future developers.
*	Fix long-standing segfault when accept() or one of the calls made right	Heikki Linnakangas	2010-10-27
\| \| \| \| \|	after accepting a connection fails, and the server is compiled with GSSAPI support. Report and patch by Alexander V. Chernikov, bug #5731.
*	Add a client authentication hook.	Robert Haas	2010-10-26
\| \| \| \|	KaiGai Kohei, with minor cleanup of the comments by me.
*	Fix dumb typo in SECURITY LABEL error message.	Robert Haas	2010-10-26
\| \| \| \|	Report by Peter Eisentraut.
*	Before removing backup_label and irrevocably changing pg_control file, check	Heikki Linnakangas	2010-10-26
\| \| \| \| \| \| \| \|	that WAL file containing the checkpoint redo-location can be found. This avoids making the cluster irrecoverable if the redo location is in an earlie WAL file than the checkpoint record. Report, analysis and patch by Jeff Davis, with small changes by me.
*	Refactor typenameTypeId()	Peter Eisentraut	2010-10-25
\| \| \| \| \| \|	Split the old typenameTypeId() into two functions: A new typenameTypeId() that returns only a type OID, and typenameTypeIdAndMod() that returns type OID and typmod. This isolates call sites better that actually care about the typmod.
*	Fix overly-enthusiastic Assert in printing of Param reference expressions.	Tom Lane	2010-10-25
\| \| \| \| \| \| \| \|	A NestLoopParam's value can only be a Var or Aggref, but this isn't the case in general for SubPlan parameters, so print_parameter_expr had better be prepared to cope. Brain fade in my recent patch to print the referenced expression instead of just printing $N for PARAM_EXEC Params. Per report from Pavel Stehule.
*	Fix inline_set_returning_function() to preserve the invalItems list properly.	Tom Lane	2010-10-25
\| \| \| \| \| \| \| \| \|	This avoids a possible crash when inlining a SRF whose argument list contains a reference to an inline-able user function. The crash is quite reproducible with CLOBBER_FREED_MEMORY enabled, but would be less certain in a production build. Problem introduced in 9.0 by the named-arguments patch, which requires invoking eval_const_expressions() before we can try to inline a SRF. Per report from Brendan Jurd.
*	Work around rounding misbehavior exposed by buildfarm.	Tom Lane	2010-10-25
\|
*	Allow new values to be added to an existing enum type.	Tom Lane	2010-10-24
\| \| \| \| \| \| \|	After much expenditure of effort, we've got this to the point where the performance penalty is pretty minimal in typical cases. Andrew Dunstan, reviewed by Brendan Jurd, Dean Rasheed, and Tom Lane
*	Support suffix matching of host names in pg_hba.conf	Peter Eisentraut	2010-10-24
\| \| \| \| \|	A name starting with a dot can be used to match a suffix of the actual host name (e.g., .example.com matches foo.example.com).
*	Add semicolon, missed in previous patch. And update the keyword list in	Heikki Linnakangas	2010-10-22
\| \| \| \|	the docs to reflect that OFF is now unreserved. Spotted by Tom Lane.
*	Make OFF keyword unreserved. It's not hard to imagine wanting to use 'off'	Heikki Linnakangas	2010-10-22
\| \| \| \| \| \| \| \|	as a variable or column name, and it's not reserved in recent versions of the SQL spec either. This became particularly annoying in 9.0, before that PL/pgSQL replaced variable names in queries with parameter markers, so it was possible to use OFF and many other backend parser keywords as variable names. Because of that, backpatch to 9.0.
*	Improve handling of domains over arrays.	Tom Lane	2010-10-21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch eliminates various bizarre behaviors caused by sloppy thinking about the difference between a domain type and its underlying array type. In particular, the operation of updating one element of such an array has to be considered as yielding a value of the underlying array type, not a value of the domain, because there's no assurance that the domain's CHECK constraints are still satisfied. If we're intending to store the result back into a domain column, we have to re-cast to the domain type so that constraints are re-checked. For similar reasons, such a domain can't be blindly matched to an ANYARRAY polymorphic parameter, because the polymorphic function is likely to apply array-ish operations that could invalidate the domain constraints. For the moment, we just forbid such matching. We might later wish to insert an automatic downcast to the underlying array type, but such a change should also change matching of domains to ANYELEMENT for consistency. To ensure that all such logic is rechecked, this patch removes the original hack of setting a domain's pg_type.typelem field to match its base type; the typelem will always be zero instead. In those places where it's really okay to look through the domain type with no other logic changes, use the newly added get_base_element_type function in place of get_element_type. catversion bumped due to change in pg_type contents. Per bug #5717 from Richard Huxton and subsequent discussion.
*	Remove obsolete comment, per Josh Kupershmidt.	Tom Lane	2010-10-20
\|
*	Don't try to fetch database name when SetTransactionIdLimit() is executed	Tom Lane	2010-10-20
\| \| \| \| \| \| \| \| \| \| \| \|	outside a transaction. This repairs brain fade in my patch of 2009-08-30: the reason we had been storing oldest-database name, not OID, in ShmemVariableCache was of course to avoid having to do a catalog lookup at times when it might be unsafe. This error explains why Aleksandr Dushein is having trouble getting out of an XID wraparound state in bug #5718, though not how he got into that state in the first place. I suspect pg_upgrade is at fault there.
*	Remove AtStart_Cache() call in CommandCounterIncrement().	Alvaro Herrera	2010-10-20
\| \| \| \| \| \| \| \|	This call was present in the aboriginal code from Berkeley, and has never been touched; it may very well be that it was there to mask effects of bugs in other places and it may no longer be necessary. The removal has been foreseen in a code comment since 2007; this seems to be a good time to test this hypothesis.
*	Fix incorrect generation of whole-row variables in planner.	Tom Lane	2010-10-19
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	A couple of places in the planner need to generate whole-row Vars, and were cutting corners by setting vartype = RECORDOID in the Vars, even in cases where there's an identifiable named composite type for the RTE being referenced. While we mostly got away with this, it failed when there was also a parser-generated whole-row reference to the same RTE, because the two Vars weren't equal() due to the difference in vartype. Fix by providing a subroutine the planner can call to generate whole-row Vars the same way the parser does. Per bug #5716 from Andrew Tipton. Back-patch to 9.0 where one of the bogus calls was introduced (the other one is new in HEAD).
*	Unbreak comments on composite type attributes.	Robert Haas	2010-10-19
\| \| \| \|	Report and diagnosis by Peter Eisentraut.
*	Support key word 'all' in host column of pg_hba.conf	Peter Eisentraut	2010-10-18
\|
*	Fix a passel of inappropriately-named global functions in GIN.	Tom Lane	2010-10-17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The GIN code has absolutely no business exporting GIN-specific functions with names as generic as compareItemPointers() or newScanKey(); that's just trouble waiting to happen. I got annoyed about this again just now and decided to fix it. This commit ensures that all global symbols defined in access/gin/ have names including "gin" or "Gin". There were a couple of cases, like names involving "PostingItem", where arguably the names were already sufficiently nongeneric; but I figured as long as I was risking creating merge problems for unapplied GIN patches I might as well impose a uniform policy. I didn't touch any static symbol names. There might be some places where it'd be appropriate to rename some static functions to match siblings that are exported, but I'll leave that for another time.
*	Improve GIN indexscan cost estimation.	Tom Lane	2010-10-17
\| \| \| \| \| \| \| \| \| \| \| \| \|	The better estimate requires more statistics than we previously stored: in particular, counts of "entry" versus "data" pages within the index, as well as knowledge of the number of distinct key values. We collect this information during initial index build and update it during VACUUM, storing the info in new fields on the index metapage. No initdb is required because these fields will read as zeroes in a pre-existing index, and the new gincostestimate code is coded to behave (reasonably) sanely if they are zeroes. Teodor Sigaev, reviewed by Jan Urbanski, Tom Lane, and Itagaki Takahiro.
*	Fix recent changes to not break non-IPV6-aware systems.	Tom Lane	2010-10-16
\|
*	Allow WITH clauses to be attached to INSERT, UPDATE, DELETE statements.	Tom Lane	2010-10-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is not the hoped-for facility of using INSERT/UPDATE/DELETE inside a WITH, but rather the other way around. It seems useful in its own right anyway. Note: catversion bumped because, although the contents of stored rules might look compatible, there's actually a subtle semantic change. A single Query containing a WITH and INSERT...VALUES now represents writing the WITH before the INSERT, not before the VALUES. While it's not clear that that matters to anyone, it seems like a good idea to have it cited in the git history for catversion.h. Original patch by Marko Tiikkaja, with updating and cleanup by Hitoshi Harada.
*	Support host names in pg_hba.conf	Peter Eisentraut	2010-10-15
\| \| \| \|	Peter Eisentraut, reviewed by KaiGai Kohei and Tom Lane
*	Change references to SQL/XML:2003 to :2008 and renumber sections accordingly	Peter Eisentraut	2010-10-15
\|
*	Fix low-risk potential denial of service against RADIUS login.	Magnus Hagander	2010-10-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Corrupt RADIUS responses were treated as errors and not ignored (which the RFC2865 states they should be). This meant that a user with unfiltered access to the network of the PostgreSQL or RADIUS server could send a spoofed RADIUS response to the PostgreSQL server causing it to reject a valid login, provided the attacker could also guess (or brute-force) the correct port number. Fix is to simply retry the receive in a loop until the timeout has expired or a valid (signed by the correct RADIUS server) packet arrives. Reported by Alan DeKok in bug #5687.
*	Improve comment about ignoring 128 error code on Windows:	Bruce Momjian	2010-10-15
\| \| \| \| \|	* Microsoft reports it is related to mutex failure: * http://archives.postgresql.org/pgsql-hackers/2010-09/msg00790.php
*	Support MergeAppend plans, to allow sorted output from append relations.	Tom Lane	2010-10-14
\| \| \| \| \| \| \| \| \|	This patch eliminates the former need to sort the output of an Append scan when an ordered scan of an inheritance tree is wanted. This should be particularly useful for fast-start cases such as queries with LIMIT. Original patch by Greg Stark, with further hacking by Hans-Jurgen Schonig, Robert Haas, and Tom Lane.