postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
*	Install check_stack_depth() protection in two recursive tsquery	Tom Lane	2007-08-31
\| \| \| \|	processing routines. Per Heikki.
*	Rewrite make_outerjoininfo's construction of min_lefthand and min_righthand	Tom Lane	2007-08-31
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	sets for outer joins, in the light of bug #3588 and additional thought and experimentation. The original methodology was fatally flawed for nests of more than two outer joins: it got the relationships between adjacent joins right, but didn't always come to the right conclusions about whether a join could be interchanged with one two or more levels below it. This was largely caused by a mistaken idea that we should use the min_lefthand + min_righthand sets of a sub-join as the minimum left or right input set of an upper join when we conclude that the sub-join can't commute with the upper one. If there's a still-lower join that the sub-join can commute with, this method led us to think that that one could commute with the topmost join; which it can't. Another problem (not directly connected to bug #3588) was that make_outerjoininfo's processing-order-dependent method for enforcing outer join identity #3 didn't work right: if we decided that join A could safely commute with lower join B, we dropped all information about sub-joins under B that join A could perhaps not safely commute with, because we removed B's entire min_righthand from A's. To fix, make an explicit computation of all inner join combinations that occur below an outer join, and add to that the full syntactic relsets of any lower outer joins that we determine it can't commute with. This method gives much more direct enforcement of the outer join rearrangement identities, and it turns out not to cost a lot of additional bookkeeping. Thanks to Richard Harris for the bug report and test case.
*	Fix int8mul so that overflow check is applied correctly for INT64_IS_BUSTED	Tom Lane	2007-08-30
\| \| \| \| \|	case, per Florian Pflug. Not back-patched since it's unclear that anyone but me still cares ...
*	Relax permissions checks on dbsize functions, per discussion. Revert out all	Tom Lane	2007-08-29
\| \| \| \| \| \| \| \| \| \|	checks for individual-table-size functions, since anyone in the database could get approximate values from pg_class.relpages anyway. Allow database-size to users with CONNECT privilege for the target database (note that this is granted by default). Allow tablespace-size if the user has CREATE privilege on the tablespace (which is not granted by default), or if the tablespace is the default tablespace for the current database (since we treat that as implicitly allowing use of the tablespace).
*	Add a debug logging message when a resource manager rejects an attempted	Tom Lane	2007-08-28
\| \| \| \|	restart point. Per suggestion from Simon Riggs.
*	Improve behavior of log_lock_waits patch. Ensure that something gets logged	Tom Lane	2007-08-28
\| \| \| \| \| \| \| \| \| \|	even if the "deadlock detected" ERROR message is suppressed by an exception catcher. Be clearer about the event sequence when a soft deadlock is fixed: the fixing process might or might not still have to wait, so log that separately. Fix race condition when someone releases us from the lock partway through printing all this junk --- we'd not get confused about our state, but the log message sequence could have been misleading, ie, a "still waiting" message with no subsequent "acquired" message. Greg Stark and Tom Lane.
*	Fix generation of snowball_create.sql on msvc builds.	Magnus Hagander	2007-08-27
\|
*	Fix a couple of misbehaviors rooted in the fact that the default creation	Tom Lane	2007-08-27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	namespace isn't necessarily first in the search path (there could be implicit schemas ahead of it). Examples are test=# set search_path TO s1; test=# create view pg_timezone_names as select * from pg_timezone_names(); ERROR: "pg_timezone_names" is already a view test=# create table pg_class (f1 int primary key); ERROR: permission denied: "pg_class" is a system catalog You'd expect these commands to create the requested objects in s1, since names beginning with pg_ aren't supposed to be reserved anymore. What is happening is that we create the requested base table and then execute additional commands (here, CREATE RULE or CREATE INDEX), and that code is passed the same RangeVar that was in the original command. Since that RangeVar has schemaname = NULL, the secondary commands think they should do a path search, and that means they find system catalogs that are implicitly in front of s1 in the search path. This is perilously close to being a security hole: if the secondary command failed to apply a permission check then it'd be possible for unprivileged users to make schema modifications to system catalogs. But as far as I can find, there is no code path in which a check doesn't occur. Which makes it just a weird corner-case bug for people who are silly enough to want to name their tables the same as a system catalog. The relevant code has changed quite a bit since 8.2, which means this patch wouldn't work as-is in the back branches. Since it's a corner case no one has reported from the field, I'm not going to bother trying to back-patch.
*	Remove the 'not in' operator (!!=). This was a hangover from Berkeley	Tom Lane	2007-08-27
\| \| \| \| \| \| \|	days that was obsolete the moment we had IN (SELECT ...) capability. It's arguably a security hole since it applied no permissions check to the table it searched, and since it was never documented anywhere, removing it seems more appropriate than fixing it.
*	Restrict pg_relation_size to relation owner, pg_database_size to DB owner,	Tom Lane	2007-08-27
\| \| \| \| \| \|	and pg_tablespace_size to superusers. Perhaps we could weaken the first case to just require SELECT privilege, but that doesn't work for the other cases, so use ownership as the common concept.
*	Make currtid() functions require SELECT privileges on the target table.	Tom Lane	2007-08-27
\| \| \| \| \| \|	While it's not clear that TID linkage info is of any great use to a nefarious user, it's certainly unexpected that these functions wouldn't insist on read privileges.
*	Make ARRAY(SELECT ...) return an empty array, rather than a NULL, when the	Tom Lane	2007-08-26
\| \| \| \| \|	sub-select returns zero rows. Per complaint from Jens Schicke. Since this is more in the nature of a definition change than a bug, not back-patched.
*	Fix brain fade in DefineIndex(): it was continuing to access the table's	Tom Lane	2007-08-25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	relcache entry after having heap_close'd it. This could lead to misbehavior if a relcache flush wiped out the cache entry meanwhile. In 8.2 there is a very real risk of CREATE INDEX CONCURRENTLY using the wrong relid for locking and waiting purposes. I think the bug is only cosmetic in 8.0 and 8.1, because their transgression is limited to using RelationGetRelationName(rel) in an ereport message immediately after heap_close, and there's no way (except with special debugging options) for a cache flush to occur in that interval. Not quite sure that it's cosmetic in 7.4, but seems best to patch anyway. Found by trying to run the regression tests with CLOBBER_CACHE_ALWAYS enabled. Maybe we should try to do that on a regular basis --- it's awfully slow, but perhaps some fast buildfarm machine could do it once in awhile.
*	Simplify implementation of ts_debug() function --- use a join instead	Tom Lane	2007-08-25
\| \| \| \| \|	of redundant sub-selects. initdb not forced, since this is just a cosmetic change, but the new code won't show up till you do one.
*	Fix synonym-dict breakage introduced in last patch :-(.	Tom Lane	2007-08-25
\| \| \| \|	Minor other cleanups.
*	Rename built-in Snowball stemmer dictionaries to be english_stem,	Tom Lane	2007-08-25
\| \| \| \|	russian_stem, etc. Per discussion.
*	Cleanup for some problems in tsearch patch:	Tom Lane	2007-08-25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	- ispell initialization crashed on empty dictionary file - ispell initialization crashed on affix file with prefixes but no suffixes - stop words file was run through pg_verify_mbstr, with database encoding, but it's supposed to be UTF-8; similar bug for synonym files - bunch of comments added, typos fixed, and other cleanup Introduced consistent encoding checking/conversion of data read from tsearch configuration files, by doing this in a single t_readline() subroutine (replacing direct usages of fgets). Cleaned up API for readstopwords too. Heikki Linnakangas
*	Reduce memory requirements for writing CSVlogs, so it will work with about	Andrew Dunstan	2007-08-23
\| \| \| \|	the same amount of memory in ErrorContext as standard logs.
*	Suppress testing the options of CREATE TEXT SEARCH DICTIONARY during	Tom Lane	2007-08-22
\| \| \| \| \|	initdb. We should create all the standard dictionaries even though some of them may not work in template1's encoding. Per Teodor.
*	Fix VPATH-build problem in new tsearch makefile, per Chad Wagner.	Tom Lane	2007-08-22
\|
*	Remove option to change parser of an existing text search configuration.	Tom Lane	2007-08-22
\| \| \| \| \| \|	This prevents needing to do complex and poorly-defined updates of the mapping table if the new parser has different token types than the old. Per discussion.
*	Whoops, missed updating dsynonym_init for new dictionary parameter method.	Tom Lane	2007-08-22
\|
*	Simplify the syntax of CREATE/ALTER TEXT SEARCH DICTIONARY by treating the	Tom Lane	2007-08-22
\| \| \| \| \| \| \| \| \| \| \| \|	init options of the template as top-level options in the syntax. This also makes ALTER a bit easier to use, since options can be replaced individually. I also made these statements verify that the tmplinit method will accept the new settings before they get stored; in the original coding you didn't find out about mistakes until the dictionary got invoked. Under the hood, init methods now get options as a List of DefElem instead of a raw text string --- that lets tsearch use existing options-pushing code instead of duplicating functionality.
*	Simplify CREATE TEXT SEARCH CONFIGURATION by eliminating the separate	Tom Lane	2007-08-21
\| \| \| \| \| \| \| \| \|	'with map' parameter; as things now stand there's really not much point in specifying a config-to-copy if you don't copy its map. Also, use COPY instead of TEMPLATE as the key word for a config-to-copy, so as to avoid confusion with text search templates. Per discussion; the just-committed reference page for the command already describes it this way.
*	Avoid using TEXT as a Bison symbol, since this provokes warnings on	Tom Lane	2007-08-21
\| \| \| \| \|	Windows builds. In passing, fix an obsolete comment, per gripe from Greg Stark.
*	Remove extraneous semicolon --- buildfarm member bear, for one,	Tom Lane	2007-08-21
\| \| \| \|	objects to it.
*	Fix cash_mul_int4 and cash_div_int4 for overenthusiastic substitution	Tom Lane	2007-08-21
\| \| \| \|	of int64 for int32. Per reports from Merlin Moncure and Andrew Chernow.
*	Fix money type's send/receive functions to conform to recent widening	Tom Lane	2007-08-21
\| \| \| \|	of the datatype to int64. Per Andrew Chernow.
*	Fix potential access-off-the-end-of-memory in varbit_out(): it fetched the	Tom Lane	2007-08-21
\| \| \| \| \| \|	byte after the last full byte of the bit array, regardless of whether that byte was part of the valid data or not. Found by buildfarm testing. Thanks to Stefan Kaltenbrunner for nailing down the cause.
*	Suppress uninitialized-variable warning.	Tom Lane	2007-08-21
\|
*	Fix a small 64-bit problem in tsearch patch.	Tom Lane	2007-08-21
\|
*	Tsearch2 functionality migrates to core. The bulk of this work is by	Tom Lane	2007-08-21
\| \| \| \| \| \| \| \|	Oleg Bartunov and Teodor Sigaev, but I did a lot of editorializing, so anything that's broken is probably my fault. Documentation is nonexistent as yet, but let's land the patch so we can get some portability testing done.
*	Provide for logfiles in machine readable CSV format. In consequence, rename	Andrew Dunstan	2007-08-19
\| \| \| \| \| \|	redirect_stderr to logging_collector. Original patch from Arul Shaji, subsequently modified by Greg Smith, and then heavily modified by me.
*	Arrange to cache a ResultRelInfo in the executor's EState for relations that	Tom Lane	2007-08-15
\| \| \| \| \| \| \| \| \| \| \| \| \|	are not one of the query's defined result relations, but nonetheless have triggers fired against them while the query is active. This was formerly impossible but can now occur because of my recent patch to fix the firing order for RI triggers. Caching a ResultRelInfo avoids duplicating work by repeatedly opening and closing the same relation, and also allows EXPLAIN ANALYZE to "see" and report on these extra triggers. Use the same mechanism to cache open relations when firing deferred triggers at transaction shutdown; this replaces the former one-element-cache strategy used in that case, and should improve performance a bit when there are deferred triggers on a number of relations.
*	Repair problems occurring when multiple RI updates have to be done to the same	Tom Lane	2007-08-15
\| \| \| \| \| \| \| \| \|	row within one query: we were firing check triggers before all the updates were done, leading to bogus failures. Fix by making the triggers queued by an RI update go at the end of the outer query's trigger event list, thereby effectively making the processing "breadth-first". This was indeed how it worked pre-8.0, so the bug does not occur in the 7.x branches. Per report from Pavel Stehule.
*	Fix oversight in async-commit patch: there were some places in heapam.c	Tom Lane	2007-08-14
\| \| \| \| \| \|	that still thought they could set HEAP_XMAX_COMMITTED immediately after seeing the other transaction commit. Make them use the same logic as tqual.c does to determine if the hint bit can be set yet.
*	TEMPORARILY make synchronous_commit default to OFF, so that we can get more	Tom Lane	2007-08-13
\| \| \| \| \|	thorough testing of async-commit mode from the buildfarm. This patch MUST get reverted before 8.3 release!
*	Fix two bugs induced in VACUUM FULL by async-commit patch.	Tom Lane	2007-08-13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	First, we cannot assume that XLogAsyncCommitFlush guarantees hint bits will be settable, because clog.c's inexact LSN bookkeeping results in windows where a previously flushed transaction is considered unhintable because it shares an LSN slot with a later unflushed transaction. But repair_frag requires XMIN_COMMITTED to be correct so that it can distinguish tuples moved by the current vacuum. Since not being able to set the bit is an uncommon corner case, the most practical way of dealing with it seems to be to abandon shrinking (ie, don't invoke repair_frag) when we find a non-dead tuple whose XMIN_COMMITTED bit couldn't be set. Second, it is possible for the same reason that a RECENTLY_DEAD tuple does not get its XMAX_COMMITTED bit set during scan_heap. But by the time repair_frag examines the tuple it might be possible to set the bit. We therefore must take buffer content lock when calling HeapTupleSatisfiesVacuum a second time, else we can get an Assert failure in SetBufferCommitInfoNeedsSave. This latter bug is latent in existing releases, but I think it cannot actually occur without async commit, since the first HeapTupleSatisfiesVacuum call should always have set the bit. So I'm not going to back-patch it. In passing, reduce the existing "cannot shrink relation" messages from NOTICE to LOG level. The new message must be no higher than LOG if we don't want unpredictable regression test failures, and consistency seems like a good idea. Also arrange that only one such message is reported per VACUUM FULL; in typical scenarios you could get spammed with many such messages, which seems a bit useless.
*	Remove an "optimization" I installed in 2001, to make repalloc() attempt to	Tom Lane	2007-08-12
\| \| \| \| \| \| \| \| \| \| \| \|	enlarge the memory chunk in-place when it was feasible to do so. This turns out to not work well at all for scenarios involving repeated cycles of palloc/repalloc/pfree: the eventually freed chunks go into the wrong freelist for the next initial palloc request, and so we consume memory indefinitely. While that could be defended against, the number of cases where the optimization can still be applied drops significantly, and adjusting the initial sizes of StringInfo buffers makes it drop to almost nothing. Seems better to just remove the extra complexity. Per recent discussion and testing.
*	Increase the initial size of StringInfo buffers to 1024 bytes (from 256);	Tom Lane	2007-08-12
\| \| \| \| \| \| \| \| \|	likewise increase the initial size of the scanner's literal buffer to 1024 (from 128). Instrumentation of the regression tests suggests that this saves a useful amount of repalloc() traffic --- the number of calls occurring during one set of tests drops from about 6900 to about 3900. The old sizes were chosen in the late 90's with an eye to machines much smaller than are common today.
*	Avoid memory leakage across successive calls of regexp_matches() or	Tom Lane	2007-08-11
\| \| \| \| \| \| \|	regexp_split_to_table() within a single query. This is only a partial solution, as it turns out that with enough matches per string these functions can also tickle a repalloc() misbehavior. But fixing that is a topic for a separate patch.
*	Code review for regexp_matches/regexp_split patch. Refactor to avoid assuming	Tom Lane	2007-08-11
\| \| \| \| \| \| \| \| \|	that cached compiled patterns will still be there when the function is next called. Clean up looping logic, thereby fixing bug identified by Pavel Stehule. Share setup code between the two functions, add some comments, and avoid risky mixing of int and size_t variables. Clean up the documentation a tad, and accept all the flag characters mentioned in table 9-19 rather than just a subset.
*	Revise postmaster startup/shutdown logic to eliminate the problem that a	Tom Lane	2007-08-09
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	constant flow of new connection requests could prevent the postmaster from completing a shutdown or crash restart. This is done by labeling child processes that are "dead ends", that is, we know that they were launched only to tell a client that it can't connect. These processes are managed separately so that they don't confuse us into thinking that we can't advance to the next stage of a shutdown or restart sequence, until the very end where we must wait for them to drain out so we can delete the shmem segment. Per discussion of a misbehavior reported by Keaton Adams. Since this code was baroque already, and my first attempt at fixing the problem made it entirely impenetrable, I took the opportunity to rewrite it in a state-machine style. That eliminates some duplicated code sections and hopefully makes everything a bit clearer.
*	Fix a gradual memory leak in ExecReScanAgg(). Because the aggregation	Neil Conway	2007-08-08
\| \| \| \| \| \| \| \| \| \| \| \|	hash table is allocated in a child context of the agg node's memory context, MemoryContextReset() will reset but not delete the child context. Since ExecReScanAgg() proceeds to build a new hash table from scratch (in a new sub-context), this results in leaking the header for the previous memory context. Therefore, use MemoryContextResetAndDeleteChildren() instead. Credit: My colleague Sailesh Krishnamurthy at Truviso for isolating the cause of the leak.
*	Fix thinko in multi-autovac-workers code: validity checks made by	Tom Lane	2007-08-08
\| \| \| \|	GUC assign hooks are supposed to be made whether doit is true or not.
*	Adjust the output of MemoryContextStats() so that the stats for a	Neil Conway	2007-08-07
\| \| \| \| \| \|	child memory contexts is indented two spaces to the right of its parent context. This should make it easier to deduce the memory context hierarchy from the output of MemoryContextStats().
*	Fix up bad layout of some comments (probably pg_indent's fault), and	Tom Lane	2007-08-04
\| \| \| \|	improve grammar a tad. Per Greg Stark.
*	Fix crash caused by log_timezone patch if we attempt to emit any elog messages	Tom Lane	2007-08-04
\| \| \| \| \| \| \| \| \|	between the setting of log_line_prefix and the setting of log_timezone. We can't realistically set log_timezone any earlier than we do now, so the best behavior seems to be to use GMT zone if any timestamps are to be logged during early startup. Create a dummy zone variable with a minimal definition of GMT (in particular it will never know about leap seconds), so that we can set it up without reference to any external files.
*	Fix a problem in my recent patch to initialize cancel_key for autovac workers	Tom Lane	2007-08-04
\| \| \| \| \| \| \| \|	as well as regular backends: if no regular backend launches before the autovac launcher tries to start an autovac worker, the postmaster would get an Assert fault due to calling PostmasterRandom before random_seed was initialized. Cleanest solution seems to be to take the initialization of random_seed out of ServerLoop and let PostmasterRandom do it for itself.
*	Switch over to using the src/timezone functions for formatting timestamps	Tom Lane	2007-08-04
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	displayed in the postmaster log. This avoids Windows-specific problems with localized time zone names that are in the wrong encoding, and generally seems like a good idea to forestall other potential platform-dependent issues. To preserve the existing behavior that all backends will log in the same time zone, create a new GUC variable log_timezone that can only be changed on a system-wide basis, and reference log-related calculations to that zone instead of the TimeZone variable. This fixes the issue reported by Hiroshi Saito that timestamps printed by xlog.c startup could be improperly localized on Windows. We still need a simpler patch for that problem in the back branches, however.