postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
*	Allow Pin/UnpinBuffer to operate in a lockfree manner.	Andres Freund	2016-04-10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Pinning/Unpinning a buffer is a very frequent operation; especially in read-mostly cache resident workloads. Benchmarking shows that in various scenarios the spinlock protecting a buffer header's state becomes a significant bottleneck. The problem can be reproduced with pgbench -S on larger machines, but can be considerably worse for queries which touch the same buffers over and over at a high frequency (e.g. nested loops over a small inner table). To allow atomic operations to be used, cram BufferDesc's flags, usage_count, buf_hdr_lock, refcount into a single 32bit atomic variable; that allows to manipulate them together using 32bit compare-and-swap operations. This requires reducing MAX_BACKENDS to 2^18-1 (which could be lifted by using a 64bit field, but it's not a realistic configuration atm). As not all operations can easily implemented in a lockfree manner, implement the previous buf_hdr_lock via a flag bit in the atomic variable. That way we can continue to lock the header in places where it's needed, but can get away without acquiring it in the more frequent hot-paths. There's some additional operations which can be done without the lock, but aren't in this patch; but the most important places are covered. As bufmgr.c now essentially re-implements spinlocks, abstract the delay logic from s_lock.c into something more generic. It now has already two users, and more are coming up; there's a follupw patch for lwlock.c at least. This patch is based on a proof-of-concept written by me, which Alexander Korotkov made into a fully working patch; the committed version is again revised by me. Benchmarking and testing has, amongst others, been provided by Dilip Kumar, Alexander Korotkov, Robert Haas. On a large x86 system improvements for readonly pgbench, with a high client count, of a factor of 8 have been observed. Author: Alexander Korotkov and Andres Freund Discussion: 2400449.GjM57CE0Yg@dinodell
*	Improve contrib/bloom regression test using code coverage info.	Tom Lane	2016-04-10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Originally, this test created a 100000-row test table, which made it run rather slowly compared to other contrib tests. Investigation with gcov showed that we got no further improvement in code coverage after the first 700 or so rows, making the large table 99% a waste of time. Cut it back to 2000 rows to fix the runtime problem and still leave some headroom for testing behaviors that may appear later. A closer look at the gcov results showed that the main coverage omissions in contrib/bloom occurred because the test never filled more than one entry in the notFullPage array; which is unsurprising because it exercised index cleanup only in the scenario of complete table deletion, allowing every page in the index to become deleted rather than not-full. Add testing that allows the not-full path to be exercised as well. Also, test the amvalidate function, because blvalidate.c had zero coverage without that, and besides it's a good idea to check for mistakes in the bloom opclass definitions.
*	Get rid of blinsert()'s use of GenericXLogUnregister().	Tom Lane	2016-04-09
\| \| \| \| \| \| \| \|	That routine is dangerous, and unnecessary once we get rid of this one caller. In passing, fix failure to clean up temp memory context, or switch back to caller's context, during slowest exit path.
*	Add the "snapshot too old" feature	Kevin Grittner	2016-04-08
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This feature is controlled by a new old_snapshot_threshold GUC. A value of -1 disables the feature, and that is the default. The value of 0 is just intended for testing. Above that it is the number of minutes a snapshot can reach before pruning and vacuum are allowed to remove dead tuples which the snapshot would otherwise protect. The xmin associated with a transaction ID does still protect dead tuples. A connection which is using an "old" snapshot does not get an error unless it accesses a page modified recently enough that it might not be able to produce accurate results. This is similar to the Oracle feature, and we use the same SQLSTATE and error message for compatibility.
*	Modify BufferGetPage() to prepare for "snapshot too old" feature	Kevin Grittner	2016-04-08
\| \| \| \| \| \| \| \| \| \| \|	This patch is a no-op patch which is intended to reduce the chances of failures of omission once the functional part of the "snapshot too old" patch goes in. It adds parameters for snapshot, relation, and an enum to specify whether the snapshot age check needs to be done for the page at this point. This initial patch passes NULL for the first two new parameters and BGP_NO_SNAPSHOT_TEST for the third. The follow-on patch will change the places where the test needs to be made.
*	Revert CREATE INDEX ... INCLUDING ...	Teodor Sigaev	2016-04-08
\| \| \| \| \| \|	It's not ready yet, revert two commits 690c543550b0d2852060c18d270cdb534d339d9a - unstable test output 386e3d7609c49505e079c40c65919d99feb82505 - patch itself
*	Fix output of regression test of contrib/tsearch2	Teodor Sigaev	2016-04-08
\| \| \| \|	Just forget to add in 1ec4c7c055ca045c5df6352a4cdacd9aa778e598
*	CREATE INDEX ... INCLUDING (column[, ...])	Teodor Sigaev	2016-04-08
\| \| \| \| \| \| \| \| \| \|	Now indexes (but only B-tree for now) can contain "extra" column(s) which doesn't participate in index structure, they are just stored in leaf tuples. It allows to use index only scan by using single index instead of two or more indexes. Author: Anastasia Lubennikova with minor editorializing by me Reviewers: David Rowley, Peter Geoghegan, Jeff Janes
*	Replace printf format %i by %d	Peter Eisentraut	2016-04-08
\| \| \| \|	see also ce8d7bb6440710058503d213b2aafcdf56a5b481
*	Fix printf format	Peter Eisentraut	2016-04-08
\|
*	Phrase full text search.	Teodor Sigaev	2016-04-07
\| \| \| \| \| \| \| \| \| \| \| \| \|	Patch introduces new text search operator (<-> or <DISTANCE>) into tsquery. On-disk and binary in/out format of tsquery are backward compatible. It has two side effect: - change order for tsquery, so, users, who has a btree index over tsquery, should reindex it - less number of parenthesis in tsquery output, and tsquery becomes more readable Authors: Teodor Sigaev, Oleg Bartunov, Dmitry Ivanov Reviewers: Alexander Korotkov, Artur Zakirov
*	Run pgindent on a batch of (mostly-planner-related) source files.	Tom Lane	2016-04-06
\| \| \| \| \|	Getting annoyed at the amount of unrelated chatter I get from pgindent'ing Rowley's unique-joins patch. Re-indent all the files it touches.
*	Modify test_decoding/messages to remove non-ascii chars	Simon Riggs	2016-04-06
\|
*	Generic Messages for Logical Decoding	Simon Riggs	2016-04-06
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	API and mechanism to allow generic messages to be inserted into WAL that are intended to be read by logical decoding plugins. This commit adds an optional new callback to the logical decoding API. Messages are either text or bytea. Messages can be transactional, or not, and are identified by a prefix to allow multiple concurrent decoding plugins. (Not to be confused with Generic WAL records, which are intended to allow crash recovery of extensible objects.) Author: Petr Jelinek and Andres Freund Reviewers: Artur Zakirov, Tomas Vondra, Simon Riggs Discussion: 5685F999.6010202@2ndquadrant.com
*	Fix typo	Teodor Sigaev	2016-04-04
\| \| \| \|	Michael Paquier
*	Clean up dubious code in contrib/seg.	Tom Lane	2016-04-03
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The restore() function assumed that the result of sprintf() with %e format would necessarily contain an 'e', which is false: what if the supplied number is an infinity or NaN? If that did happen, we'd get a null-pointer-dereference core dump. The case appears impossible currently, because seg_in() does not accept such values, and there are no seg-creating functions that would create one. But it seems unwise to rely on it never happening in future. Quite aside from that, the code was pretty ugly: it relied on modifying a static format string when it could use a "*" precision argument, and it used strtok() entirely gratuitously, and it stripped off trailing spaces by hand instead of just not asking for them to begin with. Coverity noticed the potential null pointer dereference (though I wonder why it didn't complain years ago, since this code is ancient). Since this is just code cleanup and forestalling a hypothetical future bug, there seems no need for back-patching.
*	Fix contrib/bloom to not fail under CLOBBER_CACHE_ALWAYS.	Tom Lane	2016-04-03
\| \| \| \| \| \|	The code was supposing that rd_amcache wouldn't disappear from under it during a scan; which is wrong. Copy the data out of the relcache rather than trying to reference it there.
*	Clean up some stuff in new contrib/bloom module.	Tom Lane	2016-04-03
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Coverity complained about implicit sign-extension in the BloomPageGetFreeSpace macro, probably because sizeOfBloomTuple isn't wide enough for size calculations. No overflow is really possible as long as maxoff and sizeOfBloomTuple are small enough to represent a realistic situation, but it seems like a good idea to declare sizeOfBloomTuple as Size not int32. Add missing check on BloomPageAddItem() result, again from Coverity. Avoid core dump due to not allocating so->sign array when scan->numberOfKeys is zero. Also thanks to Coverity. Use FLEXIBLE_ARRAY_MEMBER rather than declaring an array as size 1 when it isn't necessarily. Very minor beautification of related code. Unfortunately, none of the Coverity-detected mistakes look like they could account for the remaining buildfarm unhappiness with this module. It's barely possible that the FLEXIBLE_ARRAY_MEMBER mistake does account for that, if it's enabling bogus compiler optimizations; but I'm not terribly optimistic. We probably still have bugs to find here.
*	Add missing "static".	Tom Lane	2016-04-02
\| \| \| \|	Per buildfarm member pademelon.
*	Fix condition in e9e441c9fac6cbc0510cded6abb9d0e6b646ecaf	Teodor Sigaev	2016-04-02
\| \| \| \|	Comment is right, but if - not.
*	Prevent mark as deleted and as 'has free space' page in bloom module	Teodor Sigaev	2016-04-02
\| \| \| \| \|	Vacuum might put page into list of pages with some free space and mark as deleted at the same time.
*	Fixes in bloom contrib module	Teodor Sigaev	2016-04-02
\| \| \| \| \| \| \| \|	Looking at result of buildfarm member jaguarundi it seems to me that BloomOptions isn't inited sometime, but I don't see yet how it's possible. Nevertheless, check of signature length's is missed, so, add a limit of it. Also add missed GenericXLogAbort() in case of already deleted page in vacuum + minor code refactoring.
*	Copyedit comments and documentation.	Noah Misch	2016-04-01
\|
*	Fixes in bloom contrib module missed during review	Teodor Sigaev	2016-04-01
\| \| \| \| \|	- macroses llike (var & FLAG) are changed to ((var & FLAG) != 0) - do not copy uninitialized part of notFullPage array to page
*	Bloom index contrib module	Teodor Sigaev	2016-04-01
\| \| \| \| \| \| \| \| \| \| \|	Module provides new access method. It is actually a simple Bloom filter implemented as pgsql's index. It could give some benefits on search with large number of columns. Module is a single way to test generic WAL interface committed earlier. Author: Teodor Sigaev, Alexander Korotkov Reviewers: Aleksander Alekseev, Michael Paquier, Jim Nasby
*	Don't require a user mapping for FDWs to work.	Robert Haas	2016-03-28
\| \| \| \| \| \| \| \| \|	Commit fbe5a3fb73102c2cfec11aaaa4a67943f4474383 accidentally changed this behavior; put things back the way they were, and add some regression tests. Report by Andres Freund; patch by Ashutosh Bapat, with a bit of kibitzing by me.
*	Add missing checks to some of pageinspect's BRIN functions	Alvaro Herrera	2016-03-28
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	brin_page_type() and brin_metapage_info() did not enforce being called by superuser, like other pageinspect functions that take bytea do. Since they don't verify the passed page thoroughly, it is possible to use them to read the server memory with a carefully crafted bytea value, up to a file kilobytes from where the input bytea is located. Have them throw errors if called by a non-superuser. Report and initial patch: Andreas Seltenreich Security: CVE-2016-3065
*	Don't use !! but != 0/NULL to force boolean evaluation.	Andres Freund	2016-03-27
\| \| \| \| \| \| \|	I introduced several uses of !! to force bit arithmetic to be boolean, but per discussion the project prefers != 0/NULL. Discussion: CA+TgmoZP5KakLGP6B4vUjgMBUW0woq_dJYi0paOz-My0Hwt_vQ@mail.gmail.com
*	postgres_fdw: Fix crash when pushing down multiple joins.	Robert Haas	2016-03-23
\| \| \| \| \| \| \| \| \|	A join clause might mention multiple relations on either side, so it need not be the case that a given joinrel's constituent relations are all on one side of the join clause or all on the other. Report by Rajkumar Raghuwanshi. Analysis and fix by Michael Paquier and Ashutosh Bapat.
*	Clean up some Coverity complaints about commit 0bf3ae88af330496.	Tom Lane	2016-03-21
\| \| \| \| \| \| \| \| \|	The two get_tle_by_resno() calls introduced by this commit lacked any check for a NULL return, unlike any other calls of that function anywhere in our tree. Coverity quite properly complained about it. Also fix a misindented line in process_query_params(), which Coverity also complained about on the grounds that the bad indentation suggested possible programmer misinterpretation.
*	Fix phony .PHONY.	Tom Lane	2016-03-19
\| \| \| \|	A couple makefiles had misspelled the magic .PHONY target as PHONY.
*	Directly modify foreign tables.	Robert Haas	2016-03-18
\| \| \| \| \| \| \| \| \|	postgres_fdw can now sent an UPDATE or DELETE statement directly to the foreign server in simple cases, rather than sending a SELECT FOR UPDATE statement and then updating or deleting rows one-by-one. Etsuro Fujita, reviewed by Rushabh Lathia, Shigeru Hanada, Kyotaro Horiguchi, Albe Laurenz, Thom Brown, and me.
*	Various minor corrections of and improvements to comments.	Robert Haas	2016-03-18
\| \| \| \|	Aleksander Alekseev
*	pg_trgm's set_limit() now uses SetConfigOption()	Teodor Sigaev	2016-03-18
\| \| \| \| \| \| \| \| \| \|	Deprecated set_limit() is modified to use SetConfigOption() to set similarity_threshold which is actually an instance of pg_trgm.similarity_threshold GUC variable. Previous coding directly sets similarity_threshold what could cause an inconsistency between states of actual variable and GUC representation. Per gripe from Tom Lane
*	Add files forgotten in f576b17cd6ba653bdace1f0da9a3b57f4984e460	Teodor Sigaev	2016-03-16
\|
*	Add word_similarity to pg_trgm contrib module.	Teodor Sigaev	2016-03-16
\| \| \| \| \| \| \| \| \| \|	Patch introduces a concept of similarity over string and just a word from another string. Version of extension is not changed because 1.2 was already introduced in 9.6 release cycle, so, there wasn't a public version. Author: Alexander Korotkov, Artur Zakirov
*	GUC variable pg_trgm.similarity_threshold insead of set_limit()	Teodor Sigaev	2016-03-16
\| \| \| \| \| \| \| \|	Use GUC variable pg_trgm.similarity_threshold insead of set_limit()/show_limit() which was introduced when defining GUC varuables by modules was absent. Author: Artur Zakirov
*	fix typo in comment	Teodor Sigaev	2016-03-16
\|
*	Improve script generating unaccent rules	Teodor Sigaev	2016-03-16
\| \| \| \| \| \|	Script now use the standard Unicode transliterator Latin-ASCII. Author: Leonard Benedetti
*	Fix typos.	Robert Haas	2016-03-15
\| \| \| \|	Oskari Saarenmaa
*	postgres_fdw: make_tuple_from_result_row should set cur_attno for ctid.	Robert Haas	2016-03-15
\| \| \| \| \| \| \| \| \|	There's no reason for this function to do this for every other attribute number and omit it for CTID, especially since conversion_error_callback has code to handle that case. This seems to be an oversight in commit e690b9515072fd7767fdeca5c54166f6a77733bc. Etsuro Fujita
*	Allow callers of create_foreignscan_path to specify nondefault PathTarget.	Tom Lane	2016-03-14
\| \| \| \| \| \| \| \| \|	Although the default choice of rel->reltarget should typically be sufficient for scan or join paths, it's not at all sufficient for the purposes PathTargets were invented for; in particular not for upper-relation Paths. So break API compatibility by adding a PathTarget argument to create_foreignscan_path(). To ease updating of existing code, accept a NULL value of the argument as selecting rel->reltarget.
*	Rethink representation of PathTargets.	Tom Lane	2016-03-14
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	In commit 19a541143a09c067 I did not make PathTarget a subtype of Node, and embedded a RelOptInfo's reltarget directly into it rather than having a separately-allocated Node. In hindsight that was misguided micro-optimization, enabled by the fact that at that point we didn't have any Paths with custom PathTargets. Now that PathTarget processing has been fleshed out some more, it's easier to see that it's better to have PathTarget as an indepedent Node type, even if it does cost us one more palloc to create a RelOptInfo. So change it while we still can. This commit just changes the representation, without doing anything more interesting than that.
*	Update more comments for 96198d94cb7adc664bda341842dc8db671d8be72.	Robert Haas	2016-03-14
\| \| \| \| \|	Etsuro Fujita, reviewed (though not completely endorsed) by Ashutosh Bapat, and slightly expanded by me.
*	Rename auto_explain.sample_ratio to sample_rate	Magnus Hagander	2016-03-13
\| \| \| \| \| \|	Per suggestion from Tomas Vondra Author: Julien Rouhaud
*	Widen query numbers-of-tuples-processed counters to uint64.	Tom Lane	2016-03-12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch widens SPI_processed, EState's es_processed field, PortalData's portalPos field, FuncCallContext's call_cntr and max_calls fields, ExecutorRun's count argument, PortalRunFetch's result, and the max number of rows in a SPITupleTable to uint64, and deals with (I hope) all the ensuing fallout. Some of these values were declared uint32 before, and others "long". I also removed PortalData's posOverflow field, since that logic seems pretty useless given that portalPos is now always 64 bits. The user-visible results are that command tags for SELECT etc will correctly report tuple counts larger than 4G, as will plpgsql's GET GET DIAGNOSTICS ... ROW_COUNT command. Queries processing more tuples than that are still not exactly the norm, but they're becoming more common. Most values associated with FETCH/MOVE distances, such as PortalRun's count argument and the count argument of most SPI functions that have one, remain declared as "long". It's not clear whether it would be worth promoting those to int64; but it would definitely be a large dollop of additional API churn on top of this, and it would only help 32-bit platforms which seem relatively less likely to see any benefit. Andreas Scherbaum, reviewed by Christian Ullrich, additional hacking by me
*	Allow setting sample ratio for auto_explain	Magnus Hagander	2016-03-11
\| \| \| \| \| \| \| \| \|	New configuration parameter auto_explain.sample_ratio makes it possible to log just a fraction of the queries meeting the configured threshold, to reduce the amount of logging. Author: Craig Ringer and Julien Rouhaud Review: Petr Jelinek
*	Refactor pull_var_clause's API to make it less tedious to extend.	Tom Lane	2016-03-10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In commit 1d97c19a0f748e94 and later c1d9579dd8bf3c92, we extended pull_var_clause's API by adding enum-type arguments. That's sort of a pain to maintain, though, because it means every time we add a new behavior we must touch every last one of the call sites, even if there's a reasonable default behavior that most of them could use. Let's switch over to using a bitmask of flags, instead; that seems more maintainable and might save a nanosecond or two as well. This commit changes no behavior in itself, though I'm going to follow it up with one that does add a new behavior. In passing, remove flatten_tlist(), which has not been used since 9.1 and would otherwise need the same API changes. Removing these enums means that optimizer/tlist.h no longer needs to depend on optimizer/var.h. Changing that caused a number of C files to need addition of #include "optimizer/var.h" (probably we can thank old runs of pgrminclude for that); but on balance it seems like a good change anyway.
*	Avoid unlikely data-loss scenarios due to rename() without fsync.	Andres Freund	2016-03-09
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Renaming a file using rename(2) is not guaranteed to be durable in face of crashes. Use the previously added durable_rename()/durable_link_or_rename() in various places where we previously just renamed files. Most of the changed call sites are arguably not critical, but it seems better to err on the side of too much durability. The most prominent known case where the previously missing fsyncs could cause data loss is crashes at the end of a checkpoint. After the actual checkpoint has been performed, old WAL files are recycled. When they're filled, their contents are fdatasynced, but we did not fsync the containing directory. An OS/hardware crash in an unfortunate moment could then end up leaving that file with its old name, but new content; WAL replay would thus not replay it. Reported-By: Tomas Vondra Author: Michael Paquier, Tomas Vondra, Andres Freund Discussion: 56583BDD.9060302@2ndquadrant.com Backpatch: All supported branches
*	pgcrypto: support changing S2K iteration count	Alvaro Herrera	2016-03-09
\| \| \| \| \| \| \| \| \| \|	pgcrypto already supports key-stretching during symmetric encryption, including the salted-and-iterated method; but the number of iterations was not configurable. This commit implements a new s2k-count parameter to pgp_sym_encrypt() which permits selecting a larger number of iterations. Author: Jeff Janes