postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
...
*	Support fls().	Robert Haas	2012-02-07
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The immediate impetus for this is that Noah Misch's patch to elide unnecessary table and index rebuilds when changing typmod for temporal types uses it; and this is extracted from that patch, with some further commentary by me. But it seems logically separate from the remainder of the patch, so I'm committing it separately; this is not the first time someone has wanted fls() in the backend and probably won't be the last. If we end up using this in more performance-critical spots it may be worthwhile to add some architecture-specific optimizations to our src/port version of fls() - e.g. any x86 platform can implement this using the assembly instruction BSRL. But performance won't matter a bit for assessing typmod changes, so I'm not worried about that right now.
*	Add a transform function for varbit typmod coercisions.	Robert Haas	2012-02-07
\| \| \| \| \| \| \| \|	This enables ALTER TABLE to skip table and index rebuilds when the new type is unconstraint varbit, or when the allowable number of bits is not decreasing. Noah Misch, with review and a fix for an OID collision by me.
*	Add a transform function for numeric typmod coercisions.	Robert Haas	2012-02-07
\| \| \| \| \| \| \| \| \|	This enables ALTER TABLE to skip table and index rebuilds when a column is changed to an unconstrained numeric, or when the scale is unchanged and the precision does not decrease. Noah Misch, with a few stylistic changes and a fix for an OID collision by me.
*	Add TIMING option to EXPLAIN, to allow eliminating of timing overhead.	Robert Haas	2012-02-07
\| \| \| \| \| \| \| \|	Sometimes it may be useful to get actual row counts out of EXPLAIN (ANALYZE) without paying the cost of timing every node entry/exit. With this patch, you can say EXPLAIN (ANALYZE, TIMING OFF) to get that. Tomas Vondra, reviewed by Eric Theise, with minor doc changes by me.
*	Add array_to_json and row_to_json functions.	Andrew Dunstan	2012-02-03
\| \| \| \| \| \| \|	Also move the escape_json function from explain.c to json.c where it seems to belong. Andrew Dunstan, Reviewd by Abhijit Menon-Sen.
*	Allow spgist's text_ops to handle pattern-matching operators.	Robert Haas	2012-02-02
\| \| \| \| \| \| \|	This was presumably intended to work this way all along, but a few key bits of indxpath.c didn't get the memo. Robert Haas and Tom Lane
*	Catversion bump for JSON patch.	Robert Haas	2012-01-31
\| \| \| \|	Sigh.
*	Built-in JSON data type.	Robert Haas	2012-01-31
\| \| \| \| \| \| \| \| \| \|	Like the XML data type, we simply store JSON data as text, after checking that it is valid. More complex operations such as canonicalization and comparison may come later, but this is enough for not. There are a few open issues here, such as whether we should attempt to detect UTF-8 surrogate pairs represented as \uXXXX\uYYYY, but this gets the basic framework in place.
*	Make group commit more effective.	Heikki Linnakangas	2012-01-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a backend needs to flush the WAL, and someone else is already flushing the WAL, wait until it releases the WALInsertLock and check if we still need to do the flush or if the other backend already did the work for us, before acquiring WALInsertLock. This helps group commit, because when the WAL flush finishes, all the backends that were waiting for it can be woken up in one go, and the can all concurrently observe that they're done, rather than waking them up one by one in a cascading fashion. This is based on a new LWLock function, LWLockWaitUntilFree(), which has peculiar semantics. If the lock is immediately free, it grabs the lock and returns true. If it's not free, it waits until it is released, but then returns false without grabbing the lock. This is used in XLogFlush(), so that when the lock is acquired, the backend flushes the WAL, but if it's not, the backend first checks the current flush location before retrying. Original patch and benchmarking by Peter Geoghegan and Simon Riggs, although this patch as committed ended up being very different from that.
*	Various minor comments changes from bgwriter to checkpointer.	Simon Riggs	2012-01-30
\|
*	Assorted comment fixes, mostly just typos, but some obsolete statements.	Tom Lane	2012-01-29
\| \| \| \|	YAMAMOTO Takashi
*	Fix typo in comment.	Tom Lane	2012-01-29
\| \| \| \|	Peter Geoghegan
*	Use parameterized paths to generate inner indexscans more flexibly.	Tom Lane	2012-01-27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch fixes the planner so that it can generate nestloop-with- inner-indexscan plans even with one or more levels of joining between the indexscan and the nestloop join that is supplying the parameter. The executor was fixed to handle such cases some time ago, but the planner was not ready. This should improve our plans in many situations where join ordering restrictions formerly forced complete table scans. There is probably a fair amount of tuning work yet to be done, because of various heuristics that have been added to limit the number of parameterized paths considered. However, we are not going to find out what needs to be adjusted until the code gets some real-world use, so it's time to get it in there where it can be tested easily. Note API change for index AM amcostestimate functions. I'm not aware of any non-core index AMs, but if there are any, they will need minor adjustments.
*	Show default privileges in information schema	Peter Eisentraut	2012-01-27
\| \| \| \| \| \| \| \| \| \| \| \|	Hitherto, the information schema only showed explicitly granted privileges that were visible in the *acl catalog columns. If no privileges had been granted, the implicit privileges were not shown. To fix that, add an SQL-accessible version of the acldefault() function, and use that inside the aclexplode() calls to substitute the catalog-specific default privilege set for null values. reviewed by Abhijit Menon-Sen
*	Disallow ALTER DOMAIN on non-domain type everywhere	Peter Eisentraut	2012-01-27
\| \| \| \| \| \|	This has been the behavior already in most cases, but through omission, ALTER DOMAIN / OWNER TO and ALTER DOMAIN / SET SCHEMA would silently work on non-domain types as well.
*	Hide most variable-length fields from Form_pg_* structs	Peter Eisentraut	2012-01-27
\| \| \| \| \| \| \| \| \| \| \| \| \|	Those fields only appear in the structs so that genbki.pl can create the BKI bootstrap files for the catalogs. But they are not actually usable from C. So hiding them can prevent coding mistakes, saves stack space, and can help the compiler. In certain catalogs, the first variable-length field has been kept visible after manual inspection. These exceptions are noted in C comments. reviewed by Tom Lane
*	Make bgwriter sleep longer when it has no work to do, to save electricity.	Heikki Linnakangas	2012-01-26
\| \| \| \| \| \| \| \| \|	To make it wake up promptly when activity starts again, backends nudge it by setting a latch in MarkBufferDirty(). The latch is kept set while bgwriter is active, so there is very little overhead from that when the system is busy. It is only armed before going into longer sleep. Peter Geoghegan, with some changes by me.
*	Add deadlock counter to pg_stat_database	Magnus Hagander	2012-01-26
\| \| \| \| \| \| \|	Adds a counter that tracks number of deadlocks that occurred in each database to pg_stat_database. Magnus Hagander, reviewed by Jaime Casanova
*	Classify DROP operations by whether or not they are user-initiated.	Robert Haas	2012-01-26
\| \| \| \| \| \| \|	This doesn't do anything useful just yet, but is intended as supporting infrastructure for allowing sepgsql to sensibly check DROP permissions. KaiGai Kohei and Robert Haas
*	Track temporary file count and size in pg_stat_database	Magnus Hagander	2012-01-26
\| \| \| \| \| \| \| \|	Add counters for number and size of temporary files used for spill-to-disk queries for each database to the pg_stat_database view. Tomas Vondra, review by Magnus Hagander
*	Instrument index-only scans to count heap fetches performed.	Robert Haas	2012-01-25
\| \| \| \|	Patch by me; review by Tom Lane, Jeff Davis, and Peter Geoghegan.
*	Allow pg_basebackup from standby node with safety checking.	Simon Riggs	2012-01-25
\| \| \| \| \| \| \|	Base backup follows recommended procedure, plus goes to great lengths to ensure that partial page writes are avoided. Jun Ishizuka and Fujii Masao, with minor modifications
*	Add pg_trigger_depth() function	Alvaro Herrera	2012-01-25
\| \| \| \| \| \| \| \| \| \|	This reports the depth level of triggers currently in execution, or zero if not called from inside a trigger. No catversion bump in this patch, but you have to initdb if you want access to the new function. Author: Kevin Grittner
*	Add new replication mode synchronous_commit = 'write'.	Simon Riggs	2012-01-24
\| \| \| \| \| \| \| \| \|	Replication occurs only to memory on standby, not to disk, so provides additional performance if user wishes to reduce durability level slightly. Adds concept of multiple independent sync rep queues. Fujii Masao and Simon Riggs
*	Resolve timing issue with logging locks for Hot Standby.	Simon Riggs	2012-01-23
\| \| \| \| \| \| \| \| \| \|	We log AccessExclusiveLocks for replay onto standby nodes, but because of timing issues on ProcArray it is possible to log a lock that is still held by a just committed transaction that is very soon to be removed. To avoid any timing issue we avoid applying locks made by transactions with InvalidXid. Simon Riggs, bug report Tom Lane, diagnosis Pavan Deolasee
*	ALTER <thing> [IF EXISTS] ... allows silent DDL if required,	Simon Riggs	2012-01-23
\| \| \| \| \| \|	e.g. ALTER FOREIGN TABLE IF EXISTS foo RENAME TO bar Pavel Stehule
*	Add bitwise AND, OR, and NOT operators for macaddr data type.	Robert Haas	2012-01-19
\| \| \| \|	Brendan Jurd, reviewed by Fujii Masao
*	Separate state from query string in pg_stat_activity	Magnus Hagander	2012-01-19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This separates the state (running/idle/idleintransaction etc) into it's own field ("state"), and leaves the query field containing just query text. The query text will now mean "current query" when a query is running and "last query" in other states. Accordingly,the field has been renamed from current_query to query. Since backwards compatibility was broken anyway to make that, the procpid field has also been renamed to pid - along with the same field in pg_stat_replication for consistency. Scott Mead and Magnus Hagander, review work from Greg Smith
*	Prevent adding relations to a concurrently dropped schema.	Robert Haas	2012-01-16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In the previous coding, it was possible for a relation to be created via CREATE TABLE, CREATE VIEW, CREATE SEQUENCE, CREATE FOREIGN TABLE, etc. in a schema while that schema was meanwhile being concurrently dropped. This led to a pg_class entry with an invalid relnamespace value. The same problem could occur if a relation was moved using ALTER .. SET SCHEMA while the target schema was being concurrently dropped. This patch prevents both of those scenarios by locking the schema to which the relation is being added using AccessShareLock, which conflicts with the AccessExclusiveLock taken by DROP. As a desirable side effect, this also prevents the use of CREATE OR REPLACE VIEW to queue for an AccessExclusiveLock on a relation on which you have no rights: that will now fail immediately with a permissions error, before trying to obtain a lock. We need similar protection for all other object types, but as everything other than relations uses a slightly different set of code paths, I'm leaving that for a separate commit. Original complaint (as far as I could find) about CREATE by Nikhil Sontakke; risk for ALTER .. SET SCHEMA pointed out by Tom Lane; further details by Dan Farina; patch by me; review by Hitoshi Harada.
*	Fix CLUSTER/VACUUM FULL for toast values owned by recently-updated rows.	Tom Lane	2012-01-12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In commit 7b0d0e9356963d5c3e4d329a917f5fbb82a2ef05, I made CLUSTER and VACUUM FULL try to preserve toast value OIDs from the original toast table to the new one. However, if we have to copy both live and recently-dead versions of a row that has a toasted column, those versions may well reference the same toast value with the same OID. The patch then led to duplicate-key failures as we tried to insert the toast value twice with the same OID. (The previous behavior was not very desirable either, since it would have silently inserted the same value twice with different OIDs. That wastes space, but what's worse is that the toast values inserted for already-dead heap rows would not be reclaimed by subsequent ordinary VACUUMs, since they go into the new toast table marked live not deleted.) To fix, check if the copied OID already exists in the new toast table, and if so, assume that it stores the desired value. This is reasonably safe since the only case where we will copy an OID from a previous toast pointer is when toast_insert_or_update was given that toast pointer and so we just pulled the data from the old table; if we got two different values that way then we have big problems anyway. We do have to assume that no other backend is inserting items into the new toast table concurrently, but that's surely safe for CLUSTER and VACUUM FULL. Per bug #6393 from Maxim Boguk. Back-patch to 9.0, same as the previous patch.
*	Remove useless 'needlock' argument from GetXLogInsertRecPtr. It was always	Heikki Linnakangas	2012-01-11
\| \| \| \|	passed as 'true'.
*	Rename the internal structures of the CREATE TABLE (LIKE ...) facility	Peter Eisentraut	2012-01-07
\| \| \| \| \| \| \| \| \|	The original implementation of this interpreted it as a kind of "inheritance" facility and named all the internal structures accordingly. This turned out to be very confusing, because it has nothing to do with the INHERITS feature. So rename all the internal parser infrastructure, update the comments, adjust the error messages, and split up the regression tests.
*	Use __sync_lock_test_and_set() for spinlocks on ARM, if available.	Tom Lane	2012-01-07
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Historically we've used the SWPB instruction for TAS() on ARM, but this is deprecated and not available on ARMv6 and later. Instead, make use of a GCC builtin if available. We'll still fall back to SWPB if not, so as not to break existing ports using older GCC versions. Eventually we might want to try using __sync_lock_test_and_set() on some other architectures too, but for now that seems to present only risk and not reward. Back-patch to all supported versions, since people might want to use any of them on more recent ARM chips. Martin Pitt
*	Slightly reorganize struct SnapshotData.	Robert Haas	2012-01-06
\| \| \| \| \| \| \| \| \|	This squeezes out a bunch of alignment padding, reducing the size from 72 to 56 bytes on my machine. At least in my testing, this didn't produce any measurable performance improvement, but the space savings seem like enough justification. Andres Freund
*	Improve behavior of concurrent ALTER TABLE, and do some refactoring.	Robert Haas	2012-01-06
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ALTER TABLE (and ALTER VIEW, ALTER SEQUENCE, etc.) now use a RangeVarGetRelid callback to check permissions before acquiring a table lock. We also now use the same callback for all forms of ALTER TABLE, rather than having separate, almost-identical callbacks for ALTER TABLE .. SET SCHEMA and ALTER TABLE .. RENAME, and no callback at all for everything else. I went ahead and changed the code so that no form of ALTER TABLE works on foreign tables; you must use ALTER FOREIGN TABLE instead. In 9.1, it was possible to use ALTER TABLE .. SET SCHEMA or ALTER TABLE .. RENAME on a foreign table, but not any other form of ALTER TABLE, which did not seem terribly useful or consistent. Patch by me; review by Noah Misch.
*	Make the number of CLOG buffers adaptive, based on shared_buffers.	Robert Haas	2012-01-06
\| \| \| \| \| \| \| \| \| \| \| \|	Previously, this was hardcoded: we always had 8. Performance testing shows that isn't enough, especially on big SMP systems, so we allow it to scale up as high as 32 when there's adequate memory. On the flip side, when shared_buffers is very small, drop the number of CLOG buffers down to as little as 4, so that we can start the postmaster even when very little shared memory is available. Per extensive discussion with Simon Riggs, Tom Lane, and others on pgsql-hackers.
*	Improve ALTER DOMAIN / DROP CONSTRAINT with nonexistent constraint	Peter Eisentraut	2012-01-05
\| \| \| \| \| \| \|	ALTER DOMAIN / DROP CONSTRAINT on a nonexistent constraint name did not report any error. Now it reports an error. The IF EXISTS option was added to get the usual behavior of ignoring nonexistent objects to drop.
*	Use a non-locking initial test in TAS_SPIN on PPC.	Tom Lane	2012-01-03
\| \| \| \| \| \| \| \|	Further testing convinces me that this is helpful at sufficiently high contention levels, though it's still worrisome that it loses slightly at lower contention levels. Per Manabu Ori.
*	Support for building with MS Visual Studio 2010.	Andrew Dunstan	2012-01-03
\| \| \| \|	Brar Piening, reviewed by Craig Ringer.
*	Use LWSYNC in place of SYNC/ISYNC in PPC spinlocks, where possible.	Tom Lane	2012-01-02
\| \| \| \| \| \| \| \| \| \| \| \| \|	This is allegedly a win, at least on some PPC implementations, according to the PPC ISA documents. However, as with LWARX hints, some PPC platforms give an illegal-instruction failure. Use the same trick as before of assuming that PPC64 platforms will accept it; we might need to refine that based on experience, but there are other projects doing likewise according to google. I did not add an assembler compatibility test because LWSYNC has been around much longer than hint bits, and it seems unlikely that any toolchains currently in use don't recognize it.
*	Use 4-byte slock_t on both PPC and PPC64.	Tom Lane	2012-01-02
\| \| \| \| \| \| \|	Previously we defined slock_t as 8 bytes on PPC64, but the TAS assembly code uses word-wide operations regardless, so that the second word was just wasted space. There doesn't appear to be any performance benefit in adding the second word, so get rid of it to simplify the code.
*	Use mutex hint bit in PPC LWARX instructions, where possible.	Tom Lane	2012-01-02
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The hint bit makes for a small but measurable performance improvement in access to contended spinlocks. On the other hand, some PPC chips give an illegal-instruction failure. There doesn't seem to be a completely bulletproof way to tell whether the hint bit will cause an illegal-instruction failure other than by trying it; but most if not all 64-bit PPC machines should accept it, so follow the Linux kernel's lead and assume it's okay to use it in 64-bit builds. Of course we must also check whether the assembler accepts the command, since even with a recent CPU the toolchain could be old. Patch by Manabu Ori, significantly modified by me.
*	Update copyright notices for year 2012.	Bruce Momjian	2012-01-01
\|
*	Send new protocol keepalive messages to standby servers.	Simon Riggs	2011-12-31
\| \| \| \| \|	Allows streaming replication users to calculate transfer latency and apply delay via internal functions. No external functions yet.
*	Remove support for on_exit()	Peter Eisentraut	2011-12-27
\| \| \| \| \| \|	All supported platforms support the C89 standard function atexit() (SunOS 4 probably being the last one not to), and supporting both makes the code clumsy.
*	Rethink representation of index clauses' mapping to index columns.	Tom Lane	2011-12-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In commit e2c2c2e8b1df7dfdb01e7e6f6191a569ce3c3195 I made use of nested list structures to show which clauses went with which index columns, but on reflection that's a data structure that only an old-line Lisp hacker could love. Worse, it adds unnecessary complication to the many places that don't much care which clauses go with which index columns. Revert to the previous arrangement of flat lists of clauses, and instead add a parallel integer list of column numbers. The places that care about the pairing can chase both lists with forboth(), while the places that don't care just examine one list the same as before. The only real downside to this is that there are now two more lists that need to be passed to amcostestimate functions in case they care about column matching (which btcostestimate does, so not passing the info is not an option). Rather than deal with 11-argument amcostestimate functions, pass just the IndexPath and expect the functions to extract fields from it. That gets us down to 7 arguments which is better than 11, and it seems more future-proof against likely additions to the information we keep about an index path.
*	Improve planner's handling of duplicated index column expressions.	Tom Lane	2011-12-23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It's potentially useful for an index to repeat the same indexable column or expression in multiple index columns, if the columns have different opclasses. (If they share opclasses too, the duplicate column is pretty useless, but nonetheless we've allowed such cases since 9.0.) However, the planner failed to cope with this, because createplan.c was relying on simple equal() matching to figure out which index column each index qual is intended for. We do have that information available upstream in indxpath.c, though, so the fix is to not flatten the multi-level indexquals list when putting it into an IndexPath. Then we can rely on the sublist structure to identify target index columns in createplan.c. There's a similar issue for index ORDER BYs (the KNNGIST feature), so introduce a multi-level-list representation for that too. This adds a bit more representational overhead, but we might more or less buy that back by not having to search for matching index columns anymore in createplan.c; likewise btcostestimate saves some cycles. Per bug #6351 from Christian Rudolph. Likely symptoms include the "btree index keys must be ordered by attribute" failure shown there, as well as "operator MMMM is not a member of opfamily NNNN". Although this is a pre-existing problem that can be demonstrated in 9.0 and 9.1, I'm not going to back-patch it, because the API changes in the planner seem likely to break things such as index plugins. The corner cases where this matters seem too narrow to justify possibly breaking things in a minor release.
*	Add bytea_agg, parallel to string_agg.	Robert Haas	2011-12-23
\| \| \| \|	Pavel Stehule
*	Catversion bump for commit 0e4611c0234d89e288a53351f775c59522baed7c.	Robert Haas	2011-12-22
\| \| \| \|	It changed the format of stored rules.
*	Add a security_barrier option for views.	Robert Haas	2011-12-22
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a view is marked as a security barrier, it will not be pulled up into the containing query, and no quals will be pushed down into it, so that no function or operator chosen by the user can be applied to rows not exposed by the view. Views not configured with this option cannot provide robust row-level security, but will perform far better. Patch by KaiGai Kohei; original problem report by Heikki Linnakangas (in October 2009!). Review (in earlier versions) by Noah Misch and others. Design advice by Tom Lane and myself. Further review and cleanup by me.