postgresql - postgresql mirror

	Commit message (Collapse)	Author	Age
...
*	Have REINDEX use NOTICE, so it can be suppressed.	Bruce Momjian	2006-06-07
\|
*	Remove "fuzzy comparison" logic in qsort comparison function for	Tom Lane	2006-06-07
\| \| \| \| \| \| \| \| \| \| \| \|	choose_bitmap_and(). It was way too fuzzy --- per comment, it was meant to be 1% relative difference, but was actually coded as 0.01 absolute difference, thus causing selectivities of say 0.001 and 0.000000000001 to be treated as equal. I believe this thinko explains Maxim Boguk's recent complaint. While we could change it to a relative test coded like compare_fuzzy_path_costs(), there's a bigger problem here, which is that any fuzziness at all renders the comparison function non-transitive, which could confuse qsort() to the point of delivering completely wrong results. So forget the whole thing and just do an exact comparison.
*	Use INFO-level for reindex messages, rather than NOTICE.	Bruce Momjian	2006-06-07
\| \| \| \|	Euler Taveira de Oliveira
*	Make the planner estimate costs for nestloop inner indexscans on the basis	Tom Lane	2006-06-06
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	that the Mackert-Lohmann formula applies across all the repetitions of the nestloop, not just each scan independently. We use the M-L formula to estimate the number of pages fetched from the index as well as from the table; that isn't what it was designed for, but it seems reasonably applicable anyway. This makes large numbers of repetitions look much cheaper than before, which accords with many reports we've received of overestimation of the cost of a nestloop. Also, change the index access cost model to charge random_page_cost per index leaf page touched, while explicitly not counting anything for access to metapage or upper tree pages. This may all need tweaking after we get some field experience, but in simple tests it seems to be giving saner results than before. The main thing is to get the infrastructure in place to let cost_index() and amcostestimate functions take repeated scans into account at all. Per my recent proposal. Note: this patch changes pg_proc.h, but I did not force initdb because the changes are basically cosmetic --- the system does not look into pg_proc to decide how to call an index amcostestimate function, and there's no way to call such a function from SQL at all.
*	While making the seq_page_cost changes, I was struck by the fact that	Tom Lane	2006-06-05
\| \| \| \| \| \| \| \| \| \| \| \| \|	cost_nonsequential_access() is really totally inappropriate for its only remaining use, namely estimating I/O costs in cost_sort(). The routine was designed on the assumption that disk caching might eliminate the need for some re-reads on a random basis, but there's nothing very random in that sense about sort's access pattern --- it'll always be picking up the oldest outputs. If we had a good fix on the effective cache size we might consider charging zero for I/O unless the sort temp file size exceeds it, but that's probably putting much too much faith in the parameter. Instead just drop the logic in favor of a fixed compromise between seq_page_cost and random_page_cost per page of sort I/O.
*	Increase the default value of cpu_index_tuple_cost from 0.001 to 0.005.	Tom Lane	2006-06-05
\| \| \| \| \| \|	This shouldn't affect simple indexscans much, while for bitmap scans that are touching a lot of index rows, this seems to bring the estimates more in line with reality. Per recent discussion.
*	Add a GUC parameter seq_page_cost, and use that everywhere we formerly	Tom Lane	2006-06-05
\| \| \| \| \| \| \| \|	assumed that a sequential page fetch has cost 1.0. This patch doesn't in itself change the system's behavior at all, but it opens the door to people adopting other units of measurement for EXPLAIN costs. Also, if we ever decide it's worth inventing per-tablespace access cost settings, this change provides a workable intellectual framework for that.
*	Don't choke during startup if the environment offers an invalid value	Tom Lane	2006-06-03
\| \| \| \| \| \| \| \| \| \|	for LC_MESSAGES; instead, just press forward, leaving the effective setting at 'C'. There is not any very good reason to complain when we are going to replace the value soon with whatever postgresql.conf says. This change should solve the occasionally-reported problem of initdb failing with 'failed to initialize lc_messages'; the current theory is that that is a reflection of either wrong LANG/LC_MESSAGES or completely broken locale support.
*	Fix copy-and-pasteo in Russian translation: message complaining about	Tom Lane	2006-06-03
\| \| \| \| \|	HAVE_INT64_TIMESTAMP was mentioning PG_CONTROL_VERSION instead. Victor Snezhko
*	Make PG_MODULE_MAGIC required in shared libraries that are loaded into	Tom Lane	2006-05-31
\| \| \| \| \|	the server. Per discussion, there seems no point in a waiting period before making this required.
*	Magic blocks don't do us any good unless we use 'em ... so install one	Tom Lane	2006-05-30
\| \| \| \|	in every shared library.
*	Code review for magic-block patch. Remove separate header file pgmagic.h,	Tom Lane	2006-05-30
\| \| \| \| \| \| \| \| \| \|	as this seems only likely to create headaches for module developers. Put the macro in the pre-existing fmgr.h file instead. Avoid being too cute about how many fields we can cram into a word, and avoid trying to fetch from a library we've already unlinked. Along the way, it occurred to me that the magic block really ought to be 'const' so it can be stored in the program text area. Do the same for the existing data blocks for PG_FUNCTION_INFO_V1 functions.
*	Code review for EXPLAIN patch. Fix some typos, make it behave sanely	Tom Lane	2006-05-30
\| \| \| \| \|	across multiple loops, get rid of the shaky assumption that exactly one tuple is returned per node iteration.
*	PostmasterIsAlive test really ought to be in the inner loop for safety.	Tom Lane	2006-05-30
\|
*	Add pgmagic header block to store compile-time constants:	Bruce Momjian	2006-05-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It now only checks four things: Major version number (7.4 or 8.1 for example) NAMEDATALEN FUNC_MAX_ARGS INDEX_MAX_KEYS The three constants were chosen because: 1. We document them in the config page in the docs 2. We mark them as changable in pg_config_manual.h 3. Changing any of these will break some of the more popular modules: FUNC_MAX_ARGS changes fmgr interface, every module uses this NAMEDATALEN changes syscache interface, every PL as well as tsearch uses this INDEX_MAX_KEYS breaks tsearch and anything using GiST. Martijn van Oosterhout
*	Make EXPLAIN sampling smarter, to avoid excessive sampling delay.	Bruce Momjian	2006-05-30
\| \| \| \|	Martijn van Oosterhout
*	Clarify the HINT for 'checkpoint request failed', per recent complaint	Tom Lane	2006-05-30
\| \| \| \|	demonstrating that its intent wasn't obvious.
*	Move conversion dependency patch to the proper branch, out of 8.1.X,	Bruce Momjian	2006-05-30
\| \| \| \|	into HEAD.
*	Prevent multiple archivers from starting. Backpatch to 8.1.X.	Bruce Momjian	2006-05-30
\| \| \| \|	Simon Riggs
*	Fix printf mask for SizeVfdCache	Bruce Momjian	2006-05-30
\| \| \| \|	Qingqing Zhou
*	Fix ancient misdescription of namegt/namege in comment. Greg Stark	Tom Lane	2006-05-30
\|
*	Patch reverted because of random buildfarm failures:	Bruce Momjian	2006-05-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	--------------------------------------------------------------------------- Delay write of pg_stats file to once every five minutes, during shutdown, or when requested by a backend: It changes so the file is only written once every 5 minutes (changeable of course, I just picked something) instead of once every half second. It's still written when the stats collector shuts down, just as before. And it is now also written on backend request. A backend requests a rewrite by simply sending a special stats message. It operates on the assumption that the backends aren't actually going to read the statistics file very often, compared to how frequent it's written today. Magnus Hagander
*	Som improve page split in multicolumn GiST index.	Teodor Sigaev	2006-05-29
\| \| \| \| \| \|	If user picksplit on n-th column generate equals left and right unions then it calls picksplit on n+1-th column.
*	Correct cheking in findParents(). i	Teodor Sigaev	2006-05-29
\| \| \| \|	From Andreas Seltenreich <andreas+pg@gate450.dyndns.org>
*	Fix up pg_dump to do string escaping fully correctly for client encoding	Tom Lane	2006-05-28
\| \| \| \| \| \| \| \| \|	and standard_conforming_strings; likewise for the other client programs that need it. As per previous discussion, a pg_dump dump now conforms to the standard_conforming_strings setting of the source database. We don't use E'' syntax in the dump, thereby improving portability of the SQL. I added a SET escape_strings_warning = off command to keep the dumps from getting a lot of back-chatter from that.
*	Remove traces of otherwise unused RELKIND_SPECIAL symbol. Leave the psql bits	Alvaro Herrera	2006-05-28
\| \| \| \| \| \|	in place though, so that it plays nicely with older servers. Per discussion.
*	Re-introduce the yylex filter function formerly used to support UNION	Tom Lane	2006-05-27
\| \| \| \| \| \| \|	JOIN, which I removed in a recent fit of over-optimism that we wouldn't have any future use for it. Now it's needed to support disambiguating WITH CHECK OPTION from WITH TIME ZONE. As proof of concept, add stub grammar productions for WITH CHECK OPTION.
*	Use E'' strings internally only when standard_conforming_strings =	Bruce Momjian	2006-05-26
\| \| \| \| \| \| \| \| \|	'off'. This allows pg_dump output with standard_conforming_strings = 'on' to generate proper strings that can be loaded into other databases without the backslash doubling we typically do. I have added the dumping of the standard_conforming_strings value to pg_dump. I also added standard backslash handling for plpgsql.
*	Further hacking on performance of COPY OUT. It seems that fwrite()'s	Tom Lane	2006-05-26
\| \| \| \| \| \|	per-call overhead is quite significant, at least on Linux: whatever it's doing is more than just shoving the bytes into a buffer. Buffering the data so we can call fwrite() just once per row seems to be a win.
*	Fix findParents() in case of multiple levels to find.	Teodor Sigaev	2006-05-26
\| \| \| \|	By Andreas Seltenreich <andreas+pg@gate450.dyndns.org>
*	Reduce per-character overhead in COPY OUT by combining calls to	Tom Lane	2006-05-25
\| \| \| \|	CopySendData.
*	* Add support NULL to GiST.	Teodor Sigaev	2006-05-24
\| \| \| \| \| \| \| \|	* some refactoring and simplify code int gistutil.c and gist.c * now in some cases it can be called used-defined picksplit method for non-first column in index, but here is a place to do more. * small fix of docs related to support NULL.
*	Tweak writetup_heap/readtup_heap to avoid storing the tuple identity	Tom Lane	2006-05-23
\| \| \| \| \| \| \| \|	and transaction visibility fields of tuples being sorted. These are always uninteresting in a tuple being sorted (if the fields were actually selected, they'd have been pulled out into user columns beforehand). This saves about 24 bytes per row being sorted, which is a useful savings for any but the widest of sort rows. Per recent discussion.
*	Remove CXT_printf/CXT1_printf macros. If anyone had found them to be of	Tom Lane	2006-05-23
\| \| \| \| \| \| \| \| \|	any use in the past many years, we'd have made some effort to include them in all executor node types; but in fact they were only in nodeAppend.c and nodeIndexscan.c, up until I copied nodeIndexscan.c's occurrence into the new bitmap node types. Remove some other unused macros in execdebug.h, too. Some day the whole header probably ought to go away in favor of better-designed facilities.
*	Add a new GUC parameter backslash_quote, which determines whether the SQL	Tom Lane	2006-05-21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	parser will allow "\'" to be used to represent a literal quote mark. The "\'" representation has been deprecated for some time in favor of the SQL-standard representation "''" (two single quote marks), but it has been used often enough that just disallowing it immediately won't do. Hence backslash_quote allows the settings "on", "off", and "safe_encoding", the last meaning to allow "\'" only if client_encoding is a valid server encoding. That is now the default, and the reason is that in encodings such as SJIS that allow 0x5c (ASCII backslash) to be the last byte of a multibyte character, accepting "\'" allows SQL-injection attacks as per CVE-2006-2314 (further details will be published after release). The "on" setting is available for backward compatibility, but it must not be used with clients that are exposed to untrusted input. Thanks to Akio Ishida and Yasuo Ohgaki for identifying this security issue.
*	Change the backend to reject strings containing invalidly-encoded multibyte	Tom Lane	2006-05-21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	characters in all cases. Formerly we mostly just threw warnings for invalid input, and failed to detect it at all if no encoding conversion was required. The tighter check is needed to defend against SQL-injection attacks as per CVE-2006-2313 (further details will be published after release). Embedded zero (null) bytes will be rejected as well. The checks are applied during input to the backend (receipt from client or COPY IN), so it no longer seems necessary to check in textin() and related routines; any string arriving at those functions will already have been validated. Conversion failure reporting (for characters with no equivalent in the destination encoding) has been cleaned up and made consistent while at it. Also, fix a few longstanding errors in little-used encoding conversion routines: win1251_to_iso, win866_to_iso, euc_tw_to_big5, euc_tw_to_mic, mic_to_euc_tw were all broken to varying extents. Patches by Tatsuo Ishii and Tom Lane. Thanks to Akio Ishida and Yasuo Ohgaki for identifying the security issues.
*	Add last-vacuum/analyze-time columns to the stats collector, both manual and	Alvaro Herrera	2006-05-19
\| \| \| \| \| \| \| \| \|	issued by autovacuum. Add accessor functions to them, and use those in the pg_stat_*_tables system views. Catalog version bumped due to changes in the pgstat views and the pgstat file. Patch from Larry Rosenman, minor improvements by me.
*	Call MarkBufferDirty() before XLogInsert() during completion of insert	Teodor Sigaev	2006-05-19
\|
*	Simplify gistSplit() and some refactoring related code.	Teodor Sigaev	2006-05-19
\|
*	Have autovacuum report its activities to the stat collector.	Alvaro Herrera	2006-05-19
\|
*	Rework completion of incomplete inserts. Now it writes	Teodor Sigaev	2006-05-19
\| \| \| \|	WAL log during inserts.
*	Fix choose_bitmap_and() so that partial index predicates are considered when	Tom Lane	2006-05-18
\| \| \| \| \| \| \| \| \| \| \|	deciding whether a potential additional indexscan is redundant or not. As now coded, any use of a partial index that was already used in a previous AND arm will be rejected as redundant. This might be overly restrictive, but not considering the point at all is definitely bad, as per example in bug #2441 from Arjen van der Meijden. In particular, a clauseless scan of a partial index was never considered redundant by the previous coding, and that's surely wrong. Being more flexible would also require some consideration of how not to double-count the index predicate's selectivity.
*	When a bitmap indexscan is using a partial index, it is necessary to include	Tom Lane	2006-05-18
\| \| \| \| \| \| \|	the partial index predicate in the scan's "recheck condition". Otherwise, if the scan becomes lossy for lack of bitmap memory, we would fail to enforce that returned rows satisfy the predicate. Noted while studying bug #2441 from Arjen van der Meijden.
*	Allow sparcv8plus to use "cas".	Bruce Momjian	2006-05-18
\| \| \| \|	Theo Schlossnagle
*	Fix thinko in recent changes to handle ScalarArrayOpExpr as an indexable	Tom Lane	2006-05-18
\| \| \| \| \| \| \| \| \| \| \|	condition: when there are multiple possible index paths involving ScalarArrayOpExprs, they are logically to be ANDed together not ORed. This thinko was a direct consequence of trying to put the processing inside generate_bitmap_or_paths(), which I now see was a bit too cute. So pull it out and make the callers do it separately (there are only two that need it anyway). Partially responds to bug #2441 from Arjen van der Meijden. There are some additional infelicities exposed by his example, but they are also in 8.1.x, while this mistake is not.
*	Add more sparcv8plus comments.	Bruce Momjian	2006-05-18
\|
*	Add mention of -xarch=v8plus for "cas" usage on Solaris/sparc.	Bruce Momjian	2006-05-18
\|
*	Add simplified sparc8 ASM for solaris_sparc.s, from Alan Stange.	Bruce Momjian	2006-05-18
\|
*	Add comments that Solaris Sun compiler only supports sparc9 ASM,	Bruce Momjian	2006-05-17
\|
*	Change spaces to tabs, for consistency. (Caused by email cut/paste.)	Bruce Momjian	2006-05-17
\|