Tweak genericcostestimate's fudge factor for index size.

To provide some bias against using a large index when a small one would do as well, genericcostestimate adds a "fudge factor", which for a long time was random_page_cost * index_pages/10000. However, this can grow to be the dominant term in indexscan cost estimates when the index involved is large enough, a behavior that was never intended. Change to a ln(1 + n/10000) formulation, which has nearly the same behavior up to a few hundred pages but tails off significantly thereafter. (A log curve seems correct on first principles, since what we're trying to account for here is index descent costs, which are typically logarithmic.) Per bug #7619 from Niko Kiirala. Possibly this change should get back-patched, but I'm hesitant to mess with cost estimates in stable branches.
author: Tom Lane <tgl@sss.pgh.pa.us> 2012-10-24 16:25:40 -0400
committer: Tom Lane <tgl@sss.pgh.pa.us> 2012-10-24 16:25:40 -0400
commit: bf01e34b556ff37982ba2d882db424aa484c0d07 (patch)
tree: 8cefcb14ad27cce62cebc90f46493a2cc2d76b7a
parent: a4e8680a6c337955c021177457147f4b4d9a5df5 (diff)
download: postgresql-bf01e34b556ff37982ba2d882db424aa484c0d07.tar.gz
postgresql-bf01e34b556ff37982ba2d882db424aa484c0d07.zip
1 files changed, 7 insertions, 5 deletions
diff --git a/src/backend/utils/adt/selfuncs.c b/src/backend/utils/adt/selfuncs.c
index 61100aec4ae..60000aaf347 100644
--- a/src/backend/utils/adt/selfuncs.c
+++ b/src/backend/utils/adt/selfuncs.c
@@ -6130,12 +6130,14 @@ genericcostestimate(PlannerInfo *root,
 	 * index would have better selectivity.)
 	 *
 	 * We can deal with this by adding a very small "fudge factor" that
-	 * depends on the index size.  The fudge factor used here is one
-	 * spc_random_page_cost per 10000 index pages, which should be small
-	 * enough to not alter index-vs-seqscan decisions, but will prevent
-	 * indexes of different sizes from looking exactly equally attractive.
+	 * depends on the index size, so that indexes of different sizes won't
+	 * look exactly equally attractive.  To ensure the fudge factor stays
+	 * small even for very large indexes, use a log function.  (We previously
+	 * used a factor of one spc_random_page_cost per 10000 index pages, which
+	 * grew too large for large indexes.  This expression has about the same
+	 * growth rate for small indexes, but tails off quickly.)
 	 */
-	*indexTotalCost += index->pages * spc_random_page_cost / 10000.0;
+	*indexTotalCost += log(1.0 + index->pages / 10000.0) * spc_random_page_cost;
 
 	/*
 	 * CPU cost: any complex expressions in the indexquals will need to be
author	Tom Lane <tgl@sss.pgh.pa.us>	2012-10-24 16:25:40 -0400
committer	Tom Lane <tgl@sss.pgh.pa.us>	2012-10-24 16:25:40 -0400
commit	bf01e34b556ff37982ba2d882db424aa484c0d07 (patch)
tree	8cefcb14ad27cce62cebc90f46493a2cc2d76b7a
parent	a4e8680a6c337955c021177457147f4b4d9a5df5 (diff)
download	postgresql-bf01e34b556ff37982ba2d882db424aa484c0d07.tar.gz postgresql-bf01e34b556ff37982ba2d882db424aa484c0d07.zip