diff options
author | Heikki Linnakangas <heikki.linnakangas@iki.fi> | 2017-07-12 22:03:38 +0300 |
---|---|---|
committer | Heikki Linnakangas <heikki.linnakangas@iki.fi> | 2017-07-12 22:04:03 +0300 |
commit | 852902822e04f842244aaf67f2e7fede23f5b6b0 (patch) | |
tree | 5edb13edc744b72598ed2bd82c024104598b900c /src/tutorial/funcs.source | |
parent | e439bbe9996f508f584cda9075d0bb3d5fbd7d97 (diff) | |
download | postgresql-852902822e04f842244aaf67f2e7fede23f5b6b0.tar.gz postgresql-852902822e04f842244aaf67f2e7fede23f5b6b0.zip |
Reduce memory usage of tsvector type analyze function.
compute_tsvector_stats() detoasted and kept in memory every tsvector value
in the sample, but that can be a lot of memory. The original bug report
described a case using over 10 gigabytes, with statistics target of 10000
(the maximum).
To fix, allocate a separate copy of just the lexemes that we keep around,
and free the detoasted tsvector values as we go. This adds some palloc/pfree
overhead, when you have a lot of distinct lexemes in the sample, but it's
better than running out of memory.
Fixes bug #14654 reported by James C. Reviewed by Tom Lane. Backport to
all supported versions.
Discussion: https://www.postgresql.org/message-id/20170514200602.1451.46797@wrigleys.postgresql.org
Diffstat (limited to 'src/tutorial/funcs.source')
0 files changed, 0 insertions, 0 deletions