From 6f6d8632583353d60fe3bb1694b39f6124f6e5d1 Mon Sep 17 00:00:00 2001 From: Tom Lane Date: Mon, 14 Jul 2008 00:51:46 +0000 Subject: Create a type-specific typanalyze routine for tsvector, which collects stats on the most common individual lexemes in place of the mostly-useless default behavior of counting duplicate tsvectors. Future work: create selectivity estimation functions that actually do something with these stats. (Some other things we ought to look at doing: using the Lossy Counting algorithm in compute_minimal_stats, and using the element-counting idea for stats on regular arrays.) Jan Urbanski --- doc/src/sgml/catalogs.sgml | 8 +++++--- 1 file changed, 5 insertions(+), 3 deletions(-) (limited to 'doc/src') diff --git a/doc/src/sgml/catalogs.sgml b/doc/src/sgml/catalogs.sgml index de8e3456e15..76198b0f832 100644 --- a/doc/src/sgml/catalogs.sgml +++ b/doc/src/sgml/catalogs.sgml @@ -1,4 +1,4 @@ - + @@ -6516,6 +6516,8 @@ A list of the most common values in the column. (NULL if no values seem to be more common than any others.) + For some datatypes such as tsvector, this is a list of + the most common element values rather than values of the type itself. @@ -6524,10 +6526,10 @@ real[] - A list of the frequencies of the most common values, + A list of the frequencies of the most common values or elements, i.e., number of occurrences of each divided by total number of rows. (NULL when most_common_vals is.) - + -- cgit v1.2.3