diff options
| author | Tom Lane <tgl@sss.pgh.pa.us> | 2007-10-23 20:46:12 +0000 |
|---|---|---|
| committer | Tom Lane <tgl@sss.pgh.pa.us> | 2007-10-23 20:46:12 +0000 |
| commit | dbaec70c153239224c0288d865b96c2f939fbdf5 (patch) | |
| tree | a2309acc315e5d4b9f9b0cd8b2ad60dc999ba93d /src/tools/msvc/Install.pm | |
| parent | 344d0cae64dbf398559b855806fc7338ec0a2e64 (diff) | |
| download | postgresql-dbaec70c153239224c0288d865b96c2f939fbdf5.tar.gz | |
Rename and slightly redefine the default text search parser's "word"
categories, as per discussion. asciiword (formerly lword) is still
ASCII-letters-only, and numword (formerly word) is still the most general
mixed-alpha-and-digits case. But word (formerly nlword) is now
any-group-of-letters-with-at-least-one-non-ASCII, rather than all-non-ASCII as
before. This is no worse than before for parsing mixed Russian/English text,
which seems to have been the design center for the original coding; and it
should simplify matters for parsing most European languages. In particular
it will not be necessary for any language to accept strings containing digits
as being regular "words". The hyphenated-word categories are adjusted
similarly.
Diffstat (limited to 'src/tools/msvc/Install.pm')
| -rw-r--r-- | src/tools/msvc/Install.pm | 8 |
1 files changed, 4 insertions, 4 deletions
diff --git a/src/tools/msvc/Install.pm b/src/tools/msvc/Install.pm index 57815a2dc8..798810343e 100644 --- a/src/tools/msvc/Install.pm +++ b/src/tools/msvc/Install.pm @@ -3,7 +3,7 @@ package Install; # # Package that provides 'make install' functionality for msvc builds # -# $PostgreSQL: pgsql/src/tools/msvc/Install.pm,v 1.24 2007/10/16 16:00:00 tgl Exp $ +# $PostgreSQL: pgsql/src/tools/msvc/Install.pm,v 1.25 2007/10/23 20:46:12 tgl Exp $ # use strict; use warnings; @@ -258,7 +258,7 @@ sub GenerateTsearchFiles while ($#pieces > 0) { my $lang = shift @pieces || last; - my $latlang = shift @pieces || last; + my $asclang = shift @pieces || last; my $txt = $tmpl; my $stop = ''; @@ -269,8 +269,8 @@ sub GenerateTsearchFiles $txt =~ s#_LANGNAME_#${lang}#gs; $txt =~ s#_DICTNAME_#${lang}_stem#gs; $txt =~ s#_CFGNAME_#${lang}#gs; - $txt =~ s#_LATDICTNAME_#${latlang}_stem#gs; - $txt =~ s#_NONLATDICTNAME_#${lang}_stem#gs; + $txt =~ s#_ASCDICTNAME_#${asclang}_stem#gs; + $txt =~ s#_NONASCDICTNAME_#${lang}_stem#gs; $txt =~ s#_STOPWORDS_#$stop#gs; print $F $txt; print "."; |
