git.postgresql.org Git - postgresql.git/commit

git projects / postgresql.git / commit
? search:
summary | shortlog | log | commit | commitdiff | tree
(parent: 71d02dc) | patch
Update to latest Snowball sources.
2025年2月19日 02:13:46 +0000 (21:13 -0500)
2025年2月19日 02:13:54 +0000 (21:13 -0500)
commit b464e51ab32fbf09cf5d9c911a8e26f491ad1f44
Update to latest Snowball sources.

It's been some time since we did this, partly because the upstream
snowball project hasn't formally tagged a new release since 2021.
The main motivation for doing it now is to absorb a bug fix
(their commit e322673a841d9abd69994ae8cd20e191090b6ef4), which
prevents a null pointer dereference crash if SN_create_env() gets
a malloc failure at just the wrong point. We'll patch the back
branches with only that change, but we might as well do the full
sync dance on HEAD.

Aside from a bunch of mostly-minor tweaks to existing stemmers, this
update adds a new stemmer for Estonian. It also removes the existing
stemmer for Romanian using ISO-8859-2 encoding. Upstream apparently
concluded that ISO-8859-2 doesn't provide an adequate representation
of some Romanian characters, and the UTF-8 implementation should be
used instead.

While at it, update the README's instructions for doing a sync,
which have not been adjusted during the addition of meson tooling.

Thanks to Maksim Korotkov for discovering the null-pointer
bug and submitting the fix to upstream snowball.

Reported-by: Maksim Korotkov <m.korotkov@postgrespro.ru>
Discussion: https://postgr.es/m/1d1a46-67ab1000-21-80c451@83151435
61 files changed:
doc/src/sgml/textsearch.sgml diff | blob | blame | history
src/backend/snowball/Makefile diff | blob | blame | history
src/backend/snowball/README diff | blob | blame | history
src/backend/snowball/dict_snowball.c diff | blob | blame | history
src/backend/snowball/libstemmer/api.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_basque.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_catalan.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_danish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_dutch.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_english.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_finnish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_french.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_german.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_indonesian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_irish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_italian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_norwegian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_porter.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_portuguese.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_spanish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_1_swedish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_2_hungarian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_ISO_8859_2_romanian.c [deleted file] blob | blame | history
src/backend/snowball/libstemmer/stem_KOI8_R_russian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_arabic.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_armenian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_basque.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_catalan.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_danish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_dutch.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_english.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_estonian.c [new file with mode: 0644] blob
src/backend/snowball/libstemmer/stem_UTF_8_finnish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_french.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_german.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_greek.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_hindi.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_hungarian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_indonesian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_irish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_italian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_lithuanian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_nepali.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_norwegian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_porter.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_portuguese.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_romanian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_russian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_serbian.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_spanish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_swedish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_tamil.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_turkish.c diff | blob | blame | history
src/backend/snowball/libstemmer/stem_UTF_8_yiddish.c diff | blob | blame | history
src/backend/snowball/libstemmer/utilities.c diff | blob | blame | history
src/backend/snowball/meson.build diff | blob | blame | history
src/backend/snowball/snowball_create.pl diff | blob | blame | history
src/bin/initdb/initdb.c diff | blob | blame | history
src/include/snowball/libstemmer/header.h diff | blob | blame | history
src/include/snowball/libstemmer/stem_ISO_8859_2_romanian.h [deleted file] blob | blame | history
src/include/snowball/libstemmer/stem_UTF_8_estonian.h [new file with mode: 0644] blob
This is the main PostgreSQL git repository.
RSS Atom

AltStyle によって変換されたページ (->オリジナル) /