790 questions
- Bountied 0
- Unanswered
- Frequent
- Score
- Trending
- Week
- Month
- Unanswered (my tags)
4
votes
2
answers
129
views
Sort respecting diacritics (PostgreSQL)
Can I get PostgreSQL to sort rows by a string column respecting the accents?
I found out that it's possible to define a custom collation having "ks" (colStrength) set to "level2", ...
0
votes
1
answer
231
views
macos install pyicu==2.15 icu=77.1
Resolved 388 packages in 7ms
Built cbor==1.0.0
Built volcengine==×ばつ Failed to build pyicu==2.15
├─▶ The build backend returned an error
╰─▶ Call to setuptools.build_meta.build_wheel failed (...
0
votes
0
answers
70
views
PyICU import halts Python interpreter on Windows 11 23H2
I was experiencing problems similar to well-known issues while installing PyICU. By following community hints I was able to succeed in installing the corresponding wheel. Finally, after doing so (i.e. ...
1
vote
1
answer
174
views
How does postgres know which collation to use during database creation if there are two with the same locale?
I have a Postgres DB created on Azure (probably not very important, added for context).
That DB has predefined set of collations. All deterministic.
Among them there is a collation with name en-GB-x-...
0
votes
0
answers
42
views
tesseract 5 building: it finds icu 74 but says undefined reference to icu 72 files
Following the indications found here: https://tesseract-ocr.github.io/tessdoc/Compiling
I'm trying to compile, build and install tesseract 5 in Ubuntu 24.04 :
(base) raphy@raohy:~$ git clone --...
5
votes
1
answer
214
views
Correct sorting in Java for all languages (including e.g. CS)
I need correct sorting + collator that sorts all European languages correctly (including e.g. czech characters.
I've tried various approaches and it still messed up accented U.
I rather not go for a ...
1
vote
1
answer
38
views
RuleBasedCollator rule ignored
I'm trying to use the icu RuleBasedCollator in python.
In my code I specify a rule wherby "ä" should sort before "a" as a secondary (accent) difference
from icu import ...
1
vote
1
answer
73
views
R quanteda kwic not matching negative look behind pattern
I'm trying to use a regex pattern with kwic that doesn't match word preceded by in, of or and (using a negative lookbehind), it works in regex101 but not in kwic (which uses stringi's ICU regex ...
0
votes
0
answers
105
views
com.ibm.icu plugin conflict in Eclipse Kepler after upgrading
I am developing a desktop application based on Eclipse Kepler plugins. Initially, I used Kepler 4.3.2 (2014) with JDK 1.8.0_231. Later, I use Kepler 4.33 (2024)and JDK 17. During this upgrade, I ...
2
votes
1
answer
214
views
Can't get Microsoft's ICU package working on AWS Linux 2 AMI
Background: Initial issue was some computers had 4 letter month abbreviations and some 3
https://stackoverflow.com/a/79232095/1305736
We want to eliminate future inconsistencies by moving to app-local ...
0
votes
0
answers
49
views
Encounters ld error when compiling a simple C++ source file which involves icu::UnicodeString on Windows [duplicate]
I want to use icu::UnicodeString in ICU to handle unicode strings. And I wrote a simple demo .cpp file to test if I could compile with ICU libraries.
The content of demo.cpp is below:
#include <...
3
votes
0
answers
232
views
python collation sort "shift-trimmed"
How would I make this test pass?
names = [
"cote",
"coté",
"côte",
"côté",
"ReasonE",
"Reason1",
"ReasonĔ&...
1
vote
0
answers
303
views
Cannot find icuucd75.dll
I'm tring to use International Componenets for Unicode (ICU) APIs in C building with CMake and using as vcpkg as package manager, but when I execute a simple code to test the effective presence of ICU ...
3
votes
0
answers
49
views
Why does Locale::acceptFromHttp give different result for same input?
I have a PHP (Symfony) project. This currently runs on fpm-alpine base, which all the files mounted in the /app dir.
I'm making a complete image, Nginx+phpFpm + the project in one go.
In the current ...
Martijn's user avatar
- 16.2k
3
votes
3
answers
262
views
How to capture all items in one array in YAML?
I want to be able to capture all tags in Obsidian front matter, which is YAML. The format is
---
tags:
- recipe
- cooking
---
but note that other data can appear before or after the tags section.
...