Regular Expression Unicode Version Reference

This reference page indicates which version of Unicode that is used by regex flavors that support Unicode properties or that have tokens that are based on Unicode properties such as shorthand classes and POSIX classes when they match Unicode characters. Newer versions of Unicode can not only add characters to existing properties but can also move characters between properties. For example, the section sign § is matched by \p{So} in flavors based on Unicode 6.0.0 or prior but by \p{Po} in flavors based on Unicode 6.1.0 or later.

FeatureSyntaxDescriptionExampleJGsoft Python JavaScript VBScript XRegExp .NET Java ICU RE2 Perl PCRE PCRE2 PHP Delphi R Ruby std::regex Boost Tcl POSIX GNU Oracle XML XPath
Unicode version Unicode properties are based on Unicode 3.0.0 or later. YESYESwith /un/aYESYESYESYESdefaultYES5.0YESYESYESYES1.9YESYESYESn/an/aYESYESYES
Unicode version Unicode properties are based on Unicode 3.1.0 or later. YESYESwith /un/aYES2.05YESdefaultYES5.0YESYESYESYES1.9YESYESYESn/an/aYESYESYES
Unicode version Unicode properties are based on Unicode 3.2.0 or later. YESn/awith /un/aYES2.05YESdefaultYES5.0YESYESYESYES1.9n/an/an/an/an/an/aYESYES
Unicode version Unicode categories for Latin-1 characters U+0080 to U+00FF are based on Unicode 3.2.0 regardless of which Unicode version is used for other characters or properties. \p{So} matches § nononon/ano1.0–4.84 onlynonononononononononononon/an/anonono
Unicode version Unicode properties are based on Unicode 4.0.0 or later. YES2.5with /un/aYES2.05YESdefaultYES5.0YESYESYESYES1.9YESYESYESn/an/aYESYESYES
Unicode version Unicode properties are based on Unicode 4.0.1 or later. no2.5with /un/aYES2.07YESdefaultYES6.5YESYESYESYES1.9YESYESYESn/an/aYESYESYES
Unicode version Unicode blocks are based on Unicode 4.0.1 regardless of which Unicode version is used for other Unicode properties. non/an/an/an/a2.0nonon/anon/an/an/an/an/anon/an/an/an/an/an/anono
Unicode version Unicode properties are based on Unicode 4.1.0 or later. no2.5with /un/aYES2.07YESdefaultYES6.5YESYESYESYES1.9YESYESYESn/an/aYESYESYES
Unicode version Unicode properties are based on Unicode 5.0.0 or later. no2.6with /un/aYES2.07YESdefaultYES7.0YESYESYESYES1.9YESYESYESn/an/aYESYESYES
Unicode version Unicode properties are based on Unicode 5.1.0 or later. no2.6with /un/aYES4.07YESdefaultYES7.8YESYESYESYES1.9YESYESYESn/an/a12cYESYES
Unicode version Unicode properties are based on Unicode 5.2.0 or later. no2.7with /un/aYES4.07YESdefaultYES8.02YESYESXE7YES1.9nonoYESn/an/a12cYESYES
Unicode version Unicode properties are based on Unicode 6.0.0 or later. no3.2with /un/aYES4.07YESdefaultYES8.13YESYESXE7YES1.9nonoYESn/an/a12cYESYES
Unicode version Unicode blocks are based on Unicode 6.0.0 regardless of which Unicode version is used for other Unicode properties. non/an/an/an/anononon/anon/an/an/an/an/anon/an/an/an/an/an/aYESYES
Unicode version Unicode properties are based on Unicode 6.1.0 or later. no3.3with /un/aYES4.08YESdefault5.168.31YESYESXE72.15.22.0nonoYESn/an/a12cYESYES
Unicode version Unicode properties are based on Unicode 6.2.0 or later. no3.4with /un/aYES4.08YESdefault5.188.32YESYESXE72.15.32.2nono8.5n/an/a12cYESYES
Unicode version Unicode properties are based on Unicode 6.3.0 or later. no3.4with /un/aYES4.09YESdefault5.208.34YES5.5.10XE73.0.32.2nono8.5n/an/a12cnono
Unicode version Unicode properties are based on Unicode 8.0.0 or later. no3.5with /un/aYES4.0956default5.24no10.217.3.0no4.0.02.3nononon/an/a18cnono
Unicode version Unicode properties are based on Unicode 9.0.0 or later. no3.6with /un/aYES5.01158default
2017/11
5.26no10.307.3.0no4.0.02.4nononon/an/a18cnono
Unicode version Unicode properties are based on Unicode 10.0.0 or later. no3.7with /un/aYES5.01160default
2017/11
5.28no10.307.3.0no4.0.02.5nononon/an/a21cnono
Unicode version U+20BB through U+20BF are all assigned as currency symbols. U+20BB through U+20BD were added in Unicode 7.0.0, U+20BE in 8.0.0, and U+20BF in 10.0.0. But some flavors assign these despite being based on an older version of Unicode. \p{Sc} matches (Bitcoin sign U+20BF) no3.7with /un/aYES5.0860default
2017/11
5.28no10.307.3.0no4.0.02.5nononon/an/a21cnono
Unicode version Unicode properties are based on Unicode 11.0.0 or later. no3.7with /un/aYES5.01262default
2018/07
5.30no10.327.3.0no4.0.02.6nononon/an/a21cnono
Unicode version U+9FCD through U+9FEF are all assigned as ideographs. These were added in Unicode 7.0.0 through 11.0.0. Some flavors assign these despite being based on an older version of Unicode in order to support GB18030-2022. \p{Lo} matches (U+9FEF) no3.7with /un/aYES5.0862default
2018/07
5.30no10.327.3.0no4.0.02.6nononon/an/a21cnono
Unicode version Unicode properties are based on Unicode 12.0.0 or later. no3.8with /un/aYES5.01364default
2019/08
5.30no10.347.4.6no4.2.02.6nononon/an/a21cnono
Unicode version U+32FF is assigned as square era name Reiwa. This assignment is the only change in Unicode 12.1.0. Some flavors assign it despite being based on an older version of Unicode. \p{So} matches (U+32FF) no3.9with /un/aYES5.0864default
2019/08
5.30no10.347.4.6no4.2.02.6nononon/an/a21cnono
Unicode version Unicode properties are based on Unicode 13.0.0 or later. no3.9with /un/aYES5.01566default
2020/04
5.32no10.357.4.12no4.2.03.1nononon/an/anonono
Unicode version Unicode properties are based on Unicode 14.0.0 or later. no3.11with /un/aYES7.01970default
2021/11
5.36no10.398.1.1no4.2.23.2nononon/an/anonono
Unicode version Unicode properties are based on Unicode 15.0.0 or later. no3.12with /un/ano8.02072default
2023/03
5.38no10.43nono4.4.03.2nononon/an/anonono
Unicode version Unicode properties are based on Unicode 15.1.0 or later. no3.13with /un/anono2274default
2023/11
nono10.45nononononononon/an/anonono
Unicode version Unicode properties are based on Unicode 16.0.0 or later. nononon/anono2476nonono10.45nononononononon/an/anonono
FeatureSyntaxDescriptionExampleJGsoft Python JavaScript VBScript XRegExp .NET Java ICU RE2 Perl PCRE PCRE2 PHP Delphi R Ruby std::regex Boost Tcl POSIX GNU Oracle XML XPath

AltStyle によって変換されたページ (->オリジナル) /