This reference page indicates which version of Unicode that is used by regex flavors that support Unicode properties or that have tokens that are based on Unicode properties such as shorthand classes and POSIX classes when they match Unicode characters. Newer versions of Unicode can not only add characters to existing properties but can also move characters between properties. For example, the section sign § is matched by \p{So} in flavors based on Unicode 6.0.0 or prior but by \p{Po} in flavors based on Unicode 6.1.0 or later.
| Feature | Syntax | Description | Example | JGsoft | Python | JavaScript | VBScript | XRegExp | .NET | Java | ICU | RE2 | Perl | PCRE | PCRE2 | PHP | Delphi | R | Ruby | std::regex | Boost | Tcl | POSIX | GNU | Oracle | XML | XPath |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Unicode version | Unicode properties are based on Unicode 3.0.0 or later. | YES | YES | with /u | n/a | YES | YES | YES | YES | default | YES | 5.0 | YES | YES | YES | YES | 1.9 | YES | YES | YES | n/a | n/a | YES | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 3.1.0 or later. | YES | YES | with /u | n/a | YES | 2.0 | 5 | YES | default | YES | 5.0 | YES | YES | YES | YES | 1.9 | YES | YES | YES | n/a | n/a | YES | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 3.2.0 or later. | YES | n/a | with /u | n/a | YES | 2.0 | 5 | YES | default | YES | 5.0 | YES | YES | YES | YES | 1.9 | n/a | n/a | n/a | n/a | n/a | n/a | YES | YES | ||
| Unicode version | Unicode categories for Latin-1 characters U+0080 to U+00FF are based on Unicode 3.2.0 regardless of which Unicode version is used for other characters or properties. | \p{So} matches § | no | no | no | n/a | no | 1.0–4.8 | 4 only | no | no | no | no | no | no | no | no | no | no | no | no | n/a | n/a | no | no | no | |
| Unicode version | Unicode properties are based on Unicode 4.0.0 or later. | YES | 2.5 | with /u | n/a | YES | 2.0 | 5 | YES | default | YES | 5.0 | YES | YES | YES | YES | 1.9 | YES | YES | YES | n/a | n/a | YES | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 4.0.1 or later. | no | 2.5 | with /u | n/a | YES | 2.0 | 7 | YES | default | YES | 6.5 | YES | YES | YES | YES | 1.9 | YES | YES | YES | n/a | n/a | YES | YES | YES | ||
| Unicode version | Unicode blocks are based on Unicode 4.0.1 regardless of which Unicode version is used for other Unicode properties. | no | n/a | n/a | n/a | n/a | 2.0 | no | no | n/a | no | n/a | n/a | n/a | n/a | n/a | no | n/a | n/a | n/a | n/a | n/a | n/a | no | no | ||
| Unicode version | Unicode properties are based on Unicode 4.1.0 or later. | no | 2.5 | with /u | n/a | YES | 2.0 | 7 | YES | default | YES | 6.5 | YES | YES | YES | YES | 1.9 | YES | YES | YES | n/a | n/a | YES | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 5.0.0 or later. | no | 2.6 | with /u | n/a | YES | 2.0 | 7 | YES | default | YES | 7.0 | YES | YES | YES | YES | 1.9 | YES | YES | YES | n/a | n/a | YES | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 5.1.0 or later. | no | 2.6 | with /u | n/a | YES | 4.0 | 7 | YES | default | YES | 7.8 | YES | YES | YES | YES | 1.9 | YES | YES | YES | n/a | n/a | 12c | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 5.2.0 or later. | no | 2.7 | with /u | n/a | YES | 4.0 | 7 | YES | default | YES | 8.02 | YES | YES | XE7 | YES | 1.9 | no | no | YES | n/a | n/a | 12c | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 6.0.0 or later. | no | 3.2 | with /u | n/a | YES | 4.0 | 7 | YES | default | YES | 8.13 | YES | YES | XE7 | YES | 1.9 | no | no | YES | n/a | n/a | 12c | YES | YES | ||
| Unicode version | Unicode blocks are based on Unicode 6.0.0 regardless of which Unicode version is used for other Unicode properties. | no | n/a | n/a | n/a | n/a | no | no | no | n/a | no | n/a | n/a | n/a | n/a | n/a | no | n/a | n/a | n/a | n/a | n/a | n/a | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 6.1.0 or later. | no | 3.3 | with /u | n/a | YES | 4.0 | 8 | YES | default | 5.16 | 8.31 | YES | YES | XE7 | 2.15.2 | 2.0 | no | no | YES | n/a | n/a | 12c | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 6.2.0 or later. | no | 3.4 | with /u | n/a | YES | 4.0 | 8 | YES | default | 5.18 | 8.32 | YES | YES | XE7 | 2.15.3 | 2.2 | no | no | 8.5 | n/a | n/a | 12c | YES | YES | ||
| Unicode version | Unicode properties are based on Unicode 6.3.0 or later. | no | 3.4 | with /u | n/a | YES | 4.0 | 9 | YES | default | 5.20 | 8.34 | YES | 5.5.10 | XE7 | 3.0.3 | 2.2 | no | no | 8.5 | n/a | n/a | 12c | no | no | ||
| Unicode version | Unicode properties are based on Unicode 8.0.0 or later. | no | 3.5 | with /u | n/a | YES | 4.0 | 9 | 56 | default | 5.24 | no | 10.21 | 7.3.0 | no | 4.0.0 | 2.3 | no | no | no | n/a | n/a | 18c | no | no | ||
| Unicode version | Unicode properties are based on Unicode 9.0.0 or later. | no | 3.6 | with /u | n/a | YES | 5.0 | 11 | 58 | default 2017/11 | 5.26 | no | 10.30 | 7.3.0 | no | 4.0.0 | 2.4 | no | no | no | n/a | n/a | 18c | no | no | ||
| Unicode version | Unicode properties are based on Unicode 10.0.0 or later. | no | 3.7 | with /u | n/a | YES | 5.0 | 11 | 60 | default 2017/11 | 5.28 | no | 10.30 | 7.3.0 | no | 4.0.0 | 2.5 | no | no | no | n/a | n/a | 21c | no | no | ||
| Unicode version | U+20BB through U+20BF are all assigned as currency symbols. U+20BB through U+20BD were added in Unicode 7.0.0, U+20BE in 8.0.0, and U+20BF in 10.0.0. But some flavors assign these despite being based on an older version of Unicode. | \p{Sc} matches ₿ (Bitcoin sign U+20BF) | no | 3.7 | with /u | n/a | YES | 5.0 | 8 | 60 | default 2017/11 | 5.28 | no | 10.30 | 7.3.0 | no | 4.0.0 | 2.5 | no | no | no | n/a | n/a | 21c | no | no | |
| Unicode version | Unicode properties are based on Unicode 11.0.0 or later. | no | 3.7 | with /u | n/a | YES | 5.0 | 12 | 62 | default 2018/07 | 5.30 | no | 10.32 | 7.3.0 | no | 4.0.0 | 2.6 | no | no | no | n/a | n/a | 21c | no | no | ||
| Unicode version | U+9FCD through U+9FEF are all assigned as ideographs. These were added in Unicode 7.0.0 through 11.0.0. Some flavors assign these despite being based on an older version of Unicode in order to support GB18030-2022. | \p{Lo} matches 鿯 (U+9FEF) | no | 3.7 | with /u | n/a | YES | 5.0 | 8 | 62 | default 2018/07 | 5.30 | no | 10.32 | 7.3.0 | no | 4.0.0 | 2.6 | no | no | no | n/a | n/a | 21c | no | no | |
| Unicode version | Unicode properties are based on Unicode 12.0.0 or later. | no | 3.8 | with /u | n/a | YES | 5.0 | 13 | 64 | default 2019/08 | 5.30 | no | 10.34 | 7.4.6 | no | 4.2.0 | 2.6 | no | no | no | n/a | n/a | 21c | no | no | ||
| Unicode version | U+32FF is assigned as square era name Reiwa. This assignment is the only change in Unicode 12.1.0. Some flavors assign it despite being based on an older version of Unicode. | \p{So} matches ㋿ (U+32FF) | no | 3.9 | with /u | n/a | YES | 5.0 | 8 | 64 | default 2019/08 | 5.30 | no | 10.34 | 7.4.6 | no | 4.2.0 | 2.6 | no | no | no | n/a | n/a | 21c | no | no | |
| Unicode version | Unicode properties are based on Unicode 13.0.0 or later. | no | 3.9 | with /u | n/a | YES | 5.0 | 15 | 66 | default 2020/04 | 5.32 | no | 10.35 | 7.4.12 | no | 4.2.0 | 3.1 | no | no | no | n/a | n/a | no | no | no | ||
| Unicode version | Unicode properties are based on Unicode 14.0.0 or later. | no | 3.11 | with /u | n/a | YES | 7.0 | 19 | 70 | default 2021/11 | 5.36 | no | 10.39 | 8.1.1 | no | 4.2.2 | 3.2 | no | no | no | n/a | n/a | no | no | no | ||
| Unicode version | Unicode properties are based on Unicode 15.0.0 or later. | no | 3.12 | with /u | n/a | no | 8.0 | 20 | 72 | default 2023/03 | 5.38 | no | 10.43 | no | no | 4.4.0 | 3.2 | no | no | no | n/a | n/a | no | no | no | ||
| Unicode version | Unicode properties are based on Unicode 15.1.0 or later. | no | 3.13 | with /u | n/a | no | no | 22 | 74 | default 2023/11 | no | no | 10.45 | no | no | no | no | no | no | no | n/a | n/a | no | no | no | ||
| Unicode version | Unicode properties are based on Unicode 16.0.0 or later. | no | no | no | n/a | no | no | 24 | 76 | no | no | no | 10.45 | no | no | no | no | no | no | no | n/a | n/a | no | no | no | ||
| Feature | Syntax | Description | Example | JGsoft | Python | JavaScript | VBScript | XRegExp | .NET | Java | ICU | RE2 | Perl | PCRE | PCRE2 | PHP | Delphi | R | Ruby | std::regex | Boost | Tcl | POSIX | GNU | Oracle | XML | XPath |
| Quick Start | Tutorial | Search & Replace | Tools & Languages | Examples | Reference |
| Introduction | Table of Contents | Quick Reference | Characters | Basic Features | Character Classes | Shorthands | Anchors | Word Boundaries | Quantifiers | Capturing Groups & Backreferences | Named Groups & Backreferences | Special Groups | Unicode Characters and Properties | Unicode Versions | Unicode Categories | Unicode Scripts | Unicode Blocks | Unicode Binary Properties | Unicode Property Sets | Unicode Boundaries | Mode Modifiers | Recursion & Balancing Groups | Backtracking Control Verbs |
| Characters | Matched Text & Backreferences | Case Conversion | Context | Conditionals |
Page URL: https://www.regular-expressions.info/refunicodeage.html
Page last updated: 29 August 2025
Site last updated: 24 October 2025
Copyright © 2003-2025 Jan Goyvaerts. All rights reserved.