Wolfram Language & System Documentation Center

RegularExpression ["regex"]

represents the generalized regular expression specified by the string "regex".

Details

Details and Options Details and Options

Examples

Basic Examples

Scope

Basic Constructs

Compound Constructs

Properties & Relations

RegularExpression

RegularExpression ["regex"]

represents the generalized regular expression specified by the string "regex".

Details

RegularExpression can be used to represent classes of strings in functions like StringMatchQ , StringReplace , StringCases , and StringSplit .
RegularExpression supports standard regular expression syntax of the kind used in typical string manipulation languages.
The following basic elements can be used in regular expression strings:
c the literal character c

. any character except newline

[c₁c₂…] any of the characters c_i

[c₁-c₂] any character in the range c₁–c₂

[^c₁c₂…] any character except the c_i

p* p repeated zero or more times

p+ p repeated one or more times

p? zero or one occurrence of p

p{m,n} p repeated between m and n times

p*?,p+?,p?? the shortest consistent strings that match

(p₁p₂…) strings matching the sequence p₁, p₂, …

p₁|p₂ strings matching p₁ or p₂
The following represent classes of characters:
\\d digit 0–9

\\D nondigit

\\s space, newline, tab, or other whitespace character

\\S non-whitespace character

\\w word character (letter, digit, or _)

\\W nonword character

[[:class:]] characters in a named class

[^[:class:]] characters not in a named class
The following named classes can be used: alnum, alpha, ascii, blank, cntrl, digit, graph, lower, print, punct, space, upper, word, xdigit.
The following represent positions in strings:
^ the beginning of the string (or line)

$ the end of the string (or line)

\\b word boundary

\\B anywhere except a word boundary
The following set options for all regular expression elements that follow them:
(?i) treat uppercase and lowercase as equivalent (ignore case)

(?m) make ^ and $ match start and end of lines (multiline mode)

(?s) allow . to match newline

(?-c) unset options
\\., \\[, etc. represent literal characters ., [, etc.
Analogs of named Wolfram Language patterns such as x:expr can be set up in regular expression strings using (regex).
Within a regular expression string, \\gn represents the substring matched by the n^(th) parenthesized regular expression object (regex). The shorter \\n is often equivalent to \\gn.
For the purpose of functions such as StringReplace and StringCases , any $n appearing in the right‐hand side of a rule RegularExpression ["regex"]->rhs is taken to correspond to the substring matched by the n^(th) parenthesized regular expression object in regex. 0ドル represents the whole matched string.

Examples

open all close all

Basic Examples (2)

Find words involving the characters a, b, c, d, e:

Equivalent form using string patterns:

Decide whether the string consists of words and whitespace:

Equivalent form using string patterns:

Scope (22)

Basic Constructs (17)

Extract any character except newline:

Either of the characters "a" and "b":

Any character between "a" and "e", including "a" and "e":

Any character except "a" and "1":

Any digit repeated one or more times:

The character "a" repeated 2 or 3 times:

Any digit:

Nondigit characters:

Space, newline, tab, or other whitespace character:

Non-whitespace characters:

Word characters:

Nonword characters:

Find all uppercase letters:

Split a string at the beginning of a new line:

Split a string at the end of a new line:

Insert a character at the boundary of each word:

Split a string at every character except at the boundary of a word:

Compound Constructs (5)

StringExpression can contain RegularExpression objects:

Conditional patterns:

Use alternatives to match one or more line breaks:

Non-greedy matches are done by appending a question mark "?" to the quantifiers:

The 1ドル refers to the letter matched by (.):

Numbered subpatterns:

Properties & Relations (3)

Use StringMatchQ to determine string pattern matches:

Use StringCases to find matching substrings:

Use StringSplit to split a string into substrings using a delimiter pattern:

Tech Notes

▪

Regular Expressions

▪

Special Characters

▪

String Patterns

▪

Working with String Patterns

Related Guides

▪

String Patterns

▪

Programmable Linguistic Interface

▪

Text Content Types

▪

Scientific Data Analysis

▪

Natural Language Processing

History

Introduced in 2004 (5.1)

Wolfram Research (2004), RegularExpression, Wolfram Language function, https://reference.wolfram.com/language/ref/RegularExpression.html.

Text

Wolfram Research (2004), RegularExpression, Wolfram Language function, https://reference.wolfram.com/language/ref/RegularExpression.html.

CMS

Wolfram Language. 2004. "RegularExpression." Wolfram Language & System Documentation Center. Wolfram Research. https://reference.wolfram.com/language/ref/RegularExpression.html.

APA

Wolfram Language. (2004). RegularExpression. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/RegularExpression.html

BibTeX

@misc{reference.wolfram_2025_regularexpression, author="Wolfram Research", title="{RegularExpression}", year="2004", howpublished="\url{https://reference.wolfram.com/language/ref/RegularExpression.html}", note=[Accessed: 05-December-2025]}

BibLaTeX

@online{reference.wolfram_2025_regularexpression, organization={Wolfram Research}, title={RegularExpression}, year={2004}, url={https://reference.wolfram.com/language/ref/RegularExpression.html}, note=[Accessed: 05-December-2025]}

Top [フレーム]

More Learning

Tech Support

Wolfram Solutions

Wolfram Solutions For Education

Get Started

Grow Your Skills

Work with Us

Educational Programs for Adults

Educational Programs for Youth

Read

RegularExpression

Details

Examples

Basic Examples (2)

Scope (22)

Basic Constructs (17)

Compound Constructs (5)

Properties & Relations (3)

See Also

Tech Notes

Related Guides

Related Links

History

Text

CMS

APA

BibTeX

BibLaTeX