Is there a RegExp.escape function in JavaScript?

Question 1

I just want to create a regular expression out of any possible string.

var usersString = "Hello?!*`~World()[]";
var expression = new RegExp(RegExp.escape(usersString))
var matches = "Hello".match(expression);

Is there a built-in method for that? If not, what do people use? Ruby has RegExp.escape. I don't feel like I'd need to write my own, there have got to be something standard out there.

Question 2

Just wanted to update you fine folk that RegExp.escape is currently worked on and anyone who thinks they have valuable input is very welcome to contribute. core-js and other polyfills offer it.

Question 3

According to the recent update of this answer this proposal was rejected: See the issue

Question 4

Yeah I believe @BenjaminGruenbaum may be the one who put forward the proposal. I tried to get code examples plus the es-shim npm module into an answer on stack overflow here: [ stackoverflow.com/a/63838890/5979634 ] because the proposal was eventually, unfortunately, rejected. Hopefully they change their minds or someone implements 'template tags' before I retire.

Question 5

The aforementioned proposal has just advanced to stage 2

Question 6

2023 is coming to the end but most popular string-focused language doesn't have built in for the regexp escape. This never stops amuse me.

Question 7

The function linked in another answer is insufficient. It fails to escape ^ or $ (start and end of string), or -, which in a character group is used for ranges.

Use this function:

function escapeRegex(string) {
 return string.replace(/[/\-\\^$*+?.()|[\]{}]/g, '\\$&');
}

While it may seem unnecessary at first glance, escaping - (as well as ^) makes the function suitable for escaping characters to be inserted into a character class as well as the body of the regex.

Escaping / makes the function suitable for escaping characters to be used in a JavaScript regex literal for later evaluation.

As there is no downside to escaping either of them, it makes sense to escape to cover wider use cases.

And yes, it is a disappointing failing that this is not part of standard JavaScript.

Question 8

actually, we don't need to escape / at all

Question 9

@Paul: Perl quotemeta (\Q), Python re.escape, PHP preg_quote, Ruby Regexp.quote...

Question 10

If you are going to use this function in a loop, it's probably best to make the RegExp object it's own variable var e = /[\-\[\]\/\{\}\*\+\?\.\\\^\$\|]/g; and then your function is return s.replace(e, '\\$&'); This way you only instantiate the RegExp once.

Question 11

bobince cares not for eslint's opinion

Question 12

But maybe you want to escape characters to put them inside a character range. IMO better to harmlessly overescape than to underescape and cause problems in niche cases. FWIW personally I'd rather see the characters explicitly here; we're not playing code golf.

Question 13

For anyone using Lodash, since v3.0.0 a _.escapeRegExp function is built-in:

_.escapeRegExp('[lodash](https://lodash.com/)');
// → '\[lodash\]\(https:\/\/lodash\.com\/\)'

And, in the event that you don't want to require the full Lodash library, you may require just that function!

Question 14

there's even an npm package of just this! npmjs.com/package/lodash.escaperegexp

Question 15

This imports loads of code that really doesn't need to be there for such a simple thing. Use bobince's answer... works for me and its so many less bytes to load than the lodash version!

Question 16

@RobEvans my answer starts with "For anyone using lodash", and I even mention that you can require only the escapeRegExp function.

Question 17

@gustavohenke Sorry I should have been slightly more clear, I included the module linked to in your "just that function" and that is what I was commenting on. If you take a look it's quite a lot of code for what should effectively be a single function with a single regexp in it. Agree if you are already using lodash then it makes sense to use it, but otherwise use the other answer. Sorry for the unclear comment.

Question 18

@maddob I cannot see that \x3 you mentioned: my escaped strings are looking good, just what I expect

Question 19

Most of the expressions here solve single specific use cases.

That's okay, but I prefer an "always works" approach.

function regExpEscape(literal_string) {
 return literal_string.replace(/[-[\]{}()*+!<=:?.\/\\^$|#\s,]/g, '\\$&');
}

This will "fully escape" a literal string for any of the following uses in regular expressions:

Insertion in a regular expression. E.g. new RegExp(regExpEscape(str))
Insertion in a character class. E.g. new RegExp('[' + regExpEscape(str) + ']')
Insertion in integer count specifier. E.g. new RegExp('x{1,' + regExpEscape(str) + '}')
Execution in non-JavaScript regular expression engines.

Special Characters Covered:

-: Creates a character range in a character class.
[ / ]: Starts / ends a character class.
{ / }: Starts / ends a numeration specifier.
( / ): Starts / ends a group.
* / + / ?: Specifies repetition type.
.: Matches any character.
\: Escapes characters, and starts entities.
^: Specifies start of matching zone, and negates matching in a character class.
$: Specifies end of matching zone.
|: Specifies alternation.
#: Specifies comment in free spacing mode.
\s: Ignored in free spacing mode.
,: Separates values in numeration specifier.
/: Starts or ends expression.
:: Completes special group types, and part of Perl-style character classes.
!: Negates zero-width group.
< / =: Part of zero-width group specifications.

Notes:

/ is not strictly necessary in any flavor of regular expression. However, it protects in case someone (shudder) does eval("/" + pattern + "/");.
, ensures that if the string is meant to be an integer in the numerical specifier, it will properly cause a RegExp compiling error instead of silently compiling wrong.
#, and \s do not need to be escaped in JavaScript, but do in many other flavors. They are escaped here in case the regular expression will later be passed to another program.

If you also need to future-proof the regular expression against potential additions to the JavaScript regex engine capabilities, I recommend using the more paranoid:

function regExpEscapeFuture(literal_string) {
 return literal_string.replace(/[^A-Za-z0-9_]/g, '\\$&');
}

This function escapes every character except those explicitly guaranteed not be used for syntax in future regular expression flavors.

For the truly sanitation-keen, consider this edge case:

var s = '';
new RegExp('(choice1|choice2|' + regExpEscape(s) + ')');

This should compile fine in JavaScript, but will not in some other flavors. If intending to pass to another flavor, the null case of s === '' should be independently checked, like so:

var s = '';
new RegExp('(choice1|choice2' + (s ? '|' + regExpEscape(s) : '') + ')');

Question 20

The / doesn't need to be escaped in the [...] character class.

Question 21

Most of these doesn't need to be escaped. "Creates a character range in a character class" - you are never in a character class inside of the string. "Specifies comment in free spacing mode, Ignored in free spacing mode" - not supported in javascript. "Separates values in numeration specifier" - you are never in numerarion specifier inside of the string. Also you can't write arbitrary text inside of nameration specification. "Starts or ends expression" - no need to escape. Eval is not a case, as it would require much more escaping. [will be continued in the next comment]

Question 22

"Completes special group types, and part of Perl-style character classes" - seems not available in javascript. "Negates zero-width group, Part of zero-width group specifications" - you never have groups inside of the string.

Question 23

@Qwertiy The reason for these extra escapes is to eliminate edge cases which could cause problems in certain use cases. For instance, the user of this function may want to insert the escaped regex string into another regex as part of a group, or even for use in another language besides Javascript. The function does not make assumptions like "I will never be part of a character class", because it's meant to be general. For a more YAGNI approach, see any of the other answers here.

Question 24

Very good. Why is _ not escaped though? What ensures it probably won't become regex syntax later?

Question 25

Mozilla Developer Network's Guide to Regular Expressions provides this escaping function:

function escapeRegExp(string) {
 return string.replace(/[.*+?^${}()|[\]\\]/g, '\\$&'); // $& means the whole matched string
}

Question 26

There is an ES7 proposal for RegExp.escape at https://github.com/benjamingr/RexExp.escape/, with a polyfill available at https://github.com/ljharb/regexp.escape.

Question 27

Looks like this didn't make it into ES7. It also looks like it was rejected in favor of looking for a template tag.

Question 28

@John yeah this looks like the case, at which point the entire concept has been abandoned for at least 5 years. I've added an example here, as it probably should have been implemented and TC39 still hasn't implemented their 'tag' based solution. This seems more in-line with getting what you expect, although I could also see it as a String.prototype method. At some point they should reconsider and implement this, even if they get around to parameterized regex. Most other languages impl escape though, even if they have parameterized queries, so we'll see.

Question 29

I have added code examples based on this proposal. Thank you for adding this answer that led me to the proposal. I attempted to edit this answer to add exact examples, but this was rejected by the mods. Here is the answer with code examples: [ stackoverflow.com/a/63838890/5979634 ]

Question 30

RegExp.escape will soon be a part of official JavaScript (ECMAScript). You can start using the reference implementation here: npmjs.com/package/regexp.escape. Track the progress of the inclusion of this function in major runtimes here: github.com/tc39/proposal-regex-escaping/issues/58. Read the spec here: tc39.es/proposal-regex-escaping. WebKit has already accepted the PR.

Question 31

In jQuery UI's autocomplete widget (version 1.9.1) they use a slightly different regular expression (line 6753), here's the regular expression combined with bobince's approach.

RegExp.escape = function( value ) {
 return value.replace(/[\-\[\]{}()*+?.,\\\^$|#\s]/g, "\\$&");
}

Question 32

The only difference is that they escape , (which is not a metacharacter), and # and whitespace which only matter in free-spacing mode (which is not supported by JavaScript). However, they do get it right not to escape the the forward slash.

Question 33

If you want to reuse jquery UI's implementation rather than paste the code locally, go with $.ui.autocomplete.escapeRegex(myString).

Question 34

lodash has this too, _. escapeRegExp and npmjs.com/package/lodash.escaperegexp

Question 35

v1.12 the same, ok!

Question 36

Nothing should prevent you from just escaping every non-alphanumeric character:

usersString.replace(/(?=\W)/g, '\\');

You lose a certain degree of readability when doing re.toString() but you win a great deal of simplicity (and security).

According to ECMA-262, on the one hand, regular expression "syntax characters" are always non-alphanumeric, such that the result is secure, and special escape sequences (\d, \w, \n) are always alphanumeric such that no false control escapes will be produced.

Question 37

Simple and effective. I like this much better than the accepted answer. For (really) old browsers, .replace(/[^\w]/g, '\\$&') would work in the same way.

Question 38

This fails in Unicode mode. For example, new RegExp('🍎'.replace(/(?=\W)/g, '\\'), 'u') throws exception because \W matches each code unit of a surrogate pair separately, resulting in invalid escape codes.

Question 39

alternative: .replace(/\W/g, "\\$&");

Question 40

@AlexeyLebedev Hes the answer been fixed to handle Unicode mode? Or is there a solution elsewhere which does, while maintaining this simplicity?

Question 41

There is an ES7 proposal for RegExp.escape at https://github.com/benjamingr/RexExp.escape/, with a polyfill available at https://github.com/ljharb/regexp.escape.

An example based on the rejected ES proposal, includes checks if the property already exists, in the case that TC39 backtracks on their decision.

Code:

if (!Object.prototype.hasOwnProperty.call(RegExp, 'escape')) {
 RegExp.escape = function(string) {
 // https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Regular_Expressions#Escaping
 // https://github.com/benjamingr/RegExp.escape/issues/37
 return string.replace(/[.*+\-?^${}()|[\]\\]/g, '\\$&'); // $& means the whole matched string
 };
}

Code Minified:

Object.prototype.hasOwnProperty.call(RegExp,"escape")||(RegExp.escape=function(e){return e.replace(/[.*+\-?^${}()|[\]\\]/g,"\\$&")});

// ...
var assert = require('assert');
 
var str = 'hello. how are you?';
var regex = new RegExp(RegExp.escape(str), 'g');
assert.equal(String(regex), '/hello\. how are you\?/g');

There is also an npm module at: https://www.npmjs.com/package/regexp.escape

One can install this and use it as so:

npm install regexp.escape

or

yarn add regexp.escape

var escape = require('regexp.escape');
var assert = require('assert');
 
var str = 'hello. how are you?';
var regex = new RegExp(escape(str), 'g');
assert.equal(String(regex), '/hello\. how are you\?/g');

In the GitHub && NPM page are descriptions of how to use the shim/polyfill for this option, as well. That logic is based on return RegExp.escape || implementation;, where implementation contains the regexp used above.

The NPM module is an extra dependency, but it also make it easier for an external contributor to identify logical parts added to the code. ̄\(ツ)/ ̄

Question 42

This answer begins identically to [ stackoverflow.com/a/30852428/5979634 ], I had hoped to edit their answer to include this information, but a simpler version of this was considered too different from the original answer. I figured I offered actual code examples within the website, but I'm not gonna argue. Instead, I've offered this as a new, expanded answer, seeing as it is too different from the one other answer like this.

Question 43

RegExp.escape will soon be a part of official JavaScript (ECMAScript). You can start using the reference implementation here: npmjs.com/package/regexp.escape. Track the progress of the inclusion of this function in major runtimes here: github.com/tc39/proposal-regex-escaping/issues/58. Read the spec here: tc39.es/proposal-regex-escaping. WebKit has already accepted the PR.

Question 44

Another (much safer) approach is to escape all the characters (and not just a few special ones that we currently know) using the unicode escape format \u{code}:

function escapeRegExp(text) {
 return Array.from(text)
 .map(char => `\\u{${char.charCodeAt(0).toString(16)}}`)
 .join('');
}
console.log(escapeRegExp('a.b')); // '\u{61}\u{2e}\u{62}'

Please note that you need to pass the u flag for this method to work:

var expression = new RegExp(escapeRegExp(usersString), 'u');

Question 45

Much safer! And ready future Regex implementations!

Question 46

This is a shorter version.

RegExp.escape = function(s) {
 return s.replace(/[$-\/?[-^{|}]/g, '\\$&');
}

This includes the non-meta characters of %, &, ', and ,, but the JavaScript RegExp specification allows this.

Question 47

I wouldn't use this "shorter" version, since the character ranges hide the list of characters, which makes it harder to verify the correctness at first glance.

Question 48

@nhahtdh I probably wouldn't either, but it is posted here for information.

Question 49

@kzh: posting "for information" helps less than posting for understanding. Would you not agree that my answer is clearer?

Question 50

At least, . is missed. And (). Or not? [-^ is strange. I don't remember what is there.

Question 51

@nhahtdh Why must this "readable"? We're talking regex, the most unreadable code in tarnation. It takes an hour, a magnifying glass, and constant reference to a manual to analyse any complex regex pattern I think "readability" is NOT a priority when it comes to regex. Escape a complex string containing many reserved chars, and the result will be even more unreadable. If you know how to analyze regex that well, then the range-style used here should not be a problem for you. If you DON'T analyze regex that well, then the full list of chars isn't going to help you much anyway.

Question 52

XRegExp has an escape function:

XRegExp.escape('Escaped? <.>'); // -> 'Escaped\?\ <\.>'

More on: http://xregexp.com/api/#escape

Question 53

escapeRegExp = function(str) {
 if (str == null) return '';
 return String(str).replace(/([.*+?^=!:${}()|[\]\/\\])/g, '\\1ドル');
};

Question 54

Rather than only escaping characters which will cause issues in your regular expression (e.g.: a blacklist), consider using a whitelist instead. This way each character is considered tainted unless it matches.

For this example, assume the following expression:

RegExp.escape('be || ! be');

This whitelists letters, number and spaces:

RegExp.escape = function (string) {
 return string.replace(/([^\w\d\s])/gi, '\\1ドル');
}

Returns:

"be \|\| \! be"

This may escape characters which do not need to be escaped, but this doesn't hinder your expression (maybe some minor time penalties - but it's worth it for safety).

Question 55

His is this different than @filip's answer? stackoverflow.com/a/40562456/209942

Question 56

RegExp.escape will soon be a part of official JavaScript (ECMAScript).
It's in Firefox and WebKit (Safari) already.

Specification: https://tc39.es/proposal-regex-escaping

Reference Implementation: https://npmjs.com/package/regexp.escape

Spec acceptance progress:
https://github.com/tc39/proposal-regex-escaping/issues/58

Question 57

It's not in Chrome as of v134.

Question 58

It's in v142 now.

Question 59

The functions in the other answers are overkill for escaping entire regular expressions (they may be useful for escaping parts of regular expressions that will later be concatenated into bigger regexps).

If you escape an entire regexp and are done with it, quoting the metacharacters that are either standalone (., ?, +, *, ^, $, |, \) or start something ((, [, {) is all you need:

String.prototype.regexEscape = function regexEscape() {
 return this.replace(/[.?+*^$|({[\\]/g, '\\$&');
};

And yes, it's disappointing that JavaScript doesn't have a function like this built-in.

Question 60

Let's say you escape the user input (text)next and insert it in: (?: + input + ). Your method will give the resulting string (?:\(text)next) which fails to compile. Note that this is quite a reasonable insertion, not some crazy one like re\ + input + re (in this case, the programmer can be blamed for doing something stupid)

Question 61

@nhahtdh: my answer specifically mentioned escaping entire regular expressions and "being done" with them, not parts (or future parts) of regexps. Kindly undo the downvote?

Question 62

It's rarely the case that you would escape the entire expression - there are string operation, which are much faster compared to regex if you want to work with literal string.

Question 63

Please address the part about closing )

Question 64

It would be right to escape closing braces too, even if they are allowed by some dialect. As I remember, that's an extension, not a rule.

bobince 538k111 gold badges675 silver badges846 bronze badges · Accepted Answer · 2010-08-24 23:09:07Z

The function linked in another answer is insufficient. It fails to escape ^ or $ (start and end of string), or -, which in a character group is used for ranges.

Use this function:

function escapeRegex(string) {
 return string.replace(/[/\-\\^$*+?.()|[\]{}]/g, '\\$&');
}

While it may seem unnecessary at first glance, escaping - (as well as ^) makes the function suitable for escaping characters to be inserted into a character class as well as the body of the regex.

Escaping / makes the function suitable for escaping characters to be used in a JavaScript regex literal for later evaluation.

As there is no downside to escaping either of them, it makes sense to escape to cover wider use cases.

And yes, it is a disappointing failing that this is not part of standard JavaScript.

@Paul: Perl quotemeta (\Q), Python re.escape, PHP preg_quote, Ruby Regexp.quote...
If you are going to use this function in a loop, it's probably best to make the RegExp object it's own variable var e = /[\-\[\]\/\{\}\*\+\?\.\\\^\$\|]/g; and then your function is return s.replace(e, '\\$&'); This way you only instantiate the RegExp once.
But maybe you want to escape characters to put them inside a character range. IMO better to harmlessly overescape than to underescape and cause problems in niche cases. FWIW personally I'd rather see the characters explicitly here; we're not playing code golf.

CollectivesTM on Stack Overflow

Is there a RegExp.escape function in JavaScript?

21 Answers 21

33 Comments

6 Comments

8 Comments

Comments

4 Comments

4 Comments

4 Comments

2 Comments

1 Comment

9 Comments

Comments

Comments

1 Comment

2 Comments

7 Comments

4 Comments

regex-escaped.js

global.d.ts

1 Comment

Comments

Comments

1 Comment

3 Comments

Linked

Hot Network Questions

CollectivesTM on Stack Overflow

21 Answers 21

33 Comments

6 Comments

8 Comments

Comments

4 Comments

4 Comments

4 Comments

2 Comments

1 Comment

9 Comments

Comments

Comments

1 Comment

2 Comments

7 Comments

4 Comments

regex-escaped.js

global.d.ts

1 Comment

Comments

Comments

1 Comment

3 Comments

Linked

Related