Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

davidsusu/regexbee

Repository files navigation

🐝 RegexBee

Honeycomb cells for building regular expressions in a fluent way. Even complex ones.

Issues, recommendations and pull request are welcome.

Main features:

  • fluent API
  • immutable
  • concise, declarative, readable
  • limitlessly composable
  • has built-in basic fragments and compositions
  • character class nesting
  • integer range match
  • support for different greediness types
  • support for named groups and references
  • support for lookaround, modifiers and other special groups
  • flexible templating
  • and many more...

Examples

So, you can build complex patterns in a fluent, declarative way:

BeeFragment myFragment = Bee
 .then(Bee.BEGIN)
 .then(Bee.ASCII_WORD)
 .then(Bee.WHITESPACE.any())
 .then(Bee.ASCII_WORD.as("nameX"))
 .then(Bee.intBetween(-4, 1359)
 .or(Bee.fixed("-").more(Greediness.POSSESSIVE)))
 .then(Bee.ref("nameX"))
 .then(Bee.fixed("??)fixed?text. ").optional())
 .then(Bee.ASCII_WORD.optional(Greediness.LAZY))
 .then(Bee.END);

BeeFragments can be converted to java.util.regex.Pattern with calling .toPattern():

Pattern myPattern = myFragment.toPattern();
Matcher myMatcher = myPattern.matcher(someString);
if (myMatcher.matches()) {
 String nameX = myMatcher.group("nameX");
 System.out.println(String.format("We have a nice day, %s!", nameX));
}

Let's see an other example with a simple log processor:

private static final Pattern LOG_ENTRY_PATTERN = Bee
 .then(Bee.BEGIN)
 .then(Bee.TIMESTAMP.as(TIMESTAMP_NAME))
 .then(Bee.WHITESPACE.more())
 .then(Bee.oneFixedOf("INFO", "WARN", "ERROR").as(SEVERITY_NAME))
 .then(Bee.WHITESPACE.more())
 .then(Bee.ANYTHING.as(MESSAGE_NAME))
 .then(Bee.END)
 .toPattern();

Then, our pattern can be used like this:

private void processLine(String line) {
 System.out.println("--------------------");
 
 Matcher matcher = LOG_ENTRY_PATTERN.matcher(line);
 if (!matcher.matches()) {
 System.out.println(String.format("Unparseable line: %s", line));
 return;
 }
 System.out.println(String.format("timestamp: %s", matcher.group(TIMESTAMP_NAME)));
 System.out.println(String.format("severity: %s", matcher.group(SEVERITY_NAME)));
 System.out.println(String.format("message: %s", matcher.group(MESSAGE_NAME)));
}

Templating

You can insert placeholders at any place of a structure with Bee.placeholder() or Bee.placeholder("someParam"), then you can construct a template with .toTemplate(). Parameters can be substituted by calling .substitute(...).

You can create a single-parameter template with the following method:

BeeTemplate template = Bee
 .then(Bee.fixed("((("))
 .then(Bee.placeholder())
 .then(Bee.fixed(")))"))
 .toTemplate();

Then, you can create any number of substituted fragments:

BeeFragment substitutedWithWord = template.substitute(Bee.WORD);
BeeFragment substitutedWithNumber = template.substitute(Bee.UNSIGNED_INT);

Alternatively, explicitly named placeholders can be used. Multiple placeholders with the same name are also supported. For example:

BeeTemplate template = Bee
 .then(Bee.placeholder("p1"))
 .then(Bee.SPACE)
 .then(Bee.placeholder("p2"))
 .then(Bee.SPACE)
 .then(Bee.placeholder("p1"))
 .then(Bee.SPACE)
 .then(Bee.placeholder("p1")
 .or(Bee.placeholder("p2")))
 .then(Bee.SPACE)
 .then(Bee.placeholder("p3"))
 .toTemplate();

In this example 'p1' and 'p2' are used multiple times.

You can use a java.util.Map for substituting named parameters:

Map<String, BeeFragment> parameters = new HashMap<String, BeeFragment>();
parameters.put("p1", Bee.fixed("*").more());
parameters.put("p2", Bee.UNSIGNED_INT);
parameters.put("p3", Bee.fixed("%").more());
BeeFragment substitutedFragment1 = template.substitute(parameters);

Of course, with Java 9+ you can use Map.of to make this a little bit cleaner:

BeeFragment substitutedFragment1 = template.substitute(Map.of(
 "p1", Bee.fixed("*").more(),
 "p2", Bee.UNSIGNED_INT,
 "p3", Bee.fixed("%").more()));

You can use placeholders just as any fragment. For example they accept quantifiers too:

BeeTemplate somethingMoreTemplate = Bee.placeholder().more().toTemplate();

For more examples see the examples package or the tests.

Custom fragments

To create a custom fragment, just implement the BeeFragment interface:

public class SeparatedByCommaFragment implements BeeFragment {
 
 private final List<BeeFragment> fragments;
 
 
 public SeparatedByCommaFragment(BeeFragment... fragments) {
 this(Arrays.asList(fragments));
 }
 
 public SeparatedByCommaFragment(Collection<BeeFragment> fragments) {
 this.fragments = new ArrayList<>(fragments);
 }
 
 
 @Override
 public String get() {
 return fragments.stream().map(BeeFragment::get).collect(Collectors.joining(","));
 }
 
}

Now SeparatedByCommaFragment can be used as any other fragments:

BeeFragment fragment = Bee
 .then(Bee.BEGIN)
 .then(new SeparatedByCommaFragment(
 Bee.UNSIGNED_INT.as("number"),
 Bee.ASCII_WORD.as("word"),
 Bee.oneFixedOf("lorem", "ipsum").as("keyword")))
 .then(Bee.END);
// ...

AltStyle γ«γ‚ˆγ£γ¦ε€‰ζ›γ•γ‚ŒγŸγƒšγƒΌγ‚Έ (->γ‚ͺγƒͺγ‚ΈγƒŠγƒ«) /