Top Qs
Timeline
Chat
Perspective
Comparison of regular expression engines
From Wikipedia, the free encyclopedia
Remove ads
This is a comparison of regular expression engines.
Libraries
Summarize
Perspective
Remove ads
Languages
- "Dotnet/Corefx". GitHub. 16 February 2022.
- "Dotnet/Corefx". GitHub. 16 February 2022.
Remove ads
Language features
Summarize
Perspective
NOTE: An application using a library for regular expression support does not necessarily support the full set of features of the library, e.g., GNU grep uses PCRE, but supports no lookahead, though PCRE does.
Part 1
- Shy groups, also called non-capturing groups cannot be referred to with backreferences; non-capturing groups are used to speed up matching where the group's content does not need to be accessed later.
- Backreferences enable referring to previously matched groups in later parts of the regex and/or replacement string (where applicable). For instance, ([ab]+)\1 matches "abab" but not "abaab".
- FREJ have no repetitive quantifiers, but have "optional" element which behaves similar to simple "?" quantifier.
- As of ES2018
- Lua's only non-greedy quantifier is
-
, which is a non-greedy version of*
. It does not have non-greedy versions of+
or?
; in the former case, the non-greedy effect can be achieved by repeating the token followed by-
, but in the latter case, there is no equivalent.
Part 2
- Also known as flags modifiers, modes modifiers or option letters. Example pattern: "(?i:test)".
- Also called independent sub-expressions.
- Similar to back references, but with names instead of indices.
- Special feature allowing to match balanced constructs without recursion.
- Refers to the possibility of including quantifiers in look-behinds, thus making their length unpredictable.
- Unicode property support may be incomplete (products are continuously updated!). All will be incomplete when a new Unicode revision is released until they are updated to comply.
- Available as of ICU55.
- Available as of JDK7.
- The support and range of properties is dependent on implementation.
- Experimental support added in v5.29.9.
- May only be available in the regex library when used with Python versions after 3.3.
Remove ads
API features
- Means the format can be used internally without explicit conversion.
- Since version 8.30.
- Partial matching is performed implicitly, requiring a separate call to matchedLength() if an exact match fails.
- Tcl includes facilities to convert to and from UTF-8.
- wxRegEx uses any system supplied POSIX library or if not available and for Unicode mode uses Henry Spencer's library.
Remove ads
See also
References
External links
Wikiwand - on
Seamless Wikipedia browsing. On steroids.
Remove ads