Skip to content

Commit 2700d22

Browse files
committed
Update README.md
1 parent 29e2c9b commit 2700d22

File tree

1 file changed

+8
-1
lines changed

1 file changed

+8
-1
lines changed

README.md

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -308,7 +308,8 @@ whole in memory, but rather buffered in parts in a sliding window of a few KB.
308308
The window size may grow to fit a pattern match. UTF-16/32 file input with a
309309
UTF BOM is automatically normalized and matched as UTF-8.
310310

311-
For example, using Boost.Regex (alternatively use PCRE2 `reflex::PCRE2Matcher`):
311+
For example, using Boost.Regex (alternatively use PCRE2 `reflex::PCRE2Matcher`
312+
or `reflex::PCRE2UTFMatcher` to match Unicode UTF-8 input):
312313

313314
```{.cpp}
314315
#include <reflex/boostmatcher.h> // reflex::BoostMatcher, reflex::Input, boost::regex
@@ -367,6 +368,12 @@ following methods:
367368
- `[0]` operator returns `std::pair<const char*,size_t>(begin(),size())`
368369
- `[n]` operator returns n'th capture `std::pair<const char*,size_t>`
369370

371+
Note: POSIX matchers do not generally support group capturing, e.g.
372+
`BoostPosixMatcher` and `StdPosixMatcher` do not. RE/flex is an efficient
373+
backtrack-free DFA-based POSIX engine that supports a limited form of
374+
capturing, limited to outermost gouping, such as `(abc)|(def)` which has two
375+
groups. This may be extended in a future release to full capturing.
376+
370377
To search a string for words `\w+` to display with the column number of each
371378
word found:
372379

0 commit comments

Comments
 (0)