Indraft
and macros, Escaping
Insert excerpt | ||||||||
---|---|---|---|---|---|---|---|---|
|
Characters
Construct | Matches |
---|---|
x | The character x |
\\\\ | The backslash character |
\\/ | The forward slash character |
\\\" | The double quote character |
\\0n | The character with octal value 0n (0 <= n <= 7) |
\\0nn | The character with octal value 0nn (0 <= n <= 7) |
\\0mnn | The character with octal value 0mnn (0 <= m <= 3, 0 <= n <= 7) |
\\xhh | The character with hexadecimal value 0xhh |
\uhhhh | The character with hexadecimal value 0xhhhh |
\\t | The tab character ('\u0009') |
\\n | The newline (line feed) character ('\u000A') |
\\r | The carriage-return character ('\u000D') |
\\f | The form-feed character ('\u000C') |
\\a | The alert (bell) character ('\u0007') |
\\e | The escape character ('\u001B') |
\\cx | The control character corresponding to x |
Character Classes
Construct | Matches |
---|---|
[abc] | a, b, or c (simple class) |
[^abc] | Any character except a, b, or c (negation) |
[a-zA-Z] | a through z or A through Z, inclusive (range) |
[a-d[m-p]] | a through d, or m through p: [a-dm-p] (union) |
[a-z&&[def]] | d, e, or f (intersection) |
[a-z&&[^bc]] | a through z, except for b and c: [ad-z] (subtraction) |
[a-z&&[^m-p]] | a through z, and not m through p: [a-lq-z](subtraction) |
Predefined | |
. | Any character wildcard This may or may not match line terminators |
\\d | A digit: [0-9] |
\\D | A non-digit: [^0-9] |
\\s | A whitespace character: [ \\t\\n\\x0B\\f\\r] |
\\S | A non-whitespace character: [^\\s] |
\\w | A word character: [a-zA-Z_0-9] |
\\W | A non-word character: [^\\w] |
Line Terminators
Anchor | ||||
---|---|---|---|---|
|
A one- or two-character sequence that marks the end of a line of the input character sequence.
Construct | Indicates |
---|---|
\\n | newline (line feed) |
\\r | carriage-return |
\\r\\n | carriage-return followed immediately by a newline |
\\u0085 | next-line |
\\u2028 | line-separator |
\\u2029 | paragraph-separator |
POSIX Classes
(US-ASCII only)
Construct | Matches |
---|---|
\\p{Lower} | A lower-case alphabetic character: [a-z] |
\\p{Upper} | An upper-case alphabetic character:[A-Z] |
\\p{ASCII} | All ASCII:[\\x00-\\x7F] |
\\p{Alpha} | An alphabetic character:[\\p{Lower}\\p{Upper}] |
\\p{Digit} | A decimal digit: [0-9] |
\\p{Alnum} | An alphanumeric character:[\\p{Alpha}\\p{Digit}] |
\\p{Punct} | Punctuation: One of !"#$%&'()*+,-./:;<=>?@[\]^_`{|}~ |
\\p{Graph} | A visible character: [\\p{Alnum}\\p{Punct}] |
\\p{Print} | A printable character: [\\p{Graph}] |
\\p{Blank} | A space or a tab: [ \\t] |
\\p{Cntrl} | A control character: [\\x00-\\x1F\\x7F] |
\\p{XDigit} | A hexadecimal digit: [0-9a-fA-F] |
\\p{Space} | A whitespace character: [ \\t\\n\\x0B\\f\\r] |
Unicode Classes
Construct | Matches |
---|---|
\\p{InGreek} | A character in the Greek block (simple block) |
\\p{Lu} | An uppercase letter (simple category) |
\\p{Sc} | A currency symbol |
\\P{InGreek} | Any character except one in the Greek block (negation) |
[\\p{L}&&[^\\p{Lu}]] | Any letter except an uppercase letter (subtraction) |
Unicode Characters
Construct | Matches |
---|---|
\x{20ac} | The Euro symbol € |
\x{1f914} | An emoji character 🤔 |
Boundaries
Construct | Matches |
---|---|
^ | The beginning of a line |
$ | The end of a line |
\\b | A word boundary |
\\B | A non-word boundary |
\\A | The beginning of the input |
\\G | The end of the previous match |
\\Z | The end of the input but for the final, if any |
\\z | The end of the input |
Back References
Construct | Matches |
---|---|
$n | Whatever the nth capturing group matched; see Groups and Capturing, above. |
Quantifiers
Construct | Matches | ||
---|---|---|---|
Greedy longest possible match | Reluctant (lazy) | ||
X? | X?? | X?+ | X, once or not at all |
X* | X*? | X*+ | X, zero or more times |
X+ | X+? | X++ | X, one or more times |
X{n} | X{n}? | X{n}+ | X, exactly n times |
X{n,} | X{n,}? | X{n,}+ | X, at least n times |
X{n,m} | X{n,m}? | X{n,m}+ | X, at least n but not more than m times |
Logical Operators
Construct | Matches |
---|---|
XY | X followed by Y |
X|Y | Either X or Y |
(X) | X, as a capturing group; see Groups and Capturing, above. |
Special Constructs
(Non-capturing)
Construct | Matches |
---|---|
(?:X) | X, as a non-capturing group |
(?idmsux-idmsux) | Nothing, but turns match flags on - off |
(?idmsux-idmsux:X) | X, as a non-capturing group with the given flags on - off; see Groups and Capturing, above. |
(?=X) | X, via zero-width positive lookahead |
(?!X) | X, via zero-width negative lookahead |
(?<=X) | X, via zero-width positive lookbehind |
(?<!X) | X, via zero-width negative lookbehind |
(?>X) | X, as an independent, non-capturing group |
Quotation
Construct | Matches |
---|---|
\\ | Nothing, but quotes the following character |
\\Q | Nothing, but quotes all characters until \\E |
\\E | Nothing, but ends quoting started by \\Q |