cheat sheet

RegEX Cheat Sheet

Every pattern, anchor, quantifier, group, and flag. Searchable, copy-ready, with live match testing.

/ to focus

engine:

Live Tester

multiline JS engine

Pattern

Flags

Test string

Matches

Groups

Anchors

Position Anchors

JSPYPCRE

beginner

Start of string

^Hello matches Hello world

End of string

world$ matches Hello world

Word boundary

\bcat\b matches cat but not concatenate

Non-word boundary

\Bcat\B matches concatenate

Absolute start of string (multiline-safe)

\AHello matches start of string only

Absolute end of string (multiline-safe)

\Zworld matches end of string only

Character Classes

Built-in Classes

JSPYPCRE

beginner

Any digit (0 through 9)

\d+ matches 42 in "42 hello"

Any non-digit

\D+ matches non-numeric runs

Word character (a-z, A-Z, 0-9, _)

\w+ matches hello_42

Non-word character

\W matches hello!world

Whitespace (space, tab, newline)

\s+ matches spaces between words

Non-whitespace

\S+ matches each word token

Any character except newline

c.t matches cat, cut, c4t

Custom Character Classes

JSPYPCRE

beginner

Match any character in set

[aeiou]

[aeiou] matches vowels in "hello"

Match any character NOT in set

[^aeiou]

[^aeiou] matches consonants and spaces

Character range

[a-z]

[a-z]+ matches lowercase words

Alphanumeric range

[a-zA-Z0-9]

Matches any letter or digit

Literal dot inside class

[.]

Inside [], dot is literal. No escaping needed

POSIX Classes (PCRE / Python)

PYPCRE

intermediate

Any letter (locale-aware)

[[:alpha:]]

Equivalent to [a-zA-Z] in basic locales

Any digit

[[:digit:]]

Equivalent to [0-9]

Alphanumeric

[[:alnum:]]

Equivalent to [a-zA-Z0-9]

Whitespace

[[:space:]]

Includes space, tab, newline, form feed

Uppercase letters

[[:upper:]]

Equivalent to [A-Z] in C locale

Lowercase letters

[[:lower:]]

Equivalent to [a-z] in C locale

Punctuation characters

[[:punct:]]

Matches . , ! ? ; : etc.

Quantifiers

Greedy Quantifiers

JSPYPCRE

beginner

Zero or more (greedy)

a* matches "", "a", "aaa"

One or more (greedy)

a+ matches "a", "aaa" but not ""

Zero or one (optional)

colou?r matches "color" and "colour"

Exactly n times

{n}

\d{4} matches "2024"

At least n times

{n,}

\d{2,} matches "42", "123"

Between n and m times

{n,m}

\d{2,4} matches "42", "123", "2024"

Lazy Quantifiers

JSPYPCRE

intermediate

Zero or more (lazy, shortest match)

<.+?> matches <b> not the whole tag

One or more (lazy)

a+? matches only first "a" in "aaa"

Zero or one (lazy)

Prefers the shorter match when possible

Between n and m (lazy)

{n,m}?

Stops as soon as minimum is met

Groups

Capturing & Non-Capturing

JSPYPCRE

beginner

Capturing group

(abc)

(foo) captures "foo" in group 1

Non-capturing group

(?:abc)

(?:foo)+ groups without capturing

Named capturing group

(?<name>abc)

(?<year>\d{4}) -- access via match.groups.year

Alternation (OR)

cat|dog

cat|dog matches "cat" or "dog"

Backreference to group 1

(\w+) \1 matches "hello hello"

Named backreference

\k<name>

(?<w>\w+) \k<w> matches repeated words

Lookaround

Lookahead & Lookbehind

JSPYPCRE

advanced

Positive lookahead: followed by

(?=abc)

\d+(?= dollars) matches "100" in "100 dollars"

Negative lookahead: NOT followed by

(?!abc)

\d+(?! dollars) matches "200" in "200 euros"

Positive lookbehind: preceded by

(?<=abc)

(?<=\$)\d+ matches "100" in "$100"

Negative lookbehind: NOT preceded by

(?<!abc)

(?<!\$)\d+ matches digits not preceded by $

Lookaround Tips

note

Lookarounds are zero-width; they don't consume characters

(?=\d)\w+

Checks next char without moving the cursor

Chain multiple lookaheads for validation

(?=.*\d)(?=.*[A-Z]).{8,}

Validates: 8+ chars, has digit, has uppercase

Flags

Regex Flags / Modifiers

JSPYPCRE

beginner

Case-insensitive matching

/hello/i matches "Hello", "HELLO"

Global: find all matches

/\d/g finds every digit in the string

Multiline: ^ and $ match line boundaries

/^\w/m matches first word of each line

Dot-all: . matches newlines too

/a.b/s matches "a\nb"

Unicode mode

/\u{1F600}/u matches emoji codepoints

Sticky: match only at lastIndex (JS)

Anchors the match to regex.lastIndex position

Verbose: allow whitespace & comments (Python/PCRE)

re.compile(r"\d+ # digits", re.VERBOSE)

Combine multiple flags

/hello/gi (global, case-insensitive)

Escapes

Special Characters & Escapes

JSPYPCRE

beginner

Must-escape metacharacters

\ ^ $ . | ? * + ( ) [ ] { }

Escape with \ to match literally: \. \* \+

Tab character

\t matches a tab character

Newline

\n matches a line feed

Carriage return

\r\n matches Windows line endings

Unicode code point

\uXXXX

\u0041 matches "A"

Hex character

\xXX

\x41 matches "A"

Common Patterns

Email & URLs

JSPYPCRE

pattern

Email address (basic)

[\w.+-]+@[\w-]+\.[a-zA-Z]{2,}

Matches [email protected], [email protected]

URL (http / https)

https?://[\w-]+(\.[\w-]+)+([\w.,@?^=%&:/~+#-]*)?

Matches https://example.com/path?q=1

URL slug (kebab-case)

[a-z0-9]+(?:-[a-z0-9]+)*

Matches "my-blog-post", "product-v2"

Domain name

(?:[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?\.)+[a-zA-Z]{2,}

Dates & Times

JSPYPCRE

pattern

ISO date (YYYY-MM-DD)

\d{4}-(?:0[1-9]|1[0-2])-(?:0[1-9]|[12]\d|3[01])

Matches 2024-01-15

Time (HH:MM or HH:MM:SS)

(?:[01]\d|2[0-3]):[0-5]\d(?::[0-5]\d)?

Matches 14:30, 09:05:22

Numbers & IDs

JSPYPCRE

pattern

Integer (positive or negative)

-?\d+

Matches -42, 0, 1000

Decimal number

-?\d+(?:\.\d+)?

Matches -3.14, 42, 0.5

Hex color code

#(?:[0-9a-fA-F]{3}){1,2}

Matches #fff, #1a2b3c

UUID v4

[0-9a-f]{8}-[0-9a-f]{4}-4[0-9a-f]{3}-[89ab][0-9a-f]{3}-[0-9a-f]{12}

Matches 550e8400-e29b-41d4-a716-446655440000

IPv4 address

(?:\d{1,3}\.){3}\d{1,3}

Matches 192.168.1.1

Semantic version (semver)

\d+\.\d+\.\d+(?:-[a-zA-Z0-9.]+)?

Matches 1.0.0, 2.3.1-beta.1

File extension

\.\w+$

Matches .jpg, .min.js, .tar.gz

Password strength: 8+ chars, digit, uppercase

(?=.*\d)(?=.*[A-Z]).{8,}

Validates password complexity

Phone number (international)

\+?[\d\s\-().]{7,20}

Matches +1 (555) 123-4567

Credit card number (basic format)

\b(?:\d{4}[\s-]?){3}\d{4}\b

Matches 4111 1111 1111 1111, 4111-1111-1111-1111

Substitution Syntax

JSPYPCRE

intermediate

Replace using group reference (JS)

"hello world".replace(/(\w+) (\w+)/, "$2 $1")

Returns "world hello"

Replace using named group (JS)

"2024-01-15".replace(/(?<y>\d{4})-(?<m>\d{2})-(?<d>\d{2})/, "$<d>/$<m>/$<y>")

Returns "15/01/2024"

Replace using group reference (Python)

re.sub(r"(\w+) (\w+)", r"\2 \1", "hello world")

Returns "world hello"

Replace using named group (Python)

re.sub(r"(?P<first>\w+) (?P<last>\w+)", r"\g<last> \g<first>", s)

Returns "Smith John" from "John Smith"

Replace using group reference (PCRE/PHP)

preg_replace('/(\w+) (\w+)/', '$2 $1', $str)

Returns "world hello"

Character	Meaning
`.`	Matches any single character except newline
`^`	Start of string (or line in multiline mode)
`$`	End of string (or line in multiline mode)
`*`	Zero or more repetitions
`+`	One or more repetitions
`?`	Zero or one repetition (also makes quantifiers lazy)
`\`	Escape character
`	`
`()`	Capturing group
`[]`	Character class
`{}`	Quantifier range

Sequence	Matches
`\.`	A literal dot
`\*`	A literal asterisk
`\(`	A literal opening parenthesis
`\\`	A literal backslash
`\n`	Newline
`\t`	Tab
`\r`	Carriage return

Class	Description	Matches
`\d`	Digit	`0-9`
`\D`	Non-digit	Anything except `0-9`
`\w`	Word character	`a-z`, `A-Z`, `0-9`, `_`
`\W`	Non-word character	Anything `\w` won't match
`\s`	Whitespace	Space, tab, newline
`\S`	Non-whitespace	Anything `\s` won't match
`.`	Any character	Except newline (by default)

Quantifier	Meaning	Example
`*`	0 or more	`a*` matches "", "a", "aaa"
`+`	1 or more	`a+` matches "a", "aaa" but not ""
`?`	0 or 1	`colou?r` matches "color" and "colour"
`{n}`	Exactly n times	`\d{4}` matches exactly 4 digits
`{n,}`	n or more times	`\d{2,}` matches 2 or more digits
`{n,m}`	Between n and m times	`\d{2,4}` matches 2, 3, or 4 digits

Quantifier	Meaning
`*?`	0 or more (lazy)
`+?`	1 or more (lazy)
`??`	0 or 1 (lazy)
`{n,m}?`	Between n and m (lazy)

Quantifier	Meaning
`*+`	0 or more (possessive)
`++`	1 or more (possessive)
`?+`	0 or 1 (possessive)

Anchor	Matches
`^`	Start of string (or line with `m` flag)
`$`	End of string (or line with `m` flag)
`\A`	Start of string (ignores `m` flag)
`\Z`	End of string (ignores `m` flag)

Flag	Name	Effect	Example
`i`	Case insensitive	`a` matches `A` and `a`	`/cat/i` matches "Cat", "CAT"
`g`	Global	Find all matches, not just the first	`/\d+/g` returns every number
`m`	Multiline	`^` and `$` match per line	`/^\w+/gm` matches first word per line
`s`	Dotall	`.` matches newlines too	`/a.b/s` matches "a\nb"
`x`	Extended	Allows whitespace and comments in patterns	Supported in Python, PHP, Ruby
`u`	Unicode	Enables full Unicode support	`/\p{L}+/u` matches Unicode letters

Use Case	Pattern	Notes
Email	`^[\w.+-]+@[\w-]+\.[a-zA-Z]{2,}$`	Basic validation; RFC 5322 is far more complex
URL	`https?://[\w./-]+`	Simplified - use a library for production
Phone (US)	`\(?\d{3}\)?[-.\s]?\d{3}[-.\s]?\d{4}`	Handles common US formats
IP Address (IPv4)	`\b(?:\d{1,3}\.){3}\d{1,3}\b`	Does not validate the 0–255 range
Date (YYYY-MM-DD)	`\d{4}-(?:0[1-9]\|1[0-2])-(?:0[1-9]\|[12]\d\|3[01])`	Validates month and day ranges
HTML tag	`<[^>]+>`	Matches any tag - not for parsing full HTML
Whitespace (trim)	`^\s+\|\s+$`	Matches leading and trailing whitespace
Password	`^(?=.[A-Z])(?=.\d)(?=.*[\W_]).{8,}$`	8+ chars, uppercase, digit, special char
Hex color	`#[0-9a-fA-F]{3,6}`	Matches 3 or 6 digit hex codes
Username	`^[a-zA-Z0-9_]{3,16}$`	3–16 alphanumeric characters or underscores

RegEX Cheat Sheet

Anchors

Character Classes

Quantifiers

Groups

Lookaround

Flags

Escapes

Common Patterns

What is RegEx

RegEx Syntax

Special Characters

Literal Characters

Escape Sequences

RegEx Character Classes

Predefined Character Classes

Custom Character Classes

Negated Character Classes

RegEx Quantifiers

Greedy Quantifiers

Lazy Quantifiers

Possessive Quantifiers

RegEx Anchors and Boundaries

Start and End Anchors

Word Boundaries

Line vs String Anchors

RegEx Groups and Capturing

Capturing Groups

Non-Capturing Groups

Named Capturing Groups

Backreferences

RegEx Lookahead and Lookbehind

Positive Lookahead

Negative Lookahead

Positive Lookbehind

Negative Lookbehind

RegEx Flags and Modifiers

RegEx Syntax by Language

RegEx in JavaScript

RegEx in Python

RegEx in PHP

RegEx in Java

RegEx in Ruby

Common RegEx Patterns

RegEx Operators and Alternation

Pipe Operator

Precedence Rules

Grouping with Alternation

How to Test RegEx

Online RegEx Testers

RegEx Debugging Tips

RegEx Performance and Backtracking

What Causes It

How to Avoid It

FAQ on Regex

What is RegEx used for?

What does .* mean in RegEx?

What is the difference between greedy and lazy quantifiers?

What does ^ mean in a RegEx pattern?

What are RegEx flags?

What is a capturing group in RegEx?

How do word boundaries work in RegEx?

Does RegEx syntax differ between programming languages?

What is catastrophic backtracking in RegEx?

How do I test a RegEx pattern?

What does `.*` mean in RegEx?

What does `^` mean in a RegEx pattern?