PK Systems PK Systems
Text tools

Text Extractor

Pull every email, URL, phone number, hashtag, mention, date and IPv4 address out of a block of messy text — instantly.

Text Extractor

Emails 0
URLs 0
Phone numbers 0
Hashtags 0
Mentions 0
Dates 0
IPv4 addresses 0

What this tool does

Drop in any text — an email thread, a chat log, a website you copied, a CSV, a stack trace — and the extractor pulls out structured fragments for you. Each category gets its own card with a count and a copy button, so you can grab just the emails (or just the URLs) without dredging through the rest. Works equally well on small snippets and on multi-megabyte logs.

How to use the extractor

Paste your text. Each category card on the right updates as you type, showing what was found. Toggle Remove duplicates to keep one copy of each unique value (useful for emails and URLs from a long thread). Toggle Sort alphabetically if you want the results in a predictable order rather than in source order. Click Copy on any card to put just that category on your clipboard.

What the patterns recognise

Emails: anything matching local@domain.tld with at least a 2-letter TLD. URLs: starts with http://, https://, or www.. Phones: international, with optional +, parentheses, dashes, dots, spaces — minimum 7 digits to avoid catching ZIP codes. Hashtags / mentions: standard #tag / @handle with letters/digits/underscores. Dates: ISO, slashed (US/EU), and month-name forms. IPv4: dotted quads with each octet validated to 0-255.

Patterns and examples

Category What it picks up
Emailsjane@example.com, support+sales@acme.co.uk
URLshttps://example.com/path?q=1, www.acme.io
Phone numbers+1 (415) 555-0123, +44 20 7946 0958
Hashtags#launch, #summer2026
Mentions@nasa, @jane.doe
Dates2026-05-06, 06/05/2026, May 6, 2026
IPv4 addresses192.168.1.1, 10.0.0.255

Frequently asked questions

Will it find every email in my text?
Yes for any reasonable email format — local@domain.tld with a 2+ letter TLD, plus dots, dashes, pluses and percent signs in the local part. Obfuscated forms like jane (at) example dot com won't match — paste them with the literal @ first if you need them out.
Are international phone numbers detected?
Yes. The pattern allows an optional +, country code, parentheses, and any mix of spaces, dots and dashes. Numbers must have at least 7 digits to be considered phone-like, which filters out things like ZIP codes, order numbers and version strings.
Why are short hashtags ignored?
The pattern requires at least 2 characters after the # to keep noise low. Single-character hashtags like #1 are usually pagination, footnote markers or section numbers, not actual tags. If you need to capture them, use the regex find tool with a custom pattern.
What date formats does it pick up?
ISO (2026-05-06), slashed forms (06/05/2026 or 05/06/2026), dashed forms, and month-name styles like May 6, 2026. The extractor doesn't try to guess US-vs-EU order — it captures the substring as it appears, leaving disambiguation to you.
Are duplicates removed by default?
Yes — that's the most common request, especially for emails and URLs in long threads. Toggle Remove duplicates off if you need a frequency count or the original order. For emails, deduplication is case-insensitive (since email locals are technically case-sensitive but addresses are usually treated as identical when lowercased).
Does my text leave the browser?
No. All extraction is regex matching that runs locally on your machine. You can confirm with DevTools > Network — typing into the input fires no requests. Safe for chat logs, customer data, internal documents, anything sensitive.