ConvertCaseTool
Recipe

Extract & Clean URLs

Paste any text — HTML source, chat logs, spreadsheet cells, prose — and get a clean, deduplicated URL list with tracking parameters stripped in one click. Perfect for link audits, content archiving, and sharing clean URLs. 100% browser-based.

PasteExtractStrip trackingNormalizeDedupeExport
Your clean URL list will appear here...

Cleanup

Removes 32 params: utm_*, fbclid, gclid, mc_cid, and more

Domain Filter

Export Format

🔗 What This Recipe Does

The Extract & Clean URLs recipe chains five operations in one click: it finds every URL in your pasted text, removes duplicates, strips tracking parameters (UTM, fbclid, gclid, mc_cid, and 25+ more), normalizes the URLs (lowercase host, remove trailing slashes), and sorts them alphabetically. You end up with a canonical, shareable URL list free of tracking cruft.

Before this recipe, getting a clean URL list meant opening a URL extractor, running dedupe separately, manually editing out UTM params, and then sorting. Now it's one paste and one click.

🧼 What Gets Stripped

The tracking parameter stripper removes these categories of URL parameters:

UTM parameters
utm_source, utm_medium, utm_campaign, utm_term, utm_content, utm_id, utm_name
Click IDs
fbclid, gclid, dclid, yclid, msclkid, twclid, ttclid, igshid
Email tracking
mc_cid, mc_eid, _hsenc, _hsmi, hsctatracking
Affiliate & referral
ref, referrer, source, src, aff, affid, partner_id

How to Use

  1. Paste your text — from HTML source, chat logs, emails, documents, or any source
  2. Tweak the cleanup options (optional) — toggle tracking strip, normalize, dedupe, sort
  3. Add a domain filter (optional) — keep only a specific domain or exclude one
  4. Pick your export format — Plain, CSV, JSON, Markdown bullets, or HTML anchor tags
  5. Copy or download — your clean URL list is ready

🎯 Who This Is For

🔍 SEO specialists

Extract outbound links from a page's HTML source, dedupe, and audit. Get a clean CSV in seconds.

✍️ Content creators

Share clean URLs in your newsletter without UTM spam. Get a Markdown bullet list ready to paste.

📚 Researchers & archivists

Turn messy reference lists into canonical URLs for citations, bibliographies, or web archive submissions.

🧑‍💻 Developers

Extract URLs from logs or server output, filter by domain, export as JSON array for scripts.

🔒 Privacy

  • Runs entirely in your browser — no server processing
  • No data is sent, stored, or logged anywhere
  • No file uploads — just paste text
  • Your pasted text never leaves your device

Frequently Asked Questions

What does this recipe do?

It chains five operations in one click: extract every http or https URL from your pasted text, remove duplicate URLs, strip common tracking parameters (UTM, fbclid, gclid, mc_cid and 20+ more), normalize the URLs (lowercase host, remove trailing slashes), and sort them alphabetically. You end up with a clean list ready to share, audit, or archive.

What tracking parameters get stripped?

All UTM parameters (utm_source, utm_medium, utm_campaign, utm_term, utm_content), Facebook click IDs (fbclid), Google click IDs (gclid), Mailchimp tracking (mc_cid, mc_eid), Yandex (yclid), Instagram (igshid), Twitter (twclid), Microsoft (msclkid), HubSpot tracking, and common referral params like ref, source, and affiliate_id. The canonical URL is preserved — paths, query params you keep, and fragments stay intact.

What URL formats are detected?

Any URL starting with http:// or https://, including those with paths, query parameters, fragments, subdomains, ports, and international characters. Plain domains without a scheme (e.g. "example.com") are not detected by design, to avoid false positives on text.

Is my data private?

Yes. Everything runs in your browser using JavaScript. No URLs or text are ever sent to any server. Your pasted data never leaves your device.

What export formats are supported?

Plain text (one per line), CSV (with "url" header), JSON array, Markdown bullet list, and HTML anchor list (ready to paste into a webpage).

Can I extract from pasted HTML or source code?

Yes. The tool finds URLs inside any text — HTML source, JavaScript, server logs, chat transcripts, spreadsheet cells, or plain prose. Paste whatever you have.

🔗 Related Tools