prompt

Regex Data Cleaning Pattern Library

Build reusable regex patterns and cleaning functions for common {{data_type}} quality issues.

Updated June 2026
The prompt
Create a regex pattern library for {{data_type}} cleaning:

1. Common malformats:
   - Phone numbers (extra spaces, country codes, dashes)
   - Email addresses (+ addressing, domains)
   - URLs (trailing slashes, query params)
   - Dates (format variations, timezones)
   - Postal codes (spaces, hyphens)

2. For each pattern, provide:
   - Regex expression (with comments)
   - Python / SQL implementation
   - Test cases (valid & invalid examples)
   - Caveats (locale-specific issues, false positives)

3. Edge cases:
   - Null handling
   - Encoding issues (UTF-8)
   - Special characters

Format as code snippet library (Python function or SQL UDF).
Did it work? Rate this prompt

Variables

Field type (e.g. 'phone numbers', 'email addresses', 'URLs')

Details

Author

AI Khazna

License

Security

Type

prompt

Related assets

More curated picks in Data & Analytics.

Audit before you install

Run any source through our checks - AI visibility, security, performance, and stack detection.

More in Data & Analytics