prompt

Data Cleaning & Transformation Plan

Create a step-by-step data cleaning strategy including deduplication, handling missing values, and validation rules.

Updated June 2026
The prompt
Create a data cleaning plan for {{dataset_name}}:

Data profile:
- Size: {{record_count}} rows, {{column_count}} columns
- Types: {{data_types}}
- Quality issues: {{known_issues}}
- Purpose: {{intended_use}}

Address:
1. Duplicates: {{duplication_problem}} (how to identify? keep which?)
2. Missing values: {{missing_problem}} (patterns? imputation vs deletion?)
3. Outliers: {{outlier_problem}} (valid or errors?)
4. Format/type mismatches: {{format_issue}}
5. Validation rules (business logic checks)
6. Lineage & audit trail (who touched what, when?)

Provide SQL or pseudocode for each step and validation queries.
Did it work? Rate this prompt

Variables

Dataset name
Number of records
Number of columns
Data types present
Known quality issues
Intended use
Duplication issue
Missing value issue
Outlier issue
Format/type issue

Details

Author

AI Khazna

License

Security

Type

prompt

Related assets

More curated picks in Data & Analytics.

Audit before you install

Run any source through our checks - AI visibility, security, performance, and stack detection.

More in Data & Analytics