Overview
A correction pipeline is a saved sequence of cleanup steps — null fills, format standardization, deduplication, custom transformations — that you can apply to a file in one shot and reuse across other files later. If auto-remediation is “fix this one file now” and the Fix Inbox is “approve AI-suggested fixes as they appear,” correction pipelines are the missing third piece: the cleanup recipes you’ve already decided work, ready to run on demand.When to use a pipeline
Use a pipeline when:- You have a recurring file format (monthly transaction exports, weekly sales dumps) that always needs the same cleanup
- You’ve manually fixed the same set of issues on three or more files and want to stop repeating yourself
- You want to apply the same correction sequence to a different file and be able to roll it back
Pipeline anatomy
Every pipeline consists of:| Field | Description |
|---|---|
| Name | Human-readable label (e.g. “Monthly orders cleanup”) |
| File ID | The file the pipeline was authored against (used for the column schema) |
| Status | active (runnable) or archived |
| Steps | Ordered list of correction actions |
| Version | Auto-incremented on every edit so you can roll back |
- A correction type (null fill, format fix, dedupe, regex replace, custom)
- Configuration (which column, which strategy, parameters)
- Status —
proposed(suggested by AI, awaiting approval),active(approved, will run),applied(already run),rejected - Source —
auto(engine-generated) oruser(manually added) - Confidence score (only for AI-generated steps)
Creating a pipeline
There are two ways to create a pipeline:From the Fix Inbox (recommended)
The fastest path. The Fix Inbox already groups AI-suggested fixes by file into pipeline-shaped objects. Approve the steps you want, optionally add manual ones, and save.Open Fix Inbox
Navigate to Fix Inbox in the sidebar. Each file with proposed fixes appears as a draft pipeline.
Review and approve steps
Inspect each proposed step’s before/after preview. Approve the ones you trust, reject the ones you don’t.
From scratch (Correction Wizard)
For files where you want to author the steps by hand:Add steps
Add corrections one at a time. The wizard suggests strategies based on detected issues, but you can override every choice.
Running a pipeline
Once a pipeline is saved, you can apply it to any file with a compatible schema.Versioning and rollback
Every edit to a pipeline creates a new version. The pipeline detail page shows the version history with timestamps and a diff between versions. To roll back:Bulk approve and reject
For pipelines with many proposed steps (common after the AI suggester runs), bulk actions speed things up:API reference
Tips
- Name pipelines after the file pattern, not the issue. “Monthly orders cleanup” beats “Null fills” — when you have ten pipelines, you’ll search by what they apply to, not what they do.
- Start by promoting Fix Inbox approvals. The fastest way to your first pipeline is to clean up one representative file in the Fix Inbox and save the result.
- Use preview liberally. Pipelines are reusable, which means a bad step gets repeated N times. Always preview against a fresh file before scheduled apply.
- Pair with contracts. Once a pipeline reliably produces clean output, define a contract that enforces the post-cleanup state — so any future file that needs the same pipeline trips a contract violation if you forget to run it.
- Archive, don’t delete. Setting
status = archivedkeeps the version history available for audit while removing the pipeline from active lists.
What’s next?
- Auto-remediation — one-shot file cleanup without saving a pipeline
- Fix Inbox — the suggestion queue that feeds new pipelines
- Data contracts — enforce the cleaned-up state going forward