Overview
The AI readiness score is a weighted composite score (0-100) that measures how suitable a dataset is for AI/ML applications. It evaluates 7 quality dimensions, each contributing to the overall score based on its relative importance for AI/ML workloads.Looking for the full methodology, standards alignment, and known limits? See the AI Readiness methodology page — it documents what we measure, what we deliberately don’t measure, and how we map to NIST AI RMF v1.0, ISO/IEC 25012, and ISO/IEC 5259.
The 7 dimensions
Each dimension targets a different category of AI/ML risk. Together they form a single composite score per file.- Completeness — proportion of non-null values across all columns.
- Consistency — format uniformity and pattern adherence within columns.
- Referential integrity — absence of orphaned cross-column references and severely malformed values.
- Compliance — GDPR readiness and sensitive data handling.
- Uniqueness — absence of duplicate rows and redundant records.
- Schema quality — column naming conventions, type consistency, structural integrity.
- Stability — data drift and distribution changes (when historical data is available).
Completeness
Completeness
One of the most heavily weighted dimensions. Measures the proportion of non-null values across all columns. Datasets with substantial missing values in critical columns score significantly lower than those with complete data.
Consistency
Consistency
Evaluates whether values within a column follow consistent formats. For example: mixed date formats (2024-01-15 vs 01/15/2024), inconsistent casing, or varying phone number formats.
Referential integrity
Referential integrity
Detects orphaned cross-column references (values pointing at records that no longer exist) and severely malformed values that break the column’s expected format. Maps to ISO/IEC 25012’s credibility characteristic — not the standard’s accuracy characteristic, which would require comparison against a true reference value the platform does not have.
Compliance
Compliance
Checks for GDPR-sensitive columns (emails, phone numbers, national IDs, health data) and evaluates whether appropriate handling is in place. Considers the GDPR toggle setting and data retention policy.
Uniqueness
Uniqueness
Measures the proportion of unique rows. Exact duplicate rows reduce this score. Near-duplicates may also be flagged.
Schema quality
Schema quality
Evaluates column naming conventions (consistency, descriptiveness), type detection accuracy, and structural integrity (e.g., mixed types within a single column).
Stability
Stability
When historical scores exist, stability measures how much the data distribution has changed between scans. Stable data that doesn’t drift unexpectedly scores higher. On the first scan, stability defaults to a neutral baseline.
How dimensions combine
The seven dimensions are combined into a single 0-100 score per file using a weighted average. Weights reflect AI/ML use-case priorities — for example, completeness and compliance carry more weight than uniqueness, because missing values and unhandled PII are more catastrophic for AI training than duplicate records. Weights are not fixed across all use cases. ORCA carries a small set of named profiles (classification, regression, NLP, time-series, anomaly detection) that re-balance the weights to reflect what matters most for that family of model — a time-series profile gives more weight to stability than a classification profile does, because temporal drift dominates time-series risk. The exact weight values, the thresholds each dimension applies internally, and the per-issue penalty schedules are implementation details we do not publish. For the full methodology, standards alignment, and limits, see the AI Readiness methodology page.Grade scale
| Grade | Score range | Meaning |
|---|---|---|
| A | 90-100 | Excellent — production-ready for AI/ML use cases |
| B | 75-89 | Good — eligible for an assessment report; some improvements recommended |
| C | 60-74 | Fair — significant issues need addressing before training |
| D | 40-59 | Poor — major remediation required |
| F | 0-39 | Failing — fundamental data quality problems |
Assessment
Datasets scoring at Grade B or higher are eligible for an AI readiness assessment report. Assessment reports include:- Overall score and grade at time of issuance
- Per-dimension breakdown
- SHA-256 verification hash for authenticity
- Public verification URL
Use-case readiness
Beyond the overall score, ORCA evaluates readiness for 8 specific ML use cases:- Churn prediction — requires customer identifiers, temporal data, engagement metrics
- Fraud detection — requires transaction data, amounts, timestamps, categorical flags
- Recommendation engine — requires user-item interactions, ratings or implicit signals
- Customer segmentation — requires demographic and behavioral features
- Demand forecasting — requires time series data, quantities, seasonal indicators
- Sentiment analysis — requires text fields with sufficient length and variety
- Price optimization — requires pricing data, competitor info, demand signals
- Risk scoring — requires financial data, credit indicators, outcome labels