Drop your VCF file here
or click to browse · .vcf format
SteppeDNA
Multi-Gene HR Variant Classifier • 120 Features from 8 Data Sources • ESM-2 Protein LM Embeddings • SHAP Explained
Overall AUC (0.985) is weighted by gene representation. BRCA2 comprises 52% of the test set. Evaluated on a held-out 20% stratified test split.
Non-BRCA2 temporal AUCs (0.51–0.61) indicate limited generalization over time for data-scarce genes.
or click to browse · .vcf format
Prediction based on EVE, MAVE, PhyloP, ESM-2, SpliceAI, AlphaFold 3D structure, gnomAD frequencies, and 120 engineered features.
| Gene | SteppeDNA | Best SOTA | Winner |
|---|---|---|---|
| BRCA2 | 0.994 | 0.949 (BayesDel) | SteppeDNA |
| RAD51D | 0.824 | 0.461 (REVEL) | SteppeDNA |
| RAD51C | 0.785 | 0.703 (CADD) | SteppeDNA |
| BRCA1 | 0.747 | 0.646 (BayesDel) | SteppeDNA |
| PALB2 | 0.605 | 0.732 (REVEL) | REVEL |
* All tools evaluated on SteppeDNA’s held-out test set (methodological advantage). REVEL/BayesDel/CADD scored 72–73% of variants. Independent benchmark AUC: 0.750–0.801 (no training overlap).
Per-gene SOTA AUCs evaluated on SteppeDNA v5.4 test set. Scores sourced from dbNSFP; tools that could not score a variant were excluded from that gene's comparison.
AlphaMissense (Cheng et al. 2023) was partially trained on ClinVar labels, creating indirect label circularity. Ablation testing revealed that removing AlphaMissense features improved AUC for 3 of 5 genes. AlphaMissense was removed in v5.4.
Ablation study conducted with XGBoost-only on the pre-v5.4 model configuration. Per-gene AUC values differ from the final v5.4 XGBoost+MLP ensemble.
Proxy assignment by highest gnomAD sub-population AF — not self-reported ancestry. 84% of variants had no assignable population. Small sample sizes limit interpretation.
| Population | Variants meeting PM2 | Difference vs NFE |
|---|---|---|
| EAS | 3,753 | +245 |
| AMR | 3,713 | +205 |
| AFR | 3,684 | +176 |
| NFE | 3,508 | — |
PM2 (absent from population databases) is triggered more often for non-European populations due to lower representation in gnomAD. This can inflate pathogenicity evidence for underrepresented groups.
| Variant | Gene | SteppeDNA | REVEL | CADD | Freq (KZ) |
|---|---|---|---|---|---|
| p.Cys61Gly | BRCA1 | 0.991 | 0.948 | 24.5 | 0.004 |
| p.Met1Thr | BRCA1 | 0.991 | — | 25.0 | 0.006 |
| 5382insC | BRCA1 | Frameshift — N/A | — | 0.035 | |
| c.5278-2del | BRCA1 | Splice site — N/A | — | 0.008 | |
| c.4035del | BRCA1 | Frameshift — N/A | — | 0.005 | |
| c.9409dup | BRCA2 | Frameshift — N/A | — | 0.012 | |
| c.9253del | BRCA2 | Frameshift — N/A | — | 0.008 | |
CADD scores are on PHRED scale (not directly comparable to probability scores). REVEL and BayesDel evaluate missense variants only; SteppeDNA requires amino acid input and cannot score frameshifts or splice variants. CADD can score all variant types but scores for indel/splice founders were not retrieved.
| # | Type | HGVS | cDNA | AA Change | Mutation | Prediction | Probability |
|---|
Multi-gene HR Deep Neural Network & XGBoost ensemble trained on 19,000+ ClinVar & gnomAD HR missense variants with isotonic probability calibration. Resolves 5 genes seamlessly.
BLOSUM62 substitution scores, ESM-2 protein language model embeddings, PhyloP conservation, MAVE functional assays, EVE evolutionary scores, SpliceAI, AlphaFold 3D structures, and gnomAD population frequencies.
Every prediction shows which features pushed it toward pathogenic or benign, using SHAP values extracted from the XGBoost model.
Research-grade variant classification for missense mutations in 5 Homologous Recombination DNA repair genes (BRCA1, BRCA2, PALB2, RAD51C, RAD51D). Intended as a decision-support tool, not a standalone diagnostic.
19,223 variants: 18,738 from ClinVar + 485 gnomAD proxy-benign. Per gene: BRCA2 (10,085) | BRCA1 (5,432) | PALB2 (2,621) | RAD51C (675) | RAD51D (410). 60/20/20 split with gene × label stratification.
XGBoost + Multi-Layer Perceptron blended ensemble with gene-adaptive weights and isotonic calibration trained on a held-out calibration set. 120 engineered features from 8 data sources.
ROC-AUC: 0.985 · MCC: 0.928 · Balanced Accuracy: 96.5%. 10-fold CV: 0.9858 ± 0.0021. Outperforms REVEL (0.725), BayesDel (0.721), CADD (0.539) on same test set.*
* Evaluated on SteppeDNA test set. General-purpose tools not trained on same distribution.