Forecasting Breast Cancer Risk From a Pink-and-Purple Slide

When the forecast says “possible storm,” you do not want a poet with a barometer - you want the best possible clue about whether to bring the umbrella, cancel the picnic, or hide indoors with soup.

That is roughly the job of the OncotypeDX recurrence score in certain early breast cancers. For people with hormone receptor-positive, HER2-negative breast cancer, this 21-gene test helps doctors estimate recurrence risk and decide whether chemotherapy is likely to help. Think of it like a tumor weather report: not perfect prophecy, but useful enough that treatment plans may change.

The catch? Genomic tests can be expensive, slow, and hard to access in some health systems. Meanwhile, almost every tumor already gets a hematoxylin and eosin slide, the classic pink-and-purple microscope image pathologists inspect every day. So Cohen, Shamai, Sabo, and colleagues asked a very practical question: can AI read that ordinary slide and estimate the OncotypeDX score without waiting for the gene test? Their 2026 study in npj Breast Cancer says: maybe, and the “maybe” is doing some real work here [1].

The Slide Already Knows More Than It Says

OK, think of a pathology slide like a classroom drawing made by the tumor. It shows cell shapes, crowding, tissue structure, immune cells, messy nuclei, and all sorts of microscopic behavior. A pathologist sees many of these patterns. A deep learning model sees pixel patterns at absurd scale, because apparently computers enjoy homework measured in gigapixels.

The researchers used whole slide images, meaning huge digital scans of tissue slides. Since these images are too large to feed into a model all at once, the model chops them into small tiles, kind of like turning a giant mural into puzzle pieces. Then it learns how the pieces add up to a patient-level prediction.

Here is the neat part: the team started with Prov-GigaPath, a pathology foundation model pre-trained on 171,189 slides using self-supervised learning [1,2]. Think of self-supervised learning like giving a kid a mountain of picture books and asking them to notice patterns before anyone quizzes them. No one has to label every tiny cell. The model learns general “pathology vision” first, then gets fine-tuned for the OncotypeDX task.

The Numbers, Without the Lab Coat Fog Machine

The study included 4,227 patients across five cohorts from Israel, the United States, and Australia. Three cohorts were external validation sets, which matters because AI models can look brilliant at home and then fall apart the moment they meet a new scanner, staining protocol, or hospital workflow. Very relatable. I, too, perform worse under fluorescent lighting.

On two external cohorts with OncotypeDX scores, the model achieved AUCs of 0.836 and 0.817 for identifying high genomic risk, using recurrence score 26 or higher as the cutoff [1]. AUC is a ranking score: 1.0 is perfect separation, 0.5 is coin-flipping in a lab coat.

The model was especially tuned to identify low-risk patients safely. It labeled 22% and 16.3% of patients as low-risk in the two external cohorts, with sensitivity of 0.97 in both, and negative predictive values of 0.97 and 0.96 [1]. In plain English: when the model said “this looks low-risk,” it was usually right in these datasets.

The researchers also checked outcomes. Patients classified as high-risk by the model had worse prognosis than those classified as low-risk, with hazard ratios of 4.1 and 2.0 in two external outcome cohorts [1]. That does not mean the AI has magical cancer goggles. It means the slide seems to contain visual clues connected to biology and future risk.

Why This Is Not Just “AI Does Microscope Stuff”

This work sits inside a larger wave of computational pathology. Recent studies have used whole-slide images and clinicopathologic data to predict breast cancer recurrence risk [3], benchmarked pathology foundation models for recurrence-related prediction [4], and reviewed the fast-growing world of multimodal pathology foundation models [5]. The field is moving from “can AI spot tumor?” to “can AI estimate molecular behavior, prognosis, and treatment-relevant risk?”

That is a much harder trick. It is one thing to say “there is cancer here.” It is another to infer something like gene-expression-derived recurrence risk from cell architecture. That is like guessing a restaurant’s supply chain by looking at the soup. Possible? Sometimes. Easy? Absolutely not.

Where the Grown-Ups Still Need to Be Annoying

This model should not be treated as a replacement for OncotypeDX tomorrow morning. The authors are clear that digital pathology still needs slide scanners, storage, computation, quality control, and clinical validation [1]. Also, interpretability remains tricky. The model may find real patterns, but doctors need to know when to trust it, when to ignore it, and what failure looks like.

There is also the problem of boundaries. The study focused on hormone receptor-positive, HER2-negative breast cancer. Models trained for one setting can behave badly when dragged into another, like a toaster asked to do taxes.

Still, the promise is easy to understand. If future studies reproduce these results, an AI slide model could become a fast triage tool: flag patients very likely to be low-risk, prioritize genomic testing for uncertain cases, and expand access in places where gene assays are too costly or slow. Think of it like a first weather alert, not the final climate report.

And honestly, that is a good role for medical AI: not replacing the doctor, not pretending to be a crystal ball, but helping sort the pile faster and more consistently. A very fancy assistant with no coffee breaks, questionable bedside manner, and a deep emotional attachment to image tiles.

References

Cohen S, Shamai G, Sabo E, et al. “Prediction of OncotypeDX recurrence score using hematoxylin and eosin-stained whole slide images.” npj Breast Cancer (2026). DOI: 10.1038/s41523-026-00937-w. PMID: 42115215
Xu H, et al. “A whole-slide foundation model for digital pathology from real-world data.” Nature (2024). DOI: 10.1038/s41586-024-07441-w
Goyal M, Marotti JD, Workman AA, et al. “A multi-model approach integrating whole-slide imaging and clinicopathologic features to predict breast cancer recurrence risk.” npj Breast Cancer 10, 93 (2024). DOI: 10.1038/s41523-024-00700-z
Kaczmarzyk JR, Van Alsten SC, Cozzo AJ, et al. “Towards interpretable prediction of recurrence risk in breast cancer using pathology foundation models.” npj Digital Medicine (2026). DOI: 10.1038/s41746-025-02334-2
“Multi-Modal Foundation Models for Computational Pathology: A Survey.” arXiv: 2503.09091

Disclaimer: This blog post is a simplified summary of published research for educational purposes. The accompanying illustration is artistic and does not depict actual model architectures, data, or experimental results. Always refer to the original paper for technical details.