At 9:12 a.m., your single-cell pipeline staggers into work carrying two cursed backpacks - one full of gene expression counts, the other full of chromatin accessibility peaks - and both are leaking mysterious biological confetti onto the floor. By lunch, it has to decide which cells are which, why they behave that way, and whether the loudest genes are actually important or just the dataset equivalent of that bard who never stops singing.
That is the dungeon SEAGALL walks into.
In a Genome Biology paper published on April 23, 2026, Gabriele Malagoli and colleagues introduce SEAGALL - short for Single-cell Explainable Geometry-Aware Graph Attention Learning pipeLine - to explain how molecular features shape cell identity across single-cell omics data (Malagoli et al., 2026). The party build is pretty good: a geometry-regularized autoencoder to place cells into a more trustworthy latent map, then an explainable graph attention network to learn labels and reveal which features matter most.
Meet the Adventuring Party
First, a quick monster manual entry. ATAC-seq measures which parts of DNA sit open and accessible - basically, which doors in the genome are unlocked enough for regulatory proteins to walk through (Wikipedia: ATAC-seq). Pair that with single-cell RNA-seq, which tells you which genes are being expressed, and you get a richer picture of cell state. You also get noise, sparsity, and enough dimensions to make Euclid file a formal complaint.
SEAGALL’s first character class is the geometry-regularized autoencoder. Its job is not just compression. It tries to preserve the shape of the data while squeezing cells into a latent space, so nearby cells in biology stay nearby in the model. Think of it as the party cartographer who draws a map that still has roads instead of turning the kingdom into spilled spaghetti.
Then comes the graph attention network. In graph neural networks, cells become nodes and relationships become edges. A graph attention network does not treat every neighbor equally - it learns which neighboring cells deserve more attention (Wikipedia: Graph neural network). If a plain graph model is the guard who questions everyone at the tavern the same way, attention is the rogue who notices one suspicious traveler covered in dragon ash.
According to the paper’s abstract, SEAGALL uses explainable AI to identify the features driving predictions and extracts “specific and stable signatures” that go beyond ordinary differential marker genes (Malagoli et al., 2026). Translation: instead of just shouting “this gene is higher over here,” it tries to answer the more useful question: which molecular clues consistently matter for this cell identity, and how much?
Why This Boss Fight Matters
Single-cell biology has a classic side quest problem. You can measure a ridiculous amount of stuff, but connecting it into an explanation is harder than it looks. Marker genes help, but they can be noisy, coverage-biased, or too blunt for subtle regulatory states. SEAGALL is aiming for something more like a detective campaign than a box-score summary.
That fits a broader trend. A 2025 review in Briefings in Bioinformatics surveyed 107 applications of graph neural networks across single-cell omics, showing how fast this design pattern is spreading (Li and Hua, 2025). Tools such as DeepMAPS used heterogeneous graph transformers to infer cell-type-specific biological networks from multi-omics data (Ma et al., 2023), while HyGAnno applied hybrid graph neural networks to cell-type annotation in single-cell ATAC-seq (Zhang et al., 2024). Cellograph took graph methods into multi-condition single-cell RNA-seq analysis (Molaei et al., 2024).
SEAGALL’s angle is that it wants both performance and explanation. That is a good instinct, because in biology, “the model got the label right” is often the opening scene, not the finale. The actual treasure chest is mechanistic insight.
Roll for Real-World Impact
If SEAGALL holds up in broader benchmarking and reproducible downstream studies, it could help researchers spot more reliable regulatory signatures in development, disease, and treatment response. That matters in fields where cell identity is slippery - cancer, immunology, neurobiology, all the fun places where cells keep changing costumes backstage.
It also lands at a moment when the ecosystem is scaling fast. Arc Institute launched its Arc Virtual Cell Atlas in June 2025 with data from more than 300 million cells, and Illumina announced Illumina Connected Multiomics on January 6, 2026 to analyze multimodal datasets at scale. In other words, the dungeon is getting bigger, not smaller. Tools that can explain their reasoning are no longer a luxury item dropped by a rare miniboss.
If you need to sketch these cell-state relationship graphs for actual humans, something like mapb2.io would make more sense than drawing arrows on a napkin like a wizard who lost access to PowerPoint.
The catch, of course, is that explainable AI in biology still has to survive the usual traps: noisy measurements, batch effects, fragile preprocessing, and the eternal menace of models that look insightful while merely rediscovering technical artifacts. The paper is also currently available as an early-access, unedited version, so the final polished article may clarify some details.
Still, SEAGALL feels like a sensible party composition for a hard campaign: preserve geometry, build a cell graph, apply attention, and then actually ask the model to explain itself. In AI-for-biology, that last move is less a bonus action and more the whole quest.
References
Malagoli G, Hanel P, Danese A, Wolf G, Colomé-Tatché M. Geometry-aware graph attention networks to explain single-cell chromatin states and gene expression with SEAGALL. Genome Biology. 2026. DOI: 10.1186/s13059-026-04066-2
Li S, Hua H. Graph neural networks for single-cell omics data: a review of approaches and applications. Briefings in Bioinformatics. 2025;26(2):bbaf109. DOI: 10.1093/bib/bbaf109
Ma A, McDermaid A, Xu J, et al. Single-cell biological network inference using a heterogeneous graph transformer. Nature Communications. 2023;14:679. DOI: 10.1038/s41467-023-36559-0
Zhang W, Cui Y, Liu B, et al. HyGAnno: hybrid graph neural network-based cell type annotation for single-cell ATAC sequencing data. Briefings in Bioinformatics. 2024;25(3):bbae152. DOI: 10.1093/bib/bbae152. PubMed: 38581422
Molaei S, et al. Cellograph: a semi-supervised approach to analyzing multi-condition single-cell RNA-sequencing data using graph neural networks. BMC Bioinformatics. 2024;25:25. DOI: 10.1186/s12859-024-05641-9
Wikipedia contributors. ATAC-seq. Wikipedia. https://en.wikipedia.org/wiki/ATAC-seq
Wikipedia contributors. Graph neural network. Wikipedia. https://en.wikipedia.org/wiki/Graph_neural_network
Arc Institute. Arc Virtual Cell Atlas launches, combining data from over 300 million cells. Published June 2025. https://arcinstitute.org/news/news/arc-virtual-cell-atlas-launch
Illumina. Illumina launches powerful software for connected, intuitive, and scalable multiomic analysis. Published January 6, 2026. https://www.illumina.com/company/news-center/press-releases/2026/97798370-799c-4e86-b76d-7ef38b42766f.html
Disclaimer: This blog post is a simplified summary of published research for educational purposes. The accompanying illustration is artistic and does not depict actual model architectures, data, or experimental results. Always refer to the original paper for technical details.