Key takeaways
Google Research has launched DeepSomatic, an artificial intelligence model designed to transform how scientists detect cancer-causing genetic mutations, potentially reshaping cancer diagnosis and treatment worldwide.
The groundbreaking tool, developed in collaboration with the University of California, Santa Cruz Genomics Institute and Children's Mercy hospital, was announced on October 16, 2025, with findings published in the prestigious journal Nature Biotechnology.
Advanced AI technology tackles complex cancer cases
DeepSomatic employs convolutional neural networks to distinguish between inherited genetic variants and acquired somatic mutations that drive cancer development.
The system converts genomic sequencing data into images, which it then analyzes to differentiate between normal genetic variations, cancer-causing mutations, and sequencing errors.
The tool demonstrates particular strength in identifying insertions and deletions, complex genetic changes that conventional methods often miss.
In testing across multiple cancer types, DeepSomatic achieved a 90% accuracy rate for detecting these challenging mutations using Illumina sequencing data, compared to approximately 80% for existing tools.
With PacBio long-read data, the performance gap widened even further, with DeepSomatic exceeding 80% accuracy while traditional methods scored below 50%.
The AI model has shown promising results in some of the most difficult cancer cases.
When analyzing pediatric leukemia samples—where obtaining normal blood cells for comparison proves particularly challenging—DeepSomatic identified 10 new mutations that had been missed by state-of-the-art tools.
The system also successfully detected key genetic mutations in glioblastoma, an aggressive and often fatal brain tumor.
These discoveries were made possible by training DeepSomatic on the Cancer Standards Long-read Evaluation (CASTLE) dataset, a comprehensive collection of tumor and normal cell samples from breast and lung cancers sequenced across three major platforms: Illumina, PacBio HiFi, and Oxford Nanopore Technologies.
Versatility across multiple cancer research applications
DeepSomatic's adaptability extends beyond traditional tumor-normal paired analyses.
The tool can process whole-genome sequencing, whole-exome sequencing, tumor-only datasets, and even formalin-fixed paraffin-embedded samples, archived tissue specimens that typically present significant analytical challenges but are abundant in pathology departments.
Technical leads Kishwar Shafin and Andrew Carroll from Google Research emphasized the tool's flexibility in working across different sequencing technologies and experimental setups, addressing a critical need in both research and clinical contexts.
Open-source release to accelerate global research
In a move that underscores Google's commitment to advancing cancer research, both DeepSomatic and the CASTLE dataset are being released under open-source licenses.
This decision allows researchers worldwide to access cutting-edge AI technology without financial barriers, potentially accelerating discoveries in cancer genetics and precision medicine.
The open-source approach aligns with Google's broader strategy in genomics.
DeepSomatic builds on the foundation of DeepVariant, Google's widely adopted variant calling tool released in 2018, which contributed to creating the first complete human genome sequence.
The company has now spent a decade developing AI tools for genomics, from accurately reading DNA sequences to predicting gene expression and understanding disease-causing variants.
Implications for precision cancer treatment
The development arrives at a crucial moment for cancer medicine, as clinicians increasingly rely on genetic sequencing of tumor biopsies to inform treatment decisions.
By identifying the specific mutations driving each patient's cancer, doctors can select therapies that target those particular genetic vulnerabilities.
DeepSomatic's ability to detect mutations across diverse cancer types—from breast and lung cancers used in training to entirely different malignancies like brain tumors and leukemia- suggests broad applicability in precision oncology.
The tool's superior performance in detecting complex mutations could reveal treatment targets that current methods overlook.
As cancer research continues to evolve toward personalized medicine, tools like DeepSomatic represent a significant step forward in accurately characterizing tumor genetics.
Read more: