Research & Development World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE

Drowning in variants? Mount Sinai tool adds phenotype filter

By Brian Buntz | December 23, 2025

llustrative image of a genetics lab workstation." Credit: "Image: Mount Sinai Health System"

llustrative image of a genetics lab workstation.” Credit: “Image: Mount Sinai Health System”

A research team at the Icahn School of Medicine at Mount Sinai has built a machine-learning variant scoring system that combines a general “pathogenic vs. benign” estimate with broad phenotype-linked probabilities, with the goal of narrowing the list of candidate disease variants returned by sequencing.

The tool, Variant-to-Phenotype (V2P), is described in “Expanding the utility of variant effect predictions with phenotype-specific models,” published Nov. 28, 2025 in Nature Communications. The news was also the focus of a Mount Sinai news release.

Why phenotype context matters

Clinical sequencing can produce thousands of variants to triage. Many computational predictors focus on a single output: how likely a variant is to be damaging in general. V2P conditions its predictions on phenotype groupings from the Human Phenotype Ontology (HPO), so the output can be used to prioritize variants that match a patient’s presentation, a common bottleneck in rare-disease workups.

How V2P works

V2P is an ensemble, multi-task model that ingests variant annotations spanning gene-level features (disease and pathway associations), protein-sequence and structure features, protein interaction network features, and variant-level measures such as conservation.

Its output comprises 24 values from 0 to 1: one general pathogenicity probability plus 23 probabilities tied to first-level “phenotypic abnormality” classes in HPO (examples in the paper include abnormalities of the nervous system and neoplasms).

For training, the authors report using 252,125 pathogenic variants from HGMD and 244,231 putatively benign variants from gnomAD v2.1.1 exomes, spanning 6,620 genes.

Benchmarking highlights

To test whether the model aligns with experimental evidence, the authors compare predictions with results from deep mutational scanning assays across dozens of proteins, and with massively parallel reporter assay (MPRA) datasets for regulatory elements. They report V2P achieves the highest median correlation across the MPRA sets they tested, with performance in the same range as CADD and above FATHMM on the reported comparisons.

To approximate a diagnostic workflow, the team ranked variants in patient exomes using the patient’s pathology alongside V2P phenotype scores, including an evaluation in 116 exomes from patients with rare immune disorders.

In the Mount Sinai Health System release, the team also says V2P often placed the true disease-causing variant among the top 10 candidates in tests on de-identified patient data.

R&D angle: from diagnosis to targets

Beyond case review, the researchers argue phenotype-conditioned scoring could help connect genes and pathways to disease classes, information that can be useful when selecting targets or prioritizing mechanisms for follow-up.

The phenotype outputs are intentionally broad (23 top-level HPO categories), and the authors note that limited labeled data makes finer-grained, supervised phenotype prediction difficult today.

They also discuss evaluation pitfalls such as circularity between training and test data as a general issue in this area. The paper says the team precomputed V2P scores for all possible SNVs in hg38 and for gnomAD indels, and they provide a web framework for scoring user-specified variants.

Related Articles Read More >

Pancreas or pancreatic cancer with organs and tumors or cancerous cells 3D rendering illustration with male body. Anatomy, oncology, disease, medical, biology, science, healthcare concepts.
AI tool used to detect pancreatic cancer in routine CT scans in China 
Drones help diagnose deadly whale viruses 
Oral Wegovy approval charts a new course for blockbuster drug development
Los Alamos’ R&D 100-winning EpiEarth platform helps predict the global outbreaks early
rd newsletter
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest info on technologies, trends, and strategies in Research & Development.
RD 25 Power Index

R&D World Digital Issues

Fall 2025 issue

Browse the most current issue of R&D World and back issues in an easy to use high quality format. Clip, share and download with the leading R&D magazine today.

R&D 100 Awards
Research & Development World
  • Subscribe to R&D World Magazine
  • Sign up for R&D World’s newsletter
  • Contact Us
  • About Us
  • Drug Discovery & Development
  • Pharmaceutical Processing
  • Global Funding Forecast

Copyright © 2026 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search R&D World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE