Research & Development World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • Call for Nominations: The 2025 R&D 100 Awards
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
    • Explore the 2024 R&D 100 award winners and finalists
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE

When data goes missing: How poor data management can undermining research reproducibility

By Brian Buntz | May 22, 2025

Detailed view of a PCR testing kit for SARS-CoV-2 with an epidemiologist in protective gear analyzing samples to detect specific viral areas causing COVID-19 pneumonia --ar 16:9 --style raw --v 6.1 Job ID: 7e698e8b-3ac0-4058-9c69-cb803819f39e

[Adobe Stock]

Imagine you’re midway through a time-sensitive lab procedure when a Slack ping hits your laptop: “Hey, do you still have those high-res .tiff files from last quarter’s microscopy experiments? A journal editor says he needs them by the end of the day. Sorry, I didn’t let you know sooner.”

You don’t recall where they are. You freeze up, and then pull yourself together. Maybe they are on the very laptop you are using. You fire a few plausible-sounding names into a string of search queries and get back. No results.

A longstanding challenge

Such searches are not uncommon. And the problem has persisted for years. When Molecular Brain editor-in-chief Tsuyoshi Miyakawa asked 41 manuscript authors for raw data underlying their results, 97% couldn’t deliver, he wrote in 2020. The authors of 21 of 41 withdrew them rather than provide raw data. Of the remaining 20, Miyakawa rejected 19 for insufficient data quality.

Tired of data chaos in your lab?

Learn a practical “Crawl-Walk-Run” blueprint to unify R&D data, strategies for getting an ROI for your data efforts, and pave the way for advanced automation and AI. Join experts from Labcorp, Parallel Bio, Pfizer, and the ISS National Lab in our upcoming webinar.

Wednesday, June 11, 2025 | 2:00 PM EDT

Register for the Free Webinar Here

Even when data is available, replication can fail. A $2-million, 8-year effort to replicate influential cancer studies found that fewer than half stood up to scrutiny. The Reproducibility Project: Cancer Biology originally planned to repeat 193 experiments from 53 high-profile papers, but couldn’t proceed. Barriers included uncooperative authors and vague protocols, which forced researchers to complete just 50 experiments at an average cost of $53,000 and 197 weeks per study.

The problem extends beyond life sciences. A 2011 study published in PLOS One, focusing on articles within that journal, found that in chemistry, 0% of articles made data publicly available, though 5.7% indicated data was available on request. In physics, results were even lower: none of the articles with original data made the data publicly available in a repository. The study’s authors noted that physics journals “do little to share research data in a systematic way, at least in the top journals by impact factor,” and that some authors may “print many graphics that summarize the research data, but do not provide direct access to the underlying data.” Meanwhile, a 2016 Nature survey found that more than “70% of researchers have tried and failed to reproduce another scientist’s experiments, and more than half have failed to reproduce their own experiments.”

Some signs of improvement

In the positive column, the recently published FAIR Business Survey suggests that some R&D heavy organizations are getting better at treating data as a reusable asset rather than a byproduct of research. The Pistoia Alliance surveyed 36 life-science organizations and conducted follow-up interviews with 12 companies to understand why they’re investing in FAIR (Findable, Accessible, Interoperable, Reusable) data principles. The results reveal four primary business drivers: “trusted data” that enables AI capabilities and ensures compliance; “cost savings” through reduced duplication and higher resource productivity; “speed” improvements in time-to-market and decision-making; and “effectiveness” gains that unlock insights and innovations previously impossible with fragmented datasets.

The companies seeing the biggest returns started their FAIR journeys five or more years ago, and are “now realizing tangible benefits.” These more mature organizations tend to have “high-level executive support,” and more mature “data management processes” and “enhanced operational efficiency,” to go with it.

One organization cut data search time from three days to hours. Another “shortened the duration of clinical trials.” Several respondents noted that FAIR data management eliminated the manual curation work.

Yet the survey also revealed a persistent challenge: the biggest barrier isn’t technical infrastructure but cultural. Improving data maturity requires “a fundamental mindset change within the business is required to realize the full value of data-driven transformation,” the report concluded.

Related Articles Read More >

5 R&D jobs that may be lost to AI and 5 that it could create
Dinner plate-sized chips with trillions of transistors could give traditional GPUs a run for their money
FDA’s AI tool Elsa signals new era for regulatory review, says QuantHealth CEO
Sonar Screen For Submarines And Ships. Radar Sonar With Object On Map. Futuristic HUD Navigation monitor
Pentagon places big bets on frontier AI, quantum sensing and next-gen avionics in nearly $3 billion in defense technology contracts 
rd newsletter
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest info on technologies, trends, and strategies in Research & Development.
RD 25 Power Index

R&D World Digital Issues

Fall 2024 issue

Browse the most current issue of R&D World and back issues in an easy to use high quality format. Clip, share and download with the leading R&D magazine today.

Research & Development World
  • Subscribe to R&D World Magazine
  • Enews Sign Up
  • Contact Us
  • About Us
  • Drug Discovery & Development
  • Pharmaceutical Processing
  • Global Funding Forecast

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search R&D World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • Call for Nominations: The 2025 R&D 100 Awards
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
    • Explore the 2024 R&D 100 award winners and finalists
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE