Research & Development World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE

University of Pennsylvania Applies SPSS Data and Text Mining to Accelerate Disease Research

By R&D Editors | January 31, 2003

The University of Pennsylvania’s Abramson Family Cancer Research Institute has adopted SPSS’s integrated data mining and text mining solution. The Abramson Family Cancer Research Institute will be using Clementine and LexiQuest Mine solutions to analyze disease processes, with the objective of improving patient outcomes.

“The goal of our research is to understand how the interaction of genetic and environmental factors turns a genetic predisposition into active disease,” said Dr. Michael N. Liebman, the institute’s director of Computational Biology and BioMedical Informatics. “Based on our findings, we want to develop a methodology to improve patient care and gain a better understanding of the process of developing new diagnostics and therapies.”

Advances in analytical software have redefined research methodologies for the life sciences. As scientists have leveraged new technologies for capturing and evaluating large volumes of data, the speed and accuracy of their research has increased. Dr. Liebman selected SPSS’s data mining and text mining technology because the ability to combine both types of data provides the best chance for advances in research.

Dr. Liebman and his colleagues are using the Clementine data mining workbench to help identify early risk and clinical factors that could lead to more effectively diagnosing and treating cancer. The research is focused on discovering the relationships between factors that determine the onset of a disease.

By using these computational models, the institute can assess the impact of different therapeutic interventions. As Dr. Liebman’s team discovers successful processes for treating cancer, they will convert these methods into applications that can be generalized so that other researchers may use them. “We want to build an infrastructure that will enable us to automate the process and apply it to other diseases,” said Dr. Liebman.

“The combination of Clementine’s modeling techniques and its outstanding flexibility for integrating different types of data — biochemical, genetic, clinical, and family history data, among others — makes it the perfect solution for scientific research, where predictive analysis can be utilized for pattern discovery and recognition,” said Dr. Petra Scheffer, senior marketing manager for SPSS Inc.’s science practice.

SPSS Inc.’s text mining solution, LexiQuest Mine, is being used to quickly extract and analyze important concepts contained in thousands of scientific articles and patents in order to better understand the relationships between diseases. Since the intelligence is presented graphically, LexiQuest Mine facilitates easy, efficient discovery of obscure or emerging patterns hidden within massive amounts of information. Dr. Liebman’s long-term goal is to use LexiQuest Mine to build an ontology of disease progression.

“Previously, we could not process all existing disease research literature in a uniform manner,” explained Dr. Liebman. “Through its visual-map interface, LexiQuest Mine provides us with insight that goes beyond standard search tools. By leveraging text mining, we’re confident that we will discover previously overlooked relationships and make observations about issues that haven’t been fully explored in the past.”

With the integration of Clementine and LexiQuest Mine, Dr. Liebman and his researchers can access text mining from the Clementine user interface and incorporate text data into the regular flow of the data mining process. The Clementine-LexiQuest Mine combination will assist his researchers with uncovering unknown patterns and associations, and enable them to make better predictions based on all of the available information. According to Dr. Liebman, “Major research advances won’t be coming from just genomics, they’ll arise from the clinical side, as well.”

Related Articles Read More >

Abstract of modern high tech internet data center room with rows of racks with network and server hardware. 3d rendering
A startup says it found hidden memory behavior in NVIDIA GPUs and is building a security layer around it
Bioptimus launches massive patient data atlas to train its biology AI
Basecamp Research partners with Anthropic, NVIDIA to build the world’s largest genomic database
Could AI smell cancer? Science says yes
rd newsletter
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest info on technologies, trends, and strategies in Research & Development.

R&D World Digital Issues

Fall 2025 issue

Browse the most current issue of R&D World and back issues in an easy to use high quality format. Clip, share and download with the leading R&D magazine today.

R&D 100 Awards
Research & Development World
  • Subscribe to R&D World Magazine
  • Sign up for R&D World’s newsletter
  • Contact Us
  • About Us
  • Drug Discovery & Development
  • Pharmaceutical Processing
  • Global Funding Forecast

Copyright © 2026 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search R&D World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE