Research & Development World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • Call for Nominations: The 2025 R&D 100 Awards
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
    • Explore the 2024 R&D 100 award winners and finalists
  • Resources
    • Research Reports
    • Digital Issues
    • R&D Index
    • Subscribe
    • Video
    • Webinars
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE

Researcher tests performance of diverse HPC architecture

By R&D Editors | March 29, 2012

/sites/rdmag.com/files/legacyimages/RD/News/2012/03/bokhariFigurelrgx500.jpg

click to enlarge

Ohio State University researcher Saniyah Bokhari compared performance measures of parallel supercomputers that employ various architectures, testing systems located at the Ohio Supercomputer Center and Pacific Northwest Laboratory: (1) Cray XMT, 128 proc, 500 MHz, 1-TB; (2) IBM x3755, 2.4-GHz Opterons,16-core, 64-GB; (3) NVIDIA FX 5800 GPU, 1.296 GHz, 240 cores, 4-GB device memory. Source: Bokhari

Surveying the wide range of parallel system architectures offered in the
supercomputer market, an Ohio
State University
researcher recently sought to establish some side-by-side performance
comparisons.

The journal Concurrency and
Computation: Practice and Experience
, in February, published “Parallel
solution of the subset-sum problem: an empirical study.” The paper is
based upon a master’s thesis written last year by computer science and
engineering graduate student Saniyah Bokhari.

“We explore the parallelization of the subset-sum problem on
three contemporary but very different architectures, a 128-processor Cray
massively multithreaded machine, a 16-processor IBM shared memory machine, and
a 240-core NVIDIA graphics processing unit,” said Bokhari. “These experiments
highlighted the strengths and weaknesses of these architectures in the context
of a well-defined combinatorial problem.”

Bokhari evaluated the conventional central processing unit
architecture of the IBM 1350 Glenn Cluster at the Ohio Supercomputer Center
(OSC) and the less-traditional general-purpose graphic processing unit (GPGPU)
architecture, available on the same cluster. She also evaluated the
multithreaded architecture of a Cray Extreme Multithreading (XMT) supercomputer
at the Pacific Northwest National Laboratory’s (PNNL) Center for Adaptive
Supercomputing Software.

“Ms. Bokhari’s work provides valuable insights into
matching the best high-performance computing architecture with the
computational needs of a given research community,” noted Ashok
Krishnamurthy, interim co-executive director of OSC. “These systems are
continually evolving to incorporate new technologies, such as GPUs, in order to
achieve new, higher-performance measures, and we must understand exactly what
each new innovation offers.”

Each of the architectures Bokhari tested fall in the area of
parallel computing, where multiple processors are used to tackle pieces of
complex problems “in parallel.” The subset-sum problem she used for her study
is an algorithm with known solutions that is solvable in a period of time that
is proportional to the number of objects entered, multiplied by the sum of
their sizes. Also, she carefully timed the code runs for solving a comprehensive
range of problem sizes.

Bokhari concluded that the GPU performs well for problems
whose tables fit within the limitations of the device memory. Because GPUs
typically have memory sizes in the range of 10 gigabytes (GB), such
architectures are best for small problems that have table sizes of
approximately thirty billion bits.

She found that the IBM x3755 performed very well on
medium-sized problems that fit within its 64-GB memory, but had poor
scalability as the number of processors increased and was unable to sustain its
performance as the problem size increased. The machine tended to saturate for
problem with table sizes of 300 billion bits.

The Cray XMT showed very good scaling for large problems and
demonstrated sustained performance as the problem size increased, she said.
However, the Cray had poor scaling for small problem sizes, performing best
with table sizes of a trillion bits or more.

“In conclusion, we can state that the NVIDIA GPGPU is best
suited to small problem sizes; the IBM x3755 performs well for medium sizes,
and the Cray XMT is the clear choice for large problems,” Bokhari said. “For
the XMT, we expect to see better performance for large problem sizes, should
memory larger than 1 TB become available.”

Ohio Supercomputer Center

Related Articles Read More >

2025 R&D layoffs tracker tops 92,000
Eli Lilly facility
9 R&D developments this week: Lilly builds major R&D center, Stratolaunch tests hypersonic craft, IBM chief urges AI R&D funding
Five cases where shaky science snowballed into public confusion
Caltech, Fermilab, and collaborators test quantum sensors for future particle physics experiments
rd newsletter
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest info on technologies, trends, and strategies in Research & Development.
RD 25 Power Index

R&D World Digital Issues

Fall 2024 issue

Browse the most current issue of R&D World and back issues in an easy to use high quality format. Clip, share and download with the leading R&D magazine today.

Research & Development World
  • Subscribe to R&D World Magazine
  • Enews Sign Up
  • Contact Us
  • About Us
  • Drug Discovery & Development
  • Pharmaceutical Processing
  • Global Funding Forecast

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search R&D World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • Call for Nominations: The 2025 R&D 100 Awards
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
    • Explore the 2024 R&D 100 award winners and finalists
  • Resources
    • Research Reports
    • Digital Issues
    • R&D Index
    • Subscribe
    • Video
    • Webinars
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE