Research & Development World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE

Algorithms let Machines Understand Speech with Human-like Speed, Accuracy

By R&D Editors | March 3, 2016

In the game, users describe pictures they see on their computer screen, and the agent tries to guess which picture they are talking about as fast and accurately as it can. Using its real-time, "incremental" (word-by-word) speech processing algorithms, the agent's speed of understanding and response is so fast that its game performance rivals that of human-human teams playing the same game. Courtesy of David DeVault, USC Institute for Creative TechnologiesToday, talking to a computer or an app can be an infuriating experience. But is it possible that one day we will have conversations with computers that feel as fluid and natural as talking to another person?

David DeVault, a research assistant professor at the University of Southern California’s Institute for Creative Technologies, believes so — and his research is developing high-speed language processing systems whose speed and efficiency can rival that of human speakers in specific settings.

It turns out that one of the key challenges in creating more human-like voice interfaces is the rapid speed with which human speakers understand and respond to each other in a live conversation.

“While we human speakers can often understand and respond to what someone is saying to us in a fraction of a second, a typical voice interface will require much longer — often a second or two — to try to understand what you have said and respond in an appropriate way,” DeVault says.

Interacting with the relatively slow pace of current voice interfaces is one reason many people continue to find them inefficient and frustrating to use.

With support from the National Science Foundation, DeVault and his students are studying new techniques that can streamline human-machine conversations by enabling the system to perform all of the necessary computer processing steps in real-time while the user is talking. The resulting systems are often able to figure out what the speaker means and how it should respond well before that person finishes speaking.

In a recent research paper, DeVault and his students, Ramesh Manuvinakurike and Maike Paetzel, described the creation and evaluation of a high-performance game-playing agent called Eve. 

In the game, users describe the pictures they see on their computer screen and the agent tries to guess which picture they are talking about as fast and accurately as it can. By using “incremental” (word-by-word) speech processing algorithms, the agent’s speed of understanding and response is so fast that its game performance rivals that of human-human teams playing the same game.

When compared to alternative versions of the agent that wait until a user’s speech is finished to try to understand and respond, users rate their interactions with the more incremental version of Eve as more efficient, more natural, and having better shared understanding.

“These findings underscore the importance of enabling systems to not only understand what users are saying, but to do so as quickly as a human would,” DeVault said.

The research received a Best Paper Award at the 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015). It points toward the creation of voice interfaces for other applications that users may find more natural and efficient to use.

As voice interfaces continue to become faster, better at understanding what we mean and less frustrating to talk to, they will be increasingly adopted in a wide range of important applications including information access, education, healthcare, entertainment and training.

“We are at the beginning of a sea change in what we can achieve through conversation with computers,” DeVault says. “At this point we are seeing just a glimmer of what is about to become possible.”

Related Articles Read More >

Lab automation is “vaporizing”: Why the hottest innovation is invisible
Google on how AI will extend researchers
Kythera Labs’ Wayfinder remasters incomplete medical data for AI analysis
Adviser Labs raises $1M to simplify cloud HPC for in AI and scientific computing
rd newsletter
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest info on technologies, trends, and strategies in Research & Development.
RD 25 Power Index

R&D World Digital Issues

Fall 2024 issue

Browse the most current issue of R&D World and back issues in an easy to use high quality format. Clip, share and download with the leading R&D magazine today.

Research & Development World
  • Subscribe to R&D World Magazine
  • Sign up for R&D World’s newsletter
  • Contact Us
  • About Us
  • Drug Discovery & Development
  • Pharmaceutical Processing
  • Global Funding Forecast

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search R&D World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE