April 08, 2024
The Great Dictation Boom Is Here
As a little girl, I often found myself in my family’s basement, doing battle with a dragon. I wasn’t gaming or playing pretend: My dragon was a piece of enterprise voice-dictation software called Dragon Naturally Speaking, launched in 1997 (and purchased by my dad, an early adopter). As a kid, I was enchanted by the idea of a computer that could type for you. The premise was simple: Wear a headset, pull up the software, and speak. Your words would fill a document on-screen without your hands having to bear the indignity of actually typing. But no matter how much I tried to enunciate, no matter how slowly I spoke, the program simply did not register my tiny, high-pitched voice. The page would stay mostly blank, occasionally transcribing the wrong words. Eventually, I’d get frustrated, give up, and go play with something else. Much has changed in the intervening decades. Voice recognition—the computer-science term for the ability of a machine to accurately transcribe what is being said—is improving rapidly thanks in part to recent advances in AI. Today, I’m a voice-texting wizard, often dictating obnoxiously long paragraphs on my iPhone to Friends and family while walking my dog or driving. I find myself speaking into my phone’s text box all the time now, simply because I feel like it. Apple updated its dictation software last year, and it’s great. So are many other programs. The dream of accurate speech-to-text—long held not just in my parents’ basement but by people all over the world—is coming together. The dragon has nearly been slain. “All of these things that we’ve been working on are suddenly working,” Mark Hasegawa-Johnson, a professor of electrical and computer engineering at the University of Illinois Urbana-Champaign, told me. Scientists have been researching speech-recognition tools since at least the mid-20th century; early examples include the IBM Shoebox, a rudimentary computer housed within a wooden box that could measure sounds from a microphone and associate them with 16 different preprogrammed words. By the end of the 1980s, voice-dictation models could process thousands of words. And by the late ’90s, as the personal-computing boom was in full swing, dictation software was beginning to reach consumers. These programs were joined in the 2010s by digital assistants such as Siri, but even these more advanced tools were far from perfect. “For a long time, we were making gradual, incremental progress, and then suddenly things started to get better much faster,” Hasegawa-Johnson said. Experts pointed me to a few different factors that helped accelerate this technology over the past decade. First, researchers had more digitized speech to work with. Large open-source data sets were compiled, including LibriSpeech, which contains 1,000 hours of recorded speech from public-domain audiobooks. Consumers also started regularly using voice tools such as Alexa and Siri, which likely gave private companies more data to train on. Data is key to quality: The more speech data that a model has access to, the better it can recognize what’s being said—“water,” say, not “daughter” or “squatter.” Models were once trained on just a few thousand hours of speech; now they are trained on a lifetime’s worth. The models themselves also got more sophisticated as part of larger, industry-wide advancements in machine learning and AI. The rise of end-to-end neural networks—networks that could directly pair audio with words rather than trying to transcribe by BREAKING them down into syllables—has also accelerated models’ accuracy. And hardware has improved to allow more units of processing power on our personal devices, which allows bigger and fancier models to run in the palm of your hand. Of course, the tools are not yet perfect. For starters, their quality can depend on who is speaking: Voice-recognition models have been found to have higher error rates for Black speakers compared with white speakers, and they also sometimes struggle to understand people with dysarthric, or irregular, speech, such as those with Parkinson’s disease. (Hasegawa-Johnson, who compiles stats related to these issues, is the principal researcher at the Speech Accessibility Project, which aims to train models on more dysarthric speech to improve their outputs.) The future of voice dictation will also be further complicated by the rise of generative AI. Large language models of the sort that power ChatGPT can also be used with audio, which would allow a program to better predict which word should come next in a sequence. For example, when transcribing, such an audio tool might reason that, based on the context, a person is likely saying that their dog—not their frog—needs to go for its morning walk. Yet like their text counterparts, voice-recognition tools that use large language models can “hallucinate,” transcribing words that were never actually spoken. A team of scholars that recently documented violent and unsavory hallucinations, as well as those that perpetuate harmful stereotypes, coming from OpenAI’s new audio model, Whisper. (In response to a request for comment about this research, a spokesperson for OpenAI said, in part, “We continually conduct research on how we can improve the accuracy of our models, including how we can reduce hallucinations.”) So goes the AI boom: The technology is both creating impressive new things and introducing new problems. In voice dictation, the chasm between two once-distinct mediums, audio and text, is closing, leaving us to appreciate the marvel available in our hands—and to proceed with caution. Publisher
Related Stories
Latest News
Top news around the world
Academy Awards

‘Oppenheimer’ Reigns at Oscars With Seven Wins, Including Best Picture and Director

Get the latest news about the 2024 Oscars, including nominations, winners, predictions and red carpet fashion at 96th Academy Awards

Around the World

Celebrity News

> Latest News in Media

Watch It
JoJo Siwa Reveals She Spent $50k on This Cosmetic Procedure
April 08, 2024
tilULujKDIA
Gypsy Rose Blanchard Files for Divorce from Ryan Anderson
April 08, 2024
kjqE93AL4AM
Bachelor Nation’s Trista Sutter Shares Update on Husband’s Battle With Lyme Disease | E! News
April 08, 2024
mNBxwEpFN4Y
Alan Tudyk Does All His Disney Voices
April 08, 2024
fkqBY4E9QPs
Bob Iger responds to critics who call Disney "too woke"
April 06, 2024
loZMrwBYVbI
Kirsten Dunst recites a classic cheer from 'Bring it On'
April 06, 2024
VHAca3r0t-k
Dr. Paul Nassif Offers Up Plastic Surgery Warning for Gypsy Rose Blanchard | TMZ
April 09, 2024
cXIyPm8mKGY
Reba McEntire Laughs at Joy Behar's Suggestion 'Jolene' is Anti-Feminist | TMZ TV
April 08, 2024
11Cyp1sH14I
NeNe Leakes Says She's Okay with Cheating If It's Done Respectfully | TMZ TV
April 08, 2024
IsjAeJFgwhk
Ben Affleck and Jennifer Lopez’s wedding was 20 years in the making
April 08, 2024
BU8hh19xtzA
Bianca Censori wears completely sheer tube dress and knee-high stockings for Kanye West outing
April 08, 2024
IkbdMacAuhU
Kelsea Ballerini tells trolls to ‘shut up’ about pantsless CMT Music Awards 2024 performance #shorts
April 08, 2024
G4OSTYyXcOc
TV Schedule
Late Night Show
Watch the latest shows of U.S. top comedians

Sports

Latest sport results, news, videos, interviews and comments
Latest Events
08
Apr
ITALY: Serie A
Udinese - Inter Milan
07
Apr
ENGLAND: Premier League
Manchester United - Liverpool
07
Apr
ENGLAND: Premier League
Tottenham Hotspur - Nottingham Forest
07
Apr
ITALY: Serie A
Juventus - Fiorentina
07
Apr
ENGLAND: Premier League
Sheffield United - Chelsea
07
Apr
ITALY: Serie A
Monza - Napoli
07
Apr
GERMANY: Bundesliga
Wolfsburg - Borussia Monchengladbach
07
Apr
ITALY: Serie A
Verona - Genoa
07
Apr
ITALY: Serie A
Cagliari - Atalanta
07
Apr
GERMANY: Bundesliga
Hoffenheim - Augsburg
07
Apr
ITALY: Serie A
Frosinone - Bologna
06
Apr
GERMANY: Bundesliga
Heidenheim - Bayern Munich
06
Apr
GERMANY: Bundesliga
Borussia Dortmund - Stuttgart
06
Apr
ENGLAND: Premier League
Brighton - Arsenal
06
Apr
ITALY: Serie A
Roma - Lazio
06
Apr
ENGLAND: Premier League
Crystal Palace - Manchester City
06
Apr
ITALY: Serie A
AC Milan - Lecce
04
Apr
ENGLAND: Premier League
Chelsea - Manchester United
04
Apr
ENGLAND: Premier League
Liverpool - Sheffield United
03
Apr
ENGLAND: Premier League
Arsenal - Luton
03
Apr
ENGLAND: Premier League
Manchester City - Aston Villa
02
Apr
ENGLAND: Premier League
West Ham United - Tottenham Hotspur
01
Apr
SPAIN: La Liga
Villarreal - Atletico Madrid
01
Apr
ITALY: Serie A
Lecce - Roma
01
Apr
ITALY: Serie A
Inter Milan - Empoli
31
Mar
ENGLAND: Premier League
Manchester City - Arsenal
31
Mar
SPAIN: La Liga
Real Madrid - Athletic Bilbao
31
Mar
ENGLAND: Premier League
Liverpool - Brighton
30
Mar
SPAIN: La Liga
Barcelona - Las Palmas
30
Mar
ENGLAND: Premier League
Brentford - Manchester United
30
Mar
ITALY: Serie A
Fiorentina - AC Milan
Find us on Instagram
at @feedimo to stay up to date with the latest.
Featured Video You Might Like
zWJ3MxW_HWA L1eLanNeZKg i1XRgbyUtOo -g9Qziqbif8 0vmRhiLHE2U JFCZUoa6MYE UfN5PCF5EUo 2PV55f3-UAg W3y9zuI_F64 -7qCxIccihU pQ9gcOoH9R8 g5MRDEXRk4k
Copyright © 2020 Feedimo. All Rights Reserved.