April 05, 2024
Can Chatbots Hold Meaningful Conversations?
rseny Moskvichev dreams of the day he can have a meaningful conversation with Artificial intelligence. “By meaningful, I mean a conversation that has the power to change you,” says the cognitive and computer scientist. “The problem,” says Moskvichev, “is that LLMs are complete amnesiacs. They only have so much context they can attend to. If you’re out of this context, they forget everything you spoke about with them.” Even the most advanced chatbots can only process about 16,000 words of text within a prompt when in conversation with a human user. This is called a “context window.” And they can’t connect the information they receive during different “conversations” with a human, or build a storyline. To help chatbots learn to hold life-changing conversations, and to improve their comprehension of the deep complexities of context—of the webs of relationships between people, events, and timelines that govern human lives—Moskvichev and his colleagues are teaching them to read novels, the way we might learn to read them in high school literature classes. The act of reading a novel might seem like a relaxing pastime, but it . We use memory and complex, layered comprehension to follow multiple characters through twisting plots, scene changes, and narrative. And while we might not think about it, the average novel averages around 80,000 words. , by Oscar Wilde, for instance, at 82,000 words, while by W.E. DuBois, totals around 72,000. , a children’s book, by Antoine de Saint-Exupéry, has around 17,000 words. All those words gradually build a story we hold and examine in our minds. But such skills are currently out of reach to Large Language Models (LLMs) like Open AI’s ChatGPT, which can process text but cannot be said to the way we do. The problem is that LLMs are complete amnesiacs. Together with Ky-Vinh Mai, who studies complexity at the University of California, Irvine, Moskvichev built a new database that trains LLMs to read and analyze long stretches of text as part of a postdoc at the Santa Fe Institute, an independent non-profit theoretical research institute. The pair unveiled their new tool, , at the Empirical Methods in Natural Language Processing in Singapore late last year. “It’s a dataset with very long contexts,” Moskvichev says. As a training tool it offers more than a million questions for LLMs to practice on—“way above everything that was there before.” It is a “supervised” dataset, he says, which means there are gold standard answers the AI is expected to score correctly on, making assessment possible. Moskvichev and Mai built Narrative XL from 1,500 books publicly available through Project Gutenberg. They train LLMs in reading comprehension by having the AI read a book in the database, then asking it to find a correct scene summary from a pool of options. Some summaries are accurate, while others include decoy scenes or are “corrupted” with characters from other books—say, Dracula moonlighting as a protagonist in . To train memory, Moskvichev and Mai compiled read-along questions, where the AI is expected to know more of the plot if it has read further into the book—but nothing beyond that point. For example, if the AI has only read half-way through , it should still “worry” over whether Miss Bennett marries Mr. Darcy. (Spoiler alert—she does, but Austin saves the nuptials for a grand finale!) Because memory is a function of time, where the sequence of events matters, accidental knowledge of future events, even when factually correct, counts as a false memory. Any reference to the pair as married in a summary of the book, should be identified by the AI as false, unless it has been shown the entire text in its context window. As part of its training on NarrativeXL an LLM can also be asked to identify and correct the mistakes in corrupted book scenes. So not only would Dracula have to be removed from , an LLM in training would need to replace him with the correct character. This requires nuanced comprehension, an understanding of how the characters function in relation to one another. The 1,500 books and nearly 1 million questions in Narrative XL surpass existing book training databases in size by at least twofold. BookSum is a training database released in 2021 that compiles 405 books for training AIs. The average context window for all available training databases remains under 15,000 words. One of the roadblocks for developing such training tools for AI is human resources: To accurately summarize a book requires for someone to read it, and design the training questions. “It’s very expensive to collect a supervised data set of this type,” Moskvichev says. It is here that the pair used a stroke of ingenuity—they built Narrative XL by getting ChatGPT3.5 to summarize short book sections, a task at which LLMs excel. It is an example of a machine being used to train a more powerful machine. While Moskvichev and Mai validated the use of NarrativeXL as a training tool and observed significant improvement in trained LLM performance versus untrained, the LLMs still make mistakes approximately half of the time. “AI models are very tricky. They’re sneaky, and they cheat when they can,” Moskvichev says. The heart of the problem is that chatbots are designed to identify the quickest path to an answer, which rarely involves reading everything. It is simply never a wild guess to predict that the heroine of a 19th-century romance will get the guy, a shortcut that can land the AI in trouble. “There’s still room for improvement,” Moskvichev says. Posted on April 5, 2024 Elena Kazamia is a science writer from Greece. She has a master’s degree in conservation from University College London and a Ph.D. in plant sciences from the University of Cambridge in the U.K. Cutting-edge science, unraveled by the very brightest living thinkers.
Related Stories
Latest News
Top news around the world
Academy Awards

‘Oppenheimer’ Reigns at Oscars With Seven Wins, Including Best Picture and Director

Get the latest news about the 2024 Oscars, including nominations, winners, predictions and red carpet fashion at 96th Academy Awards

Around the World

Celebrity News

> Latest News in Media

Watch It
JoJo Siwa Reveals She Spent $50k on This Cosmetic Procedure
April 08, 2024
tilULujKDIA
Gypsy Rose Blanchard Files for Divorce from Ryan Anderson
April 08, 2024
kjqE93AL4AM
Bachelor Nation’s Trista Sutter Shares Update on Husband’s Battle With Lyme Disease | E! News
April 08, 2024
mNBxwEpFN4Y
Alan Tudyk Does All His Disney Voices
April 08, 2024
fkqBY4E9QPs
Bob Iger responds to critics who call Disney "too woke"
April 06, 2024
loZMrwBYVbI
Kirsten Dunst recites a classic cheer from 'Bring it On'
April 06, 2024
VHAca3r0t-k
Dr. Paul Nassif Offers Up Plastic Surgery Warning for Gypsy Rose Blanchard | TMZ
April 09, 2024
cXIyPm8mKGY
Reba McEntire Laughs at Joy Behar's Suggestion 'Jolene' is Anti-Feminist | TMZ TV
April 08, 2024
11Cyp1sH14I
NeNe Leakes Says She's Okay with Cheating If It's Done Respectfully | TMZ TV
April 08, 2024
IsjAeJFgwhk
Ben Affleck and Jennifer Lopez’s wedding was 20 years in the making
April 08, 2024
BU8hh19xtzA
Bianca Censori wears completely sheer tube dress and knee-high stockings for Kanye West outing
April 08, 2024
IkbdMacAuhU
Kelsea Ballerini tells trolls to ‘shut up’ about pantsless CMT Music Awards 2024 performance #shorts
April 08, 2024
G4OSTYyXcOc
TV Schedule
Late Night Show
Watch the latest shows of U.S. top comedians

Sports

Latest sport results, news, videos, interviews and comments
Latest Events
08
Apr
ITALY: Serie A
Udinese - Inter Milan
07
Apr
ENGLAND: Premier League
Manchester United - Liverpool
07
Apr
ENGLAND: Premier League
Tottenham Hotspur - Nottingham Forest
07
Apr
ITALY: Serie A
Juventus - Fiorentina
07
Apr
ENGLAND: Premier League
Sheffield United - Chelsea
07
Apr
ITALY: Serie A
Monza - Napoli
07
Apr
GERMANY: Bundesliga
Wolfsburg - Borussia Monchengladbach
07
Apr
ITALY: Serie A
Verona - Genoa
07
Apr
ITALY: Serie A
Cagliari - Atalanta
07
Apr
GERMANY: Bundesliga
Hoffenheim - Augsburg
07
Apr
ITALY: Serie A
Frosinone - Bologna
06
Apr
GERMANY: Bundesliga
Heidenheim - Bayern Munich
06
Apr
GERMANY: Bundesliga
Borussia Dortmund - Stuttgart
06
Apr
ENGLAND: Premier League
Brighton - Arsenal
06
Apr
ITALY: Serie A
Roma - Lazio
06
Apr
ENGLAND: Premier League
Crystal Palace - Manchester City
06
Apr
ITALY: Serie A
AC Milan - Lecce
04
Apr
ENGLAND: Premier League
Chelsea - Manchester United
04
Apr
ENGLAND: Premier League
Liverpool - Sheffield United
03
Apr
ENGLAND: Premier League
Arsenal - Luton
03
Apr
ENGLAND: Premier League
Manchester City - Aston Villa
02
Apr
ENGLAND: Premier League
West Ham United - Tottenham Hotspur
01
Apr
SPAIN: La Liga
Villarreal - Atletico Madrid
01
Apr
ITALY: Serie A
Lecce - Roma
01
Apr
ITALY: Serie A
Inter Milan - Empoli
31
Mar
ENGLAND: Premier League
Manchester City - Arsenal
31
Mar
SPAIN: La Liga
Real Madrid - Athletic Bilbao
31
Mar
ENGLAND: Premier League
Liverpool - Brighton
30
Mar
SPAIN: La Liga
Barcelona - Las Palmas
30
Mar
ENGLAND: Premier League
Brentford - Manchester United
30
Mar
ITALY: Serie A
Fiorentina - AC Milan
Find us on Instagram
at @feedimo to stay up to date with the latest.
Featured Video You Might Like
zWJ3MxW_HWA L1eLanNeZKg i1XRgbyUtOo -g9Qziqbif8 0vmRhiLHE2U JFCZUoa6MYE UfN5PCF5EUo 2PV55f3-UAg W3y9zuI_F64 -7qCxIccihU pQ9gcOoH9R8 g5MRDEXRk4k
Copyright © 2020 Feedimo. All Rights Reserved.