large language models

How Quickly Do Large Language Models Learn Unexpected Skills?

February 13, 2024

A new study suggests that so-called emergent abilities actually develop gradually and predictably, depending on how you measure them.

artificial intelligence

New Theory Suggests Chatbots Can Understand Text

By Anil Ananthaswamy

January 22, 2024

Far from being “stochastic parrots,” the biggest large language models seem to learn enough skills to understand the words they’re processing.

neural networks

Tiny Language Models Come of Age

By Ben Brubaker

October 5, 2023

To better understand how neural networks learn to simulate writing, researchers trained simpler versions on synthetic children’s stories.

machine learning

Some Neural Networks Learn Language Like Humans

By Steve Nadis

May 22, 2023

Researchers uncover striking parallels in the ways that humans and machine learning models acquire language skills.

machine learning

Chatbots Don’t Know What Stuff Isn’t

By Max G. Levy

May 12, 2023

Today’s language models are more sophisticated than ever, but they still struggle with the concept of negation. That’s unlikely to change anytime soon.

artificial intelligence

The Unpredictable Abilities Emerging From Large AI Models

By Stephen Ornes

March 16, 2023

Large language models like ChatGPT are now big enough that they’ve started to display startling, unpredictable behaviors.

Illustration showing a human understanding text while a machine doesn’t.

Quantized Columns

What Does It Mean for AI to Understand?

By Melanie Mitchell

December 16, 2021

It’s simple enough for AI to seem to comprehend data, but devising a true test of a machine’s knowledge has proved difficult.

07.14.2021

The Computer Scientist Training AI to Think With Analogies

By John Pavlus

July 14, 2021

Melanie Mitchell has worked on digital minds for decades. She says they’ll never truly be like ours until they can make analogies.

artificial intelligence

Common Sense Comes Closer to Computers

By John Pavlus

April 30, 2020

The problem of common-sense reasoning has plagued the field of artificial intelligence for over 50 years. Now a new approach, borrowing from two disparate lines of thinking, has made important progress.

Saved Articles

Log out

Change password

How Quickly Do Large Language Models Learn Unexpected Skills?