large language models

Why Do Researchers Care About Small Language Models?

March 10, 2025

Larger models can pull off greater feats, but the accessibility and efficiency of smaller models make them attractive tools.

genomics

The Poetry Fan Who Taught an LLM to Read and Write DNA

By Ingrid Wickelgren

February 5, 2025

By treating DNA as a language, Brian Hie’s “ChatGPT for genomes” could pick up patterns that humans can’t see, accelerating biological design.

artificial intelligence

Chatbot Software Begins to Face Fundamental Limitations

By Anil Ananthaswamy

January 31, 2025

Recent results show that large language models struggle with compositional tasks, suggesting a hard limit to their abilities.

natural language processing

Can AI Models Show Us How People Learn? Impossible Languages Point a Way.

By Ben Brubaker

January 13, 2025

Certain grammatical rules never appear in any known language. By constructing artificial languages that have these rules, linguists can use neural networks to explore how people learn.

natural language processing

Debate May Help AI Models Converge on Truth

By Stephen Ornes

November 8, 2024

How do we know if a large language model is lying? Letting AI systems argue with each other may help expose the truth.

machine learning

How ‘Embeddings’ Encode What Words Mean — Sort Of

By John Pavlus

September 18, 2024

Machines work with words by embedding their relationships with other words in a string of numbers.

The Joy of Why

Will AI Ever Have Common Sense?

By Steven Strogatz

July 18, 2024

Common sense has been viewed as one of the hardest challenges in AI. That said, ChatGPT4 has acquired what some believe is an impressive sense of humanity. How is this possible? Listen to this week’s “The Joy of Why” with co-host Steven Strogatz.

Ellie Pavlick in a blue scarf stands on a stairwell next to a shiny machine

Q&A

Does AI Know What an Apple Is? She Aims to Find Out.

By John Pavlus

April 25, 2024

The computer scientist Ellie Pavlick is translating philosophical concepts such as “meaning” into concrete, testable ideas.

natural language processing

How Chain-of-Thought Reasoning Helps Neural Networks Compute

By Ben Brubaker

March 21, 2024

Large language models do better at solving problems when they show their work. Researchers are beginning to understand why.

Saved Articles

Log out

Change password

Why Do Researchers Care About Small Language Models?

The Poetry Fan Who Taught an LLM to Read and Write DNA

Chatbot Software Begins to Face Fundamental Limitations

Can AI Models Show Us How People Learn? Impossible Languages Point a Way.

Debate May Help AI Models Converge on Truth

How ‘Embeddings’ Encode What Words Mean — Sort Of

Will AI Ever Have Common Sense?

Does AI Know What an Apple Is? She Aims to Find Out.

How Chain-of-Thought Reasoning Helps Neural Networks Compute

Large language models

Latest Articles

Why Do Researchers Care About Small Language Models?

The Poetry Fan Who Taught an LLM to Read and Write DNA

Chatbot Software Begins to Face Fundamental Limitations

Can AI Models Show Us How People Learn? Impossible Languages Point a Way.

Debate May Help AI Models Converge on Truth

How ‘Embeddings’ Encode What Words Mean — Sort Of

Will AI Ever Have Common Sense?

Does AI Know What an Apple Is? She Aims to Find Out.

How Chain-of-Thought Reasoning Helps Neural Networks Compute