Emergent Abilities in Large Language Models: An Explainer

web

CSET Georgetown·cset.georgetown.edu/article/emergent-abilities-in-large-l...

Credibility Rating

4/5

High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Rating inherited from publication venue: CSET Georgetown

Accessible policy-oriented explainer from CSET aimed at non-technical audiences; useful for understanding why scaling unpredictability is a concern for both AI safety researchers and policymakers.

Metadata

Importance: 62/100blog posteducational

Summary

This CSET explainer breaks down the concept of emergent abilities in large language models—capabilities that appear suddenly and unpredictably as models scale up. It explains why these emergent behaviors pose challenges for AI forecasting, evaluation, and safety planning, and discusses implications for policy and governance.

Key Points

•Emergent abilities are capabilities that appear abruptly in LLMs at certain scale thresholds, rather than improving gradually.
•These unpredictable capability jumps make it difficult to anticipate what future models will be able to do, complicating safety planning.
•The phenomenon challenges standard evaluation frameworks, since tests may show near-zero performance until a sudden threshold is crossed.
•Emergence has policy implications: regulators and developers may not foresee dangerous capabilities before they manifest.
•Debate exists over whether emergence is a genuine phenomenon or an artifact of how performance metrics are measured.

Cited by 1 page

Page	Type	Quality
Emergent Capabilities	Risk	61.0

Cached Content Preview

HTTP 200Fetched Apr 9, 202623 KB

Emergent Abilities in Large Language Models: An Explainer | Center for Security and Emerging Technology 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

 
 
 
 
 

 
 
 
 
 
 

 
 
 

 
 

 
 
 
 
 
 

 
 
 
 
 
 Skip to main content 
 
 
 

 
 
 
 
 

 
 
 
 
 
 
 
 
 
 
 
 CSET 

 Emergent Abilities in Large Language Models: An Explainer

 
 
 Thomas Woodside

 
 April 16, 2024 
 
 A recent topic of contention among artificial intelligence researchers has been whether large language models can exhibit unpredictable ("emergent") jumps in capability as they are scaled up. These arguments have found their way into policy circles and the popular press, often in simplified or distorted ways that have created confusion. This blog post explores the disagreements around emergence and their practical relevance for policy.

 
 

 

 
 
 
 
 Summary

 Claims of &#8220;emergent capabilities&#8221; in large language models (LLMs) have been discussed by journalists , researchers , and even members of Congress . But these conversations have often been hampered by confusion about what exactly the term means and what it implies.

 The idea of &#8220;emergence&#8221; comes from the study of complex systems. It describes systems that cannot be explained simply by looking at their parts, such as complex social networks. The field of deep learning, including LLMs, inherently involves emergence, since the internal properties of neural networks are difficult to predict simply by looking at their millions or billions of individual parameters.

 A related—but distinct—definition of emergence has become common in the context of LLMs. According to this definition, emergence refers to the capabilities of LLMs that appear suddenly and unpredictably as model size, computational power, and training data scale up. Researchers have found many examples of such &#8220;emergent capabilities.&#8221; The topic has garnered attention because of the potential for the unpredictable emergence of risky capabilities (e.g., effective autonomous hacking) in the future. It has also been exaggerated in the popular press.

 Some researchers dispute that LLMs exhibit sudden and unpredictable changes in their capabilities as they increase in scale. They have devised alternative metrics to measure the same capabilities that show smooth increases with scale and could potentially help predict capability increases into the future. However, devising such metrics is often not straightforward, especially for complex tasks, making it unclear how reliable this approach will be for making predictions.

 While there are some methods for estimating the capabilities of LLMs before they are trained—and these will hopefully improve so that researchers can predict even seemingly sudden jumps in capabilities—researcher and policymaker ability to predict the future of LLMs is far from assured. Previous attempts to predict capabilities in advance of further scaling have had decidedly mixed results. Even long a

... (truncated, 23 KB total)

Resource ID: 9926d26da9a3d761 | Stable ID: sid_GHh4An83W6