Skip to content
Longterm Wiki
Back

The AI Scientist: Fully Automated Scientific Discovery

web

Relevant to AI safety because a system that autonomously conducts AI research could dramatically accelerate capability gains, compress alignment timelines, and pose recursive self-improvement risks if deployed without adequate oversight.

Metadata

Importance: 72/100tool pageprimary source

Summary

Sakana AI introduces The AI Scientist, the first comprehensive system for fully automated scientific discovery, enabling LLMs to independently conduct the entire research lifecycle—from generating ideas and writing code to running experiments and producing peer-reviewed papers—at roughly $15 per paper. The system also includes an automated peer review process and iteratively builds a growing archive of knowledge.

Key Points

  • Automates the full research pipeline: idea generation, coding, experimentation, visualization, manuscript writing, and peer review.
  • Demonstrated on ML subfields including diffusion models, transformers, and grokking, producing novel contributions autonomously.
  • Automated peer review process evaluates generated papers with near-human accuracy and provides iterative feedback.
  • Highly cost-efficient at ~$15 per paper, raising questions about the future pace and democratization of scientific research.
  • Raises significant AI safety concerns: an AI system that can autonomously advance AI research accelerates recursive self-improvement risks.

Cited by 2 pages

Cached Content Preview

HTTP 200Fetched Mar 20, 202623 KB
![](https://sakana.ai/assets/ai-scientist/cover.jpeg)

At Sakana AI, we have pioneered the use of nature-inspired methods to advance cutting-edge foundation models. Earlier this year, we developed methods to automatically [merge the knowledge of multiple LLMs](https://sakana.ai/evolutionary-model-merge/). In more recent work, we harnessed [LLMs to _discover_ new objective functions](https://sakana.ai/llm-squared/) for tuning other LLMs. Throughout these projects, we have been continuously surprised by the creative capabilities of current frontier models. This led us to dream even bigger: Can we use foundation models to automate the entire process of research itself?

## Introduction

One of the grand challenges of artificial intelligence is developing agents capable of conducting scientific research and discovering new knowledge. While frontier models have already been used to aid human scientists, e.g. for brainstorming ideas or writing code, they still require extensive manual supervision or are heavily constrained to a specific task.

Today, we’re excited to introduce **The AI Scientist**, the first comprehensive system for fully automatic scientific discovery, enabling Foundation Models such as Large Language Models (LLMs) to perform research independently. In collaboration with the Foerster Lab for AI Research at the University of Oxford and Jeff Clune and Cong Lu at the University of British Columbia, we’re excited to release our new paper, [The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery](https://arxiv.org/abs/2408.06292/).

In our report:

- We propose and run a fully AI-driven system for automated scientific discovery, applied to machine learning research.
- The AI Scientist automates the entire research lifecycle, from generating novel research ideas, writing any necessary code, and executing experiments, to summarizing experimental results, visualizing them, and presenting its findings in a full scientific manuscript.
- We also introduce an automated peer review process to evaluate generated papers, write feedback, and further improve results. It is capable of evaluating generated papers with near-human accuracy.
- The automated scientific discovery process is repeated to iteratively develop ideas in an open-ended fashion and add them to a growing archive of knowledge, thus imitating the human scientific community.
- In this first demonstration, The AI Scientist conducts research in diverse subfields within machine learning research, discovering novel contributions in popular areas, such as _diffusion models_, _transformers_, and _grokking_.

The AI Scientist is designed to be compute efficient. Each idea is implemented and developed into a full paper at a cost of approximately $15 per paper. While there are still occasional flaws in the papers produced by this first version (discussed below and in the report), this cost and the promise the system shows so far illustrate the potential of The AI Scientist to de

... (truncated, 23 KB total)
Resource ID: 757e9f278d8837d1 | Stable ID: Zjc2MGFmOW