AlphaEvolve: A Gemini-Powered Coding Agent for Designing Advanced Algorithms
webCredibility Rating
High quality. Established institution or organization with editorial oversight and accountability.
Rating inherited from publication venue: Google DeepMind
Relevant to AI safety discussions around recursive self-improvement and capability acceleration, as AlphaEvolve has already been used to optimize the training of the very LLMs that power it, representing a practical instance of AI-assisted capability gain in a deployed system.
Metadata
Summary
AlphaEvolve is Google DeepMind's evolutionary coding agent that combines Gemini LLMs with automated evaluators and evolutionary algorithms to discover and optimize complex algorithms. Deployed across Google's infrastructure, it has improved data center efficiency, chip design, AI training, and has found novel solutions to open mathematical problems including faster matrix multiplication algorithms.
Key Points
- •Combines Gemini Flash (breadth) and Gemini Pro (depth) to propose algorithmic solutions as code, then uses automated evaluators to score and iteratively evolve the best candidates.
- •Has been deployed in production across Google's computing ecosystem, improving data centers, hardware design, and AI training pipelines including training the models underlying AlphaEvolve itself.
- •Discovered faster matrix multiplication algorithms and novel solutions to open mathematical problems, demonstrating capability for genuine scientific discovery.
- •Represents a significant step beyond single-function discovery toward evolving entire codebases, raising questions about AI-assisted recursive capability improvement.
- •System's effectiveness is bounded to domains with clear, automated evaluation metrics, limiting but also grounding its application scope.
Cited by 1 page
| Page | Type | Quality |
|---|---|---|
| Self-Improvement and Recursive Enhancement | Capability | 69.0 |
Cached Content Preview
AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms — Google DeepMind
Skip to main content
May 14, 2025
Science
AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms
AlphaEvolve team
Share
New AI agent evolves algorithms for math and practical applications in computing by combining the creativity of large language models with automated evaluators
Large language models (LLMs) are remarkably versatile. They can summarize documents, generate code or even brainstorm new ideas. And now we’ve expanded these capabilities to target fundamental and highly complex problems in mathematics and modern computing.
Today, we’re announcing AlphaEvolve , an evolutionary coding agent powered by large language models for general-purpose algorithm discovery and optimization. AlphaEvolve pairs the creative problem-solving capabilities of our Gemini models with automated evaluators that verify answers, and uses an evolutionary framework to improve upon the most promising ideas.
AlphaEvolve enhanced the efficiency of Google's data centers, chip design and AI training processes — including training the large language models underlying AlphaEvolve itself. It has also helped design faster matrix multiplication algorithms and find new solutions to open mathematical problems, showing incredible promise for application across many areas.
Designing better algorithms with large language models
In 2023, we showed for the first time that large language models can generate functions written in computer code to help discover new and provably correct knowledge on an open scientific problem. AlphaEvolve is an agent that can go beyond single function discovery to evolve entire codebases and develop much more complex algorithms.
AlphaEvolve leverages an ensemble of state-of-the-art large language models: our fastest and most efficient model, Gemini Flash , maximizes the breadth of ideas explored, while our most powerful model, Gemini Pro , provides critical depth with insightful suggestions. Together, these models propose computer programs that implement algorithmic solutions as code.
Diagram showing how the prompt sampler first assembles a prompt for the language models, which then generate new programs. These programs are evaluated by evaluators and stored in the programs database. This database implements an evolutionary a
... (truncated, 12 KB total)1b5c7b499756dd8f | Stable ID: MTAyZmMzND