CyberSecEval: Meta's Cybersecurity Evaluation Benchmark for LLMs

web

GitHub·github.com/facebookresearch/CyberSecEval

Credibility Rating

3/5

Good(3)

Good quality. Reputable source with community review or editorial standards, but less rigorous than peer-reviewed venues.

Rating inherited from publication venue: GitHub

Useful reference for researchers and practitioners assessing dual-use risks of AI code-generation systems; part of Meta's broader responsible AI evaluation toolkit and directly relevant to understanding how LLMs can be misused for cyberattacks.

Metadata

Importance: 62/100tool pagetool

Summary

CyberSecEval is an open-source benchmark suite from Meta (Facebook Research) designed to evaluate the cybersecurity risks and capabilities of large language models, particularly code-generating AI. It tests both the propensity of LLMs to assist with cyberattacks and their ability to generate insecure code, providing a standardized framework for assessing AI safety in security-sensitive contexts.

Key Points

•Evaluates LLMs on two key dimensions: tendency to generate insecure code and willingness to assist with cyberattacks or provide harmful security guidance
•Provides standardized, reproducible benchmarks enabling comparison across different LLMs for cybersecurity-relevant risk assessment
•Developed by Meta (Facebook Research) as part of responsible AI deployment efforts for code-generation models like Code Llama
•Covers a range of attack categories including cyberattack assistance, insecure code generation, and exploitation guidance
•Relevant for AI red-teaming and safety evaluations, helping developers identify and mitigate dual-use risks in coding assistants

Cited by 1 page

Page	Type	Quality
Autonomous Coding	Capability	63.0

Cached Content Preview

HTTP 200Fetched Mar 20, 20261 KB

[Skip to content](https://github.com/facebookresearch/CyberSecEval#start-of-content)

You signed in with another tab or window. [Reload](https://github.com/facebookresearch/CyberSecEval) to refresh your session.You signed out in another tab or window. [Reload](https://github.com/facebookresearch/CyberSecEval) to refresh your session.You switched accounts on another tab or window. [Reload](https://github.com/facebookresearch/CyberSecEval) to refresh your session.Dismiss alert

{{ message }}

![](<Base64-Image-Removed>)

![404 “This is not the web page you are looking for”](<Base64-Image-Removed>)![](<Base64-Image-Removed>)![](<Base64-Image-Removed>)![](<Base64-Image-Removed>)![](<Base64-Image-Removed>)![](<Base64-Image-Removed>)![](<Base64-Image-Removed>)

Find code, projects, and people on GitHub:

Search

[Contact Support](https://support.github.com/?tags=dotcom-404) —
[GitHub Status](https://githubstatus.com/) —
[@githubstatus](https://x.com/githubstatus)

You can’t perform that action at this time.

Resource ID: 9d6f51d4b8105682 | Stable ID: sid_bn742qePGd