1 evidence check
Last checked: 4/3/2026
The claim states that 95% AI R&D automation corresponds to AI systems achieving a 14-year "time horizon" on METR's coding task suite. The source does not mention this specific percentage or time horizon. The claim mentions that the benchmark measures the length of tasks an AI can complete with 80% reliability. The source does not mention this specific reliability percentage.
Evidence — 1 source, 1 check
Note: The claim states that 95% AI R&D automation corresponds to AI systems achieving a 14-year "time horizon" on METR's coding task suite. The source does not mention this specific percentage or time horizon. The claim mentions that the benchmark measures the length of tasks an AI can complete with 80% reliability. The source does not mention this specific reliability percentage.
Debug info
Record type: citation
Record ID: page:ai-timelines:fn37