TruthfulQA

Safety

A benchmark of 817 questions designed to test whether language models generate truthful answers, specifically targeting common misconceptions and falsehoods that models tend to reproduce.

Models Tested

Best Score

Median Score

Scoring: accuracy

Introduced: 2021-09

Maintainer: Oxford

Leaderboard (1 model)

#	Model	Developer	Score
🥇	GPT-3.5 Turbo	OpenAI	47