apps
TruthfulQA

TruthfulQA

Measuring How Models Mimic Human Falsehoods

About TruthfulQA

TruthfulQA is a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics. The authors crafted questions that some humans would answer falsely due to a false belief or misconception.

TruthfulQA screenshots

Similar apps

Project CodeNet by IBM

The Pile

WIT by Google AI

See all Datasets apps

Ready to start building?

At Apideck we're building the world's biggest API network. Discover and integrate over 12,000 APIs.

Check out the API Tracker