A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.
Introducing SimpleQA
A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.
Comments
Sign in or join free to leave a comment.
No comments yet. Be the first.