OpenAI tested GPT-4.5 on the SimpleQA benchmark, a tool that evaluates the factual accuracy of AI models in answering short, ...