Jan 9, 2025
Zhipu AI and Gemini Models Have The Lowest Hallucination Rates 🤖
What We're Showing
The top 15 AI large language models with the lowest hallucination rates.
The hallucination rate is the frequency that an LLM generates false or unsupported information in its outputs.
The data comes from Vectara and is updated as of Dec. 11, 2024. Hallucination rates were calculated by summarizing 1000 short documents with each LLM and using a model to detect hallucinations, yielding a percentage of factually inconsistent summaries.
Key Takeaways
- Smaller or more specialized models, such as Zhipu AI GLM-4-9B-Chat, OpenAI-o1-mini, and OpenAI-4o-mini have some of the lowest hallucination rates among all models
- In terms of foundational models, Google's Gemini 2.0 slightly outperforms OpenAI GPT-4 with a hallucination rate difference of just 0.2%.
- However overall, several variants of GPT-4 (e.g., Turbo, Mini, Standard) fall within the 1.5%–1.8% range, highlighting a strong focus on accuracy across different tiers of the same architecture.