Jan 9, 2025

Zhipu AI and Gemini Models Have The Lowest Hallucination Rates 🤖

What We're Showing

The top 15 AI large language models with the lowest hallucination rates.

The hallucination rate is the frequency that an LLM generates false or unsupported information in its outputs.

The data comes from Vectara and is updated as of Dec. 11, 2024. Hallucination rates were calculated by summarizing 1000 short documents with each LLM and using a model to detect hallucinations, yielding a percentage of factually inconsistent summaries.

Key Takeaways

Smaller or more specialized models, such as Zhipu AI GLM-4-9B-Chat, OpenAI-o1-mini, and OpenAI-4o-mini have some of the lowest hallucination rates among all models
In terms of foundational models, Google's Gemini 2.0 slightly outperforms OpenAI GPT-4 with a hallucination rate difference of just 0.2%.
However overall, several variants of GPT-4 (e.g., Turbo, Mini, Standard) fall within the 1.5%–1.8% range, highlighting a strong focus on accuracy across different tiers of the same architecture.

Where Data Tells the Story

Home

Support

Creators

Legal

Zhipu AI and Gemini Models Have The Lowest Hallucination Rates 🤖

What We're Showing

Key Takeaways