Benchmark

Claude Code is 4.2x faster & 3.2x cheaper with CustomGPT.ai plugin. See the report →

CustomGPT.ai Blog

RAG Benchmark: CustomGPT.ai Outperforms OpenAI in Latest RAG Benchmark

These are exciting developments at CustomGPT.ai!

We’re ecstatic to share our recent news that we’ve beaten OpenAI and other industry giants in the latest Retrieval-Augmented Generation (RAG) benchmarks from Tonic.ai. We’re here to share what this means and how setting new standards for accuracy and answer quality is having an effect on the industry as a whole. Let’s just dive straight into what’s new and groundbreaking.

image 29

CustomGPT.ai Sets New Standard for Answer Accuracy

The benchmark from Tonic.ai measures answer accuracy, assessing systems on their ability to retrieve and generate accurate, quality answers from an established set of documents.

“CustomGPT.ai is the clear winner in this RAG face-off,” shares Adam Kamor, PhD, regarding how CustomGPT.ai works.

benchmark

Source: Tonic

Kamor says, “both systems perform admirably with generally high scores.” But adds:

“CustomGPT.ai wins on a few fronts in accuracy against OpenAI. First, its aggregates are better, with a mean score of 4.4 vs OpenAI’s score of 3.5. Additionally, CustomGPT.ai only provided 6 answers with a score below 4, which is really fantastic and generally performs better than most systems we have reviewed in the past. 

Of note, CustomGPT.ai’s median score was a 5, which is not something seen before by the RAG assistants we’ve evaluated.”

CustomGPT.ai’s “secret sauce,” he notes, is its:

“Proprietary embedding and retrieval models and prompt engineering strategies, which you get out-of-the-box with little configuration needed.”

Tonic has now published its RAG Evaluation Leaderboard with CustomGPT.ai as the clear leader ahead of OpenAI Assistants, Google’s Vertex Search and Conversation, Amazon Titan, and Cohere.

Alden do Rosario, CustomGPT.ai CEO and Founder, says:

 “This achievement validates our mission to democratize generative AI by empowering organizations of all sizes and of all technical acumen. Thank you, Tonic, for advancing responsible AI by proving AI answer accuracy.”

Read the full press release: CustomGPT.ai Outperforms OpenAI and Sets New Industry Standard for Answer Accuracy in Industry RAG Benchmark

Experience Results, Not Just Benchmarks

In the end, it’s truly about results. So while we’re ecstatic about leading the pack in terms of accuracy, we’re most excited to help business achieve results! Sign up now and try it for yourself!

Curious to learn more about the benchmark? Watch this highly engaging video to hear why this benchmark matters, from Alden Do Rosario, CEO of CustomGPT.ai.

Frequently Asked Questions

What did the latest RAG benchmark report about CustomGPT.ai vs OpenAI?

The benchmark published by Tonic.ai reported stronger aggregate accuracy for CustomGPT.ai than OpenAI in this comparison. The published figures include a mean score of 4.4 for CustomGPT.ai versus 3.5 for OpenAI, with a reported median score of 5 for CustomGPT.ai.

Which accuracy metrics were highlighted in the benchmark results?

The results highlighted three core indicators: aggregate mean score, median score, and the number of low-scoring answers. The published summary states CustomGPT.ai had a mean of 4.4, a median of 5, and only 6 answers scoring below 4.

What does this RAG benchmark evaluate?

According to the published description, the benchmark evaluates answer accuracy by testing how well systems retrieve and generate accurate, high-quality answers from an established document set.

Related Resources

These resources expand on the benchmark findings and the core ideas behind accurate RAG systems.

  • Anti-Hallucination Benchmark Results — See how independent benchmark results validate CustomGPT.ai’s performance in reducing hallucinations.
  • Business AI Answer Accuracy — Get clear answers to common questions about what drives reliable AI responses in business settings.
  • What Is RAG AI — Learn how retrieval-augmented generation works and why it matters for factual, source-grounded outputs.

3x productivity.
Cut costs in half.

Launch a custom AI agent in minutes.

Instantly access all your data.
Automate customer service.
Streamline employee training.
Accelerate research.
Gain customer insights.

Try 100% free. Cancel anytime.