Experts from Massachusetts Institute of Technology (MIT), CustomGPT.ai, and AI answer accuracy evaluator Tonic.ai will discuss topics highly relevant to any business hoping to leverage the benefits of generative AI in this upcoming webinar. Read on to learn why your attendance could be vital to ensure AI answer accuracy and prevent damaging hallucinations.
On April 10th, Alden Do Rosario – CEO and Founder of CustomGPT.ai, Adam Kamor – Head of Engineering and Co-Founder of Tonic.ai and Doug Williams of the Martin Trust Center for MIT Entrepreneurship will discuss how AI hallucinations can erode trust, mislead customers and compromise business integrity. The free webinar will be organized and monitored by Bret Kinsella, CEO and Founder of VoiceBot.ai and center on:
– Measuring answer accuracy
– Dealing with hallucinations
– Importance of guardrails
– Real world case studies
Why AI Answer Accuracy is Critical?
If you’re using generative AI to serve your customers it’s critical that this interface, often a chatbot, delivers error free answers and insights. If it doesn’t, you risk impacting your brand’s reputation and reliability and even your bottom line profits.
Earlier this year, Air Canada lost a small claims court case after a grieving passenger said they were misled by a chatbot on the airline’s rules around bereavement fares. The chatbot “hallucinated,” providing an answer that wasn’t inline with airline policy that was actually delivered coupled with a link to the correct policy page. The passenger was awarded $812.02 in damages and court fees. The impact to Air Canada’s reputation, with such a topical issue going viral globally, is of course far more severe than that financial penalty.
Air Canada isn’t alone amongst big brands suffering for their chatbots errors. In January it emerged that parcel firm DPD’s “DPD Chat” had devolved into swearing, rudimentary jokes and producing a poem “about a useless chatbot for a parcel delivery firm.” In December 2023, A Chevrolet dealership’s AI chatbot also “went rogue” offering to sell a 2024 Chevrolet Tahoe for $1 and adding “That’s a legally binding offer—no takesie backsies.”
RAG Technology as the Answer to Accuracy
Retrieval Augmented Generation (RAG) technology addresses some of the limitations of using foundational large language models (LLMs) such as OpenAI’s ChatGPT in creating chatbots for business. LLMs on their own can rely purely on the data they were trained on to deliver answers.
RAG allows chatbot builders to leverage the strengths of generative AI models but also to only use external, or specific, knowledge sources such as their own company data. The technology enables generative AI chatbots to deliver more precise and contextually relevant answers. These bots can often be configured either to use company data in conjunction with pre-trained foundational databases or to only use company data, adding important guardrails to AI’s answers.
RAG can reduce the likelihood of a chatbot hallucinating or generating false information. CustomGPT.ai’s collaboration with the Martin Trust Center for MIT Entrepreneurship to produce the center’s ChatMTC is an informative illustration of an organization’s concerns and RAG as a solution.
CustomGPT.ai Outperforms OpenAI for Answer Accuracy in Tonic.ai’s RAG Benchmark
CustomGPT.ai came out ahead of OpenAI’s Assistants, Cohere, Google Vertex and Amazon Titan in Tonic.ai’s latest RAG evaluations, setting new standards for AI answer accuracy. The benchmark from Tonic.ai measures answer accuracy, assessing systems on their ability to retrieve and generate accurate, quality answers from an established set of documents.
“CustomGPT.ai is the clear winner in this RAG face-off,” says Adam Kamor, PhD.
Adam Kamor, as well as Doug Williams from MIT, join CustomGPT.ai’s Alden Do Rosario in next Wednesday’s (April 10th, 2024) webinar:
Register with Zoom for Beating OpenAI – LLM Accuracy & RAG Benchmark with CustomGPT.ai, Tonic and MIT
The experts will delve into the importance of answer accuracy and why benchmarks matter. Attendees can discover:
- Insights on Accuracy: Hear from Alden DoRosario, Founder, CEO of CustomGPT.ai as he discusses the recent benchmark and why answer quality is critical for businesses.
- In-depth Benchmark Insights: Learn about Tonic.ai’s rigorous benchmark methodology and approach for measuring accuracy with Adam Kamor, Co-Founder, Head of Engineering of Tonic.ai.
- Real-world Impact: Listen as MIT’s Doug Williams discusses how MIT leveraged CustomGPT.ai and selected it because of its highly accurate responses.
You can also join the pre-event discussion and meet the experts and attendees on LinkedIn.
Alternatively, discover CustomGPT.ai’s zero-code custom GPT chatbot solution.