9senses Chatbot Audit makes Gen-AI quality measurable
A structured evaluation process for AI chatbots makes it possible for the first time to benchmark generative AI systems -independently, cost-effectively, and quickly
More and more companies are using AI chatbots - but hardly any of them know how well they actually perform. 9senses, the newly founded consulting firm specializing in value-creating AI with Swiss roots, launches its first product in March 2026: the 9senses Chatbot Audit. For the first time, this will enable the performance of generative AI systems to be evaluated and certified using a standardized, vendor-neutral process.
What matters is whether a chatbot understands user concerns, provides meaningful answers, conducts conversations in a structured manner - and helps users achieve their goals. This is also where its contribution to business success lies: representing the company positively around the clock, solving problems cost-effectively, and relieving the burden on other channels. The first audits conducted clearly show that this goal is often missed.
Four Dimensions, One Objective Score
The Level 1 Audit is the first stage in the 9senses Audit Framework and evaluates chatbots from the user’s perspective in real-world use - without system access. It analyzes four weighted dimensions:

- Response Quality:
Relevance, Accuracy, and Completeness - Speed:
Response Times Under Real-World Conditions - User Interface:
Design and Usability - Conversation Quality:
Structure, Flow, and Focus on Resolution
In addition, the audit evaluates the estimated business value and provides initial insights into compliance, ethics, and - if desired - multilingual capabilities.
If serious weaknesses are identified in a Level 1 audit, an in-depth Level 2 audit allows for a more detailed analysis of all aspects of a generative AI system using an open-box procedure in close collaboration with the client.
Availability and Pricing
The 9senses Chatbot Audit (Level 1) is now available for booking directly at www.9senses.ai/chatbot-audit, starting at 599 EUR; various options can be configured. Delivery takes just five business days. This makes the Chatbot Audit a cost-effective and readily available tool for evaluating - even repeatedly - a generative AI solution used in customer interactions. Level 2 audits are available only upon request and are tailored individually to customer needs.
Talk to Eliza
This is a faithful representation of the 1966 Eliza version created by Joseph Weizenbaum. It was reproduced by Anthony Hay in C++ based on the original 1965 code and updated by behavior transcripts of the final version.
Note: The paper version emulates Joseph Weizenbaum's original 1966 ELIZA as it ran on the CTSS time-sharing system (IBM 7094) at MIT, accessed via an IBM Selectric-based hardcopy terminal. On CTSS the question mark served as the line-delete (line-kill) control character, so it could not appear in typed input — and the DOCTOR script accordingly produced no question marks. They are therefore suppressed here, on both sides of the conversation. The green "terminal" version enables question marks instead; it represents a glowing CRT display of a kind that did not exist for ELIZA in 1966 and evokes a later era of computing.
Play Chess like 1997 (Deep Blue Style)
Here's our simulation of Deep Blue. You can play against Stockfish (able to run on a laptop today with similar strength compared to Deep Blue). Bonus: you can replay the legendary 1997 rematch where Deep Blue won against Garry Kasparov.
New AI consulting firm focuses on measurable AI results and establishes quality standards…