9senses in the
News
9senses Chatbot Audit makes Gen-AI quality measurable

A structured evaluation process for AI chatbots makes it possible for the first time to benchmark generative AI systems -independently, cost-effectively, and quickly
More and more companies are using AI chatbots - but hardly any of them know how well they actually perform. 9senses, the newly founded consulting firm specializing in value-creating AI with Swiss roots, launches its first product in March 2026: the 9senses Chatbot Audit. For the first time, this will enable the performance of generative AI systems to be evaluated and certified using a standardized, vendor-neutral process.
What matters is whether a chatbot understands user concerns, provides meaningful answers, conducts conversations in a structured manner - and helps users achieve their goals. This is also where its contribution to business success lies: representing the company positively around the clock, solving problems cost-effectively, and relieving the burden on other channels. The first audits conducted clearly show that this goal is often missed.
Four Dimensions, One Objective Score
The Level 1 Audit is the first stage in the 9senses Audit Framework and evaluates chatbots from the user’s perspective in real-world use - without system access. It analyzes four weighted dimensions:

- Response Quality:
Relevance, Accuracy, and Completeness - Speed:
Response Times Under Real-World Conditions - User Interface:
Design and Usability - Conversation Quality:
Structure, Flow, and Focus on Resolution
In addition, the audit evaluates the estimated business value and provides initial insights into compliance, ethics, and - if desired - multilingual capabilities.
If serious weaknesses are identified in a Level 1 audit, an in-depth Level 2 audit allows for a more detailed analysis of all aspects of a generative AI system using an open-box procedure in close collaboration with the client.
Availability and Pricing
The 9senses Chatbot Audit (Level 1) is now available for booking directly at www.9senses.ai/chatbot-audit, starting at 599 EUR; various options can be configured. Delivery takes just five business days. This makes the Chatbot Audit a cost-effective and readily available tool for evaluating - even repeatedly - a generative AI solution used in customer interactions. Level 2 audits are available only upon request and are tailored individually to customer needs.
More News
A confident confabulator
AI hallucinations aren’t random — they cluster, systematically and predictably, in the topics you cannot independently verify.
Lost in Translation
Conversational AI is dominated by English, with serious consequences for other languages that are structural and can only be resolved with significant effort.
9senses Chatbot Audit makes Gen-AI quality measurable
A structured evaluation process for AI chatbots makes it possible for the first time to benchmark generative AI systems -independently, cost-effectively, and quickly
Can AI end humanity?
Renowned AI experts attribute a significant chance to AI that it could lead to the end of humanity. Is that a realistic prediction or doomsday talk? Short answer: it’s not AI that’s the problem, it’s us.
9senses: AI veterans to clear hype around AI
New AI consulting firm focuses on measurable AI results and
establishes quality standards for the responsible use of AI
Talk to Eliza
This is a faithful representation of the 1966 Eliza version created by Joseph Weizenbaum. It was reproduced by Anthony Hay in C++ based on the original 1965 code and updated by behavior transcripts of the final version.
Note: The paper version emulates Joseph Weizenbaum's original 1966 ELIZA as it ran on the CTSS time-sharing system (IBM 7094) at MIT, accessed via an IBM Selectric-based hardcopy terminal. On CTSS the question mark served as the line-delete (line-kill) control character, so it could not appear in typed input — and the DOCTOR script accordingly produced no question marks. They are therefore suppressed here, on both sides of the conversation. The green "terminal" version enables question marks instead; it represents a glowing CRT display of a kind that did not exist for ELIZA in 1966 and evokes a later era of computing.
Play Chess like 1997 (Deep Blue Style)
Here's our simulation of Deep Blue. You can play against Stockfish (able to run on a laptop today with similar strength compared to Deep Blue). Bonus: you can replay the legendary 1997 rematch where Deep Blue won against Garry Kasparov.
AI hallucinations aren't random — they cluster, systematically and predictably, in the topics you cannot independently verify.
Conversational AI is dominated by English, with serious consequences for other languages that are structural and can only be resolved with significant effort.
Renowned AI experts attribute a significant chance to AI that it could lead to the end of humanity. Is that a realistic prediction or doomsday talk? Short answer: it's not AI that's the problem, it's us.



