How to Evaluate an AI Tool Without Getting Sold To
AI demos are designed to impress. Your job is to find out whether it actually works on your problem, with your data, in your business.
The pressure to adopt AI is real. Every supplier is adding it to their product. Every trade publication is covering it. And business owners who haven't done anything yet are starting to wonder whether they're falling behind. That pressure is exactly what makes AI a difficult thing to evaluate clearly — because the emotional pressure to decide is running ahead of the evidence that it works.
Start With the Problem, Not the Tool
The first question in any AI evaluation is not "what does this tool do?" It's "what specific problem in my business am I trying to solve, and how much is that problem currently costing me?"
If you can't answer that question clearly before you book the demo, you're not ready to evaluate the tool. You're ready to be sold to.
"AI to improve my operations" is not a problem statement. "We spend 12 hours per week manually reconciling purchase orders against supplier invoices" is a problem statement. The difference matters because the second one gives you a way to assess whether the tool actually helps — and whether the help is worth the cost.
What a Demo Tells You (and Doesn't)
A vendor demo tells you what the tool can do under ideal conditions, with curated data, presented by someone who knows the system extremely well. It's designed to be impressive. It usually is.
A demo does not tell you:
- Whether the tool works with your actual data in its current state
- How long implementation will take and how much disruption it will cause
- How your staff will interact with it when the vendor isn't in the room
- What happens when the AI makes a mistake — who notices, who corrects it, and who is accountable
- What the commercial relationship looks like in three years when you're dependent on the tool
All of those questions need answers before you make a decision.
The Evaluation Questions That Matter
Bring these questions to every AI vendor conversation. Not to be awkward — to find out whether the tool actually works for your situation.
On the problem fit:
- Can I see this tool working with a data set that looks like mine — specifically the format, volume, and inconsistencies I have?
- What does the tool do when the input data is incomplete, inconsistent, or formatted differently from what it expects?
- Can you show me a customer in the same sector who has used this for more than 12 months?
On error handling:
- When the AI gets something wrong, how does a user know?
- What is the process for correcting an AI error, and how long does it take?
- Is there an audit trail for AI decisions that a human can review?
On data and commercial terms:
- Who owns the data I put into this system?
- Is my data used to train models that benefit other customers?
- What does off-boarding look like if I decide to leave? Can I export everything?
- What happens to my data when I cancel?
Defensive or vague responses to these questions are information. A vendor that can't explain what happens when the AI is wrong is a vendor that hasn't thought through the operational implications of their tool.
The Trial: What to Test
If a vendor offers a trial, use it properly. Don't use the sample data they provide — use your own. The most informative test is giving the tool the messiest, most realistic version of your actual data and seeing what happens.
The things to test:
- Does it actually solve the problem you defined? Go back to the specific use case. Did it work?
- Can your least technical team member use it? Not your most technical. The person who will actually be using it day to day.
- How many errors did it make on your real data? Count them. What was the error rate? Is that acceptable for the use case?
- What does the setup actually require? How clean does your data need to be? What integrations are needed?
When AI Isn't the Right Answer
AI amplifies what's already there. If your underlying processes are clear, documented, and consistent, AI can make them faster and reduce manual effort. If they're not — if the process is different depending on who does it, if your data is inconsistent, if nobody has mapped what happens between order and despatch — AI will automate the chaos rather than fix it.
Most decorated goods businesses I work with aren't ready for AI yet. Not because they're too small or too unsophisticated, but because the process layer AI needs to work with isn't in place. The right question before adopting AI is: "are my processes clean enough for a tool to act on them reliably?" If the answer is no, process improvement comes first.
That's not a popular thing to say when everyone is talking about AI. But it's the honest answer, and it saves a lot of money on tools that don't deliver.
Common Questions
How do you evaluate whether an AI tool is right for your business?
Start with the problem, not the tool. Define what the AI is supposed to solve and measure how much it's currently costing you. Then ask whether the tool actually solves it with your real data. A demo is not an evaluation.
What questions should you ask an AI vendor?
What happens when the AI is wrong and who is responsible? Can you speak to a reference customer in my sector who has been using it for 12 months? Who owns my data? What does off-boarding look like? Vague or defensive answers tell you something.
Is AI right for most small decoration businesses?
Most aren't ready yet — not because of size, but because the processes AI needs to work with aren't clean enough. AI amplifies what's already there. Fix the processes first.
Before you evaluate AI, evaluate your processes. The second task is harder — and more important.
A Clarity Audit maps your current operations, identifies what's clear and what isn't, and tells you honestly whether your business is ready for the tools you're being pitched — or whether there's groundwork to do first.
See Clarity