Technology

How to Evaluate an AI Tool Without Getting Sold To

Craig Blackman·2 June 2026·5 min read

The pressure to adopt AI is real. Every supplier is adding it to their product. Every trade publication is covering it. And business owners who haven't done anything yet are starting to wonder whether they're falling behind. That pressure is exactly what makes AI a difficult thing to evaluate clearly — because the emotional pressure to decide is running ahead of the evidence that it works.

Start With the Problem, Not the Tool

The first question in any AI evaluation is not "what does this tool do?" It's "what specific problem in my business am I trying to solve, and how much is that problem currently costing me?"

If you can't answer that question clearly before you book the demo, you're not ready to evaluate the tool. You're ready to be sold to.

"AI to improve my operations" is not a problem statement. "We spend 12 hours per week manually reconciling purchase orders against supplier invoices" is a problem statement. The difference matters because the second one gives you a way to assess whether the tool actually helps — and whether the help is worth the cost.

What a Demo Tells You (and Doesn't)

A vendor demo tells you what the tool can do under ideal conditions, with curated data, presented by someone who knows the system extremely well. It's designed to be impressive. It usually is.

A demo does not tell you:

Whether the tool works with your actual data in its current state
How long implementation will take and how much disruption it will cause
How your staff will interact with it when the vendor isn't in the room
What happens when the AI makes a mistake — who notices, who corrects it, and who is accountable
What the commercial relationship looks like in three years when you're dependent on the tool

All of those questions need answers before you make a decision.

The Evaluation Questions That Matter

Bring these questions to every AI vendor conversation. Not to be awkward — to find out whether the tool actually works for your situation.

On the problem fit:

Can I see this tool working with a data set that looks like mine — specifically the format, volume, and inconsistencies I have?
What does the tool do when the input data is incomplete, inconsistent, or formatted differently from what it expects?
Can you show me a customer in the same sector who has used this for more than 12 months?

On error handling:

When the AI gets something wrong, how does a user know?
What is the process for correcting an AI error, and how long does it take?
Is there an audit trail for AI decisions that a human can review?

On data and commercial terms:

Who owns the data I put into this system?
Is my data used to train models that benefit other customers?
What does off-boarding look like if I decide to leave? Can I export everything?
What happens to my data when I cancel?

Defensive or vague responses to these questions are information. A vendor that can't explain what happens when the AI is wrong is a vendor that hasn't thought through the operational implications of their tool.

The Trial: What to Test

If a vendor offers a trial, use it properly. Don't use the sample data they provide — use your own. The most informative test is giving the tool the messiest, most realistic version of your actual data and seeing what happens.

The things to test:

Does it actually solve the problem you defined? Go back to the specific use case. Did it work?
Can your least technical team member use it? Not your most technical. The person who will actually be using it day to day.
How many errors did it make on your real data? Count them. What was the error rate? Is that acceptable for the use case?
What does the setup actually require? How clean does your data need to be? What integrations are needed?

When AI Isn't the Right Answer

AI amplifies what's already there. If your underlying processes are clear, documented, and consistent, AI can make them faster and reduce manual effort. If they're not — if the process is different depending on who does it, if your data is inconsistent, if nobody has mapped what happens between order and despatch — AI will automate the chaos rather than fix it.

Most decorated goods businesses I work with aren't ready for AI yet. Not because they're too small or too unsophisticated, but because the process layer AI needs to work with isn't in place. The right question before adopting AI is: "are my processes clean enough for a tool to act on them reliably?" If the answer is no, process improvement comes first.

That's not a popular thing to say when everyone is talking about AI. But it's the honest answer, and it saves a lot of money on tools that don't deliver.

Common Questions

How do you evaluate whether an AI tool is right for your business?

Start with the problem, not the tool. Define what the AI is supposed to solve and measure how much it's currently costing you. Then ask whether the tool actually solves it with your real data. A demo is not an evaluation.

What questions should you ask an AI vendor?

What happens when the AI is wrong and who is responsible? Can you speak to a reference customer in my sector who has been using it for 12 months? Who owns my data? What does off-boarding look like? Vague or defensive answers tell you something.

Is AI right for most small decoration businesses?

Most aren't ready yet — not because of size, but because the processes AI needs to work with aren't clean enough. AI amplifies what's already there. Fix the processes first.

Plain English. No jargon. No vendor agenda.

A Clarity Audit maps your actual operations, identifies the changes that will make the biggest difference, and gives you a plan you can act on. No reports you'll never read. No recommendations you can't implement.

See Clarity