The Missing Layer: Why Great AI Tools Still Leave Work Undone
By Polly Barnfield, OBE, CEO of Maybe*
There are more AI tools than anyone can count. And most of them are genuinely useful for something.
But there's a question worth asking before you commit to any of them: does it answer, or does it act?
Most AI tools, including some very well-known and well-funded ones, answer. They generate text, search the web, write code, summarise documents. They're good at it. Often very good.
Maybe* does something different. It completes tasks. End to end. Inside the tools your business already runs on. And it does it using the best available AI models, including the ones described in this post, so you're not choosing between capability and execution. You get both.
This post explains the difference clearly, without putting anyone down, because the tools below are all worth knowing about. The question is what job you're hiring them for.
The tools and what they're actually for
ChatGPT (and large language models generally) The world's best question-answering machine
What it does well
Drafts content, emails, proposals, summaries at high speed
Answers complex questions with impressive depth
Generates ideas, frameworks, and first drafts
Works well for individuals handling their own tasks
What it doesn't do
Doesn't connect to your CRM, Slack, Teams or business data by default
Doesn't take the next step, it stops when the answer ends
Doesn't run on a schedule or trigger from business events
Every session starts from scratch, no business context carried forward
The output lands in a chat window, not in your workflow
The honest summary ChatGPT is a brilliant thinking partner and drafting tool. But you are the one who has to take what it produces and do something with it. The work doesn't complete itself; you complete it, with ChatGPT's help.
Claude (by Anthropic) A thoughtful, capable AI model, excellent at reasoning and writing
What it does well
Strong at nuanced writing, long documents, and careful reasoning
Handles sensitive topics and complex briefs with more care than most
Claude Projects lets you give it context that persists across conversations
Excellent at synthesis, pulling themes from large amounts of information
What it doesn't do
Like ChatGPT, it's a conversational model that responds; it doesn't act
Doesn't connect to all of your business tools or trigger workflows
Doesn't update your CRM, send your emails, or schedule follow-ups
Doesn't run tasks on a schedule or manage multi-step processes independently
Output stays in the conversation; getting it into your workflow is still your job
The honest summary Claude is one of the most capable AI models available. It's an excellent place to think, draft, and reason. But the gap between a well-reasoned answer and completed business work is still yours to close.
Worth knowing about: Claude Cowork and Claude Code Anthropic have been closing that gap with two products worth naming. Claude Cowork is a desktop agent that automates file and task management for non-developers, taking Claude meaningfully closer to execution. Claude Code is a terminal-based agent that autonomously works across your codebase, a step beyond autocomplete for developer teams who want AI that acts rather than just suggests. Neither replaces Maybe* for business execution, but both are worth understanding if you're mapping the Claude ecosystem.
→ Read our full breakdown of Claude Cowork
→ Read our full breakdown of Claude Code
GitHub Copilot (and code-focused AI tools) AI built specifically for software developers
What it does well
Autocompletes and generates code inside development environments
Speeds up repetitive coding tasks significantly
Understands the context of a codebase and suggests relevant completions
Useful for technical teams building and maintaining software
What it doesn't do
Built for developers, not designed for business or operational users
Doesn't handle non-technical business tasks like follow-ups or reporting
Doesn't connect to CRM, project management, or communication tools
Completing a task still requires a developer to implement and deploy
Not relevant to most day-to-day business workflow automation
The honest summary Copilot is the right tool if your job is writing code. If your job is running a business or a team, it's solving a different problem.
Perplexity AI-powered research and search
What it does well
Searches the web and synthesises answers with cited sources
Faster than a traditional search engine for research questions
Good at summarising what's known about a topic in real time
Useful for market research, fact-checking, and staying current
What it doesn't do
Research tool only, it finds and summarises, it doesn't act
No connection to your business tools, data, or workflows
Doesn't draft emails, update records, or schedule anything
Output is a summary; what you do with it is still up to you
Doesn't run tasks, manage processes, or trigger business actions
The honest summary, Perplexity is genuinely good at answering "what is happening in the world right now?" It's a research tool. The moment you need to do something with that research, follow up, report, or act, you're on your own.
Worth knowing about: Perplexity Compute is a separate product and a different kind of conversation. It's infrastructure for teams running AI workloads and evaluating where to run models at scale. If that's your question, it's relevant. If your question is how to get business work done more consistently, it's solving something else entirely.
→ Read our full breakdown of Perplexity Compute
So what does Maybe* actually do?
Maybe* is not a smarter chatbot. It is not a search engine. It is not a writing tool.
Maybe* is an AI execution layer that runs on the world's leading AI models, including Claude, GPT-5, and others. It takes instructions from you, from triggers, and from scheduled events, and completes real business work inside the tools your team already uses.
Tell Maybe* what needs doing. It completes the work.
That means the work doesn't stop at a chat window. It lands in your CRM, your inbox, your Slack, your project board, finished, assigned, tracked.
Here's what that looks like in practice
You say: "@Maybe Follow up on this lead"
Maybe* handles:
Tailored follow-up email drafted and ready to send
CRM contact record updated
Next step scheduled
You say: "Summarise this meeting and assign actions"
Maybe* handles:
Clear summary created
Actions assigned to the right people
Follow-up drafted
Project record updated
You say: "@Maybe What needs attention this week?"
Maybe* handles:
Open tasks and overdue actions surfaced
Risks and blockers flagged
Weekly view delivered, without you having to go looking
None of these tasks stop at a chat window. None of them require you to copy the output somewhere else. The work is done.
ChatGPT answers your question. Maybe* completes your work.
How they compare at a glance
This is a direct comparison across the capabilities that matter most for business execution, not general AI capability, which most of these tools have in abundance.
| Capability | ChatGPT / LLMs | Claude | GitHub Copilot | Perplexity | Maybe* |
|---|---|---|---|---|---|
| Completes multi-step tasks end to end | No | No | No | No | Yes |
| Connects to your business tools (CRM, Slack, Teams) | No | No | Dev only | No | Yes |
| Updates CRM, drafts emails, schedules follow-ups automatically | No | No | No | No | Yes |
| Runs scheduled, repeating tasks without prompting | No | No | No | No | Yes |
| Works inside your existing tools (no new interface to learn) | No | Projects only | No | No | Yes |
| Governed, auditable output with clear rules | No | No | No | No | Yes |
| Built for non-technical business users | Yes | Yes | No | Yes | Yes |
| Generates answers and content | Yes | Yes | Yes | Yes | Yes, via leading models |
| Searches the web | No | No | No | Yes | Via integrations |
| Runs on leading AI models (Claude, GPT-4, and others) | N/A | N/A | N/A | N/A | Yes |
This is a direct comparison across the capabilities that matter most for business execution, not general AI capability, which most of these tools have in abundance.Note: This comparison reflects standard, out-of-the-box capability. Developer integrations and custom configurations can extend some tools beyond their defaults. Claude Cowork, Claude Code, and Perplexity Compute are distinct products from their parent platforms and are covered separately in the linked posts above. Maybe* runs on leading AI models, including Claude and GPT-4, combining their capabilities with an execution layer that connects to your business tools and completes work end to end.
Use the right tool for the right job
None of the tools in this post is bad. Most of them are remarkable. The question isn't which is best; it's what job you need done.
Use ChatGPT or Claude when:
You need a first draft, a framework, or a thought partner
You're researching, writing, or reasoning through a problem
You want to move faster on creative or analytical work
Use Perplexity when:
You need current, cited information from the web
You're researching a topic, a market, or a competitor
You want search results that come back synthesised, not listed
Use GitHub Copilot or Claude Code when:
Your team writes code, and you want it to go faster
You're building internal tools or software products
You want AI that works across your codebase, not just suggests completions
Use Claude Cowork when:
You want Claude doing more than answering
You need file and task management to be automated for non-technical users
You're exploring desktop AI agents for knowledge workers
Use Maybe* when:
You need to work to complete, not just start
Tasks fall through the cracks or get picked up inconsistently
Your team spends time on admin that should happen automatically
You want AI to work inside your existing tools, not alongside them
You need one instruction to turn into a finished, tracked outcome
You want access to the best AI models without having to manage them yourself
The other tools on this list help you think and draft. Maybe* gets the work done. And it uses those same models to do it.
Start with one task
You don't need to replace anything in your current stack. You just need one task that takes too long, gets done inconsistently, or falls through the cracks. Tell Maybe* to handle it. See what changes.
Read the research behind the product
1,500+ interviews. The unfiltered truth of how AI is being used in business and where the gap between adoption and execution really sits.
[→ Read The Big AI Secret