AI Workforce Track

Module 6 of 7

Tool of the Month: Practical AI in Your Workflow

Practical, category-by-category guidance on which AI tools to try in your week -- meetings, drafting, research, support -- with anti-hype framing and what to look for.

13 min -- Last updated 2026-05-25

There are too many AI tools, every week brings a new one, and the marketing language is largely interchangeable. This module is a category-first guide rather than a brand-first one: here are the four or five workflow categories where AI tools deliver real value for working professionals, what to look for in each category, two or three options worth trying, and the failure modes that the demos do not show. No category here requires a paid subscription to test honestly; all of them have free tiers good enough to evaluate.

The honest framing before you pick anything

A useful AI tool meets three criteria: it sits at a real bottleneck in your week (not a small annoyance); the output is verifiable in less time than producing it from scratch; and the failure modes are understood and survivable. A tool that satisfies all three is worth integrating. A tool that fails on any one of them is usually a waste of attention even if the demo is impressive.

A second filter: prefer tools that integrate with what you already use over tools that demand a new workflow. The integration tax of a separate dashboard, a new login, and a separate inbox is usually higher than the productivity gain. The MIT Sloan / BCG 2024 study on how people actually use generative AI found that adoption sticks where the tool sits inside existing surfaces (the email client, the document editor, the meeting platform) and falls off where it requires switching contexts.

Category 1: Meeting capture and follow-up

This is the highest-leverage category for most knowledge workers. AI meeting assistants record, transcribe, summarise, and surface action items from your video calls. The realistic time saving is 30-60 minutes per meeting-heavy day across status meetings, customer calls, and internal planning sessions. The verifiable failure mode is that summaries occasionally hallucinate action items that nobody actually agreed to; check action lists against your own notes before sending them on.

Options worth testing (free tiers exist for each): Otter.ai, Fireflies.ai, Microsoft Teams Premium's built-in copilot, Zoom AI Companion, Granola. Otter and Fireflies are stand-alone and work across platforms. Microsoft and Zoom integrate natively with the meeting platform you may already use. Granola is a more recent entrant focused on letting you take live notes while the AI fills in around them. Test two against a normal meeting day and pick the one whose summary you would actually send to a colleague.

Privacy note worth taking seriously: if the meeting includes confidential or personal data, your organisation may have policies about which tools can record. Check before testing on customer calls. In some EU/UK contexts, recording requires explicit consent from all participants.

Category 2: Drafting and editing

General-purpose chat assistants (Claude, ChatGPT, Gemini, Copilot) handle drafting tasks well across emails, documents, slide outlines, and structured plans. The realistic productivity claim is not 10x or 100x; it is closer to 30-50% on first-draft tasks where you have a clear brief and the output style is conventional. Tasks where you do not have a clear brief, or where the style is genuinely yours, see much less gain.

A practical pattern: use the assistant for the first draft only, then edit substantially. Drafts from these tools tend to be plausible but generic. The editing step is where your taste, context, and audience knowledge gets injected, and is what makes the output actually yours. Skipping the edit step is how AI-generated work develops the bland sameness that experienced readers can spot in under a paragraph.

Options: Claude (Anthropic) is widely considered strongest at long-document work, coding, and nuanced instruction-following. ChatGPT (OpenAI) has the largest plugin ecosystem and the most familiar UX. Gemini (Google) integrates natively with Google Workspace. Microsoft 365 Copilot is the obvious choice if your organisation runs Microsoft. Pick the one that integrates with your existing document workflow rather than the one with the best benchmark scores; the integration gain dominates the model gap at this point in the market.

Category 3: Research and synthesis

This category covers asking specific questions, summarising long documents, and pulling structured information from messy sources. The category has two distinct sub-tools: answer engines (live web-grounded answers with citations) and document-synthesis tools (you supply the sources, the tool summarises them).

For answer engines, Perplexity and ChatGPT Search are the main options; both cite their sources, which is non-negotiable for any research use case. Treat citations as a starting point for verification, not as a finishing point -- AI answer engines still occasionally cite real-looking sources that turn out to be lightly wrong or out of date.

For document synthesis, NotebookLM (Google) is a strong free option that takes your uploaded sources and answers questions only from those sources, with explicit citations back to the source document. The constraint -- it will not invent material from outside your sources -- is the feature, not a limitation, and it makes the tool genuinely useful for analysing reports, policy documents, or research papers. Claude with Projects performs a similar function with broader file support.

Worked example

A pricing manager needs to understand how five competitors have moved on pricing in the last six months. Approach: gather the five public pricing pages and any press releases, upload to NotebookLM, ask structured questions ("which competitors moved on enterprise tier pricing"). The tool answers with direct citation to each pricing page. The work that used to be three hours of manual reading is now 30 minutes of structured questioning, and the citation trail makes verification fast.

Category 4: Code and data automation

For non-developers, this category mostly means writing small scripts to automate repetitive tasks: extracting data from a spreadsheet into a different shape, scraping a known website on a schedule, reformatting a CSV. GitHub Copilot, Cursor, and Claude (via chat) all do this well and the threshold to use them is much lower than the equivalent skill of writing code from scratch. If your week has any meaningful share of "I do the same fiddly thing in Excel every Monday", this category has real value for you.

A grounded caveat: code that an AI writes works often enough to be useful and not often enough to be trusted blindly. Treat anything safety-critical, financial, or customer-facing as requiring the same review as code you wrote yourself. The tool removes the typing, not the thinking.

Category 5: Internal knowledge and search

This is the slow-burn category that delivers value six months in rather than next week. AI tools that index your organisation's internal documents (Glean, Notion AI, Microsoft Copilot for SharePoint, Slack AI search) make it possible to ask "what was the decision on X" or "who worked on Y" and get a usable answer. The deployment is usually decided at organisation level rather than by individual users, so this is mostly a question of using what is offered well rather than picking a tool yourself.

What to ignore

Three categories worth deprioritising at the working-professional level. (1) AI agents that promise full task autonomy: the failure modes are still too frequent for production work, and the time saved on the happy path is lost three times over when the agent goes wrong silently. (2) AI image and video generation, unless your role specifically needs visual output: the quality is impressive for demos and middling for actual marketing or design work. (3) AI tools that promise to "personalise outreach at scale": these mostly produce the spam that recipients can now spot, and the trust cost outweighs the time saving.

How to test before you commit

A four-week test cycle works well. Week one: use the free tier on real work, not demos. Week two: try the alternative leading option in the same category. Week three: pick the better one and integrate it fully into your workflow. Week four: review honestly -- is the time saved real, or has the tool added new busywork? Reject any tool that fails the week-four honesty test, even if the demo was impressive.

What to do this month

  • Pick one category from this module where you have a real bottleneck. Do not start in two categories.
  • Test two leading options on real work for one week each. Free tiers only.
  • Week three, integrate the winner into your workflow. Notice where it adds friction as well as where it saves time.
  • Week four, do the honest review. Keep the tool only if the answer to "did this save real time" is yes without qualification.

Next in this track

AI capability changes fast

One email per month when published benchmarks or labour-market research materially shifts the picture.

Monthly update on AI capability changes and what they mean for your role.