Microsoft is expanding Microsoft 365 Copilot with two agent-based features designed to move office work from single prompts toward longer, more conversational task flows. The company calls the approach "Vibe Working," a label that applies the back-and-forth style of "vibe coding" to spreadsheets, documents, presentations, and chat-based productivity work.
The shift matters because these tools are not limited to drafting a paragraph or suggesting a formula. Agent Mode and Office Agent are built to plan work, take actions, check outputs, and continue across multiple steps. Microsoft is applying models from OpenAI and Anthropic in different parts of the system, with each feature aimed at a separate kind of workplace task.
What Microsoft means by Vibe Working
"Vibe Working" describes a more interactive way to use Microsoft 365 Copilot. Instead of treating AI as a one-shot assistant, the user works with the system through a continuing dialog. The article connects this idea to "vibe coding," where developers and AI assistants collaborate through repeated instructions, edits, and checks.
That comparison is useful, but it also points to the main risk. In software development, some developers warn that this workflow can create mistakes that are difficult to detect. Microsoft is now adapting the same pattern to office work, where hidden spreadsheet errors, weak document structure, or inconsistent presentation design can also be hard to notice at first glance.
The new setup has two main parts:
- Agent Mode, which adds AI agent functions directly inside Office apps such as Excel and Word.
- Office Agent, which runs as a separate multi-agent system inside Copilot chat.
Together, they show Microsoft moving Copilot beyond basic assistance and toward systems that can take on larger office workflows with less step-by-step user input.
Agent Mode brings autonomous work into Excel and Word
Agent Mode is integrated into Excel and Word, with PowerPoint support on the way. In Excel, the feature uses OpenAI's latest reasoning models and follows a loop that plans, acts, and checks its work while processing a spreadsheet.
The Excel agent can analyze data, build financial models, and generate charts. It can summarize workbook context, plan the next steps, run code, and review results. To understand the workbook, a "Document Context Producer" creates a compact blueprint that includes layout, values, objects, and formula dependencies. If the agent needs more detail, it can request it.
Microsoft’s argument is that spreadsheet mistakes can be especially dangerous because a formula may appear to function while still hiding a flawed assumption or broken dependency. Agent Mode addresses this by running lightweight tests before each action. The calculations happen directly in the spreadsheet, which means users can follow the process, inspect formulas, and verify the result instead of receiving only a final answer.
The source article also gives benchmark figures for Excel. In the SpreadsheetBench benchmark, Agent Mode in Excel reached 57.2 percent accuracy across 912 tasks. Human testers scored 71.3 percent. Shortcut.ai, a third-party AI Excel add-on, reached 46.6 percent, similar to ChatGPT Agent.
In Word, Agent Mode powers "Vibe Writing." The feature lets users draft, refine, and query documents in a more interactive workflow. The agent can update reports, format text, and summarize content from emails and other sources. The source notes that details on the Word integration are limited, but says it likely draws on earlier work with a Word-focused "Large Action Model."
Office Agent uses multiple agents inside Copilot chat
Office Agent is a different system. It is powered by Anthropic's Claude models and runs as a standalone experience in Copilot chat. Instead of being embedded directly inside one Office app, it uses a central planner that coordinates specialized agents for code, finance, search, and other roles.
This structure is aimed at larger outputs. Office Agent can create PowerPoint presentations and Word documents from scratch. It can run its own web searches, analyze sources, and organize the final result. The source article says it can assemble business plans, trend reports, or topic-specific documents without user input.
The system likely builds on Anthropic's new capability for Claude to create and edit office files. That connection is important because Office Agent is not only answering questions; it is producing complete office documents and presentations as deliverables.
Microsoft also uses a design-oriented process called "Button-Driven Development" (TDD). Instead of only generating code, which the source says can lead to messy layouts, the system relies on reusable "flavor blueprints" drawn from high-quality content. Its auto-theming tool analyzes content and produces matching designs, going beyond static templates.
For evaluation, Microsoft says Office Agent scored 88.7 percent at Level 1, 76.7 percent at Level 2, and 60 percent at Level 3 on the GAIA agent intelligence benchmark. The source also says competitors like Manus are close. Microsoft additionally uses TDDEval, which checks content quality and a "Taste Score" for design.
Availability and what to watch
Agent Mode and Office Agent are available through the Frontier program for Microsoft 365 Copilot users and Personal or Family subscribers. Agent Mode for Excel is available on the web, with desktop support coming. Office Agent is available for US-based Personal and Family subscribers and can generate PowerPoint and Word documents, with Excel support in development.
Microsoft plans to expand "Vibe Working" to more Copilot features, but it has not shared a timeline. For now, the rollout shows a clear direction: Copilot is becoming less of a passive assistant and more of a system that can coordinate work across files, formats, and task types.
The practical question is how much trust users should place in these agents. Microsoft is adding checks, benchmarks, workbook visibility, and design evaluation, but the source also makes clear that accuracy still varies by task and system. The strongest use case may be workflows where users can review each step, inspect the source material, and treat the agent’s work as a structured draft rather than an unquestioned final product.