• Agent Pulse
  • Posts
  • 2025 in Agents: 10 Things That Actually Mattered

2025 in Agents: 10 Things That Actually Mattered

In partnership with

Welcome back!

Most year-end takes will tell you 2025 was “the year of agents.”

That’s lazy.

2025 was the year agents stopped being a concept and started being constrained systems that either worked — or didn’t.
No fireworks. No AGI moments. Just a lot of quiet, important shifts.

Here are the 10 things that actually shaped the agent ecosystem this year.

Stop Planning. Start Building.

End of the year? Or time to start something new. 

With beehiiv, this quiet stretch of time can become your biggest advantage. Their platform gives you all the tools you need to make real progress, real fast. 

In just days (or even minutes) you can:

  • Build a fully-functioning website with the AI website builder 

  • Launch a professional-looking newsletter

  • Earn money on autopilot with the beehiiv ad network

  • Host all of your content on one easy-to-use platform

If you’re looking to have a breakthrough year, beehiiv is the place to start. And to help motivate you even more, we’re giving you 30% off for three months with code BIG30.

1) Agents Entered Production as Background Systems

Waymo deployed Google’s Gemini as an in-vehicle agent handling passenger interaction, cabin controls, and trip context — explicitly separated from driving logic. This wasn’t a demo; it shipped inside a live robotaxi fleet.

Why this matters:
Agents entered production without branding. They became invisible infrastructure.

Example: Waymo testing Gemini as an in-car agent

2) Narrow, Task-Bound Agents Outperformed General Ones

Klarna publicly reported that its agents focused on specific commerce and support workflows, not open-ended autonomy — replacing hundreds of customer service roles while staying tightly scoped.

Why this matters:
The highest-ROI agents in 2025 were intentionally limited.

3) Agent Evaluation Shifted to Task-Level Benchmarks

Z.ai released GLM-4.7 with explicit agent-style benchmarks (BrowseComp, τ²-Bench, Code Arena) focused on tool use and multi-step task completion, not chat quality.

Why this matters:
Agents started being measured like systems, not conversations.

4) Agent Marketplaces Became Enterprise Infrastructure

Oracle launched an enterprise marketplace for partner-built, certified AI agents embedded directly into Fusion Applications — with security, vetting, and support guarantees.

Why this matters:
Discovery moved from social proof to institutional distribution.

5) Tool Reliability Beat Raw Model Intelligence

Anthropic open-sourced Agent Skills, standardizing how agents interact with tools from partners like Atlassian and Figma — focusing on predictable execution, not smarter reasoning.

Why this matters:
Tool correctness became more valuable than clever prompts.

6) “Agentic” Was Redefined as Supervised Autonomy

Microsoft emphasized human-in-the-loop checkpoints, auditability, and bounded autonomy in Copilot Studio, moving away from fully autonomous positioning.

Why this matters:
The industry accepted that unchecked autonomy is a liability.

7) Enterprises Adopted Faster Than Startups

SOMPO Holdings deployed internal AI agents to ~30,000 employees, standardizing workflows across insurance operations.

Why this matters:
Once risk was controlled, enterprises scaled faster than startups.

8) Multi-Agent Systems Stayed Mostly Experimental

Despite heavy research output, most production deployments in 2025 avoided large agent swarms due to debugging, cost, and evaluation complexity.

Why this matters:
Coordination remains unsolved at scale.

Example: Academic & enterprise agent research

9) Cost Became a Core Design Constraint

As inference costs scaled, agent teams redesigned systems around bounded reasoning, selective tool calls, and fallback logic to keep per-task costs predictable.

Why this matters:
Agents became economically engineered systems.

Example: OpenAI & Anthropic pricing pressure

Across finance, support, and research, agents increasingly replaced scripts, dashboards, and manual ops, not chat interfaces.

Why this matters:
That’s when agents stopped being “AI features” and became economic actors.

Example: Internal enterprise agents replacing SaaS workflows

AI-native CRM

“When I first opened Attio, I instantly got the feeling this was the next generation of CRM.”
— Margaret Shen, Head of GTM at Modal

Attio is the AI-native CRM for modern teams. With automatic enrichment, call intelligence, AI agents, flexible workflows and more, Attio works for any business and only takes minutes to set up.

Join industry leaders like Granola, Taskrabbit, Flatfile and more.

Featured Agents You Shouldn’t Miss

  • AIMakeSong - Transform your ideas into music with just a few clicks

  • TeammatesAI - Hire teammates that don’t sleep

  • Mailmodo AI - Complete Email Marketing Automation With AI Agents

  • Nano Banana - Prompt-based photo editing with character consistency

  • Jason - The SDR who scales infinitely

  • TheLibrarian.io - Your WhatsApp AI sidekick

  • Sara - Interviews with zero bias

  • Rashed - Sales agent who never forgets a lead

  • Raya - Customer service that runs itself

  • Agentverse - Discover and compare thousands

Get Your Content Ops Workflows Right in 2026 - Best Practices

Want to manage and monetize your content to the fullest in 2026?

Join Forrester Research and media execs with experience spanning ESPN, Comcast, and Disney on January 14, 2026, at 10am PT/1pm ET.

Get actionable insights and perspectives from the leaders who built and transformed top media and entertainment organizations.

What to Watch this Week

 🎙️AI on Fire: Real Builders. Real Heat.

Stories from founders building AI Agents. Explore all episodes here

⚔️ AArena: The Battleground for AI

Stop demo-hopping. One workspace. Every agent. Real results.

🏆 The Leaderboard Never Sleeps

The global ranking of AI agents is shifting every day. Who’s on top? Who just dropped?

🗺️ The Map of AI Agents (Live & Growing)

We’re charting the entire AI agent ecosystem — thousands of options across categories.
Your next agent is already on the map.

THANK YOU

Visit website 

Follow us on Linkedin, X

I appreciate your time.

OP & Team

How'd we do?

Login or Subscribe to participate in polls.

Reach 23,000+ Readers:

Newsletter is read by VCs, founders, engineers, managers and tech professionals.

Reply

or to participate.