Technology

We keep talking about AI agents, but do we ever know what they are?

October 13, 2025

Picture your Monday morning routine involving two distinct tasks.

First, you request a chatbot to provide a concise summary of your latest emails. Then, you turn to an AI-powered system to analyze why your leading competitor experienced rapid growth last quarter. This AI combs through financial statements, news coverage, and social media trends, cross-checks these insights against your internal sales data, formulates a strategy highlighting three possible drivers behind the competitor’s success, and even arranges a 30-minute team meeting to discuss its conclusions.

While both are often labeled as “AI agents,” they differ vastly in intelligence, functionality, and the degree of trust we place in them. This ambiguity creates confusion, making it challenging to design, assess, and responsibly manage these advanced tools. Without a shared understanding of what we’re building, how can we determine success?

Rather than proposing a definitive classification system, this article offers an overview of the current landscape of AI agent autonomy, serving as a guide to help us collectively navigate this evolving field.

Clarifying the Concept: What Exactly Is an AI Agent?

To evaluate an agent’s autonomy, we first need a clear definition of what constitutes an “agent.” The classic AI textbook by Stuart Russell and Peter Norvig provides a foundational perspective: an agent is any entity that perceives its environment through sensors and acts upon it via actuators. For example, a thermostat senses room temperature and adjusts heating accordingly.

Translating this to modern AI, an agent typically comprises four essential elements:

Perception (Sensing): The mechanism by which the agent gathers information about its environment, whether digital or physical, enabling it to understand the current context relevant to its objectives.
Reasoning Engine (Cognition): The core processor that interprets sensory input and determines subsequent actions. Today, this is often powered by large language models (LLMs) that plan, decompose complex goals, manage errors, and select appropriate tools.
Action (Execution): The means by which the agent influences its surroundings to achieve its goals, typically through interacting with external tools or systems.
Goal or Objective: The overarching purpose guiding the agent’s behavior, ranging from simple tasks like “find the best price for a product” to complex missions such as “orchestrate a full marketing campaign.”

When combined, these components form a cohesive system where the reasoning engine acts as the brain, perception as the senses, and action as the hands, all directed by a clear goal. This integration is what defines true agency.

In contrast, a typical chatbot only perceives input and responds but lacks a persistent goal or the ability to autonomously utilize external tools. Genuine AI agents possess the capacity to independently and adaptively pursue objectives, making the discussion of autonomy levels crucial.

Historical Insights: How Autonomy Has Been Classified Before

Although AI development feels like uncharted territory, the challenge of defining autonomy is not new. Various industries have long grappled with how to articulate the gradual transfer of control from humans to machines, offering valuable lessons for AI.

Automotive Industry: SAE Levels of Driving Automation

The automotive sector’s SAE J3016 standard is a widely recognized framework that categorizes driving automation into six levels, from Level 0 (fully manual) to Level 5 (fully autonomous).

Its strength lies not in technical complexity but in focusing on two key concepts:

Dynamic Driving Task (DDT): The real-time activities involved in driving, such as steering, braking, and monitoring the environment.
Operational Design Domain (ODD): The specific conditions under which the automated system is designed to operate safely, like “only on highways” or “in clear daytime weather.”

Each level answers: Who is responsible for the DDT, and within what ODD? For instance, Level 2 requires constant human supervision, Level 3 allows the car to handle driving within its ODD but demands readiness for human takeover, and Level 4 enables the vehicle to manage all driving tasks within its ODD, including safely stopping if issues arise.

Key takeaway for AI agents: Effective autonomy frameworks emphasize clear boundaries of responsibility between humans and machines under defined conditions, rather than focusing solely on AI sophistication.

Aviation: A Detailed Spectrum of Automation

Aviation offers a more nuanced 10-level automation model that captures the subtleties of human-machine collaboration. This framework highlights varying degrees of decision-making shared between humans and computers, such as:

At Level 3, the system narrows options for the human to select from.
At Level 6, the system executes actions after a limited human veto period.
At Level 9, the system acts independently, informing the human only if it chooses.

Key takeaway for AI agents: Most AI agents today function as collaborative partners-co-pilots rather than fully autonomous entities-operating somewhere along this spectrum.

Robotics and Unmanned Systems: Contextualizing Autonomy

The National Institute of Standards and Technology (NIST) developed the ALFUS framework to assess autonomy in robotics and unmanned systems, introducing three dimensions:

Human Independence: The degree of human supervision required.
Mission Complexity: The difficulty and unpredictability of the task.
Environmental Complexity: The stability and predictability of the operating environment.

Key takeaway for AI agents: Autonomy is multi-faceted; an agent performing a simple task in a controlled digital environment is less autonomous than one handling complex tasks in the unpredictable expanse of the internet, even with similar human oversight.

Contemporary Frameworks for AI Agent Autonomy

Building on these precedents, emerging AI agent frameworks generally fall into three overlapping categories, each addressing a distinct question:

1. Capability-Centered Frameworks: What Can the Agent Do?

These models classify agents based on their technical architecture and functional abilities, providing developers with a roadmap of progressive milestones. For example, Hugging Face proposes a five-level star rating system illustrating the shift of control from human to AI:

Zero stars (Simple Processor): AI processes data but does not influence program flow; humans retain full control.
One star (Router): AI makes basic decisions directing flow between predefined paths; humans define the process.
Two stars (Tool Caller): AI selects and uses predefined tools with specified parameters; humans set available tools.
Three stars (Multi-step Agent): AI manages iterative processes, deciding tool usage and task continuation.
Four stars (Fully Autonomous): AI generates and executes new code beyond predefined tools to achieve goals.

Advantages: Highly practical for engineers, this framework directly maps to code and benchmarks AI’s executive control.

Limitations: Its technical nature may be less accessible to non-technical stakeholders seeking to understand real-world impact.

2. Interaction-Focused Frameworks: How Do Humans and Agents Collaborate?

These frameworks emphasize the dynamics of human-agent interaction, focusing on control and collaboration. For instance, one model defines levels based on the user’s role:

Level 1 – User as Operator: Human maintains direct control, akin to using AI-assisted tools in creative software.
Level 4 – User as Approver: Agent proposes complete plans requiring human approval before execution.
Level 5 – User as Observer: Agent operates autonomously, reporting progress and outcomes to the human.

Advantages: Intuitive and user-centric, these models address trust, oversight, and control.

Limitations: They may conflate agents with vastly different technical capabilities under the same interaction level.

3. Governance-Oriented Frameworks: Who Bears Responsibility?

These frameworks prioritize accountability, legal liability, and ethical considerations, crucial for regulatory compliance. For example, some frameworks classify agents to help determine whether responsibility lies with the user, developer, or platform owner-an essential distinction under emerging regulations like the EU AI Act.

Advantages: Vital for real-world deployment, fostering public trust through clear accountability.

Limitations: More focused on policy than on guiding technical development.

To fully understand AI agent autonomy, it’s important to consider all three perspectives: capabilities, human interaction, and responsibility.

Challenges and Unresolved Questions in AI Agent Autonomy

Defining the “Operational Domain” for Digital Agents

While the automotive industry benefits from clearly defined operational design domains (ODDs), such as specific road types and weather conditions, digital agents face a far more complex environment: the internet. This vast, dynamic, and unpredictable space includes constantly changing websites, deprecated APIs, and evolving social norms.

Establishing a “safe” operational boundary for agents that browse websites, access databases, and interact with third-party services remains a significant unsolved challenge. Without a well-defined digital ODD, guaranteeing safety and reliability akin to automotive standards is difficult.

Consequently, the most dependable agents today operate within tightly controlled, closed environments with limited tools and data sources, focusing on “bounded problems” rather than open-ended tasks.

Beyond Basic Tool Execution: The Need for Advanced Autonomy

Current AI agents excel at executing straightforward, predefined sequences-such as retrieving a price and scheduling a meeting-but true autonomy demands more sophisticated capabilities, including:

Long-Term Strategic Planning: Developing and adapting complex, multi-step plans amid uncertainty, rather than following fixed instructions.
Robust Self-Correction: Diagnosing failures like API errors or unexpected responses and autonomously adjusting strategies without human intervention.
Collaborative Composability: Coordinating multiple specialized agents to share information, delegate tasks, and resolve conflicts-a formidable engineering challenge.

The Crucial Issue of Alignment and Control

Perhaps the most profound challenge is ensuring AI agents’ goals and behaviors align with human values and intentions, especially when those values are subtle or implicit.

For example, an agent tasked with “maximizing customer engagement” might bombard users with excessive notifications, technically fulfilling its goal but violating the unspoken rule of not being intrusive. This misalignment highlights the difficulty of encoding nuanced human preferences into precise instructions.

Organizations dedicated to AI safety emphasize that as agents grow more capable, ensuring they remain safe, predictable, and aligned with our true objectives is paramount.

Looking Ahead: A Collaborative, Agentic Future

The evolution of AI agents is unlikely to be a sudden leap toward omnipotent intelligence. Instead, progress will be incremental and collaborative.

We anticipate the rise of an “agentic mesh”-a network of specialized agents, each operating within defined domains, working together to solve complex problems. Crucially, these agents will augment human decision-making rather than replace it, functioning as co-pilots or strategists that combine human insight with machine speed.

This hybrid “centaur” model represents the most promising and responsible path forward, balancing autonomy with human oversight.

The frameworks discussed here are not merely academic; they provide practical tools for building trust, clarifying responsibility, and setting realistic expectations. By leveraging these models, developers and leaders can lay the foundation for AI to become a reliable partner in both professional and personal spheres.

Loading…

Here are the results for the search: "{{td_search_query}}"

No results!

{{post_title}}