In this article

June 5, 2025

How AI agents connect to systems: A technical guide

From REST APIs to message queues and the Model Context Protocol—discover the building blocks that power system-aware AI.

Maria Paktiti

June 5, 2025

AI agents become truly powerful when they can do more than just reason—they need to act. An agent that can understand a task is useful; an agent that can log into your CRM, pull recent customer tickets, update a dashboard, and trigger a follow-up email is game-changing.

This leap from isolated intelligence to system-level orchestration is where the real transformation happens. By connecting to external systems—reading from databases, calling APIs, processing documents, triggering alerts, and coordinating workflows—AI agents step out of the sandbox and into the real world. They stop being tools and start becoming collaborators.

But how exactly do agents gain these capabilities? What protocols let them speak to a file server or a REST endpoint? How do they handle authentication, retries, failures, or long-running operations? And what architectural patterns support these integrations securely and scalably?

This guide answers those questions. We'll walk through the technical strategies, protocols, and integration models that allow AI agents to plug into modern software ecosystems. Whether you're building an internal agent for operations or developing a customer-facing AI product, this deep dive will help you architect connections that are both powerful and reliable.

Model Context Protocol (MCP): From chaos to clarity

Connecting AI agents to real-world systems is a messy business.

Every new agent spins up its own spaghetti of connectors—one for the database, another for the file store, a bespoke API wrapper, each with its own quirks, security model, and error-handling logic. The result? Fragile codebases, duplicated effort, and constant integration headaches.

The Model Context Protocol (MCP) changes that.

Rather than treating every integration as a one-off, MCP introduces a standardized, client-server architecture where AI agents act as clients and connect to MCP servers that expose system capabilities through a common interface. It’s the difference between writing a custom driver for every hardware device and simply using USB.

By abstracting system access behind a shared protocol, MCP unlocks a new level of reusability, consistency, and security across the agent ecosystem.

Instead of writing and maintaining a custom PostgreSQL connector in every agent, you deploy a single MCP database server—and all your agents can use it.

This is a paradigm shift in how AI systems connect to the world—moving from brittle custom glue to composable, reliable, and auditable interfaces.

An MCP Server defines tools (with @mcp.tool()), passively waits for a tool call via a transport (e.g., stdio, HTTP), and when it receives a tool call input, it processes it, and returns a result.

Here’s what a basic MCP server looks like in code—exposing a tool for secure SQL execution:

  
from mcp.server.fastmcp import FastMCP
from mcp.types import TextContent
import sqlite3
import json
import re

# Create a FastMCP server instance
mcp = FastMCP("database-server")

# Safe query execution (read-only for SELECT)
def execute_safe_query(db_path: str, query: str, params: list = []):
    # Disallow non-SELECT statements for safety
    if not re.match(r"^\s*SELECT\b", query, re.IGNORECASE):
        return {"error": "Only SELECT queries are allowed."}
    
    try:
        with sqlite3.connect(db_path) as conn:
            conn.row_factory = sqlite3.Row
            cursor = conn.execute(query, params)
            rows = cursor.fetchall()
            return [dict(row) for row in rows]
    except Exception as e:
        return {"error": str(e)}

DB_PATH = "example.db"  # Replace with your actual path

# Tool: Execute a SELECT query safely
@mcp.tool()
async def execute_query(query: str, params: list = []) -> TextContent:
    """Execute a SELECT SQL query with optional parameters."""
    result = execute_safe_query(DB_PATH, query, params)
    return TextContent(type="text", text=json.dumps(result, indent=2))

# Run the MCP server
if __name__ == "__main__":
    mcp.run(transport="stdio")

This interface becomes the single point of truth for safe, controlled access to a database—no need to duplicate logic across multiple agents.

Without MCP, agents must deal with the wild west of integration: Salesforce's auth model, PostgreSQL’s connection pool limits, S3’s object hierarchy, and every third-party’s oddball rate limits. Each system requires bespoke logic and constant upkeep.

With MCP, all of these capabilities—whether querying a database, uploading a file, or calling a REST API—are unified under one protocol with standardized schemas, security models, and error-handling patterns.

The agent’s job? Just speak MCP.

Let the server deal with the gritty details.

Beyond MCP: APIs, databases, and message queues

While the Model Context Protocol (MCP) offers a standardized and scalable approach to system integration, it's not always available—or necessary. In many cases, developers need lighter-weight or more direct ways for agents to connect to existing systems.

Here are the three most common alternatives to MCP for integrating AI agents into real-world software environments.

1. Rest APIs and web services

REST APIs remain the backbone of modern integrations, and most systems—internal or third-party—expose functionality through HTTP endpoints. AI agents can use these APIs to retrieve data, trigger workflows, or update records.

Agents use function calling (a.k.a. tool use) to translate natural language inputs into structured API calls. Function calling allows an AI agent (like a language model) to recognize that it needs help to answer a query, decide which external function or tool to call, make the call, and use the function’s response to produce a final answer.

Developers register available API endpoints (along with their parameters and expected responses), and when the model encounters a relevant user request, it emits a structured function call.

This is a simple example where GPT decides to call get_weather:

  
import openai
import json

openai.api_key = "your-api-key-here"

# Define the function you want the model to call
def get_weather(location: str) -> str:
    # This is your backend logic
    return f"The weather in {location} is 22°C and sunny."

# Define the function schema (OpenAI tool spec)
functions = [
    {
        "name": "get_weather",
        "description": "Get the current weather in a given location.",
        "parameters": {
            "type": "object",
            "properties": {
                "location": {
                    "type": "string",
                    "description": "The city to get weather for (e.g., 'Athens')"
                }
            },
            "required": ["location"]
        }
    }
]

# Step 1: Send user message and function definitions
response = openai.ChatCompletion.create(
    model="gpt-4",
    messages=[
        {"role": "user", "content": "What's the weather like in Athens?"}
    ],
    functions=functions,
    function_call="auto"  # Let the model decide
)

message = response["choices"][0]["message"]

# Step 2: If function call was requested, execute it
if message.get("function_call"):
    func_name = message["function_call"]["name"]
    args = json.loads(message["function_call"]["arguments"])

    if func_name == "get_weather":
        result = get_weather(**args)

        # Step 3: Send the result back to the model
        second_response = openai.ChatCompletion.create(
            model="gpt-4",
            messages=[
                {"role": "user", "content": "What's the weather like in Athens?"},
                message,  # Include the function_call message
                {
                    "role": "function",
                    "name": func_name,
                    "content": result
                }
            ]
        )

        print(second_response["choices"][0]["message"]["content"])
else:
    # If no function call, just print the model's answer
    print(message["content"])

2. Direct database access

Databases are where most of a company’s structured knowledge lives—whether it’s customer records, transactions, logs, analytics data, or business metrics. By accessing databases directly, agents can:

Run complex queries to retrieve or summarize data
Monitor for changes in state (e.g. “find new unassigned tickets”)
Update or insert records in real time
Maintain memory or persistent state across conversations

This enables use cases like:

Generating dynamic reports
Automating admin workflows (e.g., updating a status field)
Powering interactive agents that remember prior sessions or context

Agents typically connect to relational databases (PostgreSQL, MySQL, SQLite) using an async connection pool to avoid blocking operations and to scale across concurrent tasks.

Example with asyncpg:

  
class AgentDatabase:
    def __init__(self, dsn: str):
        self.pool = await asyncpg.create_pool(dsn, min_size=5, max_size=20)

    async def fetch_customer(self, customer_id: str):
        query = "SELECT * FROM customers WHERE id = $1"
        async with self.pool.acquire() as conn:
            return await conn.fetchrow(query, customer_id)

3. Message queues and event driven systems

Message queues decouple agents from direct system calls and enable asynchronous, event-driven workflows. This is ideal when:

Agents need to react to external events (e.g., “new file uploaded”)
Long-running tasks shouldn't block a real-time chat
Agents work in distributed teams (multi-agent collaboration)
You want to retry failed tasks or buffer load spikes

Think of queues as a task inbox for your agents.

Agents subscribe to queues to listen for incoming messages (events) and publish messages to initiate actions. This decouples the agent from direct system calls and allows for scalable, reactive architectures.

Example using aio_pika with RabbitMQ:

  
async def handle_message(message: aio_pika.IncomingMessage):
    async with message.process():
        task = json.loads(message.body)
        await agent.process(task)  # Call an LLM tool or downstream action

Agent orchestration frameworks

As agents become more powerful and interconnected, managing their workflows, memory, tool usage, and system integrations becomes increasingly complex. This is where agent orchestration frameworks shine.

Popular tools like LangChain, LlamaIndex, AutoGen, and Crew AI provide high-level abstractions that make it easier to build production-ready agents. These frameworks offer:

Prebuilt integrations with common services (Slack, Google Drive, SQL databases, etc.)
Standardized tool interfaces for calling APIs or databases
Memory management to track conversation history and context
Multi-agent coordination features for complex, parallel workflows
Error recovery and retries for robust long-running operations

They dramatically accelerate development by handling repetitive engineering patterns so you can focus on your agent's logic and UX.

While orchestration frameworks offer powerful abstractions, they come with some limitations:

You may encounter black-box behaviors that are hard to customize or debug.
Certain edge-case workflows may require dropping down to lower-level APIs.
Framework upgrades can introduce breaking changes or performance shifts.

In practice, many teams adopt a hybrid approach: using frameworks for 80% of common tasks while implementing custom tools and connectors for specialized needs.

Security and reliability for production agents

AI agents are autonomous actors—and that means they can cause real damage if misconfigured, compromised, or overly trusted. Securing and hardening your agent infrastructure is non-negotiable in production.

Use proper authentication and access control:
- Use dedicated service accounts with time-bound, scoped credentials.
- Implement token-based auth (e.g., JWT) for all service-to-service interactions.
- Apply principle of least privilege: agents should only have access to what they absolutely need—no more.
Use circuit breakers. When an external system fails, you don’t want your agent to keep hammering it. Circuit breakers prevent cascading failures by halting requests after repeated errors, then testing for recovery later.
Implement rate limiting and retry logic:
- Use exponential backoff when retrying failed calls to external APIs.
- Enforce rate limits proactively based on API quotas.
- Implement idempotency and deduplication where applicable to avoid reprocessing.

For more on this, see Securing AI agents: A guide to authentication, authorization, and defense.

Performance optimization strategies

Agent performance is about more than LLM response time. Bottlenecks often arise from external system interactions, not the AI itself. Here’s how to keep things fast and efficient:

Connection pooling: Avoid repeatedly opening and closing connections to external systems (like databases or APIs). Use connection pools to keep these alive and reuse them efficiently.
Smart caching: For frequently accessed data (user profiles, settings, lookup tables), implement intelligent caching with appropriate TTLs. Use distributed caches like Redis for multi-agent or multi-node environments.
Parallel execution: When an agent needs to fetch data from multiple sources, use concurrent async tasks rather than running everything sequentially. This improves throughput and user responsiveness, especially during multi-step workflows.

Conclusion

Connecting AI agents to systems is the key to unlocking their full potential. Whether you're using a standardized protocol like MCP, wiring in REST APIs, querying databases, or coordinating via message queues, these integration strategies define what your agents can actually do.

While frameworks simplify orchestration, and tools like circuit breakers and caching improve reliability and performance, building production-grade agents still requires careful architectural choices.

As the ecosystem matures, expect:

Greater standardization around protocols like MCP
More powerful and extensible agent frameworks
Better observability, debugging, and testing tools

In the meantime, design your systems for flexibility, security, and modularity—so your agents can evolve with the ecosystem and continue delivering value without compromising trust or control.

We’re hiring

Our global team is growing and we’re hiring all types of roles.

View open roles

About us

WorkOS builds developer tools for quickly adding enterprise features to applications.

Learn more