top of page

Global Telecom Architecture: How Model Context Protocol (MCP) and Agent-to-Agent (A2A) Are Redefining AI Customer Support for CSPs

  • 1 day ago
  • 5 min read
A group of four people in casual attire gather around a computer, smiling and engaged in discussion in a bright office setting learning about MCP and Agent to Agent framework for AI support in telecom

Global communications service providers (CSPs) collectively spend approximately $90 billion annually on labor costs tied to customer experience (CX) operations. Yet, according to the groundbreaking TM Forum Agents of Change Report, while over half of all operators have rushed to put generative AI chatbots into production, these early standalone solutions consistently fail to deliver real business value. They find themselves trapped in "disconnected intelligence" – trained on the public internet but completely blind to the live customer experience. The result? Chatbot deployments that are either confidently wrong (hallucinating scripts) or overly sandboxed into glorified FAQ finders.

  

The industry is rapidly shifting toward agentic AI – autonomous systems capable of advanced reasoning, iterative planning, and executing multi-step network diagnostics independently. In fact, the TM Forum research highlights that 44% of CSP executives are actively implementing agentic AI within their customer-facing architectures.  


To scale this autonomous transformation without compromising security, CSPs require a standardized integration framework. This is exactly why two breakthrough protocols have taken center stage in global telecom innovation: Model Context Protocol (MCP) and Agent-to-Agent (A2A) Protocol



Implementing Model Context Protocol (MCP) as a universal telecom interface


Model Context Protocol (MCP), launched as an open standard by Anthropic, provides large language models with a secure, unified way to read data and utilize tools regardless of underlying infrastructure.  


In a traditional CSP architecture, vital subscriber logs, network topology data, and billing statuses remain heavily siloed across legacy OSS/BSS infrastructure. This is where the Subtonomy Unified AI Agent Gateway steps in as a lightweight, production-grade translation layer. Sitting directly between the AI host and the production environment, the gateway exposes your existing, standard REST endpoints into a clean, searchable catalog of self-describing "Tools" that the AI engine can browse. By serving pre-processed KPIs and deterministic facts, MCP completely eliminates the primary driver of AI hallucinations: forcing an LLM to guess its way through raw telecom signaling.  



Flowchart depicting AI tools and gateways for digital channels with text on integration, access issues, subscriptions, and subscriber info.



Driving First-Contact Resolution: Eliminating 5G FWA congestion with live KPI metrics

Fixed Wireless Access (5G FWA) is expanding at a phenomenal rate, with 89% of operators globally now maintaining an FWA offering to lower high-speed ownership costs. However, as highlighted in TM Forum's Telco Revenue Growth Benchmark Report, diagnosing an unstable FWA connection is notoriously difficult for customer care because it requires immediate visibility into highly technical cellular metrics and radio frequency counters.  


When an FWA customer reports an abrupt speed degradation via an automated app chat, an MCP-connected AI agent doesn't rely on generic scripts. Backed by Subtonomy's APIs, it instantly calls on clean real-time metrics – pulling data on subscriber status, network technology counters, and physical cell location context.  


The gateway dynamically retrieves targeted signal metrics – specifically pulling key performance indicators like reference signal received power (RSRP), signal quality buckets (RSRQ), and signal-to-interference-plus-noise ratios (SINR). Recognizing that the subscriber's hardware is functional but locked onto a cell site suffering from temporary capacity limits, the agent's reasoning engine determines the exact Next-Best-Action and generates a grounded self-service resolution:



AI Support Agent: 

"Hi Mark, I have run a technical diagnostic on your local 5G network performance. Your hardware is completely healthy, but your outdoor router is currently communicating with a cellular sector experiencing temporary capacity congestion. Let's optimize your connection. Please open your smartphone compass app and try rotating your external receiver unit towards the southwest (around 225 degrees). Let me know when you are ready, and I will execute a remote active restart of your device to lock your connection onto a clearer frequency path."



Orchestrating multi-agent ecosystems via the Agent-to-Agent (A2A) Protocol


While MCP standardizes how a model pulls from your data systems, Google’s Agent-to-Agent (A2A) protocol standardizes how separate AI models talk to each other across multi-vendor networks.  


Real-world deployment benchmarking proves that smaller, tightly scoped micro-agents are significantly more efficient and easier to keep on track than single monolithic models. A2A establishes a secure mesh architecture. It enables specialized AI entities – such as a billing specialist, an internal network operations copilot, or a client-facing assistant – to smoothly pass along context, validate session permissions, and collaborate on multi-tier workflows without sharing internal memories.  


Smiling man looks at a phone near the Eiffel Tower. Blue sky, European flag, and greenery in the background. Vibrant, cheerful mood.

Cross-border workflows: Automating complex outbound roaming diagnostics

International roaming troubleshooting is notoriously complex, requiring live, multi-party visibility into foreign host networks, location update protocols, and hardware firmware variables. Whether a carrier is actively aligning with architectural frameworks like TM Forum’s Open Digital Architecture (ODA) or simply running standard, proven RESTful layers, Subtonomy provides an AI-ready Expert Insight Engine that is natively A2A-compliant.  


When a customer experiences connectivity issues abroad, the entire troubleshooting process can be fully automated across domain boundaries in the background: 


  1. A consumer traveling in France messages customer care via the self-service portal because their cellular data has completely failed. The primary chatbot catches the intent and instantly targets the task to a specialized Roaming Analytics Agent via A2A.  

  2. The Roaming Agent leverages an MCP sidecar adapter to inspect cross-border fail rates. It instantly isolates an active, highly localized outbound roaming alarm where a French partner network (Operator Y) is experiencing a 75% location update protocol failure rate.  

  3. Instead of initiating a slow human transfer, the Roaming Agent securely hands this technical diagnosis back to the primary chatbot via the structured A2A mesh interface.  

  4. The customer-facing chatbot translates the network state into an immediate, localized walkthrough tailored to the consumer's device:  



AI Support Agent: 

"Hello William, our automated systems have located a temporary network synchronization error with our roaming partner, Operator Y, here in France. While our engineering groups resolve this issue, we can restore your connection immediately by moving you onto an alternate local network. To change your roaming operator to Operator X on your iPhone 15, please open your Settings app, tap Mobile Data, turn off automatic network selection, and choose Operator X manually.



Avoiding the "escalation tax": Transforming data visibility into quantifiable ROI


To understand why shifting technical care down the support ladder is a financial imperative, carriers must look at the strict mathematics of support desk economics. Industry benchmarks from MetricNet and the Help Desk Institute (HDI) reveal a staggering cost disparity across traditional support tiers:


  • First-line (L1) resolution: Costs an average of $22 to $25 per ticket, acting as your baseline "efficiency zone."

  • Second-line (L2) escalation: Jumps to an average of $60 to $70 as costs triple once specialized support staff take over.

  • Third-line (L3) engineering: Rockets to between $180 and $210+ per ticket—a 9x expansion compared to L1.


As analyzed deeply in our recent guide on the true cost of support escalation, this drain is drastically magnified in highly complex telecom environments. Furthermore, when frontline teams are data-blind, they fall victim to the "swivel-chair effect", manually bouncing between 8 to 20 legacy platforms to track down one network answer, prolonging the time-to-resolution by 3.5x. Together, this creates a massive 14x cost multiplier for highly complex telecom escalations.


By implementing Subtonomy's a standardized MCP gateway, operators eliminate this data blindness. By pulling real-time network diagnostics into a single source of fact, standard REST subsystems become AI-ready. Exposing these underlying facts via secure self-service APIs drops the resolution cost down from a $25 L1 interaction to nearly $0 per digital resolution. For large-scale mobile and FWA carriers, bridging this visibility gap completely flattens the escalation ladder, unlocking documented savings of €2 to €3 per subscriber every single year.



bottom of page