Benchmarking Envoy AI Gateway Control Plane Scaling

February 23, 2026 · 8 min read

Member of Technical staff - Nutanix

How many AIGatewayRoute resources can Envoy AI Gateway handle?

In this post, I'll walk through how I benchmarked the control plane scaling of Envoy AI Gateway, the architecture of the test, and the results from scaling to 2,000 routes.

The Reality and Performance of MCP Traffic Routing with Envoy AI Gateway

December 8, 2025 · 9 min read

Ignasi Barrera

Founding Engineer - Tetrate

Erica Hughberg

Envoy AI Gateway Maintainer - Tetrate

Envoy AI Gateway (AIGW) provides a production-ready bridge between AI agents and their tools by handling Model Context Protocol (MCP) traffic. As teams adopt MCP, questions about scale, performance, and architecture naturally arise.

This post addresses those questions by first clearing up common misunderstandings, then diving into the actual architecture, the design choices we made (and why), and how you can test and evaluate whether it is the right solution for your system.

This post will give you the context you need to:

Evaluate MCP routing in Envoy AI Gateway with realistic expectations
Understand the design decisions, their impact, and how they impact you
Learn about how you can configure and tune MCP routing in Envoy AI Gateway to meet your needs

Announcing Model Context Protocol Support in Envoy AI Gateway

October 2, 2025 · 8 min read

Ignasi Barrera

Founding Engineer - Tetrate

Takeshi Yoneda

Envoy AI Gateway Maintainer - Netflix

Hero feature image of title.

We’re excited to announce that the next release of Envoy AI Gateway will introduce first-class support for Model Context Protocol (MCP), cementing Envoy AI Gateway (EAIGW) as the universal gateway for modern production AI workloads.

Envoy AI Gateway started in close collaboration with Bloomberg and Tetrate to meet production-scale AI workload demands, combining real-world expertise and innovation from some of the industry’s largest adopters. Built upon the battle-tested Envoy Proxy data plane as the AI extension of Envoy Gateway, it is trusted for critical workloads by thousands of enterprises worldwide. EAIGW already provides unified LLM access, cost and quota enforcement, credential management, intelligent routing, resiliency, and robust observability for mission-critical AI traffic.

With the addition of MCP, we have brought these features to the communication between Agents and external tools, making EAIGW even more versatile for enterprise-scale AI deployments. For a deeper look at the collaborative story and technical vision, see the Bloomberg partnership announcement, their official release coverage, and previous project announcements.

Enhancing AI Gateway Observability - OpenTelemetry Tracing Arrives in Envoy AI Gateway

August 25, 2025 · 6 min read

Erica Hughberg

Envoy AI Gateway Maintainer - Tetrate

Adrian Cole

Principal Engineer - Tetrate

Hero feature image of title.

Aggregated metrics like latency, error rates, and throughput on their own won't reveal the source of why a system's output was wrong, slow, or expensive.

The v0.3 release of Envoy AI Gateway brings comprehensive OpenTelemetry tracing support with OpenInference semantic conventions, extending the existing metrics foundation to provide complete visibility into LLM application behavior.

This enables you to improve the quality and safety of your AI-integrated applications by allowing you to understand the full context of a request journey, as your LLM traces will inform application improvements and guardrail needs.

Announcing the Envoy AI Gateway v0.3 Release

August 22, 2025 · 8 min read

Erica Hughberg

Envoy AI Gateway Maintainer - Tetrate

Xunzhuo (Bit) Liu

Envoy AI Gateway Maintainer - Tencent

Envoy AI Gateway v0.3.0 Release

The Envoy AI Gateway v0.3 release introduces intelligent inference routing through Endpoint Picker (EPP) integration, expands our provider ecosystem with Google Vertex AI Production Support as well as Native Anthropic API, and delivers Enterprise-Grade Observability with OpenInference tracing.

The Big Shifts in v0.3

Envoy AI Gateway v0.3 isn't just another feature release; it's a fundamental shift toward intelligent, production-ready AI infrastructure. This release addresses three critical challenges that have been holding back AI adoption in enterprise environments:

Envoy AI Gateway Introduces Endpoint Picker Support

July 30, 2025 · 7 min read

Erica Hughberg

Envoy AI Gateway Maintainer - Tetrate

Xunzhuo (Bit) Liu

Envoy AI Gateway Maintainer - Tencent

Reference Architecture for Envoy AI Gateway

Introduction

Envoy AI Gateway now supports Endpoint Picker Provider (EPP) integration as per the Gateway API Inference Extension.

This feature enables you to leverage intelligent, dynamic routing for AI inference workloads through intelligent endpoint selection based on real-time metrics, including KV-cache usage, queued requests, and LoRA adapter information.

When running AI inference at scale, this means your system can automatically select the optimal inference endpoint for each request, thereby optimizing resource utilization.

An overview of Endpoint Picker together with Envoy AI Gateway

A Reference Architecture for Adopters of Envoy AI Gateway

July 15, 2025 · 7 min read

Erica Hughberg

Envoy AI Gateway Maintainer - Tetrate

Alexa Griffith

Senior Software Engineer - Bloomberg

Reference Architecture for Envoy AI Gateway

Building a Scalable, Flexible, Cloud-Native GenAI Platform with Open Source Solutions

AI workloads are complex, and unmanaged complexity kills velocity. Your architecture is the key to mastering it.

As generative AI (GenAI) becomes foundational to modern software products, developers face a chaotic new reality, juggling different APIs from various providers while also attempting to deploy self-hosted open-source models. This leads to credential sprawl, inconsistent security policies, runaway costs, and an infrastructure that is difficult to scale and govern.

Your architecture doesn’t have to be this complex.

Announcing the first Envoy AI Gateway Release – A Community Milestone!

February 25, 2025 · 3 min read

Erica Hughberg

Envoy AI Gateway Maintainer - Tetrate

Dan Sun

Envoy AI Gateway Maintainer - Bloomberg

Takeshi Yoneda

Envoy AI Gateway Maintainer - Netflix

Aaron Choo

Envoy AI Gateway Maintainer - Bloomberg

Yao Weng

Envoy AI Gateway Maintainer - Bloomberg

Announcing the first Envoy AI Gateway Release

Today, we're excited to announce the 0.1 release of the Envoy AI Gateway, the first AI gateway built on CNCF's Envoy Gateway and backed by a thriving, growing community.

The journey to the Envoy AI Gateway started with a simple but powerful vision: make it easier for enterprises to integrate and scale AI in their applications.

Where We Are Now

The Envoy AI Gateway is now available on GitHub and ready for developers to deploy and explore. It enables enterprises to integrate AI services through a unified API while managing authorization, cost control, and scalability with built-in features:

End User Keynote at KubeCon 2024

November 14, 2024 · One min read

Erica Hughberg

Envoy AI Gateway Maintainer - Tetrate

Alexa Griffith

Senior Software Engineer - Bloomberg

At KubeCon North America 2024, Alexa Griffith had the opportunity to present the End User Keynote on Centralizing & Simplifying Enterprise AI Workflows with Envoy AI Gateway.

Introducing Envoy AI Gateway

October 18, 2024 · 3 min read

Erica Hughberg

Envoy AI Gateway Maintainer - Tetrate

The industry is embracing Generative AI functionality, and we need to evolve how we handle traffic on an industry-wide scale. Keeping AI traffic handling features exclusive to enterprise licenses is counterproductive to the industry’s needs. This approach limits incentives to a single commercial entity and its customers. Even single-company open-source initiatives do not promote open multi-company collaboration.

The Big Shifts in v0.3​

Introduction​

Building a Scalable, Flexible, Cloud-Native GenAI Platform with Open Source Solutions

Where We Are Now​

The Big Shifts in v0.3

Introduction

Where We Are Now