First Release

Envoy AI Gateway v0.1.x

The first official release of Envoy AI Gateway! This milestone includes unified API for LLM providers, token-based rate limiting, traffic management, and Kubernetes native integration.

🎉 The First Official Release!

v0.1.0

February 25, 2025

OpenAIAWS BedrockRate LimitingUnified APIUpstream AuthenticationToken-Based Rate Limiting

🎉 The inaugural release of Envoy AI Gateway! This milestone release establishes the foundation for unified AI service management with support for major LLM providers, comprehensive rate limiting, and Kubernetes-native integration.

This foundational release establishes Envoy AI Gateway as a production-ready solution for managing AI services in Kubernetes environments, providing the essential building blocks for enterprise AI infrastructure.

Installation

For installation instructions, see our Getting Started Guide.

✨ Core Features

Unified LLM Provider API

OpenAI Integration

Complete support for OpenAI's GPT models with chat completions and embeddings

AWS Bedrock Support

Native integration with AWS Bedrock for Claude, Titan, and other foundation models

Request/Response Transformation

Automatic translation between different provider APIs for consistent experience

Traffic Management & Rate Limiting

Token-Based Rate Limiting

Intelligent rate limiting based on actual token consumption rather than request count

Backend Load Balancing

Weighted routing and load balancing across multiple AI service backends

Route-Level Controls

Fine-grained traffic management at the route level with flexible matching

Kubernetes Native Integration

Custom Resource Definitions

Native Kubernetes CRDs for AIGateway, AIServiceBackend, and routing configuration

Operator Pattern

Kubernetes operator for automated lifecycle management and configuration

RBAC Integration

Full integration with Kubernetes RBAC for secure access control

API Design

The v0.1.x series establishes the core API design for Envoy AI Gateway:

Initial API Design: Core CRDs including AIGateway, AIServiceBackend, and AIGatewayRoute
Gateway API Integration: Built on top of Kubernetes Gateway API specifications
OpenAPI Schema: Complete OpenAPI v3 schema for all custom resources

📦 Dependencies Versions

Go 1.22

Built with Go 1.22 for optimal performance and security

Envoy Gateway v1.2

Built on Envoy Gateway for proven data plane capabilities

Envoy v1.30

Leveraging Envoy Proxy's battle-tested networking capabilities

Gateway API v1.1

Implementing latest Gateway API standards

⏩ Patch Releases

v0.1.5

April 3, 2025

Bug FixExtProc

Final patch in the v0.1.x series addressing ExtProc image synchronization.

Bug Fixes

ExtProc Image Syncing: Fixed external processor image synchronization issues
Container Registry: Improved container image handling and registry integration

v0.1.4

March 20, 2025

Bug FixAWS

Additional AWS validation and error handling improvements.

Bug Fixes

AWS Validation: Fixed validation error handling for AWS Bedrock requests
Error Response Format: Improved error response formatting for better debugging

v0.1.3

March 14, 2025

Bug FixStreamingMetrics

Streaming improvements and enhanced observability with GenAI metrics.

New Features

Observability Enhancements

GenAI Metrics: Added comprehensive metrics for AI service performance and usage tracking

Bug Fixes

Chat Completion Streaming: Fixed streaming response handling for chat completions
Metrics Collection: Resolved issues with metrics collection and reporting

v0.1.2

March 5, 2025

Bug FixImages

Container image and deployment fixes for improved reliability.

Bug Fixes

Image Tag Fixes: Fixed rotator and ExtProc container image tag synchronization
Deployment Stability: Improved deployment reliability and image pulling

v0.1.1

February 28, 2025

Bug FixAWS

Quick bug fix release addressing AWS tooling issues discovered in the initial release.

Bug Fixes

AWS CLI Tools: Fixed authentication issues with AWS Bedrock integration
Configuration Validation: Improved validation for AWS-specific configuration parameters

🙏 Acknowledgements

The v0.1.0 release represents months of collaborative effort from our founding community. Special thanks to early contributors and adopters who helped shape the initial architecture and provided invaluable feedback during the alpha and beta phases.

This milestone release sets the foundation for Envoy AI Gateway's future growth and establishes our commitment to providing enterprise-grade AI service management capabilities.

🔮 What Came Next

The success of the v0.1.x series paved the way for v0.2.0, which introduced:

Azure OpenAI Integration
Sidecar Architecture for Better Performance
Cross-Backend Failover and Retry Logic
Enhanced CLI Tools

v0.1 is now complete

The v0.1.x release is now complete. For the latest features and improvements, check out v0.2.

Next: v0.2.x Series →All Releases

Envoy AI Gateway v0.1.x

v0.1.0

Installation

✨ Core Features​

Unified LLM Provider API

Traffic Management & Rate Limiting

Kubernetes Native Integration

API Design​

📦 Dependencies Versions​

⏩ Patch Releases​

v0.1.5

Bug Fixes

v0.1.4

Bug Fixes

v0.1.3

New Features

Observability Enhancements

Bug Fixes

v0.1.2

Bug Fixes

v0.1.1

Bug Fixes

🙏 Acknowledgements​

🔮 What Came Next​

v0.1 is now complete

✨ Core Features

API Design

📦 Dependencies Versions

⏩ Patch Releases

🙏 Acknowledgements

🔮 What Came Next