Skip to main content

Envoy AI Gateway v0.1.x

The first official release of Envoy AI Gateway! This milestone includes unified API for LLM providers, token-based rate limiting, traffic management, and Kubernetes native integration.

🎉 The First Official Release!

v0.1.0

February 25, 2025
OpenAIAWS BedrockRate LimitingUnified APIUpstream AuthenticationToken-Based Rate Limiting
🎉 The inaugural release of Envoy AI Gateway! This milestone release establishes the foundation for unified AI service management with support for major LLM providers, comprehensive rate limiting, and Kubernetes-native integration.

This foundational release establishes Envoy AI Gateway as a production-ready solution for managing AI services in Kubernetes environments, providing the essential building blocks for enterprise AI infrastructure.

Installation

For installation instructions, see our Getting Started Guide.

✨ Core Features

Unified LLM Provider API

OpenAI Integration

Complete support for OpenAI's GPT models with chat completions and embeddings

AWS Bedrock Support

Native integration with AWS Bedrock for Claude, Titan, and other foundation models

Request/Response Transformation

Automatic translation between different provider APIs for consistent experience

Traffic Management & Rate Limiting

Token-Based Rate Limiting

Intelligent rate limiting based on actual token consumption rather than request count

Backend Load Balancing

Weighted routing and load balancing across multiple AI service backends

Route-Level Controls

Fine-grained traffic management at the route level with flexible matching

Kubernetes Native Integration

Custom Resource Definitions

Native Kubernetes CRDs for AIGateway, AIServiceBackend, and routing configuration

Operator Pattern

Kubernetes operator for automated lifecycle management and configuration

RBAC Integration

Full integration with Kubernetes RBAC for secure access control

API Design

The v0.1.x series establishes the core API design for Envoy AI Gateway:

  • Initial API Design: Core CRDs including AIGateway, AIServiceBackend, and AIGatewayRoute
  • Gateway API Integration: Built on top of Kubernetes Gateway API specifications
  • OpenAPI Schema: Complete OpenAPI v3 schema for all custom resources

📦 Dependencies Versions

Go 1.22

Built with Go 1.22 for optimal performance and security

Envoy Gateway v1.2

Built on Envoy Gateway for proven data plane capabilities

Envoy v1.30

Leveraging Envoy Proxy's battle-tested networking capabilities

Gateway API v1.1

Implementing latest Gateway API standards

⏩ Patch Releases

v0.1.5

April 3, 2025
Bug FixExtProc
Final patch in the v0.1.x series addressing ExtProc image synchronization.

Bug Fixes

  • ExtProc Image Syncing: Fixed external processor image synchronization issues
  • Container Registry: Improved container image handling and registry integration

v0.1.4

March 20, 2025
Bug FixAWS
Additional AWS validation and error handling improvements.

Bug Fixes

  • AWS Validation: Fixed validation error handling for AWS Bedrock requests
  • Error Response Format: Improved error response formatting for better debugging

v0.1.3

March 14, 2025
Bug FixStreamingMetrics
Streaming improvements and enhanced observability with GenAI metrics.

New Features

Observability Enhancements
  • GenAI Metrics: Added comprehensive metrics for AI service performance and usage tracking

Bug Fixes

  • Chat Completion Streaming: Fixed streaming response handling for chat completions
  • Metrics Collection: Resolved issues with metrics collection and reporting

v0.1.2

March 5, 2025
Bug FixImages
Container image and deployment fixes for improved reliability.

Bug Fixes

  • Image Tag Fixes: Fixed rotator and ExtProc container image tag synchronization
  • Deployment Stability: Improved deployment reliability and image pulling

v0.1.1

February 28, 2025
Bug FixAWS
Quick bug fix release addressing AWS tooling issues discovered in the initial release.

Bug Fixes

  • AWS CLI Tools: Fixed authentication issues with AWS Bedrock integration
  • Configuration Validation: Improved validation for AWS-specific configuration parameters

🙏 Acknowledgements

The v0.1.0 release represents months of collaborative effort from our founding community. Special thanks to early contributors and adopters who helped shape the initial architecture and provided invaluable feedback during the alpha and beta phases.

This milestone release sets the foundation for Envoy AI Gateway's future growth and establishes our commitment to providing enterprise-grade AI service management capabilities.

🔮 What Came Next

The success of the v0.1.x series paved the way for v0.2.0, which introduced:

  • Azure OpenAI Integration
  • Sidecar Architecture for Better Performance
  • Cross-Backend Failover and Retry Logic
  • Enhanced CLI Tools

v0.1 is now complete

The v0.1.x release is now complete. For the latest features and improvements, check out v0.2.