---
title: "Cloudflare Workers AI - Edge AI Inference Platform"
description: "Run AI inference globally with one API call. 50+ models, serverless pricing, OpenAI-compatible API, and inference in 200+ cities worldwide."
url: "https://www.cloudflare.com/products/workers-ai"
---

# Workers AI

> Workers AI lets you run AI inference globally with one API call. No GPUs to manage, no capacity planning. Just intelligent machine learning models  running where they're needed, on Cloudflare's global network.

## Key Features

- 100+ AI models available
- LLMs: Llama 3, Mistral, Gemma
- Image: Stable Diffusion, FLUX
- Audio: Whisper, TTS
- Embeddings: BGE, multilingual models
- LoRA fine-tuning support
- Streaming responses

## Benefits

### Serverless pricing

Pay-per-inference pricing with no idle costs. No guessing what.

### Rich model catalog

50+ models running close to users in 200+ cities

### Widely compatible

One API call, works with any OpenAI SDK or task type

## Use Cases

### Image generation

Execute image generation, manipulation, and creative workflows without managing GPU infrastructure. Perfect for content platforms, social apps, and creative tools.

### Speech-to-text, in real-time

Transcribe, analyze, and generate audio content without specialized infrastructure. Built for voice agents, note-taking apps, and media processing.

### Embeddings

Create intelligent search, recommendations, and context-aware features using vector embeddings. Seamlessly integrates with Vectorize AI Search for complete AI workflows.

### LLMs

Perform a wide range of natural language tasks. Use large language models for text generation, classification, question answering, and other complex language-based operations through a simple API.

## Resources

- [Full Documentation](https://developers.cloudflare.com/workers-ai/): Complete technical documentation
- [Get Started](https://dash.cloudflare.com/sign-up): Sign up and start building
- [Pricing](/plans.md): See pricing details

## Related Products

- [Agents](/products/agents.md): Build stateful AI agents
- [AI Gateway](/products/ai-gateway.md): AI observability
- [AI Search](/products/ai-search.md): Instant retrieval
- [Vectorize](/products/vectorize.md): Vector database

---

*This is a markdown version of [https://www.cloudflare.com/products/workers-ai](https://www.cloudflare.com/products/workers-ai) for AI/LLM consumption.*
