These docs track the main branch and may describe unreleased features. The stable documentation lives at docs.docker.com.

Cloudflare Workers AI

Use Cloudflare Workers AI models with docker-agent.

Overview

Cloudflare Workers AI runs open-weight models (Llama, Mistral, Qwen, Gemma, and more) on Cloudflare’s global edge network through an OpenAI-compatible endpoint. No separate provider accounts are needed for the supported models. docker-agent includes built-in support for Workers AI as an alias provider.

Setup

Workers AI is account-scoped, so its base URL is resolved from your account ID. Two environment variables are required:

export CLOUDFLARE_ACCOUNT_ID=your-account-id
export CLOUDFLARE_API_TOKEN=your-api-token

Create an API token with the Workers AI permission from the Cloudflare dashboard. Your account ID is shown on the Workers AI page.

Usage

Workers AI model IDs use the @cf/... form (for example @cf/meta/llama-3.1-8b-instruct).

Inline Syntax

agents:
  root:
    model: cloudflare-workers-ai/@cf/meta/llama-3.1-8b-instruct
    description: Assistant using Cloudflare Workers AI
    instruction: You are a helpful assistant.

Named Model

For more control over parameters:

models:
  cloudflare_model:
    provider: cloudflare-workers-ai
    model: "@cf/meta/llama-3.1-8b-instruct"
    temperature: 0.7
    max_tokens: 8192

agents:
  root:
    model: cloudflare_model
    description: Assistant using Cloudflare Workers AI
    instruction: You are a helpful assistant.

Available Models

Check the Workers AI models catalog for the current list, IDs, and pricing.

Model Description
@cf/meta/llama-3.1-8b-instruct Meta Llama 3.1 8B Instruct
@cf/mistralai/mistral-small-3.1-24b-instruct Mistral Small 3.1 24B Instruct
@cf/qwen/qwen2.5-coder-32b-instruct Qwen 2.5 Coder 32B Instruct

How It Works

Cloudflare Workers AI is implemented as a built-in alias in docker-agent:

The base URL is templated: ${CLOUDFLARE_ACCOUNT_ID} is substituted from the environment when the provider is built, so CLOUDFLARE_ACCOUNT_ID must be set in addition to CLOUDFLARE_API_TOKEN. Because Workers AI serves open-weight models with strict chat templates, docker-agent coalesces consecutive system messages into a single leading one for this provider.