These docs track the main branch and may describe unreleased features. The stable documentation lives at docs.docker.com.

Cerebras

Use Cerebras models with docker-agent.

Overview

Cerebras serves open-weight models such as GPT-OSS and GLM through an OpenAI-compatible API on its wafer-scale hardware, delivering some of the highest tokens/sec available. That speed makes it a strong fit for latency-sensitive coding workflows. docker-agent includes built-in support for Cerebras as an alias provider.

Setup

  1. Create an API key from the Cerebras Cloud console.
  2. Set the environment variable:

    export CEREBRAS_API_KEY=your-api-key
    

Usage

Inline Syntax

The simplest way to use Cerebras:

agents:
  root:
    model: cerebras/gpt-oss-120b
    description: Assistant using Cerebras
    instruction: You are a helpful assistant.

Named Model

For more control over parameters:

models:
  cerebras_model:
    provider: cerebras
    model: gpt-oss-120b
    temperature: 0.7
    max_tokens: 8192

agents:
  root:
    model: cerebras_model
    description: Assistant using Cerebras
    instruction: You are a helpful assistant.

Available Models

Cerebras hosts a curated set of open-weight models. Check the Cerebras models documentation for current model IDs, context limits, and pricing.

Model Description
gpt-oss-120b Open-weight GPT-OSS reasoning model with tool calling
zai-glm-4.7 Z.AI GLM-4.7 reasoning model with tool calling

Model IDs are case-sensitive and must be passed exactly as the catalogue lists them. Cerebras may serve additional models not in the built-in catalog; those still work but resolve to default capability metadata locally.

How It Works

Cerebras is implemented as a built-in alias in docker-agent:

Because Cerebras fronts open-weight models whose chat templates may only accept a single leading system message, docker-agent coalesces its per-source system messages (agent instruction plus each toolset’s instructions) into one before sending the request.

Example: Code Assistant

agents:
  coder:
    model: cerebras/gpt-oss-120b
    description: Fast code assistant using Cerebras
    instruction: |
      You are an expert programmer.
      Write clean, well-documented code and follow language best practices.
    toolsets:
      - type: filesystem
      - type: shell
      - type: think