POST /v1/messages) is compatible with the Anthropic Messages API standard. Existing Anthropic SDK clients can target SambaNova by changing only the base URL, API key, and model identifier. This endpoint is designed for conversational, tool-capable, and reasoning-oriented integrations.
The Messages API complements the existing Chat Completions and Responses API endpoints; it does not replace them.
Supported models
In the initial release, the Messages API is available forgpt-oss-120b. Additional models may be added in later releases — see the supported models page for the current list.
How it works
The Messages API structures model output as typed content blocks —text, tool_use, and thinking — rather than a single assistant text field. Each request returns a message object containing one or more of these blocks, depending on the model’s behavior.
Key characteristics:
- Client-executed tools only. When a tool is needed, the model returns a
tool_usecontent block. Your application executes the function and returns the result in a follow-up request via atool_resultcontent block. Server-side tools are not supported. - Thinking passthrough. Reasoning-capable models expose thinking content via a
thinkingcontent block alongside thetextblock, with no extra request parameters. - System prompt as a top-level field. Unlike Chat Completions, Anthropic-style requests pass the system prompt via the top-level
systemfield rather than as a message withrole: "system". - Structured streaming. Streaming responses use typed Server-Sent Events (SSE) following the event sequence:
message_start→content_block_start→content_block_delta→content_block_stop→message_delta→message_stop.
Limitations
Read these before migrating an existing Anthropic-based application:- Server-side tools (
web_search,code_execution,bash,text_editor) are not supported and return a400error. Only client-executed function tools are available. documentcontent blocks (PDF input) are not supported and return a400error.- URL image sources are not supported. Use base64-encoded images instead.
Usage
All examples below use the Anthropic Python SDK pointed at SambaNova. Install withpip install anthropic and configure as shown in the Anthropic compatibility page.
Simple generation
Streaming response
Useclient.messages.stream(...) to receive typed SSE events as the response is generated.
Thinking
Reasoning-capable models expose thinking content via athinking content block alongside the text block. No additional parameters are required; thinking content is surfaced automatically when the model produces it.
Tool calling
When tools are provided, the model may return atool_use content block. Your application is responsible for executing the function and returning the result.
Step 1: Send a request with tools defined.
client, response, and tools are reused.

