Structured Outputs

Structured outputs let you get JSON responses that conform to a predefined schema. This guide covers how to configure and use structured outputs.

What Are Structured Outputs?

Instead of free-form text:

The sentiment is positive with high confidence.

Get structured JSON:

{
  "sentiment": "positive",
  "confidence": 0.95,
  "keywords": ["happy", "satisfied", "great"]
}

Setting Up Structured Outputs

In the Web App

Create a Schema

Define the output structure in the Schemas section.

Set as Structured Output

Mark the schema as “Structured Output” (not “Tool”).

Attach to Prompt

Attach the schema to your prompt’s structured output setting.

Schema Example

{
  "type": "object",
  "properties": {
    "sentiment": {
      "type": "string",
      "enum": ["positive", "negative", "neutral"]
    },
    "confidence": {
      "type": "number",
      "minimum": 0,
      "maximum": 1
    },
    "keywords": {
      "type": "array",
      "items": {"type": "string"}
    },
    "reasoning": {
      "type": "string"
    }
  },
  "required": ["sentiment", "confidence"]
}

Provider Support

Provider	Structured Output Support
Anthropic	Yes (beta)
OpenAI	Yes
Google	Yes

Using in Code

Basic Usage

session = await client.create_prompt_session(
    prompt_id="sentiment-analysis-prompt",
    session_data=AnalysisInput(text="Great product!")
)

# Structured output schema is included automatically
response = anthropic.messages.create(
    **session.to_anthropic_invocation(),
    extra_headers={"anthropic-beta": "structured-outputs-2025-11-13"}
)

Parsing the Response

import json
from pydantic import BaseModel

class SentimentResult(BaseModel):
    sentiment: str
    confidence: float
    keywords: list[str] = []
    reasoning: str | None = None

# Parse the response (uses stored provider from completion_config)
parsed = session.parse_response(response)
text_content = parsed.candidates[0].content[0].text

# Validate with Pydantic
result = SentimentResult.model_validate_json(text_content)
print(f"Sentiment: {result.sentiment} ({result.confidence:.0%})")

Provider-Specific Handling

Anthropic

Requires beta header:

response = anthropic.messages.create(
    **session.to_anthropic_invocation(),
    extra_headers={"anthropic-beta": "structured-outputs-2025-11-13"}
)

Generated payload includes:

{
    "output_format": {
        "type": "json_schema",
        "json_schema": {
            "name": "SentimentResult",
            "schema": {...}
        }
    }
}

OpenAI

Works out of the box:

response = openai.chat.completions.create(
    **session.to_openai_chat_invocation()
)

Generated payload includes:

{
    "response_format": {
        "type": "json_schema",
        "json_schema": {
            "name": "SentimentResult",
            "schema": {...},
            "strict": True
        }
    }
}

Google

response = client.models.generate_content(
    **session.to_google_gemini_invocation()
)

Generated config includes:

{
    "config": {
        "response_schema": {...},
        "response_mime_type": "application/json"
    }
}

Validation Patterns

Basic Validation

from pydantic import ValidationError

try:
    result = SentimentResult.model_validate_json(response_text)
except ValidationError as e:
    print(f"Invalid response: {e}")

With Telemetry

validation_errors = []
try:
    result = SentimentResult.model_validate_json(response_text)
except ValidationError as e:
    validation_errors = [str(err) for err in e.errors()]
    result = None

# Log with validation status
event = session.create_llm_event_from_parsed_response(
    parsed_response=parsed,
    validation_errors=validation_errors or None
)
await client.log_telemetry_event(event)

Retry on Failure

async def analyze_with_retry(text: str, max_retries: int = 2):
    for attempt in range(max_retries + 1):
        response = await call_llm()
        parsed = session.parse_response(response)

        try:
            result = SentimentResult.model_validate_json(
                parsed.candidates[0].content[0].text
            )
            return result
        except ValidationError as e:
            if attempt == max_retries:
                raise
            # Optionally: add error to context for next attempt

Schema Design Tips

Use Enums for Categories

{
  "category": {
    "type": "string",
    "enum": ["bug", "feature", "question", "other"]
  }
}

Set Bounds for Numbers

{
  "score": {
    "type": "number",
    "minimum": 0,
    "maximum": 10
  }
}

Use Required Fields

{
  "required": ["category", "confidence"]
}

Allow Additional Context

Include a reasoning field for explainability:

{
  "reasoning": {
    "type": "string",
    "description": "Explanation for the classification"
  }
}

Tools vs Structured Outputs

Aspect	Tools (Function Calling)	Structured Outputs
Use when	LLM decides to use a tool	Always want JSON
Control	LLM chooses when	Guaranteed
Response	May include tool calls	Pure JSON
Multiple schemas	Yes (multiple tools)	One schema

Use tools when: The LLM should decide whether to call a function Use structured output when: You always want a specific JSON format

Combining with Tools

You can have both tools and structured output:

Tools for actions (search, calculate, etc.)
Structured output for the final response format

# Prompt has both tools and structured output configured
response = anthropic.messages.create(
    **session.to_anthropic_invocation(),
    extra_headers={"anthropic-beta": "structured-outputs-2025-11-13"}
)

# Response may include tool calls followed by structured output
for block in response.content:
    if block.type == "tool_use":
        # Handle tool call
        result = execute_tool(block.name, block.input)
    elif block.type == "text":
        # Parse structured output
        output = SentimentResult.model_validate_json(block.text)

Complete Example

from moxn import MoxnClient
from moxn.types.content import Provider
from anthropic import Anthropic
from pydantic import BaseModel, ValidationError

class ExtractionResult(BaseModel):
    entities: list[str]
    summary: str
    category: str
    confidence: float

async def extract_from_document(text: str):
    async with MoxnClient() as client:
        session = await client.create_prompt_session(
            prompt_id="extraction-prompt",
            session_data=ExtractionInput(document=text)
        )

        async with client.span(
            session,
            name="extract_entities",
            metadata={"doc_length": len(text)}
        ) as span:
            anthropic = Anthropic()
            response = anthropic.messages.create(
                **session.to_anthropic_invocation(),
                extra_headers={"anthropic-beta": "structured-outputs-2025-11-13"}
            )

            parsed = session.parse_response(response)

            # Validate structured output
            validation_errors = []
            try:
                result = ExtractionResult.model_validate_json(
                    parsed.candidates[0].content[0].text
                )
            except ValidationError as e:
                validation_errors = [str(err) for err in e.errors()]
                result = None

            # Log with validation status in event attributes
            event = session.create_llm_event_from_parsed_response(
                parsed_response=parsed,
                validation_errors=validation_errors or None,
                attributes={
                    "extraction_success": result is not None,
                    "entity_count": len(result.entities) if result else 0
                }
            )
            await client.log_telemetry_event(event)

            return result

Next Steps

Data Extraction Example

Complete extraction example

Provider Integration

Provider-specific details

Code Generation

Generate models for outputs

Logging Events

Log validation results

Video Walkthrough

Getting Started

Core Workflow

Concepts

Telemetry

Web App

Advanced

Structured Outputs

What Are Structured Outputs?

Setting Up Structured Outputs

In the Web App

Schema Example

Provider Support

Using in Code

Basic Usage

Parsing the Response

Provider-Specific Handling

Anthropic

OpenAI

Google

Validation Patterns

Basic Validation

With Telemetry

Retry on Failure

Schema Design Tips

Use Enums for Categories

Set Bounds for Numbers

Use Required Fields

Allow Additional Context

Tools vs Structured Outputs

Combining with Tools

Complete Example

Next Steps

Data Extraction Example

Provider Integration

Code Generation

Logging Events

Video Walkthrough

Getting Started

Core Workflow

Concepts

Telemetry

Web App

Advanced

​What Are Structured Outputs?

​Setting Up Structured Outputs

​In the Web App

​Schema Example

​Provider Support

​Using in Code

​Basic Usage

​Parsing the Response

​Provider-Specific Handling

​Anthropic

​OpenAI

​Google

​Validation Patterns

​Basic Validation

​With Telemetry

​Retry on Failure

​Schema Design Tips

​Use Enums for Categories

​Set Bounds for Numbers

​Use Required Fields

​Allow Additional Context

​Tools vs Structured Outputs

​Combining with Tools

​Complete Example

​Next Steps

Data Extraction Example

Provider Integration

Code Generation

Logging Events

What Are Structured Outputs?

Setting Up Structured Outputs

In the Web App

Schema Example

Provider Support

Using in Code

Basic Usage

Parsing the Response

Provider-Specific Handling

Anthropic

OpenAI

Google

Validation Patterns

Basic Validation

With Telemetry

Retry on Failure

Schema Design Tips

Use Enums for Categories

Set Bounds for Numbers

Use Required Fields

Allow Additional Context

Tools vs Structured Outputs

Combining with Tools

Complete Example

Next Steps