Intermediate ⏱️ 6 min

🎓 What is Structured Output?

Techniques for generating reliable JSON, code, and formatted data from LLMs

What is Structured Output?

Structured Output refers to techniques for making language models generate reliable, parseable data formats like JSON, XML, or code that follows specific schemas. This is essential for building reliable AI applications.

The Challenge

LLMs are trained on free-form text and may:

Output invalid JSON (missing brackets, trailing commas)
Include explanatory text mixed with data
Deviate from required schemas
Hallucinate field names

Solution Approaches

1. JSON Mode

API-level guarantee of valid JSON output.

response = client.chat.completions.create(
    model="gpt-4",
    response_format={"type": "json_object"},
    messages=[...]
)

2. Function Calling / Tool Use

Define schemas, get structured responses.

tools = [{
    "type": "function",
    "function": {
        "name": "get_weather",
        "parameters": {
            "type": "object",
            "properties": {
                "location": {"type": "string"},
                "unit": {"enum": ["celsius", "fahrenheit"]}
            }
        }
    }
}]

3. Grammar-Constrained Decoding

Enforce output grammar at the token level.

Outlines: Python library for constrained generation
llama.cpp: GBNF grammar support
vLLM: Guided decoding

4. Pydantic / JSON Schema

Define schemas that validate outputs.

from pydantic import BaseModel

class Person(BaseModel):
    name: str
    age: int
    email: str

Comparison

Method	Reliability	Flexibility	Speed
JSON Mode	High	Low	Fast
Function Calling	Very High	Medium	Fast
Grammar Constrained	Perfect	High	Slower
Post-processing	Variable	High	Fast

Best Practices

Be explicit: Include schema in prompt
Use examples: Show expected format
Validate: Always parse and validate output
Retry logic: Handle malformed responses
Temperature: Lower values for more reliable output

Tools & Libraries

Tool	Use Case
Instructor	Pydantic + LLM integration
Outlines	Grammar-constrained generation
Marvin	AI functions with types
LangChain	Output parsers

Prompt Engineering - Crafting effective prompts
Agents - Structured tool use
RAG - Structured retrieval

🎓 What is Structured Output?

What is Structured Output?

The Challenge

Solution Approaches

1. JSON Mode

2. Function Calling / Tool Use

3. Grammar-Constrained Decoding

4. Pydantic / JSON Schema

Comparison

Best Practices

Tools & Libraries

🕸️ Knowledge Mesh

🕸️ Knowledge Graph

What is Structured Output?

The Challenge

Solution Approaches

1. JSON Mode

2. Function Calling / Tool Use

3. Grammar-Constrained Decoding

4. Pydantic / JSON Schema

Comparison

Best Practices

Tools & Libraries

Related Concepts

🕸️ Knowledge Mesh

🕸️ Knowledge Graph