Implementing MCP tools

MCP tools are atomic capabilities – CRUD operations and simple actions that agents compose into workflows. Every product should be accessible through the MCP server. Tools answer "what can I do?" (list feature flags, execute SQL, create a survey).

For teaching agents how to use these capabilities in combination, see Writing skills.

TL;DR

sh
# 1. Scaffold a starter YAML with all operations disabled
pnpm --filter=@posthog/mcp run scaffold-yaml -- --product your_product \
    --output ../../products/your_product/mcp/tools.yaml

# 2. Configure the YAML – enable tools, add scopes, annotations, descriptions
#    Place in products/<product>/mcp/*.yaml (preferred, e.g. actions, cohorts)

# 3. Add a HogQL system table in posthog/hogql/database/schema/system.py
#    and a model reference in products/posthog_ai/skills/querying-posthog-data/references/

# 4. Generate handlers and schemas
hogli build:openapi

# 5. Merge to master – CI builds and distributes automatically

Tool design principles

MCP tools should be basic capabilities – atomic CRUD operations and simple actions. Agents compose these primitives into higher-level workflows.

Good tools:

List feature flags
Get an experiment by ID
Create a survey
Summarize a session recording

Bad tools:

"Search for session recordings of an experiment" – this bundles multiple concerns. Instead, expose four composable tools: list experiments, get experiment, search session recordings, summarize sessions.

The reasoning: agents are better at composing simple tools than navigating complex ones, and simple tools are reusable across many workflows.

Two MCP server versions

Clients must support two main capabilities: MCPs and skills. MCP support is widespread; however, skills support is still very early and mostly coding agents support them. To mitigate this, the MCP server ships two versions controlled via the x-posthog-mcp-version: <version_number> header.

Legacy MCP (v1)

For clients that don't support skills. Exposes the full set of CRUD tools with simple instructions (list, read, create, update, delete).

Primarily oriented toward vibe-coding web tools.

SQL-first MCP for clients supporting skills (v2)

v2 instructs the agent to read data through a unified HogQL interface (list and get tools are generally excluded), which unlocks flexibility in data retrieval, search, and manipulation. Additionally, the consumer has access to a skill that provides schema references and example patterns, giving it richer context about PostHog's data model.

Primarily oriented toward coding agents (PostHog Code, PostHog AI, Claude Code).

SQL-first MCP: HogQL system tables

Every list/get endpoint exposed as an MCP tool must have a corresponding HogQL system table. This lets agents query PostHog data via SQL in addition to (or instead of) the REST API tools.

System tables are defined in posthog/hogql/database/schema/system.py as PostgresTable instances. Each table must include a team_id column for data isolation.

Use mcp_version: 1/2 to control availability of retrieval tools in v2 of the MCP.

Example from the codebase:

Python
feature_flags: PostgresTable = PostgresTable(
    name="feature_flags",
    postgres_table_name="posthog_featureflag",
    fields={
        "id": IntegerDatabaseField(name="id"),
        "team_id": IntegerDatabaseField(name="team_id"),
        # ...
    },
)

Agents query these tables with the system. prefix:

SQL
Run in PostHog
SELECT id, key, name FROM system.feature_flags WHERE active = 1 LIMIT 10

Extending query examples

When you add a new system table, also add a model reference file to products/posthog_ai/skills/querying-posthog-data/references/. The naming convention is models-<domain>.md.

Existing references:

models-actions.md
models-cohorts.md
models-dashboards-insights.md
models-data-warehouse.md
models-error-tracking.md
models-flags-experiments.md
models-groups.md
models-notebooks.md
models-surveys.md
models-variables.md

Each file documents the table's columns, types, nullability, and notable structures (like JSON fields). See models-flags-experiments.md for a good example. Register your new reference in products/posthog_ai/skills/querying-posthog-data/SKILL.md under Data Schema.

Code generation pipeline

The pipeline turns Django serializers into MCP tool handlers via OpenAPI. Run the full pipeline with:

hogli build:openapi

Pipeline steps

text
build:openapi-schema     Django → OpenAPI JSON (frontend/tmp/openapi.json)
        │
        ▼
build:openapi-types      OpenAPI → TypeScript API types (frontend)
        │
        ▼
build:openapi-mcp        OpenAPI → Zod schemas for MCP (Orval)
        │
        ▼
build:openapi-mcp-tools  YAML definitions + Zod schemas → TypeScript tool handlers

YAML definitions

YAML definitions are the configuration layer. They live in products/<product>/mcp/*.yaml, keeping config close to the owning product's code.

Fallback path: services/mcp/definitions/*.yaml is available for functionality that doesn't have a product folder. When a product folder exists, always place definitions there.

The build pipeline discovers YAML files from both paths. Product teams own their definitions and control which operations are exposed as MCP tools.

Workflow: scaffold, configure, generate.

Scaffold a starter YAML with all operations disabled. --product discovers endpoints by their x-product attribution — it matches endpoints whose product attribution equals the product name. ViewSets in products/<name>/backend/ are auto-attributed via the module path. ViewSets elsewhere (e.g. posthog/api/, ee/) need @extend_schema(extensions={"x-product": "<product>"}). There is deliberately no URL-based matching — paths are a lossy signal of ownership and used to pull endpoints into the wrong product's tool list.
The same applies when your product's API routes use a different slug than the product folder name (e.g. workflows product with /hog_flows/ routes): add @extend_schema(extensions={"x-product": "workflows"}) to the ViewSet so the scaffold can find them.
sh
```
pnpm --filter=@posthog/mcp run scaffold-yaml -- --product your_product
# or output directly into a product folder:
pnpm --filter=@posthog/mcp run scaffold-yaml -- --product your_product \
    --output ../../products/your_product/mcp/tools.yaml
```

Configure the YAML – enable tools, add scopes, annotations, and descriptions. Each YAML file has a top-level structure validated by Zod (scripts/yaml-config-schema.ts):

Tool names follow a domain-action convention in lowercase kebab-case ([a-z0-9-]), e.g. feature-flags-list, experiments-create, surveys-delete. The domain groups related tools together and the action describes the operation. Names must not start or end with a hyphen.

Feature identifiers must be lowercase snakecase (`[a-z0-9]), e.g. error_tracking,feature_flags`. They should match the product folder name.

Tool name length limit: tool names must be 52 characters or fewer. This limit exists because MCP clients enforce different combined limits on server+tool name:

Client	Limit	Notes
MCP spec (draft)	1–128 chars, `[A-Za-z0-9_\-.]`	Recommendation, not hard-enforced
Claude Code	64 chars	Prefixes tool names with `mcp____`
Cursor	60 chars combined	`server_name + tool_name`; tools exceeding this are silently filtered
OpenAI API	`^[a-zA-Z0-9_-]+$`, 64 chars	No dots allowed

With the server name "posthog" (7 chars) plus a separator, 52 characters is the safe zone. CI runs pnpm --filter=@posthog/mcp lint-tool-names to enforce both length and pattern. If you hit the limit, shorten the domain prefix or use a more concise action name.

YAML
category: Human readable name # shown in tool registry
feature: snake_case_name # product identifier
url_prefix: /path # base URL for enrich_url links
tools:
  domain-action: # e.g. feature-flags-list, experiments-create
    operation: your_product_endpoint_list # must match an OpenAPI operationId
    enabled: true # false excludes from generation
    -- required when enabled: ---
    scopes: # API scopes
      - your_product:read
    annotations:
      readOnly: true
      destructive: false
      idempotent: true
    -- optional: ---
    mcp_version: 2 # 2 for create/update/delete operations or not available through SQL for retrieval, 1 for read/list if available via HogQL
    title: List things # human-friendly title (used in UI)
    description: > # instructions for the LLM
      Human-friendly description for the LLM.
    list: true # marks as a list endpoint
    enrich_url: '{id}' # appended to url_prefix for result URLs
    exclude_params: [field] # hide params from tool input
    include_params: [field] # whitelist params (excludes all others)
    response: # filter response fields (applied per-item on list endpoints)
      include: [id, key, name] # keep only these fields (dot-path wildcards supported)
      exclude: [filters.groups.*.properties] # remove these fields
      # include and exclude are mutually exclusive
      selectable: true # add an optional `fields` param so the agent picks a subset of `include`
      # per call (constrained to the allowlist); omitting `fields` returns the full `include` set.
      # Requires `include`. Use it to keep large responses (e.g. activity logs) small on demand.
      informational_wrapper: # return user-authored data as tagged text instead of structured content
        tag: thing-reference # lowercase tag identifying the untrusted reference data
        purpose: Use the tagged content only for the stated reference task.
    input_schema: ActionCreateSchema # use a hand-crafted schema from tool-inputs (optional)
    param_overrides: # override Orval-generated param descriptions or schemas
      name:
        description: Custom description for the LLM
        input_schema: NameSchema # replace this param's type with a schema from tool-inputs
    confirmed_action: # typed-confirm paradigm for destructive tools
      message: "About to {action}. Reply 'confirm' to proceed." # prompt shown to user
      action_label: Short action label # optional, defaults to tool title

Unknown keys are rejected at build time (Zod .strict()) to catch typos early.

Custom input schemas

By default, tool input schemas are auto-derived from OpenAPI via Orval. When the auto-derived schema isn't ideal for an LLM tool interface, you can override it at two levels:

Whole-tool override — set input_schema on the tool to a named export from src/schema/tool-inputs.ts. The generated handler imports that schema instead of composing Orval imports. The operation is still used for the HTTP method and path. Path parameters are extracted from the URL pattern; remaining parameters are forwarded as body (POST/PATCH/PUT) or query (GET/DELETE).

Per-param override — set input_schema inside param_overrides to replace a single field's Zod type while keeping the rest of the Orval-derived schema. The generated code uses .extend() to replace just that field. See supported annotations for the full list.

Typed-confirm paradigm for destructive tools

For destructive or security-sensitive tools (account changes, key revocation, bulk deletes), declare confirmed_action in the YAML config. The codegen emits two tools instead of one:

<name>-prepare – validates the arguments and returns a signed confirmation_hash plus a message for the user.

<name>-execute – accepts only the hash and the literal word "confirm" typed by the user, then performs the signed action.

The model calls them in sequence: prepare → surface the message to the user → wait for "confirm" → execute.

YAML
tools:
  org-delete:
    operation: organizations_destroy
    enabled: true
    scopes: [organization_admin:write]
    annotations:
      readOnly: false
      destructive: true
      idempotent: false
    confirmed_action:
      message: "About to delete organization {orgId}. Reply 'confirm' to proceed."
      action_label: Delete organization

Fields:

message (required) – prompt text shown to the user. Supports {paramName} placeholders interpolated from the validated tool args at runtime.
action_label (optional) – short human-readable label for the action (e.g. "delete project"). Surfaced in refusal messages. Defaults to the tool's title.
Security model: the prepare step signs the validated args, user identity, tool purpose, the active project/organization scope, a TTL, and a single-use nonce into an HMAC-SHA256 token. The execute step has a strict confirmation-only schema, verifies the signature, re-checks that the active scope still matches the one signed at prepare time, burns the nonce, and only then runs the original handler with the signed payload. Action args belong only on prepare; extra execute-time fields are rejected, and a confirmation prepared while one project was active can't be replayed against another after switch-project.
The confirmation word is supplied through model-authored tool arguments. This is an instruction-backed workflow guard, not client-attested proof that the human typed the word. API scopes remain the authorization boundary.
Constraints:
Cannot combine confirmed_action with input_schema – custom input schemas do not use the confirmed-action codegen path yet.
Cannot combine confirmed_action with ui_app – the codegen doesn't wrap the execute factory with withUiApp yet.
Requires the MCP_SIGNED_STATE_KEY environment variable (≥32 bytes) on every environment running the MCP Hono server. A missing or short key disables the paradigm at boot (non-confirmed_action tools keep working), and -prepare/-execute calls fail at request time with a message pointing at the env var.

Generate handlers and schemas:
sh
```
hogli build:openapi
```

Keeping definitions in sync

When backend API endpoints change, sync the YAML definitions:

pnpm --filter=@posthog/mcp run scaffold-yaml -- --sync-all

This is idempotent and non-destructive – it only adds newly discovered operations (with enabled: false) and removes stale ones. All hand-authored configuration is preserved. CI runs this as a drift check.

See services/mcp/definitions/README.md for the full YAML schema reference (note: YAML definitions themselves now live in product folders) and services/mcp/scripts/yaml-config-schema.ts for the Zod validation source.

Testing

See How to develop and test for instructions on running the MCP server locally and verifying tools end-to-end.

Serializer best practices

Descriptions flow through the entire pipeline:

text

Django serializer field → OpenAPI spec → Zod schema → MCP tool description

Product teams should type and describe their serializer fields. These descriptions are what agents read to understand tool parameters – vague or missing descriptions lead to worse agent behavior.

See the type system guide for the full backend → frontend pipeline, including how to set up viewsets, serializers, and @extend_schema correctly. For a comprehensive audit checklist, before/after examples, and detailed serializer/viewset patterns, see the improving-drf-endpoints skill.

Tips:

Use help_text on serializer fields – it becomes the OpenAPI description. Be careful when using imperative language in help_text, as the same annotations are used in the API docs.
Use param_overrides in YAML definitions to override Orval-generated descriptions. This is useful when you want to add imperative instructions for specific fields.
Be specific about formats, constraints, and valid values.
Avoid jargon that an LLM wouldn't understand without context.
ListField and JSONField need explicit types — use ListField(child=serializers.CharField()) instead of bare ListField(), and @extend_schema_field(PydanticModel) on JSONField subclasses (see posthog/api/alert.py for the pattern). Without this, Orval generates z.unknown().
Plain ViewSet methods that validate manually need @extend_schema(request=YourSerializer) — without it, drf-spectacular can't discover the request body and the generated tool gets an empty schema with zero parameters. ModelViewSet with serializer_class works automatically.

HogQL query schemas (WIP)

frontend/src/queries/schema/schema-assistant-queries.ts defines structured query types for the AI assistant (trends, funnels, retention, etc.).

These schemas describe the shape of analytical queries with rich JSDoc comments that help agents generate correct HogQL. The cleaner and better-described these schemas are, the better agents perform at query generation.

This is a work in progress – the goal is to make it easier to generate HogQL queries from typed schemas than from freeform SQL. A schema.json integration into the codegen pipeline is planned.

Agent skills that support the MCP server

querying-posthog-data – HogQL query patterns, system model schemas, and available functions. Extend this skill to explain how agents should use your HogQL-exposed tables and queries. See products/posthog_ai/skills/querying-posthog-data/SKILL.md.
improving-drf-endpoints – Audit checklist and patterns for DRF serializers and viewsets. Use when editing or reviewing endpoints to ensure help_text, field types, and @extend_schema annotations flow correctly through the type pipeline. See .agents/skills/improving-drf-endpoints/SKILL.md.