How we use PostHog's built-in data warehouse

Ian Vanagas

Jul 30, 2025

PostHog's data warehouse is our most powerful feature. It lets you sync data from the tools you already use like Stripe, Salesforce, and Hubspot, query it alongside your existing product data using SQL, and visualize it natively.

We built it because the modern data stack sucks. What starts as a handful of business critical tools devolves into dozens of tools, many specifically built to capture, clean, format, load, query, and visualize data.

We knew it didn't have to be this way, so we built the data warehouse to get rid of all this complexity and give you a single source of truth for all your business data.

We sync hundreds of millions of rows from Postgres, Stripe, Temporal, and more on top of the tens of millions of events we capture each day.

We've created over 1,600 SQL insights and visualizations using our data warehouse so far. It's our second most-used insight type behind trends, which was around long before we had the data warehouse.

To help you get started, we're sharing how teams at PostHog use our built-in data warehouse and custom SQL insights to answer critical business questions, like:

Which customers churned and how did it impact revenue?
Which customers are increasing their spend?
Who are our biggest customers and what products are they using?
Are we achieving our customer support goals?
What are the biggest sales opportunities in our pipeline?

We're also sharing the actual SQL queries we use to answer these questions as well as the insights and dashboards we use to visualize the data (we've faked the data for the screenshots though).

Modifying queries to your data

Because our data structure is unique, the SQL queries included here likely won't work out-of-the-box for you. Luckily, PostHog AI makes modifying SQL easy. Just paste these queries in as context and ask it to use your data instead.

Problem #1: Understanding growth and churn

Sources:
- Postgres for billing data
- PostHog for usage and activation data
- Salesforce for sales context (e.g. ICP scores)
Tables: postgres_invoice_with_annual_view, postgres_billing_customer, events, salesforce.contact, postgres.posthog_team, postgres.posthog_organization, postgres_billing_usagereport

Growth and churn

Engineers make the product decisions at PostHog, but they'd be lost without the context product managers provide.

One of the ways PMs give them this context is through monthly growth reviews, where they explore:

Who churned, how much was their churn, and why (so we can prevent future churn)
Product-specific activation and retention rates, using our custom definitions.
Metrics for their growth reviews, like revenue expansion and contraction.

This analysis is only possible when you combine product and revenue data, so the data warehouse and SQL insights are their weapon of choice.

Most companies would use our Stripe source to import revenue data, but we don't. Since we have multiple different products with different usage-based pricing, we need a custom billing service to handle everything. The data from this service goes into Postgres, which then gets synced and used in PostHog.

We import millions of rows from our billing tables in our Postgres database into PostHog.

Product managers combine this billing data from Postgres with product analytics event and property data collected by PostHog, and some additional supplementary customer info imported from Salesforce.

From here they create specific SQL insights and combine them into a single dashboard like this one for error tracking:

It includes insights like:

Top 50 error tracking customers by volume

SQL query for top 50 error tracking customers by volume

SQL

How we use PostHog's built-in data warehouse

Contents

Contents

Problem #1: Understanding growth and churn

Top 50 error tracking customers by volume

Organizations regularly capturing exceptions (monthly)

Error tracking churn for June

Using views to create reusable queries

Problem #2: Tracking revenue

US/EU revenue split

Revenue lifecycle

Revenue per product

Problem #3: Creating quarterly sales and customer success reports

Salesforce opportunities by quarter

Salesforce open pipeline by quarter (annual only)

Sales and CS managed accounts start of quarter ARR

Using variables

Problem #4: Creating support reports (SLA, first response time, and more)

Breached non-escalated tickets in the last 7 days

Escalated SLA % last 7 days by team

Time UTC when tickets are created (last 6 months)

Problem #5: Tracking startup and YC plan growth, costs, and ROI

Startup plan customer count (not YC)

Previous startup & YC plan customers revenue per customer

Startup & YC plan customers cohorts by starting month

How should you get started?

Community questions