How CloudPeek cut debugging time by 10x with PostHog Logs and Error Tracking
Mar 10, 2026
Contents
Agentic AI platform for cybersecurity
- Cybersecurity
- Engineering
CloudPeek is an agentic platform for next-generation cybersecurity. It sits above an organisation's existing security stack as a reasoning and orchestration layer – connecting to any tool, understanding the specific environment, and letting operators automate complex workflows with full control and auditability.
Craig Hollington is the CTO & co-founder at CloudPeek – and when PostHog launched Logs, his team jumped on it immediately.
The debugging gap
Before PostHog launched Logs, CloudPeek's team didn't have a dedicated logging tool. They were already using PostHog for error tracking, so they'd see exceptions fire – but tracing back to a root cause meant SSHing into servers and searching local log files with grep.

We used to get exception logs come through on PostHog, and then I'd have to log onto the server manually and search our log files which were written locally.
For a fast-moving team shipping AI-driven security tooling across diverse environments, this created a real bottleneck. Issues on a server were manageable. But when something happened on a less accessible device, the trail often went cold.
"If it was on something a bit more niche like a laptop, it became harder or almost impossible to actually diagnose a root cause."
Craig had used dedicated logging tools in previous roles but both were too costly and cumbersome to set up for what CloudPeek needed at this stage. They also looked at Amazon CloudWatch, given their heavy AWS usage, but the setup overhead and custom logging requirements ruled it out.
How CloudPeek debugs now
Today, the team uses two primary debugging workflows. The first starts from a specific exception: they take the URL or ID from the fired alert, narrow the time window down to about a minute either side, and trace through exactly what happened to cause the issue.
They try to really narrow down the time window to about a minute either side of when the alarm fired, so they can trace through very granularly what happened.
The second approach uses PostHog's MCP integration to search more broadly when they don't have a precise starting point.
The team also uses AI-generated test scripts to simulate edge cases and user behavior. They fire these scripts, then watch the logs live to see what breaks and what doesn't – which lets them ship hot fixes quickly.

We automatically get AI to write triage scripts and fire them to see what works, and then from there we review the log files to see what it did and what it didn't do.
Why OpenTelemetry was the deciding factor
CloudPeek works with customers who have strict audit requirements, particularly when working with high consequence organizations. The fact that PostHog Logs uses OpenTelemetry under the hood turned out to be the single biggest factor in their adoption.
Because it's built on an open standard, CloudPeek can tell customers who want their own logs to simply set a few environment variables and point to their own collector. No custom integrations, no bespoke pipelines.
It was a drop-in change for them. Instead of dealing with anything custom, they can tell customers who want their own logs to simply set a few environment variables and they're good to go.
This was also the reason they passed on CloudWatch – OpenTelemetry meant they didn't need to support multiple custom logging systems or rebuild anything when a new customer had different requirements.
The results: 5–10x faster debugging and better reliability

We're probably five to ten times faster at debugging now, and it's enabled us to collaboratively work on issues as a team in a way we just couldn't before.
That collaboration piece has been just as important as the speed. When two or three developers are working on the same issue, they can share links to specific log entries, add comments, and split up the investigation – one person tracing the root cause while another reviews the affected service. Their overall service reliability has improved significantly as a result, with the team now able to identify and resolve issues before they compound.
One of the most impactful changes for CloudPeek is how error tracking and logs work together.

We spot an exception in Error Tracking and pivot straight into Logs to understand the full context of what went wrong. That link between the two is huge for us.
This workflow replaced what used to be a completely disconnected process: seeing an error in one place, then manually hunting through server files in another. Now the investigation stays inside one tool, and the whole team can follow the same trail.
The team has also found that watching Logs live during testing has changed how they ship. When AI-generated test scripts run edge cases against the platform, developers monitor the log output in real time, spot unhandled cases immediately, and push hot fixes without waiting for a bug report.
Why it all works: one tool, one place
CloudPeek's philosophy is that observability shouldn't require jumping between a dozen different dashboards. Having Logs, Error tracking, and LLM analytics in a single platform means the team spends their time investigating problems rather than context-switching between tools.
"We don't want to pivot between 10 different windows, 10 different tools to find a problem. We want to be in one place only."
PostHog is an all-in-one developer platform for building successful products. We provide product analytics, web analytics, session replay, error tracking, feature flags, experiments, surveys, LLM analytics, logs, workflows, endpoints, data warehouse, CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.