Every kind of production traffic,
from medicine to entertainment.
We take production telemetry from every domain — clinical reasoning, legal drafting, financial analysis, code review, customer support, marketing copy, research workflows, entertainment and creative. If a real team is running a real model against it in production, it has a place in the research corpus.
Production traces,
by vertical.
Full agent traces — system prompt, attached exports, tool calls, subagents, and completion. Samples are redacted; the corpus contains millions per vertical.
Samples shown are redacted excerpts from contributor traces. All PII is removed at ingest via three-pass redaction; tenant identifiers are masked. GOAT labs does not provide medical, legal, or financial advice. Model and vendor names are trademarks of their respective owners.
Get cashback on your tokens,
advance the research.
We pay 5–15% of the original model's output-token price. The rate scales with trace complexity — function calls, external API lookups, multi-turn depth, and multi-agent activity all push the rate higher.
- 01
Observability platform
Bring a read-only API key from Braintrust, Langfuse, Datadog, Laminar, Arize, Helicone, LangSmith, and 10+ more. No code to write — we pull from your existing pipeline.
- 02
Editor or CLI tool
Connect Cursor, Claude Code, or OpenAI Codex in one click. We tail your usage endpoint and calculate cashback on every token.
— or connect your editor directly —