Feature - Yesterday Audit

Every brief is followed by an audit of the previous brief.

Most research products in this space don’t look back. When yesterday’s call was wrong, it quietly disappears and tomorrow’s gets fresh attention. The reader has no structured way to know whether the service has been right or wrong over time.

We do the opposite. Every brief opens with a Yesterday Audit: per-bucket hit rate, calibration error per prediction type, drift flags, retrospective-vs-live disclosure.

What a populated audit section looks like

Excerpt - Yesterday Audit, live predictions

YESTERDAY AUDIT - brief=BRIEF_2026_06_04 predictions_made=12 resolved=11 hit rate by confidence bucket: - touch watch / very_high: n=2, hit_rate=100%, mean_p=88%, calibration_error=-0.12 - touch watch / high: n=5, hit_rate=80%, mean_p=72%, calibration_error=-0.08 - touch watch / moderate: n=3, hit_rate=33%, mean_p=58%, calibration_error=+0.25 - avoidance / high: n=1, hit_rate=100%, mean_p=100%, calibration_error=0.00

Each row pairs the bucket’s mean predicted probability against its actual hit rate. The calibration error is the difference. A small positive error means we were a touch over-confident; a small negative error means we under-priced the move. Errors larger than |0.08| trigger a drift flag on the public dashboard and a note in the next brief’s confidence section.

When the audit window includes retrospective replay

New customers see briefs days after we’ve started publishing them, which means the live-collected outcome log is thin at first. To fill the gap, we replay historical bundles through the same brief generator to produce retrospective predictions, then resolve them against actual bar data.

Retrospective replay is useful as a directional check on the engine’s calibration. It is not the same as a live track record - the bundle was fit on data that overlaps the replayed dates, which can encode hindsight in subtle ways.

When more than half of the resolved predictions in yesterday’s audit window came from retrospective replay, our renderer prepends an explicit disclosure line:

Excerpt - Yesterday Audit, retrospective-replay disclosure

YESTERDAY AUDIT - Note: calibration estimated on retrospective replay (62% of resolved predictions were backfilled from historical bundles, not collected live). Treat the numbers below as a directional read, not a live track record. brief=BRIEF_2026_05_28 predictions_made=11 resolved=9 hit rate by confidence bucket: - touch watch / high: n=4, hit_rate=75%, mean_p=71%, calibration_error=-0.04 - touch watch / moderate: n=3, hit_rate=33%, mean_p=58%, calibration_error=+0.25 - avoidance / moderate: n=2, hit_rate=100%, mean_p=100%, calibration_error=0.00

As live outcomes accumulate, the retrospective share drops. You can see the current share on the track record page, and the disclosure line will quietly disappear from your briefs once the live share crosses 50%.

See the public dashboard View a full sample brief