Decision-Quality Logging: The Calibration Loop

What to log in the decision journal

What to log, at minimum: (1) the date and the action (initiate, scale, exit); (2) the thesis statement in one sentence; (3) the price and the position size; (4) the conviction level (expressed as a percentage); (5) the explicit bear case; (6) the catalyst and the timeline; (7) the three operational signals you are watching; (8) what would falsify the thesis. Then, after every exit, log the outcome: was the thesis correct, partially correct, or wrong? Was the P&L good, neutral, or bad? Where did the two agree or disagree?

The skill-versus-luck quadrant matrix

Quadrant	Thesis correct	Thesis wrong
P&L good	Skill — the framework worked as designed. Document what made the read correct.	Luck — the position made money for reasons unrelated to your edge. Do not generalize from these.
P&L bad	Bad luck (or a timing mismatch with the catalyst window). The framework worked; the outcome is noise.	Skill failure — the framework or the execution broke. The single most valuable category to study.

Grading decisions on process, not P&L

The outcome-bias trap. Most retail investors implicitly grade their decisions on P&L alone, which conflates skill and luck. The practitioner discipline is to grade decisions on the four-quadrant matrix above. A profitable trade where the thesis was wrong is a luck-driven outcome that should NOT inflate confidence; an unprofitable trade where the thesis was correct should NOT shake conviction. The journal is the only reliable mechanism to keep skill and luck separated.

Building a calibration curve from logged decisions

Worked example — calibration curve construction. After 60 logged decisions, bucket them by initiation conviction: 50-59%, 60-69%, 70-79%, 80%+. For each bucket, compute the actual hit rate (thesis correct OR P&L good — pick one definition and stay consistent). Compare to the midpoint of the bucket. A well-calibrated analyst lands within five points of the bucket midpoint in each tier. A 15-point or larger miss is a systematic miscalibration the analyst can identify and work on. The curve is the most honest scorecard an investor can produce.

Grading a clean loss on process, not outcome

You exit a position at a 22% loss after the bear case played out exactly as you wrote it in the journal at initiation. The catalyst materialized; the operational signal failed; the exit was triggered by the falsification trigger you pre-named. How should you grade this decision?

Weekly, quarterly, and annual review cadence

Review cadence matters. A weekly review keeps positions fresh; a quarterly review surfaces patterns across positions; an annual review reveals the calibration curve. The annual review is the highest-leverage of the three — most investors never do it, and the ones who do compound their edge faster than the ones who do not.

Where to deepen the calibration discipline

Sit with the ideas.

You log every initiation, sizing decision, and exit in a structured journal. Twelve months in, you review the journal and notice a pattern: positions where you initiated at conviction levels of 70% or higher have an actual win rate of 48%, while positions where you initiated at 50% conviction have a win rate of 51%. What does this pattern indicate?

Why:

Continue this lesson in the appAdaptive practice · saves your progress · remediation when you miss

§ 01What to log in the decision journal

§ 02The skill-versus-luck quadrant matrix

§ 03Grading decisions on process, not P&L

§ 04Building a calibration curve from logged decisions

§ 05Grading a clean loss on process, not outcome

§ 06Weekly, quarterly, and annual review cadence

§ 07Where to deepen the calibration discipline