The Sharpe ratio is the most widely cited number in quantitative trading — and the most widely misread. This article unpacks what it measures, what it does not measure, and how to use it without fooling yourself.
The Definition
For a strategy with mean excess return μ and standard deviation σ, the Sharpe ratio is:
It is a unit-free reward-to-risk number. A Sharpe of 1.0 means you earn one unit of excess return per unit of volatility per year. Higher is better — until it isn't.
What It Actually Measures
Sharpe is fundamentally a signal-to-noise ratio for a return stream. If you observe a strategy with a true Sharpe of 1.0 and you have N years of data, the standard error on your estimate is roughly √((1 + ½ × SR²) / N). With 2 years of daily data, the 95% confidence interval on a measured Sharpe of 2.0 is roughly [0.6, 3.4] — wide enough to span "real edge" to "lottery ticket."
A backtest reporting Sharpe 2.6 over 2 years is reporting a number with the same statistical resolution as a coin flip with 25 tosses. The point estimate is precise; the underlying truth is not.
What It Does Not Measure
- Tail risk. Sharpe assumes returns are roughly Gaussian. A strategy that earns 1% per month for 24 months and then loses 30% in one month can have a beautiful Sharpe right up until ruin.
- Asymmetry. Two strategies with identical Sharpe can have wildly different drawdown profiles. Sharpe penalizes upside volatility just as much as downside.
- Capacity. Sharpe says nothing about how much capital the strategy can absorb before slippage destroys the edge.
- Selection bias. The Sharpe of "the best of 100 random strategies tried" is much higher than 0 even when no strategy has any edge. This is the deflated-Sharpe problem.
The Three Sharpe Ratios You Will Encounter
In-Sample Sharpe
Computed on the data you optimized over. Always optimistic. In QuanterLab, this is the Sharpe shown immediately after a backtest with default parameters — useful as a first read, not as a verdict.
Out-of-Sample Sharpe
Computed on data the strategy never saw during parameter selection. Closer to the truth, but usable only once per dataset before it itself becomes in-sample.
Walk-Forward Sharpe
Computed by re-optimizing parameters in many sequential windows and stitching together the genuine OOS slices. The most honest single number, and the one to trust when it disagrees with the headline Sharpe.
How to Read a Sharpe Number Honestly
- IS Sharpe > 3: almost certainly curve-fit. Treat with skepticism, validate aggressively.
- IS Sharpe 1.5–2.5: plausible. Validate via walk-forward; expect some decay.
- IS Sharpe < 1: probably not worth trading after costs.
- Walk-forward Sharpe ≥ 60% of IS: healthy decay ratio.
- DSR > 0.95: strategy survives multiple-testing correction.
The Bottom Line
Sharpe is a useful first filter, not a verdict. A high Sharpe earns a strategy the right to be investigated, not the right to be deployed. The work between the headline Sharpe and a deployable strategy is what QuanterLab's walk-forward, robustness, and DSR tooling exists to do.
Further Reading
Foundational papers
- Sharpe, W. F. (1994). The Sharpe Ratio. Journal of Portfolio Management, 21(1), 49–58.
- Lo, A. W. (2002). The Statistics of Sharpe Ratios. Financial Analysts Journal, 58(4), 36–52.
- Bailey, D. H. & López de Prado, M. (2014). The Deflated Sharpe Ratio: Correcting for Selection Bias, Backtest Overfitting, and Non-Normality. Journal of Portfolio Management, 40(5), 94–107.
Textbook references
- López de Prado, M. (2018). Advances in Financial Machine Learning. Wiley.
Related QuanterLab articles
- Deflated Sharpe Ratio (DSR) and Multiple-Testing Correction
- Bootstrap Confidence Intervals for Backtests
- Walk-Forward Validation: Anchored vs Rolling
Try it in QuanterLab
Open any backtest result in SC001STCB, SB099MRBD, or UB001UNIV. The Sharpe number you see is the raw IS Sharpe — useful, but not the full story. Compare it to the DSR shown in the robustness panel.