What is factor decay?

Factor decay is the tendency of a documented investment factor — such as the value, size, momentum, or quality premium — to deliver weaker returns after it has been discovered, published, and crowded with capital. It differs from the general decay of a proprietary trading signal in that it is driven specifically by public disclosure: once a premium is named in a widely read study, the whole market can replicate it, and the conditions that produced it begin to change.

How much do factors decay after publication?

The landmark study by McLean and Pontiff (2016) reconstructed 97 documented return predictors and found that their returns were on average about 26% lower out-of-sample (after the original study’s sample period) and about 58% lower after the research was published. The out-of-sample drop reflects statistical over-fitting in the original work, while the additional post-publication drop is attributed to informed traders learning about the predictor and arbitraging it away.

What causes factor decay?

Three mechanisms drive it. First, arbitrage and crowding: when a real premium is published, capital floods in, pushes up the prices of the favored securities, and competes the future return away. Second, data mining: some factors never had a real premium and were artifacts of testing too many candidates, so they fade the moment they leave their original sample. Third, structural or regime change: the economic reason for the premium genuinely shifts, for example as market composition or transaction costs change.

How do you tell genuine factor decay from data mining?

Compare two windows. If a premium collapses immediately out-of-sample, before the research was even published, it was most likely a data-mined artifact that was never robust. If it survives out-of-sample but fades after publication, that pattern is consistent with arbitrage crowding a real effect. If it fades alongside a clear economic or regime shift, the cause is structural. The out-of-sample versus post-publication split — together with a deflated, multiple-testing-aware significance test and a prior economic rationale — is the cleanest diagnostic.

What is the factor zoo?

The "factor zoo" is the term John Cochrane used in his 2011 American Finance Association presidential address for the explosion of published return factors — now numbering in the hundreds, many with weak economic justification. The concern is that with so many candidates tested, some will appear significant by pure chance (the multiple-testing problem). Replication work by Hou, Xue and Zhang found that most documented anomalies fail to replicate under careful uniform testing, and Harvey, Liu and Zhu argued a new factor should clear a t-statistic of roughly 3.0, not the usual 2.0, to be credible.

Can you time factors to avoid decay?

It is difficult and easy to get wrong. Timing factors on recent returns tends to buy them right after they have become expensive and crowded, which is when forward returns are weakest. Relative valuation spreads between a factor’s long and short legs are a better, if noisy, guide to whether a premium is cheap or has been borrowed forward — a central point in Arnott, Beck, Kalesnik and West’s work on how smart-beta strategies can go wrong. For most teams, diversifying across weakly correlated, economically grounded factors and continuously researching new ones is more robust than aggressive factor timing.

Factor Decay: Why Published Factor Premia Fade Over Time

Factor decay is the tendency of a documented investment factor — a value, size, momentum, or quality premium — to deliver weaker returns after it has been discovered, published, and crowded with arbitrage capital. It is the specific, public-market cousin of the broader signal decay that erodes any edge: where a proprietary signal decays quietly inside one firm, a factor decays in plain sight, because the very act of publishing it tells the whole market what to trade.

The distinction matters because factor decay carries a sharper diagnostic question than ordinary alpha decay: when a published premium shrinks, was it a real effect that capital has now arbitraged away, or was it never real — a data-mined artifact that out-of-sample reality has simply exposed? This guide sets out the canonical evidence on how much factors decay, the three mechanisms that drive it, how to separate genuine decay from a data-mining mirage, how to measure it, and what a research team can actually do about it.

Key Takeaways

Factor decay is the post-discovery fade of publicly documented factor premia — distinct from the private erosion of a proprietary signal, though the detection toolkit overlaps with general signal decay.
The landmark study by McLean and Pontiff (2016) found that documented anomaly returns were on average about 26% lower out-of-sample and about 58% lower after publication — strong evidence that publication itself accelerates decay.
Three mechanisms drive it: arbitrage and crowding (capital floods in and competes the premium away), data mining (the factor was a multiple-testing artifact that was never robust), and structural change (the economic reason for the premium genuinely shifted).
The cleanest test of which mechanism is at work is the gap between out-of-sample decay (suggests data mining) and post-publication decay beyond that (suggests crowding of a real effect).
With hundreds of published factors — Cochrane's "factor zoo" — most fail to replicate; Harvey, Liu and Zhu argue a new factor needs a t-statistic above roughly 3.0, not the usual 2.0, to survive the multiple-testing problem.

What Factor Decay Is — and What It Is Not

A factor is a systematic, broadly known source of return: the value premium (cheap stocks beating expensive ones), the size premium (small beating large), momentum, low-volatility, quality. These are documented in academic journals, packaged into index products, and tracked by the whole industry. Factor decay is the observed weakening of those premia over time, and especially after the research describing them enters the public domain.

It is worth being precise about what factor decay is not. It is not the same as a single fund's proprietary signal losing its edge — that is the general alpha lifecycle, where the cause is usually local competition and capacity. Factor decay is a market-wide phenomenon driven by public disclosure: the moment a premium is named in a widely read paper, every quant desk can replicate it, and the conditions that produced it begin to change. The two share detection tools — rolling performance, regime analysis, the Information Coefficient — but they have different causes and call for different responses.

Why "documented" is the operative word

A premium that no one knows about cannot be crowded. The defining feature of factor decay is that the information is public: it is the price of fame. This is why the strongest evidence for factor decay comes not from comparing recent returns to old returns in general, but from comparing returns before and after the publication date of the paper that first documented each factor — the natural experiment that the next section describes.

The Evidence: How Much Do Factors Decay?

The canonical study is R. David McLean and Jeffrey Pontiff's 2016 Journal of Finance paper, "Does Academic Research Destroy Stock Return Predictability?" They reconstructed 97 characteristics that academic studies had shown to predict cross-sectional stock returns, then asked what happened to each predictor's return after the sample period of the original study, and again after the study was published.

Their two headline findings are now the reference points for the whole topic:

Returns to the documented predictors were on average about 26% lower out-of-sample — that is, in the period after the original study's sample ended but using the same method. This portion is consistent with statistical bias and over-fitting in the original work.
Returns were on average about 58% lower in the post-publication period — a further, larger drop after the research was actually published. McLean and Pontiff attribute this extra decline to informed traders learning about the predictor and arbitraging it.

The structure of that result is the key to the whole subject. The out-of-sample drop says some of the original premium was an illusion of in-sample fitting; the additional post-publication drop says the rest was real but is being competed away now that it is public. Decay, in other words, is partly the unwinding of data mining and partly the arbitrage of genuine effects — and the McLean–Pontiff design lets you see roughly how much of each.

Three Mechanisms Behind Factor Decay

1. Arbitrage and crowding

When a real premium is published, capital chases it. Funds launch products, allocators rebalance toward the factor, and the buying pressure pushes up the price of the favored securities — which is precisely what lowers their future expected return. The premium is, in effect, paid forward into current valuations and then competed away. This is the mechanism behind the extra post-publication decay McLean and Pontiff measured, and it is the most benign in one sense: the effect was real. But a crowded factor also becomes more fragile, because the same capital can exit together. The unwinds of crowded factor trades — the 2007 "quant quake" being the textbook case — show how a crowded but real factor can produce sudden, correlated drawdowns that a long backtest never hinted at.

2. Data mining and the factor zoo

Some "factors" never had a premium to decay; they were artifacts of searching too many candidates. John Cochrane named this the "zoo of new factors" in his 2011 American Finance Association presidential address, and the menagerie has only grown — hundreds of published factors, most with thin economic justification. Hou, Xue and Zhang's large-scale replication study found that the majority of documented anomalies fail to replicate under careful, uniform testing. The statistical root is the multiple-testing problem: test enough characteristics and some will clear the usual t ≈ 2 bar by chance alone. Harvey, Liu and Zhu argued that, given how many factors have been tried, a newly proposed factor should clear a t-statistic of roughly 3.0 before it is taken seriously. A factor that "decays" the instant it leaves its original sample was most likely this kind of mirage — see backtest overfitting and the probability of backtest overfitting.

3. Structural and regime change

The third mechanism is the most fundamental: sometimes the economic reason for a premium genuinely changes. A factor justified by a risk story (value as compensation for distress risk) or a behavioral story (under-reaction driving momentum) can fade if the underlying structure shifts — changes in market composition, the rise of intangible-heavy firms that strain book-value-based value measures, lower transaction costs, or new participants. This kind of decay is not arbitrage and not data mining; it is the world moving on. It is also the hardest to detect from returns alone, which is why a factor should always come with an economic rationale you can re-examine when its attribution shifts.

Real Decay vs. a Data-Mining Mirage

The single most useful discipline in this area is to ask which mechanism a given decline reflects, because the response differs completely. A genuine but crowded premium may still earn a (smaller) return and deserve a place in a diversified book; a data-mined artifact deserves to be dropped entirely. The diagnostic rests on separating two windows:

Observation	Most likely mechanism	Implication
Premium collapses immediately out-of-sample, before publication	Data mining / over-fitting	Treat as never-real; drop it
Premium survives out-of-sample but fades after publication	Crowding of a real effect	Keep at reduced weight; monitor capacity
Premium fades alongside a clear economic or regime shift	Structural change	Re-examine the rationale; may be permanent

This is why the methods for validating trading signals lean so heavily on out-of-sample and post-publication testing, on the Deflated Sharpe Ratio that corrects for the number of trials, and on demanding a prior economic rationale rather than a pattern alone. A factor that arrives with a credible story and clears a multiple-testing-aware significance bar is far more likely to be decaying because it is crowded than because it never existed.

How to Measure Factor Decay

The measurement toolkit is shared with general signal decay analysis, applied to the factor's long-short return or its forecasting power:

Rolling performance and rolling t-statistic. Track the factor's return and its statistical significance over a moving window rather than over the full history. A lifetime Sharpe can hide a premium that has been flat or negative for years.
Rolling Information Coefficient. The rolling IC — the moving correlation between the factor score and subsequent returns — is a direct read on whether the factor still forecasts anything; a steadily declining rolling IC is the cleanest early-warning of decay.
Pre- vs. post-publication split. Where a publication date exists, compare returns before and after it — the McLean–Pontiff design applied to your own factor set.
Valuation spread / crowdedness. The relative valuation of the long versus the short leg signals how much of the recent return came from the factor getting more expensive (a borrowed return that reverses) rather than from the premium itself — the central warning in Arnott, Beck, Kalesnik and West's analysis of how smart-beta strategies can go wrong.

A Worked Illustration

The figures below are invented to show the shape of post-publication decay that the McLean–Pontiff structure implies; they are not the returns of any real factor.

Window	Mean annual long-short return	Relative to original
Original in-sample study period	6.0%	100%
Out-of-sample, pre-publication	4.4%	~74% (−26%)
Post-publication	2.5%	~42% (−58% vs original)

Read against the evidence section, the pattern tells a two-part story: the step down from 6.0% to 4.4% out-of-sample is the over-fitting unwinding, and the further step to 2.5% after publication is arbitrage capital competing the surviving, real premium. A factor that instead fell straight to near zero out-of-sample — before anyone could trade on the paper — would point to data mining rather than decay, and would not belong in the book at all.

What to Do About Factor Decay

Decay is not a reason to abandon factor investing; it is a reason to practise it with humility and maintenance.

Demand an economic rationale, not just a backtest. A factor with a credible risk or behavioral story is more likely to persist (in weakened form) than a pattern found by search. The rationale is also what you re-examine when returns fade — it tells you whether you are seeing crowding or structural death.
Discount the backtest for multiple testing. Apply a higher significance hurdle, the Deflated Sharpe Ratio, and honest out-of-sample validation before trusting any documented premium — most of the "decay" you would otherwise suffer is really over-fitting you can avoid up front.
Watch valuation spreads, not just past returns. Naively timing factors on recent performance tends to buy them after they have become expensive. Relative valuation is a better, if noisy, guide to whether a premium is cheap or has been borrowed forward.
Diversify across weakly correlated factors. Because individual factors decay and crowd at different times, a diversified multi-factor book is more robust than a bet on any single premium — and reduces exposure to the correlated unwinds that hit crowded factors.
Keep researching. The honest conclusion of the decay literature is that edges are perishable. A research process that continuously sources new, economically grounded factors — and retires faded ones — is the only durable response. This is the same renewal discipline that fighting proprietary signal decay demands.

Factor Decay in Practice

For a working quant team, factor decay reframes the job. The goal is not to find the factor and harvest it forever — the evidence says that premium will shrink, and publication will accelerate the shrinking. The goal is to maintain a diversified portfolio of economically justified, multiple-testing-survivor factors; to monitor each one's rolling IC, rolling significance, and valuation spread for the first signs of fade; to distinguish a crowded-but-real premium worth keeping at lower weight from a data-mined mirage worth dropping; and to feed a research pipeline that replaces decayed factors faster than they erode. Done that way, factor decay stops being a threat and becomes simply the metabolism of an honest factor process — closely related to the broader practice of protecting signals from decay and validating them before they ever reach the book.