This article first appeared in the Morningstar ETFInvestor – July 2013.
Life would be a lot easier (and more interesting) if everyone had a number floating above their heads quantifying something important about them—it could be kindness, trustworthiness, or whatever. Because they don’t, we rely on signals that may or may not be correlated to the qualities we care to learn about. Fortunately, personality traits are highly persistent. Five years from now, the odds are good a kind person will still be kind. Cause and effect is clean and reliable.
Investors mistakenly infer that a manager with good past performance will have good future performance. Anyone who knows anything about investing pays lip service to the notion that they know this isn’t true, but their behavior indicates otherwise. Every manager has a performance record, and most investors cannot resist treating it as if it were a number floating above the manager’s head indicating his true merit.
Sadly, we will never see that number. Past performance clearly isn’t it, because as a whole managers with the best records don’t continue to outperform in a reliable way. A manager’s true skill, or alpha, expressed as an annualized return value, is his expected outperformance if he were to manage a portfolio for a very long time. In other words, if a manager’s true alpha is 2%, then he will beat the market by 2 percentage points annualized over a long enough horizon. This number exists only in the mind of God.
But what if we could figure out the distribution of that true number in the population? It would be like taking a small peek behind the veil. While knowing the distribution won’t help you pinpoint any particular managers, it could help you figure out how much evidence you need to conclude that any particular manager is truly skilled.
Eugene Fama and Kenneth French took a stab at this question in a 2010 study, ”Luck Versus Skill in the Cross-Section of Mutual Fund Returns.”1 In setting up the study, they restricted themselves to equity mutual funds that existed for at least eight months during the period from 1984 to 2006 with at least $5 million (in 2006 dollars) in assets.
They then set out to answer an interesting question: What would that sample of funds have looked like if none of their managers had a whit of skill? To do this, they stripped out the excess of return of each fund. Of course, doing this doesn’t really tell you anything by itself. Remember, they were trying to figure out the distribution of true skill, and this can’t be found by looking at any particular manager’s realized performance, which is the product of a tiny amount of skill and a huge amount of luck.
To generate their own luck, they created a simulation of the sample by randomly selecting the monthly returns of each fund. The simulation was like an alternate universe in which no mutual fund manager is skilled. They repeated the simulation 10,000 times.
Within each simulation, there inevitably were outperformers and stinkers. But recall that no one is actually skilled in them; any over- or underperformance is purely by chance. Fama and French sorted the funds in each simulation by their outperformances. They then averaged the outperformances across every simulation by percentiles. So for the 95th percentile bucket, they averaged 10,000 returns from each simulation’s 95th percentile fund. (I’m simplifying here. My description isn’t exactly correct and skips over some details.)
This allowed Fama and French to look at the probability of outliers if no one was skilled, compared with the number of outlier mutual funds in the real world. If we lived in Bill Bernstein’s “Randomovia,” where markets are efficient and no one can beat them, then the actual historical distribution of excess returns wouldn’t look much different from the distribution generated from 10,000 alternate universes where no one is skilled. They found that mutual funds at the extreme tails of performance appeared more often than predicted by simulations that assumed no skill.
Fama and French, in other words, found that skilled and unskilled managers almost certainly exist.
So what does the distribution of true skill look like? To answer that question, they “injected” different distributions of skill into their simulations. They found that an injected standard deviation of 1.25% annual returns from skill made their simulations look like the historical distribution. A fund 1 standard deviation above the average is in the top 16%. A fund 2 standard deviations above the mean is in the top 2%. In other words, if you had a crystal ball that could sort every equity mutual fund manager by true skill level, the top 16% would be skilled enough to generate 1.25% or greater outperformance over the long run, and the top 2% would be able to generate 2.5% outperformance—before fees.
Remember that I’m talking about true skill, not historical outperformance, which is the product of mostly luck. If you picked managers who are in the top 2% relative to their peers, they are almost certainly not going to earn 2.5% annualized outperformance before fees going forward because many of the top 2% are lucky.
Unfortunately, they found that only a few fund managers had enough skill to overcome their fees.
However, Fama and French assume excess returns arising from exposure to value and momentum stocks are attributable to risk, not skill. Under a more inclusive rubric that doesn’t debit a manager’s performance for exploiting such strategies, the top 40% of managers show evidence of skill before fees. After fees, the top 10% of managers add value.
I don’t think this yardstick is applicable today. Value and momentum strategy index funds are cheap and widely available. The hurdle for true alpha is now higher than it was 20 years ago, when value and momentum weren’t widely known.
A mitigating factor to equity managers’ dismal collective performance may be closet indexing. Evidence of skill is more persuasive prior to the mid-1990s and declines thereafter, coinciding with the rise of closet indexing. According to Antti Petajisto, prior to 1996, closet indexers had less than 6% of equity fund assets. From 1999 on, closet indexers made up about 25% of all equity fund assets, or about 30% of actively managed equity fund assets. Closet indexing obscures the true relationship between skill and luck, especially when looking at after-fee performance. It’s reasonable to believe that skilled managers generate less outperformance on their lower-conviction ideas—perhaps even none. If this is the case, then closet indexing dilutes their ability to outperform.
When managers index half of their portfolios, they effectively double the fees they charge on their active portfolios. Warren Buffett’s stock portfolio, estimated from Berkshire Hathaway’s BRK.B 13F filings, beat the market by about 2.4% annualized from 1991 to 2011. He ran a concentrated portfolio to produce those results. If Buffett indexed half his holdings, maintained proportional weights for his active picks, and charged a 1% fee on the whole pie, his stock portfolio’s outperformance would’ve shrunk to 0.2% (2.4%/2 – 1%) — a statistically and economically insignificant level.
So all hope is not lost for the investor looking for skilled managers. The data, however, suggest that strong past performance is not sufficient to identify one, unless perhaps if his record is exceptionally long. A manager that can be reasonably expected to beat the market after fees is rarer than rare and requires a heck of a lot more information to identify than historical performance.
1 Eugene F. Fama and Kenneth R. French. “Luck Versus Skill in the Cross-Section of Mutual Fund Returns.” Journal of Finance, 2010.
Samuel Lee is an ETF strategist with Morningstar and editor of Morningstar ETFInvestor.