Allocation models that know their unknowns

Quants say probabilistic programming beats machine learning in balancing strategies

Quant firms are buzzing with ideas about algorithms that can ‘learn’ how best to adjust the mix of assets and strategies they trade. But another technology – probabilistic programming – may yet prove equally valuable in answering that question.

The approach allows firms in their portfolio allocation to build models incorporating the kind of market savvy that traders take for granted and also to construct a probabilistic view of future returns, according to Thomas Wiecki, head of research at hedge fund Quantopian. Both are areas where voguish machine learning models now being employed in allocation can struggle.

Thomas Wiecki
Thomas Wiecki

Portfolio allocation decisions hinge on how markets are primed to move and how linkages between strategies or assets are likely to evolve. Allocation calls often shape fund performance more than security-level choices, particularly for a fund such as Quantopian that crowdsources strategies, including from amateur quants.

Quant funds see the area as ripe for the application of new technology because the linkages in question often are complex and non-linear.

Firms have built machine learning allocation models that learn from historical data how assets and strategies might behave in different markets and how relationships between them could change. These models start, though, with no base understanding of how markets work and can latch onto spurious patterns in the data, ascribing them unfounded meaning.

Models that use probabilistic programming, by contrast, start out with rules about how markets function but include an element of uncertainty about how far those rules will hold. The models simulate millions of future paths for a given portfolio within those boundaries.

“Compared to machine learning, probabilistic programming allows you to build a model that is far more targeted and incorporates much more information about the domain of the problem,” says Wiecki. “We might know that the correlations often have a certain structure or that volatility tends to change over time,” he says. “If we can prime the model to say ‘this is the direction to look in’, that is really helpful.”

Probabilistic programming essentially automates Bayesian inference, the statistical technique whereby the probability of a given event is updated step-by-step based on fresh evidence. Only recently have these tools become powerful enough to use in models at the scale needed for quant finance.

Quantopian’s models generate a probability distribution of future daily returns for the dozens of different strategies the hedge fund runs, based on past performance data. “Sometimes, due to luck, even a bad strategy will have a good run and a good strategy will have a bad run. Those would be scenarios we consider in the millions of forecasts that we simulate,” Wiecki says.

The approach allows the firm to find the portfolio balance that works best across all possible future scenarios, rather than just the most likely scenario and lets the firm be more “principled” about the uncertainty in its forecasts.

“A strategy that has a good track record could be good, or it could have got lucky, or it could be overfitted. We can never truly know which,” Wiecki says. 

Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.

To access these options, along with all other subscription benefits, please contact or view our subscription options here:

You are currently unable to copy this content. Please contact to find out more.

You need to sign in to use this feature. If you don’t have a account, please register for a trial.

Sign in
You are currently on corporate access.

To use this feature you will need an individual account. If you have one already please sign in.

Sign in.

Alternatively you can request an individual account here