CLICK HERE TO VIEW THE PDF
In multiperiod trading with realistic market impact, determining the dynamic trading strategy that optimises the expected utility of final wealth can be difficult. Gordon Ritter shows that, with an appropriate choice of reward function, reinforcement learning techniques (specifically Q-learning) can successfully handle the risk-averse case
In this article, we show how machine learning can be applied to the problem of discovering and implementing dynamic trading
- People moves: SocGen adds in prime services, Deutsche fills new rates hole, HSBC makes model move, and more
- Quant Finance Master’s Guide 2019
- Credit risk quants are hitting the tech gap
- Princeton tops inaugural Risk.net quant master’s ranking
- Does credit risk need an expected shortfall-style revamp?