メインコンテンツに移動

Pricing options with temporal difference backpropagation

Miquel Noguer i Alonso, Daniel Bloch and David Pacheco Aznar

10.1 INTRODUCTION

Quantitative trading and risk management require the computation of expectations. However, we need to recompute the risk–return profile as it evolves over time. Hence, we must dynamically modify our positions based on the surrounding environment and hedge against adverse moves. The problem is that conditional expectations are very difficult to estimate from experiments directly (see Figure 10.1).

Figure 10.1

Existing solutions include the following.

  •  
    • Parametric models: conditional expectations of path-dependent events are tied to their ability to capture the dynamics of the underlying stochastic processes.

  •  
    • Neural networks: these use data to estimate the implicit stochastic processes driving the dynamics of the future events, although they cannot solve multi-step prediction problems.

  •  
    • Reinforcement learning: this allows for computations of path dependent payouts from past data; the temporal difference backpropagation (TDBP) model is a special case (see Section 5.6).

There are wide range of numerical techniques for computing conditional expectations (eg, integration, partial differential equations (PDEs), Monte Carlo simulation and Fourier transform)

Sorry, our subscription options are not loading right now

Please try again later. Get in touch with our customer services team if this issue persists.

New to Risk.net? View our subscription options

無料メンバーシップの内容をお知りになりたいですか?ここをクリック

パスワードを表示
パスワードを非表示にする

You need to sign in to use this feature. If you don’t have a Risk.net account, please register for a trial.

ログイン
You are currently on corporate access.

To use this feature you will need an individual account. If you have one already please sign in.

Sign in.

Alternatively you can request an individual account here