Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Optimal Execution: A Review

Optimal Execution: A Review APPLIED MATHEMATICAL FINANCE 2022, VOL. 29, NO. 3, 181–212 https://doi.org/10.1080/1350486X.2022.2161588 Ryan Donnelly Department of Mathematics, King’s College London, London, UK ABSTRACT ARTICLE HISTORY Received 16 December 2022 This review article is intended to collect and summarize many of the Accepted 16 December 2022 results in the field of optimal execution over the last twenty years. In doing so, we describe the general workings of the limit order KEYWORDS book so that the sources of costs and risks which need to be opti- Algorithmic trading; price mized are understood. The initial models considered propose simple impact; stochastic dynamics for prices which allow easily computable strategies which optimization maximize risk-adjusted profits. Subsequently, the review is divided into two major parts. The first explores several works which investi- gate how optimal liquidation strategies are modified to account for more complex dynamics, namely other stochastic or non-linear fac- tors. The second presents optimal trading strategies when the agent utilizes benchmarks in addition to risk-adjusted wealth, or when she has objectives beyond optimal liquidation. 1. Introduction In the last few decades, the actions taken by participants in n fi ancial markets have become increasingly based on quantitative analysis and algorithms rather than any human deci- sion making process. The motivation for this shift comes from attempting to minimize costs associated with trading, or to minimize risks involved in holding financial assets, or both. Establishing a mathematical framework that models the setting of trading in a na fi n- cial market allows one to quantify the trade-off between minimizing costs and minimizing risks, thereby tailoring a trading strategy to one’s preferences optimally. In this review arti- cle, we provide an overview of many developments and results related to optimal execution of trades. We begin with a description of the limit order book (LOB), and then proceed with a summary of some preliminary models in discrete time, in particular those of Bertsimas and Lo (1998) and R. Almgren and Chriss (2001). These pieces will provide a groundwork for thetypeofproblemsconsideredinthisreview, butmostofthefocuswillbeonanalogous models and results in continuous time. After this groundwork is established, we review developments which take more realistic or generalized underlying dynamics, different or more general performance criteria, and other aspects of financial trading and optimization. The primary mathematical tools that are used to solve many of the problems contained herein are the Dynamic Programming Principle (DPP) and the Hamilton–Jacobi–Bellman CONTACT Ryan Donnelly ryan.f.donnelly@kcl.ac.uk Department of Mathematics, King’s College London, Strand,LondonWC2R2LS,UK © 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons. org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. 182 R. DONNELLY (HJB) equation, although sometimes other approaches such as variational techniques are necessary. The full solutions to optimizations are omitted, but the original sources of the results or standard references on stochastic optimal control should provide any details whichareskipped(seeforexampleYongand Zhou 1999; Fleming and Soner 2006). We focus on modelling and optimization results of a theoretical nature, but many of the primary sources that are cited contain empirical analysis to motivate modelling choices. Some of these motivational empirical results will be provided when warranted, but the discussion about these results and their interpretation will be limited. 2. The Limit Order Book This section describes the general mechanisms of the LOB. An understanding of how trans- actions take place within LOBs motivates some of the modelling choices in the next section. Many of thesemodelling choicesmay stillbeappropriatein othermarketsettingsthatdo not employ a LOB, nevertheless a specific context may help with interpretation of results that we present later. Most electronic exchanges allow participants to submit several different types of orders, but these can be categorized broadly as either passive limit orders (LOs) or aggressive market orders (MOs), so for clarity we describe a simplified form of LOB mechanisms containing only the most basic of these two types of orders. LOs consist of a price, a volume, and an indicator to either buy or sell the asset. After an LO issubmitted,it sitsin theLOBuntilitiseithercancelled orfilled againstanMOata later time. The cancellation or filling of an LO can be either full or partial, meaning either the entire volume of the order is removed from the LOB or some amount of the original volume remains in the LOB. If a subsequent LO is placed with the same price, then both of these orders will exist simultaneously in the LOB at this price, and the order of execution of the orders depends on the matching rules of the exchange. The matching rules we describe below correspond to a price-time priority of orders, meaning the LOs form a queue at each price and the LOs to be filled are determined according to their position in this queue, known as the First In First Out (FIFO) rule. When several LOs of different types (buys and sells) are sitting in the LOB, there are two price levels which correspond to the highest price across all buy LOs and the lowest price across all sell LOs, referred to as the best bid andbestask,respectively.Theaverage of thebestbid andbestaskis referred toasthe midprice of the asset. Figure 1 shows a graphical representation of the state of the LOB at a single point in time, followed by the state of the LOB after a single LO is placed. An MO consists of a volume and indicator to either buy or sell the asset. When an MO is submitted it is matched against resting LOs of the opposite buy/sell type, and the price of the transaction is determined by the price level of all filled LOs up to the volume specified by the MO. In Figure 2 we show a graphical representation of an MO filling several LOs to illustrate how the volume of the MO and the volumes and prices of the LOs determine the overall transaction price of this trade. With this structure of the LOB in mind, it is clear that the midprice of the asset may change throughtimeasLOsareplacedand cancellednearthebest bidand ask.Addition- ally, the eeff ct that MOs have on prices can be interpreted in multiple ways. We generally divide the impact of MOs into temporary and permanent, although they are not fully dis- tinctand theremay beotherimpactphenomena whichdon’tfall into either category. APPLIED MATHEMATICAL FINANCE 183 Figure 1. Blue bars represent buy LOs, red bars represent sell LOs. Left panel: the highest price of all buy LOs and the lowest price of all sell LOs are the best bid and ask, respectively. Middle panel: a new LO is placed at a price where orders already exist at the back of the queue. Right panel: the volume at this price is increased. Analogous changes would result from an order cancellation, and thus the state of the LOB may change through time, even if no trades are taking place. Figure 2. Left panel: the LOB immediately before an MO is submitted. Middle panel: a sell MO is sub- mitted with volume greater than that of the best bid, so it must ‘walk the book’ into the next best price. Right panel: the MO has filled LOs in the book, so there is now less volume available at the two relevant prices. Temporary price impact refers to the relationship between the volume of the MO and the average execution price of the trade. For larger MOs, multiple LOs at different levels are exe- cuted, thustheaveragepriceofthetradeisafunction ofvolume.Permanent priceimpact refers to the relationship between the volume of the MO and the midprice process at future times after the execution. For larger MOs, more LOs are removed from the LOB resulting in a greater change of the midprice which will persist into the future, aeff cting the price of subsequent trades. The LOB structure discussed above is an idealization of real markets where there are several types of orders that can be submitted. Usually, order types can be loosely grouped by whether they add or remove liquidity from the market, and we think of them as LOs or MOs, respectively, in our idealized framework. These other order types include stop-loss, stop-limit, immediate-or-cancel, fill-or-kill, and all-or-none. For extensive analysis on LOB dynamics, including and beyond price-impact effects, seeBiais,Hillion,andSpatt(1995), Ranaldo (2004), R. Almgren et al. (2005), Cont (2011), Cont, Kukanov, and Stoikov (2014), Cartea,Jaimungal,and Penalva(2015), Lehalle and Mounjid (2017), Bechler and Ludkovski (2017), Cartea, Donnelly, and Jaimungal (2018a), and Sirignano (2019). For more in depth outline of the operation of LOBs in n fi ancial markets and of advanced order types, see Cartea, Jaimungal, and Penalva (2015)and Guéant (2016). 184 R. DONNELLY Figure 3. Left panel: the LOB as volume density. Middle panel: a sell MO is submitted with volume equal to the area of the yellow region. Right panel: the MO has filled LOs in the book, and both buy and sell LOs are shortly thereafter placed to establish a new impacted midprice. 3. Modeling Trades, Price Impact, and Objectives 3.1. Discrete-Time Model The discrete time model that we present here is adapted from the models rfi st established in Bertsimas and Lo (1998) and R. Almgren and Chriss (2001), and which have consistently formed the basis for most models in discrete time optimal execution since then. While Bertsimas and Lo (1998) takes into account additional considerations such as informed price changes and more realistic dynamics, the initial model they propose to get intuition aboutoptimal executionisequivalenttowhatwegivebelow. We make a conceptual change to the description of the LOB in the previous section to motivate many of the assumptions made in this model. The main change we make is that we consider the LOB to be based on a continuum of prices rather than a discrete set, andtheLOsformavolumedensity,rather than acollectionofindividualqueuesateach price. Thus, our graphical representation including the arrival of an MO changes to that of Figure 3. Note that in the right-hand panel of Figure 3, the new midprice is located between (not necessarilyatthe midpoint)theoldmidpriceand thelowestexecutedpriceoftheMOfrom the middle panel. We assume that immediately after the MO, there is a u fl rry of activity in the LOB which includes many new LO placements to establish this new midprice. This activity could potentially also include LO cancellations resulting in a new midprice beyond thelowestexecutedprice.Inaddition,thereisnogapleftinthe LOBand indeedtheoverall shape is merely a translation of the original shape. This will be reflected in the model below through the fact that our impact functions depend only on the size of the MO. We work on a ni fi te time horizon equal to N and index time by n ∈{0, ... , N}.Let Q ={Q } be the agent’s inventory process at each point in time. Denote by V the size n n n=0 of theagent’ssubmittedMOattime n,sowehavetherelation Q = Q + V.(1) n+1 n n Let S ={S } be the midprice observable in the LOB at each time. In specifying the n=0 dynamics of this process, we wish to include a component contributed by the MOs sub- mitted by the agent, as well as a component which comprises the actions of all other agents participating in the market. To this end, let  ={ } be a sequence of independent ran- n=1 dom variables with mean zero, and let f be an increasing function with f (0) = 0. Then we APPLIED MATHEMATICAL FINANCE 185 set S = S + f (V ) + .(2) n+1 n n n+1 Thus, if the agent does not trade (V = 0), the midprice changes randomly by  from n n+1 the perspective of the agent, where this change is due to the actions of other agents. If the agent submits a buy order (V > 0)thenthenextmidpriceisstillrandom,butwilltend to be greater than if there were no trade, and larger buy orders will have a larger effect on increasing this future midprice. Similar eect ff s hold as a result of the agent submitting a sell order, and so the function f captures the permanent price impact eeff cts of the agent’s trades. Let X ={X } betheamountofcashheldbytheagentthrough time.Inspecifying n=0 the dynamics of X we include a term which represents a temporary price impact eeff ct as discussed in Figure 2.Let g be an increasing function with g(0) = 0and set X = X − S V,(3) n+1 n n n S = S + g(V ).(4) n n n The quantity S represents the average price per share of the agent’s executed MO, given that the volume of the MO is V . A larger MO volume will result in executions farther from the midprice with buy orders executing at higher prices and sell orders at lower prices, and so the function g captures temporary price impact as desired. With an average execution price of S and a total volume of V , the change in the agent’s cash account is −S V ,hence n n n n the relation in (3). It is straightforward to show that if the shape of the LOB is given by a curve as in Figure 3, then the function g is uniquely determined by this shape. 3.2. Continuous-Time Model Although many of the problems we summarize in this survey can be formulated and solved in discrete-time, most recent work is conducted in continuous-time. Here we replace Equa- tions (1) to (4) with analogous continuous time dynamics. Equivalently, this could be done by considering the limiting behaviour of the discrete time model as the number of time steps N →∞ (the functions f and g,and thedistributionsof  wouldalsohavetobe scaled appropriately, see R. Almgren and Chriss 2001). We work on a complete filtered probability space (,F, {F } ,P) over the t 0≤t≤T finite time interval [0, T]. The inventory, midprice, and cash processes are now repre- ν ν ν ν ν sented by their continuous-time counterparts Q ={Q } , S ={S } ,and X = 0≤t≤T 0≤t≤T t t {X } , and we replace their discrete time dynamics with 0≤t≤T dQ = ν dt,(5) dS = f (ν )dt + dM,(6) t t ν ν dX =−S ν dt,(7) t t ν ν S = S + g(ν ),(8) t t where M ={M } isamartingalewithrespect tothefiltration {F } .Withthese t 0≤t≤T t 0≤t≤T dynamics the trading strategy ν ={ν } represents the speed of trading, not the t 0≤t≤T volume ofasequenceoftradesasinSection 3.1. 186 R. DONNELLY The form of Equations (5) to (8) does not constrain the midprice to be necessarily pos- itive. While this is usually a desirable feature in a model, the time scales for which typical optimal executions are implemented are short enough that the probability of observing large price deviations into negative values is very small. If the time scale is long enough to warrant a positive price restriction, then geometric dynamics may be more appropriate and realistic. See Gatheral and Schied (2011), Becherer, Bilarev, and Frentrup (2018), and Colaneri et al. (2020) for analysis of optimal execution problems with geometric dynamics for the price process. In addition, neither the midprice nor inventory will be constrained to lie on a grid as they would be in reality. This assumption will have little consequence if the typicalinventory andmidpricethatweconsiderarelarge.If therelevantinventory levels are typically close to zero for a particular execution, a properly discretized model would be more appropriate over the continuous rate of trading implied by (5). 3.3. Performance Criterion Nowthatwehavespeciefi dthedynamics ofpricechangesandtheagent’sinventoryand wealth, we must propose a goal for the agent so that she may seek an optimal trading strat- egy. One of the most common types of goals we have in mind is an agent that holds a block of shares and wishes to liquidate them before time T. In doing so, she will have to consider theeeff ct that hertradeswillhaveontheimmediatetransaction priceduetotemporary price impact, as well as future prices due to permanent price impact. Additionally, we may impose a penalty on the agent if she does not fulfil her goal of full liquidation by time T. Thus, we specify that the agent attempts to maximize her expected terminal wealth at time T subject to a liquidation penalty. In Bertsimas and Lo (1998), the authors frame the liqui- dation problem in terms of minimizing execution costs. While the objective function in (9) below is framed in terms of maximizing wealth, it is equivalent to minimizing costs. Let the initial cash holdings, inventory, and midprice be x , q ,and S respectively. 0 0 0 Given a trading strategy ν ={ν } which is sufficiently integrable and adapted to the t 0≤t≤T filtration {F } , and an increasing function  with (0) = 0, we define the agent’s t 0≤t≤T performance criterion as ν ν ν ν ν H (x , q , S ) = E X + Q (S − (Q )).(9) 0 0 0 T T T T The terminal wealth in (9) consists of the agent’s accumulated cash, X ,aswellasthevalue of the remaining Q sharesoftheassetwhichmustbeliquidatedatthe prevailingmidprice ν ν S with an associated penalty (Q ).Thisliquidationpenaltyisakintothetemporaryprice T T impact effect which arises from walking the book as discussed previously. Although we are taking the perspective of an agent liquidating her holdings, this frame- workcanalsobeapplied to an agentwishing to acquireablockofshares. This canbe done by setting her initial inventory holdings equal to Q =−N,where N is the number of shares she is trying to acquire by time T. This is based on the realization that liquidating anegativepositionrequirestheagenttopurchasethesharessothatherterminalposition is zero. With this performance criterion in mind, den fi e the agent’s value function H(x , q , S ) = sup H (x , q , S ), (10) 0 0 0 0 0 0 ν∈A APPLIED MATHEMATICAL FINANCE 187 wherethesupremum istakenoveranadmissibleset A of sufficiently integrable {F }- adapted processes. We would like to determine a specicfi control ν such that the agent’s performance function is maximized over all possible controls. That is, we desire a trading speed ν such that H(x , q , S ) = H (x , q , S ). (11) 0 0 0 0 0 0 One common method to solve such an optimization problem is to introduce the dynamic value function (with a slight abuse of notation) ν ν ν ν H(t, X , Q , S ) = sup E X + Q (S − (Q )) F , (12) t t t t T T T T ν∈A and then resort to methods of dynamic programming to solve for both H and ν .For more detailsonthesemethods, seePham(2009), Björk (2009), Touzi and Tourin (2013), and Cartea, Jaimungal, and Penalva (2015). In order to make these methods more tractable and allow for easier analysis, we assume for now that all of the relevant price impact functions are linear in the speed of trading. That is f (ν) = bν, g(ν) = kν, (ν) = αν, (13) with b > 0, k > 0, andα> 0. Regarding the visualization in Figure 3,thismakestheimplicit assumption that the shape of the LOB is a flat line. Proposition 3.1: The dynamic value function in (12) is given by H(t, x, q, S) = x + qS + h (t)q , (14) −k(2α − b) b h (t) = − , (15) 2k + (2α − b)(T − t) 2 and the optimal trading strategy in feedback form is −(2α − b) ν (t, q) = q. (16) 2k + (2α − b)(T − t) We should assume 2α> b for otherwise there will be values of T and t which make the expression in (16) undefined. Henceforth, we always assume parameter values are chosen so that the resulting expressions are well den fi ed. If we substitute the liquidation strategy from Proposition 3.1 into Equation (5), we arrive atanordinarydieff rentialequation(ODE)governingthedynamicsoftheagent’sinventory, which can be solved to give 2k + (2α − b)(T − t) Q = q , (17) 2k + (2α − b)T and we see that the resulting inventory process is deterministic, decreasing linearly through time towards a terminal value of 2k Q = q . (18) 2k + (2α − b)T We canensurethatthe agentfully liquidatestheinitial position of sharesby placingan infinitely large penalty on remaining inventory represented by the limit α →∞.Inthis 188 R. DONNELLY case the optimal strategy in (16) and the resulting inventory path in (17) become lim ν (t, q) =−q (19) α→∞ ∗ T − t lim Q = q . (20) α→∞ The trading strategy and resulting inventory in (19) and (20) can be thought of as a bench- mark called time-weighted average price (TWAP). If the trades of this strategy, which occur at a constant speed over the trading interval, are marked to the midprice, S, rather than the execution price, S,thentheaveragepriceofalltrades bytheagentistheaverageof themid- 1 ν price process given by S dt. Many of the trading strategies which appear later can be T 0 thoughtofasincorporatingacorrectiontothe TWAP strategy. While the strategy in (16) is the strategy which minimizes expected costs, there is no consideration of risk involved in the execution. In R. Almgren and Chriss (2001), the authors allow the agent to also account for the variance of her performance in addition to the expectation. This allows the agent to tailor her performance by managing the trade-off between maximizing expected wealth and minimizing variance of wealth. This mean- variance approach is incompatible with dynamic programming methods, but the authors only consider deterministic trading strategies allowing solutions to be found more easily. We incorporate risk considerations with a different approach to the mean-variance analysis, but with the additional assumption that price innovations are Gaussian. Under this assumption the class of optimal trading strategies coincide with R. Almgren and Chriss (2001). To this end, we assume M = σ W in Equation (6) where W isaBrown- ian motion. We also suppose the agent adopts a constant absolute risk-aversion (CARA) utility function so that her dynamic value function is ν ν ν ν H(t, X , Q , S ) = sup E − exp −γ X + Q (S − αQ ) | F , (21) t t t t T T T T ν∈A forsomevalueof risk-aversionparameter γ> 0. Proposition 3.2: The dynamic value function in (21) is given by H(t, x, q, S) =− exp −γ x + qS + h (t)q , (22) − −ω(T−t) + ω(T−t) β ψ e − ψ e b h (t) = − , (23) − −ω(T−t) + ω(T−t) 2 ψ e + ψ e 2 with γσ β = 2kγσ , ω = , ψ = β ± 2α ∓ b, 2k and the optimal trading strategy in feedback form is (2h (t) + b) ν (t, q) = q. (24) 2k APPLIED MATHEMATICAL FINANCE 189 The optimal trading strategy in Proposition 3.2 is equivalent to one which maximizes an alternative performance criteria given by ν ν ν ν 2 ν 2 E X + Q (S − αQ ) − σ (Q ) dt . (25) T T T T t The running quadratic inventory penalty in (25) is a popular choice to incorporate in opti- malexecutionproblemsratherthanusingautilityfunctionasin(21)(seeforexampleGuil- baud and Pham (2013), Barger and Lorig (2019), and Cartea, Gan, and Jaimungal (2019)). However, these two approaches would not be equivalent if the random innovations in price were notgiven byaBrownianmotionorifthere areotherstochastic factorsinthemodel. When there are more factors, this running penalty term can be considered to contribute inventory risk, while the agent remains risk-neutral towards other dynamics (see Cartea, Donnelly, and Jaimungal 2017). Substituting the optimal strategy from Proposition 3.2 into Equation (5) yields again an ODE which can be solved, resulting in the agent’s inventory process through time − −ω(T−t) + ω(T−t) ∗ ψ e + ψ e Q = q . (26) − −ωT + ωT ψ e + ψ e If we wish to consider an agent who must finish the full liquidation by time T,wecan enforce this by taking the limit α →∞ which results in lim ν (t, q) =−ω coth(ω(T − t))q, (27) α→∞ ∗ sinh(ω(T − t)) lim Q = q . (28) α→∞ sinh(ωT) In Figure 4 we show the optimal inventory process for several values of the risk-aversion parameter γ . We note that for larger values of γ , the agent uses a trading strategy which liquidates faster towards the beginning of the time interval. This is because the agent’s riskislargestwhenherinventoryislargest,and to lowerthatriskshedesirestolower her inventory quickly. This comes at an increased cost due to the temporary price impact. When the agent is risk-neutral (and has a resulting inventory path from (17)), she trades at a constant rate through time which is determined only by the trade-osff between tem- porary price impact, permanent price impact, and the liquidation penalty. The optimal strategies of Propositions 3.1 and 3.2 are the result of many simplifying assumptions regard- ing the dynamics of trading in a market as well as the agent’s goals, but they still oer ff a benchmark to which many other strategies may be compared. In the following sections, we explore work that expands upon the previous results by considering more realistic market dynamics, alternate goals of market participants, or the inclusion of other phenomena. 4. Alternative Dynamics We begin our further exploration by considering works which make alternative assump- tions about the dynamics proposed in Equations (5) to (8) in order to model more general or more realistic market settings. 190 R. DONNELLY Figure 4. Optimal inventory process for various values of risk-aversion parameter γ.The curvecor- responding to γ = 0 is from Equation (17), the remaining curves are from Equation (26). Left panel: α = 0.05. Right panel: α = 1. Other parameter values are T = 1, σ = 1, k = 0.01, and b = 0.01. 4.1. Stochastic Temporary and Permanent Price Impact Firstwerevisit oneofour assumptionsonthe natureof priceimpact,both permanentand temporary. In Equations (6) and (8), the relation between the agent’s trading and price dynamics are homogeneous in time – identical actions and different points in time will have identical effects on the asset price. In terms of the graphical representation of the (idealized continuous) LOB, this means that its general shape is always the same, albeit translated horizontally as the midprice changes. In reality, market liquidity varies through time meaning the shape of our graphical representation should also u fl ctuate. We consider in some detail the work Barger and Lorig (2019) which allows price impact effects to be stochastic, replacing price and wealth dynamics with (1) dS = f (b )ν dt + σ dW,db = μ (b )dt + η (b )dB , (29) t t t t b t b t t t ν ν dX =−S ν dt, (30) t t ν ν (2) S = S + g ˆ(k )ν,dk = μ (k )dt + η (k )dB , (31) t t t t t k k t t t (1) (2) where B and B are Brownian motions independent of W,buthave instantaneouscor- relation with each other equal to ρ ∈ [−1, 1]. Furthermore, the functions g, μ ,and η k k should be chosen so that g ˆ(k )> 0 for all t (otherwise temporary price impact would work in the agent’s favour and she could make unbounded prots). fi In this model, price impact depends linearly on trading speed (as was assumed in Propositions 3.1 and 3.2), but the magnitudeofthe eeff ctisdrivenbythe stochastic processes b and k.The graphicalLOB representation is therefore given by a flat line which uc fl tuates through time in the vertical direction. When f (b ) and g ˆ(k ) decrease, indicating a low impact state, the flat line level t t of the LOB moves up and there is high liquidity in the market. The agent’s performance criterion is taken as stated in (25) with a running penalty on inventory, and so as discussed earlier this can be interpreted as the agent being sensitive to the risk associated with midprice changes, but when considering the stochastic nature of price impact she only accounts for the potential benefit and detriment in terms of expected APPLIED MATHEMATICAL FINANCE 191 wealth. These changes to the assumed dynamics complicate the HJB analysis which was used to arrive at the conclusions of our previous propositions, so the authors resort to asymptotic approximations of the dynamic value function and optimal trading speed. The authors show that a zeroth order approximation in this setting can be obtained by the same expression as in (24) by making the substitutions b → f (b) and k → g ˆ(k). This means the agent acts according to the previously examined optimal strategy, but replaces the price impact parameters with the corresponding value of what is observed at any particular time. The rfi st order correction to this strategy does not have a simple expression in general. But if the agent is forced to liquidate her position by time T (α →∞) and if she does not impose the running inventory penalty (γ = 0), then a first order approximation is given by 1 μ (k)g ˆ (k) 1 μ (b)f (b) k b ∗ ∗ ν ˆ (t, q, b, k) =ˆ ν (t, q, b, k) − q − (T − t)q, (32) 1 0 ˆ ˆ 2 g(k) 6 g(k) where the term ν ˆ is understood to be the strategy in (24) along with the substitutions ˆ ˆ b → f (b) and k → g ˆ(k).Notethatatanyfixedtime t with given impact parameters f (b ) and g ˆ(k ), this rst fi order strategy still liquidates the asset at a speed proportional to the remaininginventory.The twocorrectiontermsstem fromthetwodieff renttypesofimpact and can be interpreted in terms of what the agent expects the impact to be in the future, thereby acting accordingly. If we suppose μ (k) and g ˆ (k) are both positive, then in the shortterm,thetemporary priceimpactisexpectedtoincrease, so theagent is incentivized to trade faster now to avoid higher transaction costs. Similarly, if we suppose μ (b) and f (b) are both positive, then permanent price impact is expected to increase, and the agent should trade faster now so that her remaining pool of shares does not have their value negatively aect ff ed. This term depends on the remaining trading time T−t because the effects of permanent price impact as modelled here accumulate over time, and so become insignica fi nt at the end of the trading horizon. Similar arguments hold accordingly if there are changes in sign of μ (k) and g ˆ (k). Both of these terms are scaled by the instanta- neous temporary impact 1/g ˆ(k) because any expected benefits from trading faster must be weighed against the instantaneous costs of acquiring those benetfi s. Numerous other works have also investigated the effects of stochastic liquidity on opti- mal execution. A similar model is proposed in R. Almgren (2012)wherethereisno permanent price impact, but the midprice volatility is allowed to be stochastic rather than constant. This work also recognizes that substituting appropriate parameters into a strat- egy such as (24) with the observed stochastic counterparts at time t (k → k and σ → σ ) t t is a plausible strategy. In general, it is found that the agent trades quickly when price impact is low, volatility is high, or the end of the trading period approaches. Both Cherid- ito and Sepin (2014) and Ackermann, Kruse, and Urusov (2021) take a fully discrete-time approach to this problem, each considering different aspects of the dynamics to be stochas- tic and classifying the optimal trading strategies in terms of recursion equations. Other workswithelementsofstochasticprice impact areIshitaniand Kato(2014), Becherer, Bilarev, and Frentrup (2018), Fruth, Schöneborn, and Urusov (2019), Fouque, Jaimungal, andSaporito(2022), and Cartea, Drissi, and Monga (2022), some of which will be discussed in a subsequent section on resilience and transient price impact. 192 R. DONNELLY 4.2. Stochastic Volatility of Midprice In the models described in Section 3, the only exogenous source of randomness comes from the midprice innovations which were assumed to have constant volatility. In reality, the volatility will change through time due to a variety of factors in the LOB. There is generally increased activity at the beginning and end of the trading day, both in terms of thenumberofLOsbeingplacedand cancelled, andinthenumberofMOssubmitted. Each of these events can change the midprice, so we end up with seasonality of volatility in theLOBoverthecourseoftheday. Additionally,many events in theLOBareself-exciting, in particular there is a large spike in the intensity of events for a short time after MOs, and so spikes in volatility may also be observed around these times. The changing volatility results in different levels of risk for any given inventory level, and accounting for this can help a risk-averse agent improve her performance. We have already made brief mention of R. Almgren (2012) and Cheridito and Sepin (2014) which consider models with stochastic volatility of the midprice. Here we pro- vide some discussion of Chan and Sircar (2015) which gives approximations to the optimal trading strategy in fast or slow variation volatility regimes, which correspond to the season- ality and spike eeff cts described previously. To this end, we replace the midprice dynamics in Equation (6) with dS = σ(Z )dW , (33) t t where σ is a function of the stochastic process Z , which determines the instantaneous volatility of the midprice. Permanent price impact is not considered in Chan and Sir- car (2015), but an additional drift term representing an observable signal is included which we have suppressed in our present discussion. We return to the modelling of observable signals in Section 5.1. When analysing a slow volatility factor, the dynamics of Z are taken as dZ = μ (Z )dt + η (Z )dB , (34) Z Z t t t t where B isastandardBrownianmotionindependentof W, whereas the fast volatility factor uses dynamics 1 1 dZ = μ (Z )dt + η (Z )dB . (35) Z Z t t t t In each case  will act as a parameter of asymptotic expansion which allows for approx- imations of the optimal strategy when the value of the parameter is small. Chan and Sircar (2015) consider multiscale stochastic volatility with slow and fast factors simultane- ously, but we only discuss the resulting effects individually. Additionally, they have assumed an inn fi itely long trading horizon with a performance criterion of the form 2  ν 2 2 E − σ (Z )(Q ) − kν dt , (36) t t t which results in a trading strategy that does not explicitly depend on time and brings more focus to the effects of stochastic volatility. Otherwise, the general behaviour of the optimal strategy is largely unchanged from what would arise in a ni fi te horizon. As may be expected APPLIED MATHEMATICAL FINANCE 193 after the discussion in Section 4.1, the approximations are largely based on the strategy which maximizes (36) when volatility is assumed constant. This can be shown to be γσ ∗ 2 ν (q; σ ) =− q, 2k which is equivalent to (24) in the limit T →∞. Under slow scale volatility, the approximation of the optimal strategy is found through perturbation analysis of the HJB equation. The resulting strategy in feedback form is γσ (z) ∗ ∗ 2 ν ˆ (q, z) = ν (q; σ (z)) =− q, 2k which we note involves a similar substitution to previously discussed approximations, where parameters are replaced with their currently observed values through time. We also seethatthisapproximationdoesnotincludeanexplicitdependenceonthe parameter . The smallest order corrections considered in Chan and Sircar (2015)involvesastochas- tic drift correlated to the slow scale volatility. In this case the appropriate correction is proportional to  as well as the correlation between the drift and volatility. Thefastscale volatilityis treatedwithsingularperturbationanalysistoarriveatan approximate optimal control, with resulting approximation given by γ q ∗ ∗ 2 2  2 ν ˆ (q, z) = ν (q; σ ) − E σ (Z ) − σ | Z = z du, (37) 0 u t 2k where σ is the long-term expected variance. This expression has contributions from two terms which represent a zeroth order approximation and first order correction. The 2 2 zeroth order approximation makes a familiar parameter substitution σ → σ ,but the rfi st order correction involves future expected deviations of the variance from its long-term expected value. If future variance is expected to be larger than the long-term expected vari- ance, then the agent desires faster liquidation of her shares to avoid the associated inventory risk. Other work of interest is Colaneri et al. (2020) where the authors assume that innova- tions in asset price are driven by a regime switching pure jump process of the form dS = S −dR , (38) t t t where R has compensating random measure η(t, Y −, ν −;dz)dt with Y afinite state t t Markov chain. Whilethisisnotstrictlyastochasticvolatilitymodel becauseitlacksadiffu- sive term, the price innovations dR being driven by a Markov chain means that the price process will share some qualitative features of a stochastic volatility model. The depen- dence of the compensating measure on the trading strategy ν also captures permanent price impact eeff cts. In this case, the more complicated nature of the model leads to numerical methods for computing the optimal strategy. 4.3. Resilience and Transient Impact Thusfarwehaveonlydescribed twotypes of priceimpacteeff cts(permanentandtem- porary) which are included in the models of many algorithmic trading works. But when 194 R. DONNELLY Figure 5. Ask side of the LOB showing resilience and transient price impact after a trade. The left panel shows minimal spread (best ask price coincides with reference price). In the middle panel, an MO walks the book and exhibits temporary price impact. Immediately after the MO in the right panel, the orders are filled, a new reference price is established due to permanent impact, and the new best ask price will gradually decay back to the new reference price. A subsequent order submitted immediately will be subjected to higher prices than an order submitted after waiting some time. discussing the nature of price changes after the occurrence of a transaction, there are other dynamicaleeff ctsatplay.In particular,whenmultipleLOs arefilled in theLOB dueto a transaction, the resulting gap in the LOB is not replenished instantaneously as Figure 3 suggests.Instead ittakestimeforLOstobesubmittedin placeofthose thatwere filled.The rate at which these orders are replenished is called resilience, and thus Figure 3 suggests a market with infinite resilience. In Figure 5 we show (from the ask side of the book) how resilience might be interpreted in an idealized continuous LOB. After a transaction takes place (filling sell orders in the book), if no further transactions were to occur then these sell orders may gradually be replenished until a new stable best ask price is established. However, if a subsequent trans- action were performed after only a short time, then it would be subjected to higher prices thanifit hadoccurredafteralongtime. This dieff renceintransaction pricewhich depends on the amount of time between trades is called transient price impact. We focus on describing Chen, Horst, and Tran (2019)inmoredetail, as theframework is most similar to what has been presented here so far. A general formulation of transient price impact can be incorporated into the dynamics by introducing an impact function and decay kernel, denoted by a strictly increasing function h and a non-increasing function G, respectively, and defining the inventory, midprice, and cash processes by Q = Q + ν du, (39) 0 u t t S = S + bν du + h(ν )G(|t − u|)du + σ W , (40) 0 u u t 0 0 ν ν X = X − S ν du, (41) 0 u t u ν ν S = S + kν . (42) t t Here we have taken the permanent and temporary price impacts to be linear with propor- tionality constants b and k.Thefunction h determines the magnitude of transient price impact and the decay kernel G controls how quickly the impact of trading decays through time.InChen,Horst, andTran(2019) the transient impact is linear with h(ν) = λν and APPLIED MATHEMATICAL FINANCE 195 the decay is exponential with G(|t − u|) = exp(−ρ|t − u|). Thus, the parameter λ deter- mines the distance from the new reference midprice to the best ask in the right panel of Figure 5, and the distance between the best ask and the new reference midprice decays exponentially. See Gatheral (2010) for some detailed discussion of other impact functions and decay kernels, and consequences of different models with respect to statistical arbitrage opportunities. The agent is risk-neutral, but with the additional constraint that the inventory process must satisfy Q = 0. Thus, the performance criterion reduces to ν ν H (x , q , S ) = E[X ], (43) 0 0 0 andtheagentattemptstomaximizethisovertradingspeeds {ν } which fully liquidate t t∈[0,T] the inventory by time T. Recall that this type of constraint can be applied formally to the strategy given in Proposition 3.1 by taking α →∞, and it is to the trading strategies in this limit that we shall compare the optimal strategy with transient impact. Other works mentioned at the end of this section allow for strategies that do not fully liquidate the position by incorporating a liquidation penalty. Proposition 4.1: The trading strategy which maximizes (43) is given by φT φT T 2 2 φ (φ k − λρ) ρ cosh + φ sinh + λρ cosh φ t − 2 2 2 ν =− q , φT φT 2 2 4 φρT cosh (φ k − λρ) + sinh (λρ(2ρ − φ T) + φ kT) 2 2 (44) which results in the inventory process A + Bt + C sinh φ t − ν 2 Q = q − q , (45) 0 0 2A + BT where φ = ρ ρ + , (46) λ φT A = sinh , (47) k 2 φT φT B = φρ cosh + φ sinh , (48) 2 2 C = . (49) In Figure 6 we show the optimal trading speed and resulting inventory for various values of the transient impact parameter λ (top row) and decay rate ρ (bottom row). When λ = 0 the optimal rate of trading is constant at q /T which is the TWAP strategy in (19). This is expected because λ = 0 corresponds to the same dynamics given in Equations (5) to (8). When ρ = 0wealsoobtainthe TWAP strategy,which isconsistentwith intuitionbecause if there is no resilience, the transient price impact becomes a permanent price impact, so 196 R. DONNELLY Figure 6. Optimal trading speed and inventory process with transient price impact for various values of transient impact parameter λ and decay coefficient ρ. Top row panels: other parameter values are k = 0.01, b = 0.01, ρ = 1. Bottom row panels: other parameter values are k = 0.01, b = 0.01, λ = 0.3. the dynamics are equivalent to Equations (5) to (8) with the substitution b → b + λ.The last case in which the optimal trading speed is constant is in the limit ρ →∞ which again corresponds to the dynamics of (5) to (8). For parameter values which do not give rise to the TWAP strategy, the typical optimal behaviour is to trade at faster speeds closer to the beginning and end of the trading period, and at slower speeds closer to the middle of the trading period (compared to the constant rate of q /T).Thereasonforthisisbecausetheagentwantstominimizethetimewhenthey are trading under significant effect of transient price impact. A single large trade will suer ff from temporary price impact, but it is not until subsequent trades are submitted that the effects of transient impact are encountered. By trading quickly early on, there will be more time spent in a state where the exponential resilience has brought the price back towards its unaeff cted level. Then, when the transient price impact has decayed significantly, the agent can again place large trades to complete the liquidation. For other descriptions and works which model resilience and transient price impact, see Gatheral (2010), Alfonsi, Fruth, and Schied (2010), Predoiu, Shaikhet, and Shreve (2011), Obizhaeva and Wang (2013), Guo and Zervos (2015), Graewe and Horst (2017), Becherer, Bilarev, and Frentrup (2018), Bank and Voß (2019), Fruth, Schöneborn, and Urusov (2019), APPLIED MATHEMATICAL FINANCE 197 Ekren and Muhle-Karbe (2019), Horst and Xia (2019), Forde, Sánchez-Betancourt, and Smith (2021), and Neuman and Voß (2022). 4.4. Non-linear Price Impact Most of the results in previous sections rely on assuming that price impact functions are linear with respect to the rate of trading, even though the original models proposed in Sec- tions 3.1 and 3.2 have general impact functions. In this section we explore an optimization in which impact functions may be non-linear with respect to the rate of trading, with a focus on R. F. Almgren (2003). This work begins with a model in discrete time which is based on that of Section 3.1, where the nature of temporary price impact is modified so that Equations (1) to (4) become = Q + V , (50) n+1 n n S = S + f (V ) +  , (51) n+1 n n n+1 X = X − S V , (52) n+1 n n n S = S + g(V ) + h(V )ξ , (53) n n n n n+1 where {ξ } is a sequence of independent random variables with mean zero, unit vari- n=1 ance, and which are also independent from { } . Thus, the temporary price impact is n=1 composed of the usual term, g(V ), which depends only on the trade submitted at time n,butalso hasastochasticcomponent givenby h(V )ξ . This type of stochastic depen- n n+1 dence of temporary impact could be the result of several factors, for example the latency between the time that a trade decision is made and the time at which it is matched against passive orders during which the depth of available orders may have changed. See Cartea and Sánchez-Betancourt (2021a) which considers this stochastic delay directly. The agent attempts to maximize a mean-variance style criterion with respect to her N−1 terminal wealth subject to the constraints Q = 0 and that the sequence {V } is N n n=0 deterministic. The agent’s terminal wealth is given by N−1 N−1 N−1 N−1 X = X + S Q + Q f (V ) + Q  − V g(V ) − V h(V )ξ , N 0 0 0 n n n n+1 n n n n n+1 n=0 n=0 n=0 n=0 (54) which, under deterministic trading strategies, has expectation and variance N−1 N−1 E[X ] = X + S Q + Q f (V ) − V g(V ), (55) N 0 0 0 n n n n n=0 n=0 N−1 N−1 2 2 2 2 V[X ] = σ Q + V h (V ), (56) N n n n n=0 n=0 2 2 where σ = E[ ]. Before optimizing the trade-off between these two terms, they are both replaced by their continuous-time analogues E[X ] = X + S Q + Q f (ν ) − ν g(ν ) dt, (57) T 0 0 0 t t t t 0 198 R. DONNELLY 2 2 2 2 V[X ] = σ Q + ν h (ν ) dt. (58) T t t t The objective of the agent is to maximize the mean-variance criterion H = E[X ] − V[X ]. (59) T T Recognizing that ν = Q frames this computation into a standard problem of the calculus of variations. After classifying optimality in R. F. Almgren (2003), closed-form expres- sions for the strategy are found by allowing the horizon T to approach infinity. First, we investigate the effect of non-linear temporary impact with zero stochastic impact such that f (ν) = bν, g(ν) = kν , h(ν) = 0, (60) with m > 0. In this case the optimal inventory path is given by 1+m ⎪ 1−m 1 − m t ⎪ q 1 + ,if0 < m < 1, ⎪ ∗ ⎪ 1 + m T ∗ t Q = q exp − ,if m = 1, (61) m+1 ⎪ m−1 m − 1 t m+1 ⎪ ∗ ⎩ q 1 − ,if m > 1and t ≤ T , m−1 m + 1 T where m+1 m−1 2mkq ∗ 0 T = . (62) γσ A comparison with (26) in the limit T →∞ shows that it coincides with the optimal strat- egy above when m = 1. This is expected because the price impact functions are the same and the optimal strategy which results from (21) is deterministic. The eeff cts of the non- linear impact function g on the optimal strategy are shown in Figure 7. The other case which yields a closed-form expression for the inventory path is f (ν) = bν, g(ν) = kν, h(ν) = a, (63) with a > 0. The corresponding optimal inventory path is given by q exp − , (64) where 2k + γ a T = . (65) γσ The effect of the stochastic component of price impact can be seen in Figure 8,which is toslowtherateoftrading compared tothebenchmark of(26).Thisisbecause trading quickly magnifies the risks associated with uncertainties in temporary price impact caused APPLIED MATHEMATICAL FINANCE 199 Figure 7. Optimal inventory paths from (61) for various values of parameter m in temporary impact function g(ν) = kν . Other parameter values are σ = 1, k = 0.01, b = 0.01, γ = 1. Figure 8. Optimal inventory paths from (64) for various values of parameter a in stochastic temporary impact function h(ν) = a. Other parameter values are σ = 1, k = 0.01, b = 0.01, γ = 1. by the term aV ξ in (53). The agent is willing to accept more risk associated with the n n+1 uncertainty in future price in order to lower the certain costs and riskiness associated with temporary price impact by trading slower. Additional work related to optimal trading with non-linear impact functions can be found in Bouchard, Dang, and Lehalle (2011) and Colaneri et al. (2020). Various discus- sionsare giveninGatheraland Schied(2013)relatedto theformofimpactfunctionand midprice dynamics. 5. Alternative Objectives In each of the previous sections, the perspective of the optimization problem was that of an agent wishing to liquidate a position subject to various eect ff s related to market dynamics. Within this section, we consider agents who are not necessarily attempting to liquidate some initial holdings, but instead behave according to an alternative objective. Inspection 200 R. DONNELLY of the optimal trading strategies in previous sections shows that if the agent’s inventory begins at zero, then she will do nothing over the course of the trading interval (as her position is already liquidated, any further trading will only be subjected to costs without benetfi ).Duetothealternativeobjectiveswenow consider,thisresultnolongerholds. 5.1. Speculation, Trade Signals, and Statistical Arbitrage From a modelling perspective, a straightforward method of introducing statistical arbitrage and the ability of an agent to speculate is to assume that the unaect ff ed midprice of the asset is no longer a martingale. In each of the models considered in previous sections, if the agent doesn’t trade then the midprice process is a martingale. If instead the unaeff cted price has non-zero expected change in value over some time interval, then any non-zero inventory position will also be expected to change in value. The essence of a protfi able strategy in this situationistoholdinventory positionswhichtake advantageofthesenon-zero expected price changes, but to do so in an efficient manner so as to not suffer from price impact effects or a future change in the asset’s expected drift. We focusonCarteaandJaimungal(2016b) which introduces a stochastic drift into the midprice process. The interpretation of the drift in this work is that it arises from the per- manent price impact of total order flow of all market participants, but it is statistically equivalent to having a short term price predictor. Let μ ={μ } be a stochastic process t 0≤t≤T which satisefi s the Markov property, and assume it is adapted to the filtration {F } t 0≤t≤T weareworking with.Wespecifythedynamicsofthe relevant processestobe dQ = ν dt, (66) dS = (μ + bν )dt + σ dW , (67) t t t ν ν dX =−S ν dt, (68) t t ν ν S = S + kν , (69) t t where b >0and k > 0 are constants. Other than the inclusion of the drift μ in (67), these equations are equivalent to (5) to (8) (with f and g replaced by linear functions). The value function of our agent will correspond to the performance criterion given in (25), that is ν ν ν ν 2 ν 2 H(t, X , Q , S , μ ) = sup E X + Q (S − αQ ) − σ (Q ) du | F , (70) t t t t t T T T T u ν∈A t with α> 0and γ> 0 constant. As discussed shortly after (25), this performance criterion can be interpreted as the agent expressing risk-aversion with respect to the random mid- price changes, but risk-neutrality with respect to randomness associated with the stochastic drift process μ. At this point, no specific dynamics for the process μ have been specified. As long as this stochastic drift term satisfies some technical integrability conditions, the value function and optimal trading strategy can be expressed in terms of expected future drift, as shown in the following Proposition. APPLIED MATHEMATICAL FINANCE 201 Proposition 5.1: The value function in (70) is given by H(t, x, q, S, μ) = x + qS + h (t, μ) + h (t, μ)q + h (t)q , (71) 0 1 2 − −ω(T−t) + ω(T−t) β ψ e − ψ e b h (t) = − , (72) − −ω(T−t) + ω(T−t) 2 ψ e + ψ e 2 T − −ω(T−u) + ω(T−u) ψ e + ψ e h (t, μ) = E[μ |F ]du, (73) 1 u t − −ω(T−t) + ω(T−t) ψ e + ψ e h (t, μ) = E[h (t, μ )|F ]du (74) 0 u t 4k with γσ β = 2kγσ , ω = , ψ = β ± 2α ∓ b, 2k and the optimal strategy in feedback form is h (t, μ) (2h (t) + b) 1 2 ν (t, q, μ) = + q. (75) 2k 2k A comparison of the optimal strategy (75) to the liquidation strategy without signals given in (24) shows that the inventory component of the feedback form is identical, but the strategy with signals has an additional component which comes from the agent’s forecast of the expected future asset price built into the term h (t, μ). This term could naturally be compared with the more straightforward quantity E[μ |F ]du, which represents u t the expected change in the unaec ff ted midprice from time t to T. While this itself yields a forecast of how much each unit of the agent’s inventory is expected to change in value, thereareotherconsiderationsthattheagentmusttakeintoaccount(namely priceimpact effects, liquidation penalties, and risk) when taking advantage of this signal. This gives rise to the complicated discount factor within the integral which balances the trade-off between taking advantage of the signal while acknowledging that this position will be partially liq- uidated close to time T and that any inventory holdings have inherent risk of price changes. If we assume 2α> b, then inspection of this discount factor shows that it is decreasing from 1 as u ranges from t to T and remains positive. This indicates that the expected price changesintheimmediatefutureasassignedmoreweighttowardsthetradingstrategycom- paredtoexpectedpricechangesinthe distantfuture. This arisesbecausetheagentwould like to enter a position as early as possible to take advantage price changes over as long a time horizon as possible, but also because there is less risk associated with acting on immediate forecasts compared to distant ones. There are numerous other works in which some form of trade signal or price pre- dictor are incorporated. See for example Chan and Sircar (2015), Cartea, Donnelly, and Jaimungal (2018a), Casgrain and Jaimungal (2019), Lehalle and Neuman (2019), Don- nelly and Lorig (2020), Forde, Sánchez-Betancourt, and Smith (2021), Bergault, Drissi, and Guéant (2022), and Neuman and Voß (2022). 202 R. DONNELLY 5.2. Hedging The main focus of the previous section was not about optimally liquidating an asset posi- tion, but rather attempting to maximize prots fi that could be extracted from knowledge about expected price changes. Nevertheless, both optimal liquidation and speculative trad- ing have the common theme that they try to minimize costs, or equivalently maximize profits. Here we consider an agent who has an additional objective which is not directly related to cost minimization, although that will still be a consideration which will influence her trading strategy. InBank,Soner,and Voß(2017), the authors consider an agent who is tasked with opti- mally tracking a hedge ratio which arises from attempting to minimize the risk associated with holding another security. Mathematically, they assume that there is a stochastic pro- cess ξ ={ξ } which indicates an inventory level which the agent intends to track t 0≤t≤T ecffi iently. The dynamics of speciefi d processes and the performance criterion used by the agent are given by dQ = ν dt, (76) dX =−(S + kν )ν dt, (77) t t t ν ν ν ν 2 H (x , q , S ) = E X + Q S − (Q − ξ ) dt , (78) 0 0 0 T t T T where the midprice process Sisamartingalewhichisnotsubjecttotheeeff ctsofpermanent price impact. Within the performance criterion, the rfi st two terms reward the agent for trading optimally subject to temporary price impact. The second term gives incentive for the agent to closely track the indicated inventory level ξ. With an optimization criterion in hand, the authors characterize the optimal trading strategy and value function. For this it is convenient to introduce the notation T − t τ(t) = √ , (79) 2k T τ(u) −τ(u) 1 e + e ξ = √ E ξ du | F . (80) t u t τ(t) −τ(t) e − e 2k t Note the similarity between Equations (80) and (73). In Equation (73), the relevant quan- tity at any point in time is not the trade signal μ itself, but instead the optimal strategy dependsonthe expectedvalueofthe signal atall timesinthe future,modiefi daccord- ing to an integration kernel. A similar theme occurs in (80) where the relevant quantity to the optimal efficient hedging strategy is not the intended hedging target ξ ,but aweighted average of the expected future value of the target. The precise way that the target hedge ratio enters the optimal trading strategy is summarized by the following proposition. APPLIED MATHEMATICAL FINANCE 203 Proposition 5.2: The value function corresponding to the performance criterion in (78) is given by H(x , q , S ) = x + q S − tanh(τ (0))(q − ξ ) 0 0 0 0 0 0 0 T T 1 k − E (ξ − ξ ) dt − E tanh(τ (t))d ξ , (81) t t 2 2 0 0 and the trading strategy which achieves this performance yields an inventory process that satisefi s tanh(τ (t)) dQ = √ (ξ − Q )dt, Q = q . (82) t t t 0 0 2k InEquation(82)we seethatatany time,the agentwilltradesuch thatherinventory moves towards the weighted target ξ with speed being proportional to the current hedging error, with the coefficient being a function depending on the remaining time T−t and the temporary price impact parameter k. This function increases with respect to the remaining time, indicating that the agent puts more eoff rt into tracking the hedge ratio when there is a long amount of time remaining in the trading interval. Indeed, when close to the end of the trading horizon, the performance criterion has relatively small penalty associated with theremainingtracking error,sotheagent deemsitmoreecffi ientto minimize thelosses due to trading rather than holding an accurate hedge. In Bank,Soner,andVoß(2017) the authors also solve this hedge tracking problem when a terminal inventory constraint is imposed of the form Q =  ,for agiven ∈ L (,F ,P). Other works related to efficient hedging include R. Almgren and T T Li (2016), Guéant and Pu (2017), and Ekren and Nadtochiy (2022)inwhichanagent holds a European option on the traded asset and attempts to maximize the expected util- ity or mean-variance performance of her terminal wealth. The work Cartea, Donnelly, and Jaimungal (2020) is similar except the agent holds a contingent claim on a non-traded risky quantity which is hedged by a correlated asset, and provide closed form trading strategies which approximate the optimal hedge. 5.3. Targeting VWAP Each of the previous sections considers an agent that measures her performance using a criterion which is internal. That is, there is no sense of measuring performance relative to other market participants or market state. In this section, we consider an agent who wishes to employ a trading strategy with performance measured by the average execution price of her trades relative to the volume weighted average price (VWAP). The time weighted average price (TWAP) can be captured by any trading strategy which trades at a constant speed over the time interval of interest, and such a strategy has already been seen in Equa- tions (19) and (20). Unlike TWAP, VWAP depends on the actions of all market participants and measures the average transaction price of all trades over a time period, and the uncer- tainty in the actions of all other market participants makes it a more difficult quantity to target. 204 R. DONNELLY Suppose that all trades over our time period of interest occur at a sequence of times denoted τ ={τ } , and that the trade at time τ has volume V which is executed when n n n n=1 the midprice is S . Then the VWAP for this time period is V S n τ n=1 . (83) n=1 Working in continuous time, if we denote total order-flow in the market by μ ={μ } , t 0≤t≤T then the appropriate expression for VWAP is μ S dt t t . (84) μ dt VWAP is a frequently used benchmark for a trading strategy because it represents the trade performance across the entire market, and additionally is easy to compute. When an agent is trying to optimally liquidate a block of shares with performance measured relative to VWAP, she must time her trades throughout the trading interval so that her average trade price is close to this target. The main difficulty is that VWAP is uncertain because there is randomness in future market order-flow. This difficulty is present in addition to try- ing to maintaining trade efficiency regarding price impact considerations. One method of attempting to target VWAP is to instead target the percentage of volume (POV) at each point in time. This is the approach taken in Cartea and Jaimungal (2016a)which wenow focus on. Let μ ={μ } denote the order-flow of buy and sell orders in the market, both 0≤t≤T assumed to be positive. Then the total order-flow at time t including the agent’s trading is + − given by μ + μ +|ν |. The midprice is assumed to be linearly impacted by all order- flow, and so its dynamics are ν + − dS = b(μ − μ + ν )dt + dM , (85) t t t t t ± ± where M is a martingale independent of μ . Similar to Section 5.1 we assume that μ have the Markov property with respect to our filtration. Instead of directly trying to target VWAP, the agent will attempt to track POV through time. Let ρ ∈ (0, 1) be xfi ed and den fi e ν + − χ = ρ(μ + μ +|ν |), and specify the agent’s dynamic value function to be ± ν ν ν ν ν 2 H(t, X , Q , S , μ ) = sup E X + Q (S − αQ ) − ϕ (|ν |− χ ) dt | F , (86) t t t u t t T T T T u ν∈A where ϕ> 0isapenalty parameterfor deviatingfromPOV.Tosee whytheinclusionof the POV target assists in targeting VWAP, suppose that the agent is only selling, meaning ν ≤ 0, and that her trading speed exactly matches the percentage of total order-flow, giving + − ν =−ρ(μ + μ +|ν |). Then the agent’s trades, marked to the midprice as in (84), are t t t t executed at an average value of T T + − ν ν ν S dt (μ + μ +|ν |)S dt t t t t t t 0 0 = , (87) T T + − ν dt (μ + μ +|ν |)dt t t t t 0 0 and thus she has exactly targeted VWAP. Unfortunately, by following this strategy she will likely fail to liquidate her initial inventory position, and so inclusion of the penalty APPLIED MATHEMATICAL FINANCE 205 term in (86) initiates a trade-off between targeting VWAP and efficiently liquidating her inventory as represented by the other terms in the performance criterion. For the remainder of this analysis we assume that the agent’s optimal trading strategy will consist only of selling, meaning we may replace occurrences of |ν | with −ν ,resulting t t in simplified computations. This assumption will likely be the case when the agent’s initial inventory is large and positive and when the need for liquidation outweighs the potential benetfi s from buying shares of the asset due to price forecasts similar to the treatment in Section 5.1. Under this assumption, the value function and optimal strategy are given by the following Proposition. Proposition 5.3: The value function in (86) is given by ± ± ± 2 H(t, x, Q, S, μ ) = h (t, μ ) + h (t, μ )q + h (t)q , (88) 0 1 2 −1 h (t) =−(ϕ ˆ + k) ((T − t) + ζ ) − (89) (T − s) + ζ ± + − h (t, μ ) = b E[μ − μ |F ]ds 1 t s s (T − t) + ζ 2ϕ ˆρˆ + − + E[μ + μ |F ]ds (90) s s (T − t) + ζ T ± + − 2 (h (s, μ ) − 2ϕ ˆρ( ˆ μ + μ )) ± s s s 2 + − 2 h (t, μ ) = E −ˆ ϕρˆ (μ + μ ) | F 0 t s s 4(ϕ ˆ + k) (91) with 2(ϕ ˆ + k) ζ = , ϕ ˆ = ϕ(1 − ρ) , ρˆ = ρ/(1 − ρ), 2α − b and the optimal trading strategy in feedback form is ± + − h (t, μ ) − 2ϕ ˆρ( ˆ μ + μ ) ∗ ± −1 ν (t, q, μ ) =− ((T − t) + ζ ) q + . (92) 2(ϕ ˆ + k) In (92) the first term which depends on inventory resembles a strategy that targets TWAP, and the remaining term is an adjustment that attempts to protfi from expected + − future price forecasts (as seen from the term which includes E[μ − μ |F ]) and the s s + − desire to closely target POV (as seen in all terms including μ + μ ). Further intuition about the optimal strategy can be gathered when considering rfi st the limit α →∞,then the limit ϕ →∞. The rfi st of these limits ensures that the agent fully liquidates her posi- tion by time T, and the second limit drives her to match POV as closely as possible under the full liquidation constraint. Evaluating these limits gives 1 1 ∗ ± + − + − lim lim ν (t, q, μ ) =− q −ˆ ρ (μ + μ ) − E[μ + μ |F ]ds . s s ϕ→∞ α→∞ T − t T − t (93) The dependence on expected future price has disappeared because of the infinite weight associated with full liquidation and volume tracking. The form of makes it clear that the 206 R. DONNELLY agent’s trading strategy has a component which directly depends on the current total order + − flow μ + μ ,buttheinufl enceisadjusteddownwardduetotheinufl enceoffuturetrades. In Cartea and Jaimungal (2016a)itisalsoshown howanagentmaytargetVWAPby tracking percentage of cumulative volume (POCV) rather than POV. Other works which consider an agent attempting to match her trade executions to market VWAP are Kon- ishi (2002)and Frei andWestray (2015). Both derive expressions for strategies which minimizetheVWAPtargeting error,butrelyonfutureknowledgeofthetotalmarket vol- ume. Astudyof guaranteed VWAPcontractsisconducted inGuéant andRoyer(2014), in which an agent guarantees to provide an execution service at VWAP prices, but then mitigates the risk of having entered this contract. See Białkowski, Darolles, and Fol (2008) and Mitchell, Białkowski, and Tompaidis (2020) for empirical studies of strategies which target VWAP. 5.4. Other Objectives There are numerous other criteria that an agent could consider for the purposes of evaluat- ing the performance of her trading strategy. Here we briefly mention some lines of research that do not explicitly fall into any of the categories of the previous few sections. In Neuman et al. (2020)the agentisconsideredtobeacentralbank,andshe istasked with maintaining a currency peg or currency target zone, the goal of which is to make sure that a particular foreign exchange rate remains within a specified domain. This is done bytradingintheunderlyingcurrenciesandutilizing thepermanent priceimpactfrom the trades in such a way as to move the exchange rate to its desired value. This task has significant inventory risk associated with it which the central bank wishes to manage. In Neuman and Schied (2022), a similar currency peg context is considered, but in addition to a central bank maintaining the peg there is a strategic investor attempting to disrupt thebank’sobjective.Thisismodelledasastochasticdieff rentialgame andaStackelberg equilibrium is established. The operation of modern markets is strongly dependent on technology, and the fre- quency at which events occur mean that latency can have an affect on the performance of an execution strategy. From the time that an agent submits her intention to trade until that signal arrives at the matching engine of the exchange, the state of the limit order bookmayhavechangedin suchawaythattheagent’s ordercannotbefilled.InCartea and Sánchez-Betancourt (2021b) the agent acknowledges that latency can aeff ct whether her submitted orders are executed as expected, and adjusts her performance criterion to account for the fraction of her trades that are filled. See also Moallemi and Sağlam ( 2013) for some empirical analysis of the effects of latency on optimal execution strategies. 6. Other Phenomena Below is a small number of other styles of model which do not explicitly fall into any of the previous categories. Many of the ideas explored in the following sections could be adapted in conjunction with the topics explored already. The referenced material should be consulted for the mathematical details used to approach these phenomena. APPLIED MATHEMATICAL FINANCE 207 6.1. Multiple Agents Each setting considered so far has incorporated the strategic actions of only one agent. Some of the previous models have incorporated total order flow of the market within the dynamics, but any such model does not arise as a result of an optimization procedure car- ried out by other agents. In reality, no single agent acts in isolation, and their actions will have an effect on the state of the market, thereby altering the actions of other agents. When modelling several agents simultaneously, we move from a stochastic optimization into a stochastic game setting. In Schied and Zhang (2019) a competitive game is considered between a finite number of agents where they are restricted to trading at discrete times. Continuous-time counterparts ares studied in Strehle (2017) and Neuman and Voß (2021), and in Voß (2022)atwoagent setting is considered in which they must both efficiently track an inventory target. Finite player stochastic differential games generally result in a set of coupled differential equations which can prove to be intractable. Modeling the problem as a mean-field game, appropriate for a large number of agents, often reduces the complexity of the computations. The optimal liquidation problem in a mean eld fi setting is detailed in Huang, Jaimungal, and Nourian (2019)and Fuetal.(2021). In Casgrain and Jaimungal (2020)andDonnelly and Lorig (2020) a mean-field game is used to consider agents who have different methods of speculation about the future price of the asset. 6.2. Robustness Each of the optimal trading strategies from previous sections depend on a specification of adynamic modelforthepriceprocessandotherrelevantquantities.Butthe performance of any trading strategy is subject to risks arising from using a misspecified model, thus it is desirable to account for these risks and attempt to formulate a trading strategy which is robust to model error. In Schied (2013) the optimality of a trading strategy with respect to the underlying modelisstudied.InNyström,Aly,andZhang(2014), Cartea, Donnelly, and Jaimungal (2018b), andHorst,Xia,and Zhou (2022), the agent’s performance criterion accounts for the possibility that the dynamics evolve according to a dieff rent model, and she thereby adjusts her trading strategy to compensate. 6.3. Market Making and Liquidation with Limit Orders The results which have been outlined in previous sections frame a trading strategy in terms of the speed or volume of trades. Keeping in mind the discussion about the LOB structure, an agent can only guarantee that a particular volume of trades be executed if she employs MOs. Here we give references which study the strategic placement of LOs instead of or in conjunction with MOs. An agent who simultaneously places buy and sell LOs and attempts to maximize protfi s is considered in Avellaneda and Stoikov (2008). Further work on optimal market making is considered in Guilbaud and Pham (2013), Guéant, Lehalle, and Fernandez-Tapia (2013), Cartea,Jaimungal,andRicci (2014), Cartea and Jaimungal (2015b), and Cartea, Donnelly, and Jaimungal (2018a). 208 R. DONNELLY The incorporation of LOs on only one side of the LOB for use in optimal liqui- dation is considered in Guéant, Lehalle, and Fernandez-Tapia (2012), Bayraktar and Ludkovski (2014), Guéant and Lehalle (2015), and Cartea and Jaimungal (2015a). Notes 1. Other matching rules such as Pro Rata are also used, but the intricate details of how they dieff r from FIFO will not be necessary for the types of models considered here. 2. The opposite inequality means the original optimization problem in (12) may lack concavity, and the agent could make infinite prots fi through price manipulations. 3. See Cartea, Donnelly, and Jaimungal (2018a) for some empirical investigation of LOB event frequency seasonality and self-excitation. 4. Refer to the discussion after Proposition 3.1. 5. We take all of these volumes to be positive whether they are buys or sells. One could alternatively consider the VWAP of only buy or sell trades depending on whether the agent is buying or sell- ing, but typically all trades are considered in practice. Additionally, VWAP is typically measured with respect to the midprice without taking transaction or price impact eeff cts into account. Acknowledgments The author would like to thank Fayçal Drissi, Matthew Lorig, and Álvaro Cartea for their comments and suggestions which have improved this manuscript. Disclosure statement No potential conflict of interest was reported by the author(s). References Ackermann J., Kruse T., and Urusov M. 2021. “Optimal Trade Execution in an Order Book Model with Stochastic Liquidity Parameters.” SIAM Journal on Financial Mathematics 12 (2): 788–822. doi:10.1137/20M135409X. Alfonsi A., Fruth A., and Schied A. 2010. “Optimal Execution Strategies in Limit Order Books with General Shape Functions.” Quantitative Finance 10 (2): 143–157. doi:10.1080/1469768080259 Almgren R. 2012. “Optimal Trading with Stochastic Liquidity and Volatility.” SIAM Journal on Financial Mathematics 3 (1): 163–181. doi:10.1137/090763470. AlmgrenR., andChrissN. 2001. “Optimal Execution of Portfolio Transactions.” The Journal of Risk 3: 5–39. doi:10.21314/JOR.2001.041. Almgren R., and Li T. M. 2016.“Option HedgingwithSmoothMarketImpact.”MarketMicrostruc- ture and Liquidity 2 (1): Article ID 1650002. doi:10.1142/S2382626616500027. AlmgrenR.,Thum C.,Hauptmann E.,and LiH. 2005. “Direct Estimation of Equity Market Impact.” Risk 18 (7): 58–62. Almgren R. F. 2003. “Optimal Execution with Nonlinear Impact Functions and Trading-Enhanced Risk.” Applied Mathematical Finance 10 (1): 1–18. doi:10.1080/135048602100056. Avellaneda M., and Stoikov S. 2008. “High-Frequency Trading in a Limit Order Book.” Quantitative Finance 8 (3): 217–224. doi:10.1080/14697680701381228. Bank P., Soner H. M., and Voß M. 2017. “Hedging with Temporary Price Impact.” Mathematics and Financial Economics 11 (2): 215–239. doi:10.1007/s11579-016-0178-4. Bank P., and Voß M. 2019. “Optimal Investment with Transient Price Impact.” SIAM Journal on Financial Mathematics 10 (3): 723–768. doi:10.1137/18M1182267. Barger W., and Lorig M. 2019. “Optimal Liquidation Under Stochastic Price Impact.” International Journal of Theoretical and Applied Finance 22 (2): Article ID 1850059. doi:10.1142/S0219024918500590. APPLIED MATHEMATICAL FINANCE 209 Bayraktar E., and Ludkovski M. 2014. “Liquidation in Limit Order Books with Controlled Intensity.” Mathematical Finance 24 (4): 627–650. doi:10.1111/ma.2014.24.is fi sue-4 . Becherer D., Bilarev T., and Frentrup P. 2018. “Optimal Liquidation Under Stochastic Liquidity.” Finance and Stochastics 22 (1): 39–68. doi:10.1007/s00780-017-0346-2. Bechler K., and Ludkovski M. 2017.“OrderFlowsandLimitOrderBookResiliencyon the Meso-Scale.” Market Microstructure and Liquidity 3 (03n04): Article ID 1850006. doi:10.1142/S2382626618500065. Bergault P., Drissi F., and Guéant O. 2022. “Multi-Asset Optimal Execution and Statistical Arbitrage Strategies Under Ornstein–Uhlenbeck Dynamics.” SIAM Journal on Financial Mathematics 13 (1): 353–390. doi:10.1137/21M1407756. BertsimasD.,andLoA.W. 1998. “Optimal Control of Execution Costs.” Journal of Financial Markets 1 (1): 1–50. doi:10.1016/S1386-4181(97)00012-8. Biais B., Hillion P., and Spatt C. 1995. “An Empirical Analysis of the Limit Order Book and the Order Flow in the Paris Bourse.” The Journal of Finance 50 (5): 1655–1689. doi:10.1111/j.1540-6261.1995.tb05192.x. Białkowski J., Darolles S., and Fol G. Le. 2008. “Improving Vwap Strategies: A Dynamic Volume Approach.” Journal of Banking & Finance 32 (9): 1709–1722. doi:10.1016/j.jbankfin.2007.09.023 . Björk T. 2009. Arbitrage Theory in Continuous Time. Oxford: Oxford University Press. Bouchard B., Dang N.-M., and Lehalle C.-A. 2011. “Optimal Control of Trading Algorithms: A General Impulse Control Approach.” SIAM Journal on Financial Mathematics 2 (1): 404–438. doi:10.1137/090777293. Cartea Á., Donnelly R., and Jaimungal S. 2017. “Algorithmic Trading with Model Uncertainty.” SIAM Journal on Financial Mathematics 8 (1): 635–671. doi:10.1137/16M106282X. Cartea Á., Donnelly R., and Jaimungal S. 2018. “Enhancing Trading Strategies with Order Book Signals.” Applied Mathematical Finance 25 (1): 1–35. doi:10.1080/1350486X.2018.1434009. Cartea Á., Donnelly R., and Jaimungal S. 2018. “Portfolio Liquidation and Ambiguity Aversion.” In High-Performance Computing in Finance, edited by M. Dempster, J. Kanniainen, J. Keane, and E. Vynckier, 77–114. New York: Chapman and Hall/CRC. Cartea Á., Donnelly R., and Jaimungal S. 2020. “Hedging Nontradable Risks with Transaction Costs and Price Impact.” Mathematical Finance 30 (3): 833–868. doi:10.1111/ma.v30.3 fi . Cartea Á., Drissi F., and Monga M. 2022. “Decentralised Finance and Automated Market Making: Predictable Loss and Optimal Liquidity Provision.” Available at SSRN 4273989. Cartea Á.,GanL.,andJaimungalS. 2019. “Trading Co-Integrated Assets with Price Impact.” Mathematical Finance 29 (2): 542–567. doi:10.1111/ma.2019.29.is fi sue-2 . Cartea Á., and Jaimungal S. 2015. “Optimal Execution with Limit and Market Orders.” Quantitative Finance 15 (8): 1279–1291. doi:10.1080/14697688.2015.1032543. Cartea Á., and Jaimungal S. 2015. “Risk Metrics and Fine Tuning of High-Frequency Trading Strategies.” Mathematical Finance 25 (3): 576–611. doi:10.1111/ma.2015.25.is fi sue-3 . Cartea Á., and Jaimungal S. 2016. “A Closed-Form Execution Strategy to Target Vol- ume Weighted Average Price.” SIAM Journal on Financial Mathematics 7 (1): 760–785. doi:10.1137/16M1058406. Cartea Á., and Jaimungal S. 2016. “Incorporating Order-Flow into Optimal Execution.” Mathematics and Financial Economics 10 (3): 339–364. doi:10.1007/s11579-016-0162-z. Cartea Á., Jaimungal S., and Penalva J. 2015. Algorithmic and High-Frequency Trading. Cambridge: Cambridge University Press. Cartea Á.,Jaimungal S.,andRicci J. 2014. “Buy Low, Sell High: A High Frequency Trading Perspective.” SIAM Journal on Financial Mathematics 5 (1): 415–444. doi:10.1137/130911196. Cartea A., and Sánchez-Betancourt L. 2021. “Optimal Execution with Stochastic Delay.” Available at SSRN 3812324. Cartea Á., and Sánchez-Betancourt L. 2021. “The Shadow Price of Latency: Improving Intraday Fill Ratios in Foreign Exchange Markets.” SIAM Journal on Financial Mathematics 12 (1): 254–294. doi:10.1137/19M1258888. Casgrain P., and Jaimungal S. 2019. “Trading Algorithms with Learning in Latent Alpha Models.” Mathematical Finance 29 (3): 735–772. doi:10.1111/ma.2019.29.is fi sue-3 . 210 R. DONNELLY Casgrain P., and Jaimungal S. 2020. “Mean-field Games with Differing Beliefs for Algorithmic Trading.” Mathematical Finance 30 (3): 995–1034. doi:10.1111/ma.v30.3 fi . Chan P., and Sircar R. 2015. “Optimal Trading with Predictable Return and Stochastic Volatility.” Available at SSRN 2623747. Chen Y., Horst U., and Tran H. H. 2019. “Portfolio Liquidation Under Transient Price Impact–Theoretical Solution and Implementation with 100 Nasdaq Stocks.” Available at SSRN Cheridito P., and Sepin T. 2014. “Optimal Trade Execution Under Stochastic Volatility and Liquid- ity.” Applied Mathematical Finance 21 (4): 342–362. doi:10.1080/1350486X.2014.881005. Colaneri K.,EksiZ.,FreyR.,and Szölgyenyi M. 2020. “Optimal Liquidation Under Partial Infor- mation with Price Impact.” Stochastic Processes and Their Applications 130 (4): 1913–1946. doi:10.1016/j.spa.2019.06.004. Cont R. 2011. “Statistical Modeling of High-Frequency Financial Data.” IEEE Signal Processing Magazine 28 (5): 16–25. doi:10.1109/MSP.2011.941548. ContR.,Kukanov A.,andStoikovS. 2014.“ThePriceImpact of OrderBookEvents.”Journal of Financial Econometrics 12 (1): 47–88. doi:10.1093/jjfinec/nbt003 . Donnelly R., and Lorig M. 2020. “Optimal Trading with Differing Trade Signals.” Applied Mathe- matical Finance 27 (4): 317–344. doi:10.1080/1350486X.2020.1847672. Ekren I., and Muhle-Karbe J. 2019. “Portfolio Choice with Small Temporary and Transient Price Impact.” Mathematical Finance 29 (4): 1066–1115. doi:10.1111/ma.v29.4 fi . Ekren I., and Nadtochiy S. 2022. “Utility-Based Pricing and Hedging of Contingent Claims in Almgren-Chriss Model with Temporary Price Impact.” Mathematical Finance 32 (1): 172–225. doi:10.1111/ma.v32.1 fi . Fleming W. H., and Soner H. M. 2006. Controlled Markov Processes and Viscosity Solutions. Vol. 25. New York: Springer Science & Business Media. Forde M., Sánchez-Betancourt L., and Smith B. 2021. “Optimal Trade Execution for Gaussian Signals with Power-Law Resilience.” Quantitative Finance 22 (3): 585–596. doi:10.1080/14697688.2021. Fouque J.-P., Jaimungal S., and Saporito Y. F. 2022.“OptimalTrading withSignalsandStochastic Price Impact.” SIAM Journal on Financial Mathematics 13 (3): 944–968. doi:10.1137/21M139 Frei C., and Westray N. 2015. “Optimal Execution of a VWAP Order: A Stochastic Control Approach.” Mathematical Finance 25 (3): 612–639. doi:10.1111/ma.2015.25.is fi sue-3 . Fruth A., Schöneborn T., and Urusov M. 2019. “Optimal Trade Execution in Order Books with Stochastic Liquidity.” Mathematical Finance 29 (2): 507–541. doi:10.1111/ma.2019.29.is fi sue-2 . FuG.,GraeweP.,HorstU., andPopierA. 2021. “A Mean Field Game of Optimal Portfolio Liquida- tion.” Mathematics of Operations Research 46 (4): 1250–1281. doi:10.1287/moor.2020.1094. Gatheral J. 2010. “No-Dynamic-Arbitrage and Market Impact.” Quantitative Finance 10 (7): 749–759. doi:10.1080/14697680903373692. Gatheral J., and Schied A. 2011. “Optimal Trade Execution Under Geometric Brownian Motion in the Almgren and Chriss Framework.” International Journal of Theoretical and Applied Finance 14 (3): 353–368. doi:10.1142/S0219024911006577. Gatheral J., and Schied A. 2013. “Dynamical Models of Market Impact and Algorithms for Order Execution.” In Handbook on Systemic Risk, edited by Fouque, Jean-Pierre, and Joseph A. Langsam, 579–599. Cambridge: Cambridge University Press. Graewe P., and Horst U. 2017. “Optimal Trade Execution with Instantaneous Price Impact and Stochastic Resilience.” SIAM Journal on Control and Optimization 55 (6): 3707–3725. doi:10.1137/16M1105463. Guéant O. 2016. The Financial Mathematics of Market Liquidity: From Optimal Execution to Market Making. Vol. 33. New York: CRC Press. Guéant O.,and LehalleC.-A. 2015. “General Intensity Shapes in Optimal Liquidation.” Mathemat- ical Finance 25 (3): 457–495. doi:10.1111/ma.2015.25.is fi sue-3 . Guéant O., Lehalle C.-A., and Fernandez-Tapia J. 2012. “Optimal Portfolio Liquidation with Limit Orders.” SIAM Journal on Financial Mathematics 3 (1): 740–764. doi:10.1137/110850475. APPLIED MATHEMATICAL FINANCE 211 Guéant O., Lehalle C.-A., and Fernandez-Tapia J. 2013.“DealingwiththeInventoryRisk: ASolu- tion to the Market Making Problem.” Mathematics and Financial Economics 7 (4): 477–507. doi:10.1007/s11579-012-0087-0. Guéant O., and Pu J. 2017. “Option Pricing and Hedging with Execution Costs and Market Impact.” Mathematical Finance 27 (3): 803–831. doi:10.1111/ma.2017.27.is fi sue-3 . Guéant O., and Royer G. 2014. “VWAP Execution and Guaranteed VWAP.” SIAM Journal on Financial Mathematics 5 (1): 445–471. doi:10.1137/130924676. Guilbaud F., and Pham H. 2013. “Optimal High-Frequency Trading with Limit and Market Orders.” Quantitative Finance 13 (1): 79–94. doi:10.1080/14697688.2012.708779. Guo X., and Zervos M. 2015. “Optimal Execution with Multiplicative Price Impact.” SIAM Journal on Financial Mathematics 6 (1): 281–306. doi:10.1137/120894622. Horst U., and Xia X. 2019. “Multi-Dimensional Optimal Trade Execution Under Stochastic Resilience.” Finance and Stochastics 23 (4): 889–923. doi:10.1007/s00780-019-00394-3. Horst U., Xia X., and Zhou C. 2022. “Portfolio Liquidation Under Factor Uncertainty.” The Annals of Applied Probability 32 (1): 80–123. doi:10.1214/21-AAP1672. Huang X., Jaimungal S., and Nourian M. 2019. “Mean-Field Game Strategies for Optimal Execution.” Applied Mathematical Finance 26 (2): 153–185. doi:10.1080/1350486X.2019.1603183. Ishitani K., and Kato T. 2014. “Optimal Execution for Uncertain Market Impact: Derivation and Characterization of a Continuous-Time Value Function.” In 2012 Recent Advances in Financial Engineering: Proceedings of the International Workshop on Finance 2012, edited by A. Takahashi, Y. Muromachi, and T. Shibata, 93–116. Tokyo: World Scientific. Konishi H. 2002. “Optimal Slice of a VWAP Trade.” Journal of Financial Markets 5 (2): 197–221. doi:10.1016/S1386-4181(01)00023-4. LehalleC.-A.,andMounjidO. 2017. “Limit Order Strategic Placement with Adverse Selec- tion Risk and the Role of Latency.” Market Microstructure and Liquidity 3 (1): 1750009. doi:10.1142/S2382626617500095. Lehalle C.-A., and Neuman E. 2019. “Incorporating Signals into Optimal Trading.” Finance and Stochastics 23 (2): 275–311. doi:10.1007/s00780-019-00382-7. Mitchell D., Białkowski J., and Tompaidis S. 2020. “Volume-Weighted Average Price Tracking: A Theoretical and Empirical Study.” IISE Transactions 52 (8): 864–889. doi:10.1080/24725854.2019. Moallemi C. C., and Sağlam M. 2013. “Or Forum–The Cost of Latency in High-Frequency Trading.” Operations Research 61 (5): 1070–1086. doi:10.1287/opre.2013.1165. Neuman E., and Schied A. 2022. “Protecting Pegged Currency Markets from Speculative Investors.” Mathematical Finance 32 (1): 405–420. doi:10.1111/ma.v32.1 fi . Neuman E.,SchiedA.,Weng C.,and XueX. 2020. “A Central Bank Strategy for Defending a Currency Peg.” Systems & Control Letters 144: Article ID 104761. doi:10.1016/j.sysconle.2020. Neuman E., and Voß M. 2021. Trading with the Crowd. arXiv preprint arXiv:2106.09267. Neuman E., and Voß M. 2022. “Optimal Signal-Adaptive Trading with Temporary and Transient Price Impact.” SIAM Journal on Financial Mathematics 13 (2): 551–575. doi:10.1137/20M137 NyströmK., AlyS.M.Ould, andZhangC. 2014. “Market Making and Portfolio Liquidation Under Uncertainty.” International Journal of Theoretical and Applied Finance 17 (5): Article ID 1450034. doi:10.1142/S0219024914500344. Obizhaeva A. A., and Wang J. 2013. “Optimal Trading Strategy and Supply/Demand Dynamics.” Journal of Financial Markets 16 (1): 1–32. doi:10.1016/j.finmar.2012.09.001 . Pham H. 2009. Continuous-Time Stochastic Control and Optimization with Financial Applications. Vol. 61. Berlin: Springer Science & Business Media. Predoiu S., Shaikhet G., and Shreve S. 2011. “Optimal Execution in a General One-Sided Limit- Order Book.” SIAM Journal on Financial Mathematics 2 (1): 183–212. doi:10.1137/10078534X. Ranaldo A. 2004. “Order Aggressiveness in Limit Order Book Markets.” Journal of Financial Markets 7 (1): 53–74. doi:10.1016/S1386-4181(02)00069-1. 212 R. DONNELLY Schied A. 2013. “Robust Strategies for Optimal Order Execution in the Almgren–Chriss Frame- work.” Applied Mathematical Finance 20 (3): 264–286. doi:10.1080/1350486X.2012.683963. Schied A., and Zhang T. 2019. “A Market Impact Game Under Transient Price Impact.” Mathematics of Operations Research 44 (1): 102–121. doi:10.1287/moor.2017.0916. Sirignano J. A. 2019. “Deep Learning for Limit Order Books.” Quantitative Finance 19 (4): 549–570. doi:10.1080/14697688.2018.1546053. Strehle E. 2017. “Optimal Execution in a Multiplayer Model of Transient Price Impact.” Market Microstructure and Liquidity 3 (03n04): Article ID 1850007. doi:10.1142/S2382626618500077. Touzi N., and Tourin A. 2013. Optimal Stochastic Control, Stochastic Target Problems, and Back- ward SDE. Vol. 29. New York: Springer. Voß M. 2022. “A Two-Player Portfolio Tracking Game.” Mathematics and Financial Economics 16 (4): 779–809. doi:10.1007/s11579-022-00324-6. Yong J., and Zhou X. Y. 1999. Stochastic Controls: Hamiltonian Systems and HJB Equations. Vol. 43. New York: Springer Science & Business Media. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Applied Mathematical Finance Taylor & Francis

Optimal Execution: A Review

Applied Mathematical Finance , Volume 29 (3): 32 – May 4, 2022

Optimal Execution: A Review

Abstract

This review article is intended to collect and summarize many of the results in the field of optimal execution over the last twenty years. In doing so, we describe the general workings of the limit order book so that the sources of costs and risks which need to be optimized are understood. The initial models considered propose simple dynamics for prices which allow easily computable strategies which maximize risk-adjusted profits. Subsequently, the review is divided into two major parts. The...
Loading next page...
 
/lp/taylor-francis/optimal-execution-a-review-VRdmwzF6x0
Publisher
Taylor & Francis
Copyright
© 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group
ISSN
1466-4313
eISSN
1350-486X
DOI
10.1080/1350486X.2022.2161588
Publisher site
See Article on Publisher Site

Abstract

APPLIED MATHEMATICAL FINANCE 2022, VOL. 29, NO. 3, 181–212 https://doi.org/10.1080/1350486X.2022.2161588 Ryan Donnelly Department of Mathematics, King’s College London, London, UK ABSTRACT ARTICLE HISTORY Received 16 December 2022 This review article is intended to collect and summarize many of the Accepted 16 December 2022 results in the field of optimal execution over the last twenty years. In doing so, we describe the general workings of the limit order KEYWORDS book so that the sources of costs and risks which need to be opti- Algorithmic trading; price mized are understood. The initial models considered propose simple impact; stochastic dynamics for prices which allow easily computable strategies which optimization maximize risk-adjusted profits. Subsequently, the review is divided into two major parts. The first explores several works which investi- gate how optimal liquidation strategies are modified to account for more complex dynamics, namely other stochastic or non-linear fac- tors. The second presents optimal trading strategies when the agent utilizes benchmarks in addition to risk-adjusted wealth, or when she has objectives beyond optimal liquidation. 1. Introduction In the last few decades, the actions taken by participants in n fi ancial markets have become increasingly based on quantitative analysis and algorithms rather than any human deci- sion making process. The motivation for this shift comes from attempting to minimize costs associated with trading, or to minimize risks involved in holding financial assets, or both. Establishing a mathematical framework that models the setting of trading in a na fi n- cial market allows one to quantify the trade-off between minimizing costs and minimizing risks, thereby tailoring a trading strategy to one’s preferences optimally. In this review arti- cle, we provide an overview of many developments and results related to optimal execution of trades. We begin with a description of the limit order book (LOB), and then proceed with a summary of some preliminary models in discrete time, in particular those of Bertsimas and Lo (1998) and R. Almgren and Chriss (2001). These pieces will provide a groundwork for thetypeofproblemsconsideredinthisreview, butmostofthefocuswillbeonanalogous models and results in continuous time. After this groundwork is established, we review developments which take more realistic or generalized underlying dynamics, different or more general performance criteria, and other aspects of financial trading and optimization. The primary mathematical tools that are used to solve many of the problems contained herein are the Dynamic Programming Principle (DPP) and the Hamilton–Jacobi–Bellman CONTACT Ryan Donnelly ryan.f.donnelly@kcl.ac.uk Department of Mathematics, King’s College London, Strand,LondonWC2R2LS,UK © 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons. org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. 182 R. DONNELLY (HJB) equation, although sometimes other approaches such as variational techniques are necessary. The full solutions to optimizations are omitted, but the original sources of the results or standard references on stochastic optimal control should provide any details whichareskipped(seeforexampleYongand Zhou 1999; Fleming and Soner 2006). We focus on modelling and optimization results of a theoretical nature, but many of the primary sources that are cited contain empirical analysis to motivate modelling choices. Some of these motivational empirical results will be provided when warranted, but the discussion about these results and their interpretation will be limited. 2. The Limit Order Book This section describes the general mechanisms of the LOB. An understanding of how trans- actions take place within LOBs motivates some of the modelling choices in the next section. Many of thesemodelling choicesmay stillbeappropriatein othermarketsettingsthatdo not employ a LOB, nevertheless a specific context may help with interpretation of results that we present later. Most electronic exchanges allow participants to submit several different types of orders, but these can be categorized broadly as either passive limit orders (LOs) or aggressive market orders (MOs), so for clarity we describe a simplified form of LOB mechanisms containing only the most basic of these two types of orders. LOs consist of a price, a volume, and an indicator to either buy or sell the asset. After an LO issubmitted,it sitsin theLOBuntilitiseithercancelled orfilled againstanMOata later time. The cancellation or filling of an LO can be either full or partial, meaning either the entire volume of the order is removed from the LOB or some amount of the original volume remains in the LOB. If a subsequent LO is placed with the same price, then both of these orders will exist simultaneously in the LOB at this price, and the order of execution of the orders depends on the matching rules of the exchange. The matching rules we describe below correspond to a price-time priority of orders, meaning the LOs form a queue at each price and the LOs to be filled are determined according to their position in this queue, known as the First In First Out (FIFO) rule. When several LOs of different types (buys and sells) are sitting in the LOB, there are two price levels which correspond to the highest price across all buy LOs and the lowest price across all sell LOs, referred to as the best bid andbestask,respectively.Theaverage of thebestbid andbestaskis referred toasthe midprice of the asset. Figure 1 shows a graphical representation of the state of the LOB at a single point in time, followed by the state of the LOB after a single LO is placed. An MO consists of a volume and indicator to either buy or sell the asset. When an MO is submitted it is matched against resting LOs of the opposite buy/sell type, and the price of the transaction is determined by the price level of all filled LOs up to the volume specified by the MO. In Figure 2 we show a graphical representation of an MO filling several LOs to illustrate how the volume of the MO and the volumes and prices of the LOs determine the overall transaction price of this trade. With this structure of the LOB in mind, it is clear that the midprice of the asset may change throughtimeasLOsareplacedand cancellednearthebest bidand ask.Addition- ally, the eeff ct that MOs have on prices can be interpreted in multiple ways. We generally divide the impact of MOs into temporary and permanent, although they are not fully dis- tinctand theremay beotherimpactphenomena whichdon’tfall into either category. APPLIED MATHEMATICAL FINANCE 183 Figure 1. Blue bars represent buy LOs, red bars represent sell LOs. Left panel: the highest price of all buy LOs and the lowest price of all sell LOs are the best bid and ask, respectively. Middle panel: a new LO is placed at a price where orders already exist at the back of the queue. Right panel: the volume at this price is increased. Analogous changes would result from an order cancellation, and thus the state of the LOB may change through time, even if no trades are taking place. Figure 2. Left panel: the LOB immediately before an MO is submitted. Middle panel: a sell MO is sub- mitted with volume greater than that of the best bid, so it must ‘walk the book’ into the next best price. Right panel: the MO has filled LOs in the book, so there is now less volume available at the two relevant prices. Temporary price impact refers to the relationship between the volume of the MO and the average execution price of the trade. For larger MOs, multiple LOs at different levels are exe- cuted, thustheaveragepriceofthetradeisafunction ofvolume.Permanent priceimpact refers to the relationship between the volume of the MO and the midprice process at future times after the execution. For larger MOs, more LOs are removed from the LOB resulting in a greater change of the midprice which will persist into the future, aeff cting the price of subsequent trades. The LOB structure discussed above is an idealization of real markets where there are several types of orders that can be submitted. Usually, order types can be loosely grouped by whether they add or remove liquidity from the market, and we think of them as LOs or MOs, respectively, in our idealized framework. These other order types include stop-loss, stop-limit, immediate-or-cancel, fill-or-kill, and all-or-none. For extensive analysis on LOB dynamics, including and beyond price-impact effects, seeBiais,Hillion,andSpatt(1995), Ranaldo (2004), R. Almgren et al. (2005), Cont (2011), Cont, Kukanov, and Stoikov (2014), Cartea,Jaimungal,and Penalva(2015), Lehalle and Mounjid (2017), Bechler and Ludkovski (2017), Cartea, Donnelly, and Jaimungal (2018a), and Sirignano (2019). For more in depth outline of the operation of LOBs in n fi ancial markets and of advanced order types, see Cartea, Jaimungal, and Penalva (2015)and Guéant (2016). 184 R. DONNELLY Figure 3. Left panel: the LOB as volume density. Middle panel: a sell MO is submitted with volume equal to the area of the yellow region. Right panel: the MO has filled LOs in the book, and both buy and sell LOs are shortly thereafter placed to establish a new impacted midprice. 3. Modeling Trades, Price Impact, and Objectives 3.1. Discrete-Time Model The discrete time model that we present here is adapted from the models rfi st established in Bertsimas and Lo (1998) and R. Almgren and Chriss (2001), and which have consistently formed the basis for most models in discrete time optimal execution since then. While Bertsimas and Lo (1998) takes into account additional considerations such as informed price changes and more realistic dynamics, the initial model they propose to get intuition aboutoptimal executionisequivalenttowhatwegivebelow. We make a conceptual change to the description of the LOB in the previous section to motivate many of the assumptions made in this model. The main change we make is that we consider the LOB to be based on a continuum of prices rather than a discrete set, andtheLOsformavolumedensity,rather than acollectionofindividualqueuesateach price. Thus, our graphical representation including the arrival of an MO changes to that of Figure 3. Note that in the right-hand panel of Figure 3, the new midprice is located between (not necessarilyatthe midpoint)theoldmidpriceand thelowestexecutedpriceoftheMOfrom the middle panel. We assume that immediately after the MO, there is a u fl rry of activity in the LOB which includes many new LO placements to establish this new midprice. This activity could potentially also include LO cancellations resulting in a new midprice beyond thelowestexecutedprice.Inaddition,thereisnogapleftinthe LOBand indeedtheoverall shape is merely a translation of the original shape. This will be reflected in the model below through the fact that our impact functions depend only on the size of the MO. We work on a ni fi te time horizon equal to N and index time by n ∈{0, ... , N}.Let Q ={Q } be the agent’s inventory process at each point in time. Denote by V the size n n n=0 of theagent’ssubmittedMOattime n,sowehavetherelation Q = Q + V.(1) n+1 n n Let S ={S } be the midprice observable in the LOB at each time. In specifying the n=0 dynamics of this process, we wish to include a component contributed by the MOs sub- mitted by the agent, as well as a component which comprises the actions of all other agents participating in the market. To this end, let  ={ } be a sequence of independent ran- n=1 dom variables with mean zero, and let f be an increasing function with f (0) = 0. Then we APPLIED MATHEMATICAL FINANCE 185 set S = S + f (V ) + .(2) n+1 n n n+1 Thus, if the agent does not trade (V = 0), the midprice changes randomly by  from n n+1 the perspective of the agent, where this change is due to the actions of other agents. If the agent submits a buy order (V > 0)thenthenextmidpriceisstillrandom,butwilltend to be greater than if there were no trade, and larger buy orders will have a larger effect on increasing this future midprice. Similar eect ff s hold as a result of the agent submitting a sell order, and so the function f captures the permanent price impact eeff cts of the agent’s trades. Let X ={X } betheamountofcashheldbytheagentthrough time.Inspecifying n=0 the dynamics of X we include a term which represents a temporary price impact eeff ct as discussed in Figure 2.Let g be an increasing function with g(0) = 0and set X = X − S V,(3) n+1 n n n S = S + g(V ).(4) n n n The quantity S represents the average price per share of the agent’s executed MO, given that the volume of the MO is V . A larger MO volume will result in executions farther from the midprice with buy orders executing at higher prices and sell orders at lower prices, and so the function g captures temporary price impact as desired. With an average execution price of S and a total volume of V , the change in the agent’s cash account is −S V ,hence n n n n the relation in (3). It is straightforward to show that if the shape of the LOB is given by a curve as in Figure 3, then the function g is uniquely determined by this shape. 3.2. Continuous-Time Model Although many of the problems we summarize in this survey can be formulated and solved in discrete-time, most recent work is conducted in continuous-time. Here we replace Equa- tions (1) to (4) with analogous continuous time dynamics. Equivalently, this could be done by considering the limiting behaviour of the discrete time model as the number of time steps N →∞ (the functions f and g,and thedistributionsof  wouldalsohavetobe scaled appropriately, see R. Almgren and Chriss 2001). We work on a complete filtered probability space (,F, {F } ,P) over the t 0≤t≤T finite time interval [0, T]. The inventory, midprice, and cash processes are now repre- ν ν ν ν ν sented by their continuous-time counterparts Q ={Q } , S ={S } ,and X = 0≤t≤T 0≤t≤T t t {X } , and we replace their discrete time dynamics with 0≤t≤T dQ = ν dt,(5) dS = f (ν )dt + dM,(6) t t ν ν dX =−S ν dt,(7) t t ν ν S = S + g(ν ),(8) t t where M ={M } isamartingalewithrespect tothefiltration {F } .Withthese t 0≤t≤T t 0≤t≤T dynamics the trading strategy ν ={ν } represents the speed of trading, not the t 0≤t≤T volume ofasequenceoftradesasinSection 3.1. 186 R. DONNELLY The form of Equations (5) to (8) does not constrain the midprice to be necessarily pos- itive. While this is usually a desirable feature in a model, the time scales for which typical optimal executions are implemented are short enough that the probability of observing large price deviations into negative values is very small. If the time scale is long enough to warrant a positive price restriction, then geometric dynamics may be more appropriate and realistic. See Gatheral and Schied (2011), Becherer, Bilarev, and Frentrup (2018), and Colaneri et al. (2020) for analysis of optimal execution problems with geometric dynamics for the price process. In addition, neither the midprice nor inventory will be constrained to lie on a grid as they would be in reality. This assumption will have little consequence if the typicalinventory andmidpricethatweconsiderarelarge.If therelevantinventory levels are typically close to zero for a particular execution, a properly discretized model would be more appropriate over the continuous rate of trading implied by (5). 3.3. Performance Criterion Nowthatwehavespeciefi dthedynamics ofpricechangesandtheagent’sinventoryand wealth, we must propose a goal for the agent so that she may seek an optimal trading strat- egy. One of the most common types of goals we have in mind is an agent that holds a block of shares and wishes to liquidate them before time T. In doing so, she will have to consider theeeff ct that hertradeswillhaveontheimmediatetransaction priceduetotemporary price impact, as well as future prices due to permanent price impact. Additionally, we may impose a penalty on the agent if she does not fulfil her goal of full liquidation by time T. Thus, we specify that the agent attempts to maximize her expected terminal wealth at time T subject to a liquidation penalty. In Bertsimas and Lo (1998), the authors frame the liqui- dation problem in terms of minimizing execution costs. While the objective function in (9) below is framed in terms of maximizing wealth, it is equivalent to minimizing costs. Let the initial cash holdings, inventory, and midprice be x , q ,and S respectively. 0 0 0 Given a trading strategy ν ={ν } which is sufficiently integrable and adapted to the t 0≤t≤T filtration {F } , and an increasing function  with (0) = 0, we define the agent’s t 0≤t≤T performance criterion as ν ν ν ν ν H (x , q , S ) = E X + Q (S − (Q )).(9) 0 0 0 T T T T The terminal wealth in (9) consists of the agent’s accumulated cash, X ,aswellasthevalue of the remaining Q sharesoftheassetwhichmustbeliquidatedatthe prevailingmidprice ν ν S with an associated penalty (Q ).Thisliquidationpenaltyisakintothetemporaryprice T T impact effect which arises from walking the book as discussed previously. Although we are taking the perspective of an agent liquidating her holdings, this frame- workcanalsobeapplied to an agentwishing to acquireablockofshares. This canbe done by setting her initial inventory holdings equal to Q =−N,where N is the number of shares she is trying to acquire by time T. This is based on the realization that liquidating anegativepositionrequirestheagenttopurchasethesharessothatherterminalposition is zero. With this performance criterion in mind, den fi e the agent’s value function H(x , q , S ) = sup H (x , q , S ), (10) 0 0 0 0 0 0 ν∈A APPLIED MATHEMATICAL FINANCE 187 wherethesupremum istakenoveranadmissibleset A of sufficiently integrable {F }- adapted processes. We would like to determine a specicfi control ν such that the agent’s performance function is maximized over all possible controls. That is, we desire a trading speed ν such that H(x , q , S ) = H (x , q , S ). (11) 0 0 0 0 0 0 One common method to solve such an optimization problem is to introduce the dynamic value function (with a slight abuse of notation) ν ν ν ν H(t, X , Q , S ) = sup E X + Q (S − (Q )) F , (12) t t t t T T T T ν∈A and then resort to methods of dynamic programming to solve for both H and ν .For more detailsonthesemethods, seePham(2009), Björk (2009), Touzi and Tourin (2013), and Cartea, Jaimungal, and Penalva (2015). In order to make these methods more tractable and allow for easier analysis, we assume for now that all of the relevant price impact functions are linear in the speed of trading. That is f (ν) = bν, g(ν) = kν, (ν) = αν, (13) with b > 0, k > 0, andα> 0. Regarding the visualization in Figure 3,thismakestheimplicit assumption that the shape of the LOB is a flat line. Proposition 3.1: The dynamic value function in (12) is given by H(t, x, q, S) = x + qS + h (t)q , (14) −k(2α − b) b h (t) = − , (15) 2k + (2α − b)(T − t) 2 and the optimal trading strategy in feedback form is −(2α − b) ν (t, q) = q. (16) 2k + (2α − b)(T − t) We should assume 2α> b for otherwise there will be values of T and t which make the expression in (16) undefined. Henceforth, we always assume parameter values are chosen so that the resulting expressions are well den fi ed. If we substitute the liquidation strategy from Proposition 3.1 into Equation (5), we arrive atanordinarydieff rentialequation(ODE)governingthedynamicsoftheagent’sinventory, which can be solved to give 2k + (2α − b)(T − t) Q = q , (17) 2k + (2α − b)T and we see that the resulting inventory process is deterministic, decreasing linearly through time towards a terminal value of 2k Q = q . (18) 2k + (2α − b)T We canensurethatthe agentfully liquidatestheinitial position of sharesby placingan infinitely large penalty on remaining inventory represented by the limit α →∞.Inthis 188 R. DONNELLY case the optimal strategy in (16) and the resulting inventory path in (17) become lim ν (t, q) =−q (19) α→∞ ∗ T − t lim Q = q . (20) α→∞ The trading strategy and resulting inventory in (19) and (20) can be thought of as a bench- mark called time-weighted average price (TWAP). If the trades of this strategy, which occur at a constant speed over the trading interval, are marked to the midprice, S, rather than the execution price, S,thentheaveragepriceofalltrades bytheagentistheaverageof themid- 1 ν price process given by S dt. Many of the trading strategies which appear later can be T 0 thoughtofasincorporatingacorrectiontothe TWAP strategy. While the strategy in (16) is the strategy which minimizes expected costs, there is no consideration of risk involved in the execution. In R. Almgren and Chriss (2001), the authors allow the agent to also account for the variance of her performance in addition to the expectation. This allows the agent to tailor her performance by managing the trade-off between maximizing expected wealth and minimizing variance of wealth. This mean- variance approach is incompatible with dynamic programming methods, but the authors only consider deterministic trading strategies allowing solutions to be found more easily. We incorporate risk considerations with a different approach to the mean-variance analysis, but with the additional assumption that price innovations are Gaussian. Under this assumption the class of optimal trading strategies coincide with R. Almgren and Chriss (2001). To this end, we assume M = σ W in Equation (6) where W isaBrown- ian motion. We also suppose the agent adopts a constant absolute risk-aversion (CARA) utility function so that her dynamic value function is ν ν ν ν H(t, X , Q , S ) = sup E − exp −γ X + Q (S − αQ ) | F , (21) t t t t T T T T ν∈A forsomevalueof risk-aversionparameter γ> 0. Proposition 3.2: The dynamic value function in (21) is given by H(t, x, q, S) =− exp −γ x + qS + h (t)q , (22) − −ω(T−t) + ω(T−t) β ψ e − ψ e b h (t) = − , (23) − −ω(T−t) + ω(T−t) 2 ψ e + ψ e 2 with γσ β = 2kγσ , ω = , ψ = β ± 2α ∓ b, 2k and the optimal trading strategy in feedback form is (2h (t) + b) ν (t, q) = q. (24) 2k APPLIED MATHEMATICAL FINANCE 189 The optimal trading strategy in Proposition 3.2 is equivalent to one which maximizes an alternative performance criteria given by ν ν ν ν 2 ν 2 E X + Q (S − αQ ) − σ (Q ) dt . (25) T T T T t The running quadratic inventory penalty in (25) is a popular choice to incorporate in opti- malexecutionproblemsratherthanusingautilityfunctionasin(21)(seeforexampleGuil- baud and Pham (2013), Barger and Lorig (2019), and Cartea, Gan, and Jaimungal (2019)). However, these two approaches would not be equivalent if the random innovations in price were notgiven byaBrownianmotionorifthere areotherstochastic factorsinthemodel. When there are more factors, this running penalty term can be considered to contribute inventory risk, while the agent remains risk-neutral towards other dynamics (see Cartea, Donnelly, and Jaimungal 2017). Substituting the optimal strategy from Proposition 3.2 into Equation (5) yields again an ODE which can be solved, resulting in the agent’s inventory process through time − −ω(T−t) + ω(T−t) ∗ ψ e + ψ e Q = q . (26) − −ωT + ωT ψ e + ψ e If we wish to consider an agent who must finish the full liquidation by time T,wecan enforce this by taking the limit α →∞ which results in lim ν (t, q) =−ω coth(ω(T − t))q, (27) α→∞ ∗ sinh(ω(T − t)) lim Q = q . (28) α→∞ sinh(ωT) In Figure 4 we show the optimal inventory process for several values of the risk-aversion parameter γ . We note that for larger values of γ , the agent uses a trading strategy which liquidates faster towards the beginning of the time interval. This is because the agent’s riskislargestwhenherinventoryislargest,and to lowerthatriskshedesirestolower her inventory quickly. This comes at an increased cost due to the temporary price impact. When the agent is risk-neutral (and has a resulting inventory path from (17)), she trades at a constant rate through time which is determined only by the trade-osff between tem- porary price impact, permanent price impact, and the liquidation penalty. The optimal strategies of Propositions 3.1 and 3.2 are the result of many simplifying assumptions regard- ing the dynamics of trading in a market as well as the agent’s goals, but they still oer ff a benchmark to which many other strategies may be compared. In the following sections, we explore work that expands upon the previous results by considering more realistic market dynamics, alternate goals of market participants, or the inclusion of other phenomena. 4. Alternative Dynamics We begin our further exploration by considering works which make alternative assump- tions about the dynamics proposed in Equations (5) to (8) in order to model more general or more realistic market settings. 190 R. DONNELLY Figure 4. Optimal inventory process for various values of risk-aversion parameter γ.The curvecor- responding to γ = 0 is from Equation (17), the remaining curves are from Equation (26). Left panel: α = 0.05. Right panel: α = 1. Other parameter values are T = 1, σ = 1, k = 0.01, and b = 0.01. 4.1. Stochastic Temporary and Permanent Price Impact Firstwerevisit oneofour assumptionsonthe natureof priceimpact,both permanentand temporary. In Equations (6) and (8), the relation between the agent’s trading and price dynamics are homogeneous in time – identical actions and different points in time will have identical effects on the asset price. In terms of the graphical representation of the (idealized continuous) LOB, this means that its general shape is always the same, albeit translated horizontally as the midprice changes. In reality, market liquidity varies through time meaning the shape of our graphical representation should also u fl ctuate. We consider in some detail the work Barger and Lorig (2019) which allows price impact effects to be stochastic, replacing price and wealth dynamics with (1) dS = f (b )ν dt + σ dW,db = μ (b )dt + η (b )dB , (29) t t t t b t b t t t ν ν dX =−S ν dt, (30) t t ν ν (2) S = S + g ˆ(k )ν,dk = μ (k )dt + η (k )dB , (31) t t t t t k k t t t (1) (2) where B and B are Brownian motions independent of W,buthave instantaneouscor- relation with each other equal to ρ ∈ [−1, 1]. Furthermore, the functions g, μ ,and η k k should be chosen so that g ˆ(k )> 0 for all t (otherwise temporary price impact would work in the agent’s favour and she could make unbounded prots). fi In this model, price impact depends linearly on trading speed (as was assumed in Propositions 3.1 and 3.2), but the magnitudeofthe eeff ctisdrivenbythe stochastic processes b and k.The graphicalLOB representation is therefore given by a flat line which uc fl tuates through time in the vertical direction. When f (b ) and g ˆ(k ) decrease, indicating a low impact state, the flat line level t t of the LOB moves up and there is high liquidity in the market. The agent’s performance criterion is taken as stated in (25) with a running penalty on inventory, and so as discussed earlier this can be interpreted as the agent being sensitive to the risk associated with midprice changes, but when considering the stochastic nature of price impact she only accounts for the potential benefit and detriment in terms of expected APPLIED MATHEMATICAL FINANCE 191 wealth. These changes to the assumed dynamics complicate the HJB analysis which was used to arrive at the conclusions of our previous propositions, so the authors resort to asymptotic approximations of the dynamic value function and optimal trading speed. The authors show that a zeroth order approximation in this setting can be obtained by the same expression as in (24) by making the substitutions b → f (b) and k → g ˆ(k). This means the agent acts according to the previously examined optimal strategy, but replaces the price impact parameters with the corresponding value of what is observed at any particular time. The rfi st order correction to this strategy does not have a simple expression in general. But if the agent is forced to liquidate her position by time T (α →∞) and if she does not impose the running inventory penalty (γ = 0), then a first order approximation is given by 1 μ (k)g ˆ (k) 1 μ (b)f (b) k b ∗ ∗ ν ˆ (t, q, b, k) =ˆ ν (t, q, b, k) − q − (T − t)q, (32) 1 0 ˆ ˆ 2 g(k) 6 g(k) where the term ν ˆ is understood to be the strategy in (24) along with the substitutions ˆ ˆ b → f (b) and k → g ˆ(k).Notethatatanyfixedtime t with given impact parameters f (b ) and g ˆ(k ), this rst fi order strategy still liquidates the asset at a speed proportional to the remaininginventory.The twocorrectiontermsstem fromthetwodieff renttypesofimpact and can be interpreted in terms of what the agent expects the impact to be in the future, thereby acting accordingly. If we suppose μ (k) and g ˆ (k) are both positive, then in the shortterm,thetemporary priceimpactisexpectedtoincrease, so theagent is incentivized to trade faster now to avoid higher transaction costs. Similarly, if we suppose μ (b) and f (b) are both positive, then permanent price impact is expected to increase, and the agent should trade faster now so that her remaining pool of shares does not have their value negatively aect ff ed. This term depends on the remaining trading time T−t because the effects of permanent price impact as modelled here accumulate over time, and so become insignica fi nt at the end of the trading horizon. Similar arguments hold accordingly if there are changes in sign of μ (k) and g ˆ (k). Both of these terms are scaled by the instanta- neous temporary impact 1/g ˆ(k) because any expected benefits from trading faster must be weighed against the instantaneous costs of acquiring those benetfi s. Numerous other works have also investigated the effects of stochastic liquidity on opti- mal execution. A similar model is proposed in R. Almgren (2012)wherethereisno permanent price impact, but the midprice volatility is allowed to be stochastic rather than constant. This work also recognizes that substituting appropriate parameters into a strat- egy such as (24) with the observed stochastic counterparts at time t (k → k and σ → σ ) t t is a plausible strategy. In general, it is found that the agent trades quickly when price impact is low, volatility is high, or the end of the trading period approaches. Both Cherid- ito and Sepin (2014) and Ackermann, Kruse, and Urusov (2021) take a fully discrete-time approach to this problem, each considering different aspects of the dynamics to be stochas- tic and classifying the optimal trading strategies in terms of recursion equations. Other workswithelementsofstochasticprice impact areIshitaniand Kato(2014), Becherer, Bilarev, and Frentrup (2018), Fruth, Schöneborn, and Urusov (2019), Fouque, Jaimungal, andSaporito(2022), and Cartea, Drissi, and Monga (2022), some of which will be discussed in a subsequent section on resilience and transient price impact. 192 R. DONNELLY 4.2. Stochastic Volatility of Midprice In the models described in Section 3, the only exogenous source of randomness comes from the midprice innovations which were assumed to have constant volatility. In reality, the volatility will change through time due to a variety of factors in the LOB. There is generally increased activity at the beginning and end of the trading day, both in terms of thenumberofLOsbeingplacedand cancelled, andinthenumberofMOssubmitted. Each of these events can change the midprice, so we end up with seasonality of volatility in theLOBoverthecourseoftheday. Additionally,many events in theLOBareself-exciting, in particular there is a large spike in the intensity of events for a short time after MOs, and so spikes in volatility may also be observed around these times. The changing volatility results in different levels of risk for any given inventory level, and accounting for this can help a risk-averse agent improve her performance. We have already made brief mention of R. Almgren (2012) and Cheridito and Sepin (2014) which consider models with stochastic volatility of the midprice. Here we pro- vide some discussion of Chan and Sircar (2015) which gives approximations to the optimal trading strategy in fast or slow variation volatility regimes, which correspond to the season- ality and spike eeff cts described previously. To this end, we replace the midprice dynamics in Equation (6) with dS = σ(Z )dW , (33) t t where σ is a function of the stochastic process Z , which determines the instantaneous volatility of the midprice. Permanent price impact is not considered in Chan and Sir- car (2015), but an additional drift term representing an observable signal is included which we have suppressed in our present discussion. We return to the modelling of observable signals in Section 5.1. When analysing a slow volatility factor, the dynamics of Z are taken as dZ = μ (Z )dt + η (Z )dB , (34) Z Z t t t t where B isastandardBrownianmotionindependentof W, whereas the fast volatility factor uses dynamics 1 1 dZ = μ (Z )dt + η (Z )dB . (35) Z Z t t t t In each case  will act as a parameter of asymptotic expansion which allows for approx- imations of the optimal strategy when the value of the parameter is small. Chan and Sircar (2015) consider multiscale stochastic volatility with slow and fast factors simultane- ously, but we only discuss the resulting effects individually. Additionally, they have assumed an inn fi itely long trading horizon with a performance criterion of the form 2  ν 2 2 E − σ (Z )(Q ) − kν dt , (36) t t t which results in a trading strategy that does not explicitly depend on time and brings more focus to the effects of stochastic volatility. Otherwise, the general behaviour of the optimal strategy is largely unchanged from what would arise in a ni fi te horizon. As may be expected APPLIED MATHEMATICAL FINANCE 193 after the discussion in Section 4.1, the approximations are largely based on the strategy which maximizes (36) when volatility is assumed constant. This can be shown to be γσ ∗ 2 ν (q; σ ) =− q, 2k which is equivalent to (24) in the limit T →∞. Under slow scale volatility, the approximation of the optimal strategy is found through perturbation analysis of the HJB equation. The resulting strategy in feedback form is γσ (z) ∗ ∗ 2 ν ˆ (q, z) = ν (q; σ (z)) =− q, 2k which we note involves a similar substitution to previously discussed approximations, where parameters are replaced with their currently observed values through time. We also seethatthisapproximationdoesnotincludeanexplicitdependenceonthe parameter . The smallest order corrections considered in Chan and Sircar (2015)involvesastochas- tic drift correlated to the slow scale volatility. In this case the appropriate correction is proportional to  as well as the correlation between the drift and volatility. Thefastscale volatilityis treatedwithsingularperturbationanalysistoarriveatan approximate optimal control, with resulting approximation given by γ q ∗ ∗ 2 2  2 ν ˆ (q, z) = ν (q; σ ) − E σ (Z ) − σ | Z = z du, (37) 0 u t 2k where σ is the long-term expected variance. This expression has contributions from two terms which represent a zeroth order approximation and first order correction. The 2 2 zeroth order approximation makes a familiar parameter substitution σ → σ ,but the rfi st order correction involves future expected deviations of the variance from its long-term expected value. If future variance is expected to be larger than the long-term expected vari- ance, then the agent desires faster liquidation of her shares to avoid the associated inventory risk. Other work of interest is Colaneri et al. (2020) where the authors assume that innova- tions in asset price are driven by a regime switching pure jump process of the form dS = S −dR , (38) t t t where R has compensating random measure η(t, Y −, ν −;dz)dt with Y afinite state t t Markov chain. Whilethisisnotstrictlyastochasticvolatilitymodel becauseitlacksadiffu- sive term, the price innovations dR being driven by a Markov chain means that the price process will share some qualitative features of a stochastic volatility model. The depen- dence of the compensating measure on the trading strategy ν also captures permanent price impact eeff cts. In this case, the more complicated nature of the model leads to numerical methods for computing the optimal strategy. 4.3. Resilience and Transient Impact Thusfarwehaveonlydescribed twotypes of priceimpacteeff cts(permanentandtem- porary) which are included in the models of many algorithmic trading works. But when 194 R. DONNELLY Figure 5. Ask side of the LOB showing resilience and transient price impact after a trade. The left panel shows minimal spread (best ask price coincides with reference price). In the middle panel, an MO walks the book and exhibits temporary price impact. Immediately after the MO in the right panel, the orders are filled, a new reference price is established due to permanent impact, and the new best ask price will gradually decay back to the new reference price. A subsequent order submitted immediately will be subjected to higher prices than an order submitted after waiting some time. discussing the nature of price changes after the occurrence of a transaction, there are other dynamicaleeff ctsatplay.In particular,whenmultipleLOs arefilled in theLOB dueto a transaction, the resulting gap in the LOB is not replenished instantaneously as Figure 3 suggests.Instead ittakestimeforLOstobesubmittedin placeofthose thatwere filled.The rate at which these orders are replenished is called resilience, and thus Figure 3 suggests a market with infinite resilience. In Figure 5 we show (from the ask side of the book) how resilience might be interpreted in an idealized continuous LOB. After a transaction takes place (filling sell orders in the book), if no further transactions were to occur then these sell orders may gradually be replenished until a new stable best ask price is established. However, if a subsequent trans- action were performed after only a short time, then it would be subjected to higher prices thanifit hadoccurredafteralongtime. This dieff renceintransaction pricewhich depends on the amount of time between trades is called transient price impact. We focus on describing Chen, Horst, and Tran (2019)inmoredetail, as theframework is most similar to what has been presented here so far. A general formulation of transient price impact can be incorporated into the dynamics by introducing an impact function and decay kernel, denoted by a strictly increasing function h and a non-increasing function G, respectively, and defining the inventory, midprice, and cash processes by Q = Q + ν du, (39) 0 u t t S = S + bν du + h(ν )G(|t − u|)du + σ W , (40) 0 u u t 0 0 ν ν X = X − S ν du, (41) 0 u t u ν ν S = S + kν . (42) t t Here we have taken the permanent and temporary price impacts to be linear with propor- tionality constants b and k.Thefunction h determines the magnitude of transient price impact and the decay kernel G controls how quickly the impact of trading decays through time.InChen,Horst, andTran(2019) the transient impact is linear with h(ν) = λν and APPLIED MATHEMATICAL FINANCE 195 the decay is exponential with G(|t − u|) = exp(−ρ|t − u|). Thus, the parameter λ deter- mines the distance from the new reference midprice to the best ask in the right panel of Figure 5, and the distance between the best ask and the new reference midprice decays exponentially. See Gatheral (2010) for some detailed discussion of other impact functions and decay kernels, and consequences of different models with respect to statistical arbitrage opportunities. The agent is risk-neutral, but with the additional constraint that the inventory process must satisfy Q = 0. Thus, the performance criterion reduces to ν ν H (x , q , S ) = E[X ], (43) 0 0 0 andtheagentattemptstomaximizethisovertradingspeeds {ν } which fully liquidate t t∈[0,T] the inventory by time T. Recall that this type of constraint can be applied formally to the strategy given in Proposition 3.1 by taking α →∞, and it is to the trading strategies in this limit that we shall compare the optimal strategy with transient impact. Other works mentioned at the end of this section allow for strategies that do not fully liquidate the position by incorporating a liquidation penalty. Proposition 4.1: The trading strategy which maximizes (43) is given by φT φT T 2 2 φ (φ k − λρ) ρ cosh + φ sinh + λρ cosh φ t − 2 2 2 ν =− q , φT φT 2 2 4 φρT cosh (φ k − λρ) + sinh (λρ(2ρ − φ T) + φ kT) 2 2 (44) which results in the inventory process A + Bt + C sinh φ t − ν 2 Q = q − q , (45) 0 0 2A + BT where φ = ρ ρ + , (46) λ φT A = sinh , (47) k 2 φT φT B = φρ cosh + φ sinh , (48) 2 2 C = . (49) In Figure 6 we show the optimal trading speed and resulting inventory for various values of the transient impact parameter λ (top row) and decay rate ρ (bottom row). When λ = 0 the optimal rate of trading is constant at q /T which is the TWAP strategy in (19). This is expected because λ = 0 corresponds to the same dynamics given in Equations (5) to (8). When ρ = 0wealsoobtainthe TWAP strategy,which isconsistentwith intuitionbecause if there is no resilience, the transient price impact becomes a permanent price impact, so 196 R. DONNELLY Figure 6. Optimal trading speed and inventory process with transient price impact for various values of transient impact parameter λ and decay coefficient ρ. Top row panels: other parameter values are k = 0.01, b = 0.01, ρ = 1. Bottom row panels: other parameter values are k = 0.01, b = 0.01, λ = 0.3. the dynamics are equivalent to Equations (5) to (8) with the substitution b → b + λ.The last case in which the optimal trading speed is constant is in the limit ρ →∞ which again corresponds to the dynamics of (5) to (8). For parameter values which do not give rise to the TWAP strategy, the typical optimal behaviour is to trade at faster speeds closer to the beginning and end of the trading period, and at slower speeds closer to the middle of the trading period (compared to the constant rate of q /T).Thereasonforthisisbecausetheagentwantstominimizethetimewhenthey are trading under significant effect of transient price impact. A single large trade will suer ff from temporary price impact, but it is not until subsequent trades are submitted that the effects of transient impact are encountered. By trading quickly early on, there will be more time spent in a state where the exponential resilience has brought the price back towards its unaeff cted level. Then, when the transient price impact has decayed significantly, the agent can again place large trades to complete the liquidation. For other descriptions and works which model resilience and transient price impact, see Gatheral (2010), Alfonsi, Fruth, and Schied (2010), Predoiu, Shaikhet, and Shreve (2011), Obizhaeva and Wang (2013), Guo and Zervos (2015), Graewe and Horst (2017), Becherer, Bilarev, and Frentrup (2018), Bank and Voß (2019), Fruth, Schöneborn, and Urusov (2019), APPLIED MATHEMATICAL FINANCE 197 Ekren and Muhle-Karbe (2019), Horst and Xia (2019), Forde, Sánchez-Betancourt, and Smith (2021), and Neuman and Voß (2022). 4.4. Non-linear Price Impact Most of the results in previous sections rely on assuming that price impact functions are linear with respect to the rate of trading, even though the original models proposed in Sec- tions 3.1 and 3.2 have general impact functions. In this section we explore an optimization in which impact functions may be non-linear with respect to the rate of trading, with a focus on R. F. Almgren (2003). This work begins with a model in discrete time which is based on that of Section 3.1, where the nature of temporary price impact is modified so that Equations (1) to (4) become = Q + V , (50) n+1 n n S = S + f (V ) +  , (51) n+1 n n n+1 X = X − S V , (52) n+1 n n n S = S + g(V ) + h(V )ξ , (53) n n n n n+1 where {ξ } is a sequence of independent random variables with mean zero, unit vari- n=1 ance, and which are also independent from { } . Thus, the temporary price impact is n=1 composed of the usual term, g(V ), which depends only on the trade submitted at time n,butalso hasastochasticcomponent givenby h(V )ξ . This type of stochastic depen- n n+1 dence of temporary impact could be the result of several factors, for example the latency between the time that a trade decision is made and the time at which it is matched against passive orders during which the depth of available orders may have changed. See Cartea and Sánchez-Betancourt (2021a) which considers this stochastic delay directly. The agent attempts to maximize a mean-variance style criterion with respect to her N−1 terminal wealth subject to the constraints Q = 0 and that the sequence {V } is N n n=0 deterministic. The agent’s terminal wealth is given by N−1 N−1 N−1 N−1 X = X + S Q + Q f (V ) + Q  − V g(V ) − V h(V )ξ , N 0 0 0 n n n n+1 n n n n n+1 n=0 n=0 n=0 n=0 (54) which, under deterministic trading strategies, has expectation and variance N−1 N−1 E[X ] = X + S Q + Q f (V ) − V g(V ), (55) N 0 0 0 n n n n n=0 n=0 N−1 N−1 2 2 2 2 V[X ] = σ Q + V h (V ), (56) N n n n n=0 n=0 2 2 where σ = E[ ]. Before optimizing the trade-off between these two terms, they are both replaced by their continuous-time analogues E[X ] = X + S Q + Q f (ν ) − ν g(ν ) dt, (57) T 0 0 0 t t t t 0 198 R. DONNELLY 2 2 2 2 V[X ] = σ Q + ν h (ν ) dt. (58) T t t t The objective of the agent is to maximize the mean-variance criterion H = E[X ] − V[X ]. (59) T T Recognizing that ν = Q frames this computation into a standard problem of the calculus of variations. After classifying optimality in R. F. Almgren (2003), closed-form expres- sions for the strategy are found by allowing the horizon T to approach infinity. First, we investigate the effect of non-linear temporary impact with zero stochastic impact such that f (ν) = bν, g(ν) = kν , h(ν) = 0, (60) with m > 0. In this case the optimal inventory path is given by 1+m ⎪ 1−m 1 − m t ⎪ q 1 + ,if0 < m < 1, ⎪ ∗ ⎪ 1 + m T ∗ t Q = q exp − ,if m = 1, (61) m+1 ⎪ m−1 m − 1 t m+1 ⎪ ∗ ⎩ q 1 − ,if m > 1and t ≤ T , m−1 m + 1 T where m+1 m−1 2mkq ∗ 0 T = . (62) γσ A comparison with (26) in the limit T →∞ shows that it coincides with the optimal strat- egy above when m = 1. This is expected because the price impact functions are the same and the optimal strategy which results from (21) is deterministic. The eeff cts of the non- linear impact function g on the optimal strategy are shown in Figure 7. The other case which yields a closed-form expression for the inventory path is f (ν) = bν, g(ν) = kν, h(ν) = a, (63) with a > 0. The corresponding optimal inventory path is given by q exp − , (64) where 2k + γ a T = . (65) γσ The effect of the stochastic component of price impact can be seen in Figure 8,which is toslowtherateoftrading compared tothebenchmark of(26).Thisisbecause trading quickly magnifies the risks associated with uncertainties in temporary price impact caused APPLIED MATHEMATICAL FINANCE 199 Figure 7. Optimal inventory paths from (61) for various values of parameter m in temporary impact function g(ν) = kν . Other parameter values are σ = 1, k = 0.01, b = 0.01, γ = 1. Figure 8. Optimal inventory paths from (64) for various values of parameter a in stochastic temporary impact function h(ν) = a. Other parameter values are σ = 1, k = 0.01, b = 0.01, γ = 1. by the term aV ξ in (53). The agent is willing to accept more risk associated with the n n+1 uncertainty in future price in order to lower the certain costs and riskiness associated with temporary price impact by trading slower. Additional work related to optimal trading with non-linear impact functions can be found in Bouchard, Dang, and Lehalle (2011) and Colaneri et al. (2020). Various discus- sionsare giveninGatheraland Schied(2013)relatedto theformofimpactfunctionand midprice dynamics. 5. Alternative Objectives In each of the previous sections, the perspective of the optimization problem was that of an agent wishing to liquidate a position subject to various eect ff s related to market dynamics. Within this section, we consider agents who are not necessarily attempting to liquidate some initial holdings, but instead behave according to an alternative objective. Inspection 200 R. DONNELLY of the optimal trading strategies in previous sections shows that if the agent’s inventory begins at zero, then she will do nothing over the course of the trading interval (as her position is already liquidated, any further trading will only be subjected to costs without benetfi ).Duetothealternativeobjectiveswenow consider,thisresultnolongerholds. 5.1. Speculation, Trade Signals, and Statistical Arbitrage From a modelling perspective, a straightforward method of introducing statistical arbitrage and the ability of an agent to speculate is to assume that the unaect ff ed midprice of the asset is no longer a martingale. In each of the models considered in previous sections, if the agent doesn’t trade then the midprice process is a martingale. If instead the unaeff cted price has non-zero expected change in value over some time interval, then any non-zero inventory position will also be expected to change in value. The essence of a protfi able strategy in this situationistoholdinventory positionswhichtake advantageofthesenon-zero expected price changes, but to do so in an efficient manner so as to not suffer from price impact effects or a future change in the asset’s expected drift. We focusonCarteaandJaimungal(2016b) which introduces a stochastic drift into the midprice process. The interpretation of the drift in this work is that it arises from the per- manent price impact of total order flow of all market participants, but it is statistically equivalent to having a short term price predictor. Let μ ={μ } be a stochastic process t 0≤t≤T which satisefi s the Markov property, and assume it is adapted to the filtration {F } t 0≤t≤T weareworking with.Wespecifythedynamicsofthe relevant processestobe dQ = ν dt, (66) dS = (μ + bν )dt + σ dW , (67) t t t ν ν dX =−S ν dt, (68) t t ν ν S = S + kν , (69) t t where b >0and k > 0 are constants. Other than the inclusion of the drift μ in (67), these equations are equivalent to (5) to (8) (with f and g replaced by linear functions). The value function of our agent will correspond to the performance criterion given in (25), that is ν ν ν ν 2 ν 2 H(t, X , Q , S , μ ) = sup E X + Q (S − αQ ) − σ (Q ) du | F , (70) t t t t t T T T T u ν∈A t with α> 0and γ> 0 constant. As discussed shortly after (25), this performance criterion can be interpreted as the agent expressing risk-aversion with respect to the random mid- price changes, but risk-neutrality with respect to randomness associated with the stochastic drift process μ. At this point, no specific dynamics for the process μ have been specified. As long as this stochastic drift term satisfies some technical integrability conditions, the value function and optimal trading strategy can be expressed in terms of expected future drift, as shown in the following Proposition. APPLIED MATHEMATICAL FINANCE 201 Proposition 5.1: The value function in (70) is given by H(t, x, q, S, μ) = x + qS + h (t, μ) + h (t, μ)q + h (t)q , (71) 0 1 2 − −ω(T−t) + ω(T−t) β ψ e − ψ e b h (t) = − , (72) − −ω(T−t) + ω(T−t) 2 ψ e + ψ e 2 T − −ω(T−u) + ω(T−u) ψ e + ψ e h (t, μ) = E[μ |F ]du, (73) 1 u t − −ω(T−t) + ω(T−t) ψ e + ψ e h (t, μ) = E[h (t, μ )|F ]du (74) 0 u t 4k with γσ β = 2kγσ , ω = , ψ = β ± 2α ∓ b, 2k and the optimal strategy in feedback form is h (t, μ) (2h (t) + b) 1 2 ν (t, q, μ) = + q. (75) 2k 2k A comparison of the optimal strategy (75) to the liquidation strategy without signals given in (24) shows that the inventory component of the feedback form is identical, but the strategy with signals has an additional component which comes from the agent’s forecast of the expected future asset price built into the term h (t, μ). This term could naturally be compared with the more straightforward quantity E[μ |F ]du, which represents u t the expected change in the unaec ff ted midprice from time t to T. While this itself yields a forecast of how much each unit of the agent’s inventory is expected to change in value, thereareotherconsiderationsthattheagentmusttakeintoaccount(namely priceimpact effects, liquidation penalties, and risk) when taking advantage of this signal. This gives rise to the complicated discount factor within the integral which balances the trade-off between taking advantage of the signal while acknowledging that this position will be partially liq- uidated close to time T and that any inventory holdings have inherent risk of price changes. If we assume 2α> b, then inspection of this discount factor shows that it is decreasing from 1 as u ranges from t to T and remains positive. This indicates that the expected price changesintheimmediatefutureasassignedmoreweighttowardsthetradingstrategycom- paredtoexpectedpricechangesinthe distantfuture. This arisesbecausetheagentwould like to enter a position as early as possible to take advantage price changes over as long a time horizon as possible, but also because there is less risk associated with acting on immediate forecasts compared to distant ones. There are numerous other works in which some form of trade signal or price pre- dictor are incorporated. See for example Chan and Sircar (2015), Cartea, Donnelly, and Jaimungal (2018a), Casgrain and Jaimungal (2019), Lehalle and Neuman (2019), Don- nelly and Lorig (2020), Forde, Sánchez-Betancourt, and Smith (2021), Bergault, Drissi, and Guéant (2022), and Neuman and Voß (2022). 202 R. DONNELLY 5.2. Hedging The main focus of the previous section was not about optimally liquidating an asset posi- tion, but rather attempting to maximize prots fi that could be extracted from knowledge about expected price changes. Nevertheless, both optimal liquidation and speculative trad- ing have the common theme that they try to minimize costs, or equivalently maximize profits. Here we consider an agent who has an additional objective which is not directly related to cost minimization, although that will still be a consideration which will influence her trading strategy. InBank,Soner,and Voß(2017), the authors consider an agent who is tasked with opti- mally tracking a hedge ratio which arises from attempting to minimize the risk associated with holding another security. Mathematically, they assume that there is a stochastic pro- cess ξ ={ξ } which indicates an inventory level which the agent intends to track t 0≤t≤T ecffi iently. The dynamics of speciefi d processes and the performance criterion used by the agent are given by dQ = ν dt, (76) dX =−(S + kν )ν dt, (77) t t t ν ν ν ν 2 H (x , q , S ) = E X + Q S − (Q − ξ ) dt , (78) 0 0 0 T t T T where the midprice process Sisamartingalewhichisnotsubjecttotheeeff ctsofpermanent price impact. Within the performance criterion, the rfi st two terms reward the agent for trading optimally subject to temporary price impact. The second term gives incentive for the agent to closely track the indicated inventory level ξ. With an optimization criterion in hand, the authors characterize the optimal trading strategy and value function. For this it is convenient to introduce the notation T − t τ(t) = √ , (79) 2k T τ(u) −τ(u) 1 e + e ξ = √ E ξ du | F . (80) t u t τ(t) −τ(t) e − e 2k t Note the similarity between Equations (80) and (73). In Equation (73), the relevant quan- tity at any point in time is not the trade signal μ itself, but instead the optimal strategy dependsonthe expectedvalueofthe signal atall timesinthe future,modiefi daccord- ing to an integration kernel. A similar theme occurs in (80) where the relevant quantity to the optimal efficient hedging strategy is not the intended hedging target ξ ,but aweighted average of the expected future value of the target. The precise way that the target hedge ratio enters the optimal trading strategy is summarized by the following proposition. APPLIED MATHEMATICAL FINANCE 203 Proposition 5.2: The value function corresponding to the performance criterion in (78) is given by H(x , q , S ) = x + q S − tanh(τ (0))(q − ξ ) 0 0 0 0 0 0 0 T T 1 k − E (ξ − ξ ) dt − E tanh(τ (t))d ξ , (81) t t 2 2 0 0 and the trading strategy which achieves this performance yields an inventory process that satisefi s tanh(τ (t)) dQ = √ (ξ − Q )dt, Q = q . (82) t t t 0 0 2k InEquation(82)we seethatatany time,the agentwilltradesuch thatherinventory moves towards the weighted target ξ with speed being proportional to the current hedging error, with the coefficient being a function depending on the remaining time T−t and the temporary price impact parameter k. This function increases with respect to the remaining time, indicating that the agent puts more eoff rt into tracking the hedge ratio when there is a long amount of time remaining in the trading interval. Indeed, when close to the end of the trading horizon, the performance criterion has relatively small penalty associated with theremainingtracking error,sotheagent deemsitmoreecffi ientto minimize thelosses due to trading rather than holding an accurate hedge. In Bank,Soner,andVoß(2017) the authors also solve this hedge tracking problem when a terminal inventory constraint is imposed of the form Q =  ,for agiven ∈ L (,F ,P). Other works related to efficient hedging include R. Almgren and T T Li (2016), Guéant and Pu (2017), and Ekren and Nadtochiy (2022)inwhichanagent holds a European option on the traded asset and attempts to maximize the expected util- ity or mean-variance performance of her terminal wealth. The work Cartea, Donnelly, and Jaimungal (2020) is similar except the agent holds a contingent claim on a non-traded risky quantity which is hedged by a correlated asset, and provide closed form trading strategies which approximate the optimal hedge. 5.3. Targeting VWAP Each of the previous sections considers an agent that measures her performance using a criterion which is internal. That is, there is no sense of measuring performance relative to other market participants or market state. In this section, we consider an agent who wishes to employ a trading strategy with performance measured by the average execution price of her trades relative to the volume weighted average price (VWAP). The time weighted average price (TWAP) can be captured by any trading strategy which trades at a constant speed over the time interval of interest, and such a strategy has already been seen in Equa- tions (19) and (20). Unlike TWAP, VWAP depends on the actions of all market participants and measures the average transaction price of all trades over a time period, and the uncer- tainty in the actions of all other market participants makes it a more difficult quantity to target. 204 R. DONNELLY Suppose that all trades over our time period of interest occur at a sequence of times denoted τ ={τ } , and that the trade at time τ has volume V which is executed when n n n n=1 the midprice is S . Then the VWAP for this time period is V S n τ n=1 . (83) n=1 Working in continuous time, if we denote total order-flow in the market by μ ={μ } , t 0≤t≤T then the appropriate expression for VWAP is μ S dt t t . (84) μ dt VWAP is a frequently used benchmark for a trading strategy because it represents the trade performance across the entire market, and additionally is easy to compute. When an agent is trying to optimally liquidate a block of shares with performance measured relative to VWAP, she must time her trades throughout the trading interval so that her average trade price is close to this target. The main difficulty is that VWAP is uncertain because there is randomness in future market order-flow. This difficulty is present in addition to try- ing to maintaining trade efficiency regarding price impact considerations. One method of attempting to target VWAP is to instead target the percentage of volume (POV) at each point in time. This is the approach taken in Cartea and Jaimungal (2016a)which wenow focus on. Let μ ={μ } denote the order-flow of buy and sell orders in the market, both 0≤t≤T assumed to be positive. Then the total order-flow at time t including the agent’s trading is + − given by μ + μ +|ν |. The midprice is assumed to be linearly impacted by all order- flow, and so its dynamics are ν + − dS = b(μ − μ + ν )dt + dM , (85) t t t t t ± ± where M is a martingale independent of μ . Similar to Section 5.1 we assume that μ have the Markov property with respect to our filtration. Instead of directly trying to target VWAP, the agent will attempt to track POV through time. Let ρ ∈ (0, 1) be xfi ed and den fi e ν + − χ = ρ(μ + μ +|ν |), and specify the agent’s dynamic value function to be ± ν ν ν ν ν 2 H(t, X , Q , S , μ ) = sup E X + Q (S − αQ ) − ϕ (|ν |− χ ) dt | F , (86) t t t u t t T T T T u ν∈A where ϕ> 0isapenalty parameterfor deviatingfromPOV.Tosee whytheinclusionof the POV target assists in targeting VWAP, suppose that the agent is only selling, meaning ν ≤ 0, and that her trading speed exactly matches the percentage of total order-flow, giving + − ν =−ρ(μ + μ +|ν |). Then the agent’s trades, marked to the midprice as in (84), are t t t t executed at an average value of T T + − ν ν ν S dt (μ + μ +|ν |)S dt t t t t t t 0 0 = , (87) T T + − ν dt (μ + μ +|ν |)dt t t t t 0 0 and thus she has exactly targeted VWAP. Unfortunately, by following this strategy she will likely fail to liquidate her initial inventory position, and so inclusion of the penalty APPLIED MATHEMATICAL FINANCE 205 term in (86) initiates a trade-off between targeting VWAP and efficiently liquidating her inventory as represented by the other terms in the performance criterion. For the remainder of this analysis we assume that the agent’s optimal trading strategy will consist only of selling, meaning we may replace occurrences of |ν | with −ν ,resulting t t in simplified computations. This assumption will likely be the case when the agent’s initial inventory is large and positive and when the need for liquidation outweighs the potential benetfi s from buying shares of the asset due to price forecasts similar to the treatment in Section 5.1. Under this assumption, the value function and optimal strategy are given by the following Proposition. Proposition 5.3: The value function in (86) is given by ± ± ± 2 H(t, x, Q, S, μ ) = h (t, μ ) + h (t, μ )q + h (t)q , (88) 0 1 2 −1 h (t) =−(ϕ ˆ + k) ((T − t) + ζ ) − (89) (T − s) + ζ ± + − h (t, μ ) = b E[μ − μ |F ]ds 1 t s s (T − t) + ζ 2ϕ ˆρˆ + − + E[μ + μ |F ]ds (90) s s (T − t) + ζ T ± + − 2 (h (s, μ ) − 2ϕ ˆρ( ˆ μ + μ )) ± s s s 2 + − 2 h (t, μ ) = E −ˆ ϕρˆ (μ + μ ) | F 0 t s s 4(ϕ ˆ + k) (91) with 2(ϕ ˆ + k) ζ = , ϕ ˆ = ϕ(1 − ρ) , ρˆ = ρ/(1 − ρ), 2α − b and the optimal trading strategy in feedback form is ± + − h (t, μ ) − 2ϕ ˆρ( ˆ μ + μ ) ∗ ± −1 ν (t, q, μ ) =− ((T − t) + ζ ) q + . (92) 2(ϕ ˆ + k) In (92) the first term which depends on inventory resembles a strategy that targets TWAP, and the remaining term is an adjustment that attempts to protfi from expected + − future price forecasts (as seen from the term which includes E[μ − μ |F ]) and the s s + − desire to closely target POV (as seen in all terms including μ + μ ). Further intuition about the optimal strategy can be gathered when considering rfi st the limit α →∞,then the limit ϕ →∞. The rfi st of these limits ensures that the agent fully liquidates her posi- tion by time T, and the second limit drives her to match POV as closely as possible under the full liquidation constraint. Evaluating these limits gives 1 1 ∗ ± + − + − lim lim ν (t, q, μ ) =− q −ˆ ρ (μ + μ ) − E[μ + μ |F ]ds . s s ϕ→∞ α→∞ T − t T − t (93) The dependence on expected future price has disappeared because of the infinite weight associated with full liquidation and volume tracking. The form of makes it clear that the 206 R. DONNELLY agent’s trading strategy has a component which directly depends on the current total order + − flow μ + μ ,buttheinufl enceisadjusteddownwardduetotheinufl enceoffuturetrades. In Cartea and Jaimungal (2016a)itisalsoshown howanagentmaytargetVWAPby tracking percentage of cumulative volume (POCV) rather than POV. Other works which consider an agent attempting to match her trade executions to market VWAP are Kon- ishi (2002)and Frei andWestray (2015). Both derive expressions for strategies which minimizetheVWAPtargeting error,butrelyonfutureknowledgeofthetotalmarket vol- ume. Astudyof guaranteed VWAPcontractsisconducted inGuéant andRoyer(2014), in which an agent guarantees to provide an execution service at VWAP prices, but then mitigates the risk of having entered this contract. See Białkowski, Darolles, and Fol (2008) and Mitchell, Białkowski, and Tompaidis (2020) for empirical studies of strategies which target VWAP. 5.4. Other Objectives There are numerous other criteria that an agent could consider for the purposes of evaluat- ing the performance of her trading strategy. Here we briefly mention some lines of research that do not explicitly fall into any of the categories of the previous few sections. In Neuman et al. (2020)the agentisconsideredtobeacentralbank,andshe istasked with maintaining a currency peg or currency target zone, the goal of which is to make sure that a particular foreign exchange rate remains within a specified domain. This is done bytradingintheunderlyingcurrenciesandutilizing thepermanent priceimpactfrom the trades in such a way as to move the exchange rate to its desired value. This task has significant inventory risk associated with it which the central bank wishes to manage. In Neuman and Schied (2022), a similar currency peg context is considered, but in addition to a central bank maintaining the peg there is a strategic investor attempting to disrupt thebank’sobjective.Thisismodelledasastochasticdieff rentialgame andaStackelberg equilibrium is established. The operation of modern markets is strongly dependent on technology, and the fre- quency at which events occur mean that latency can have an affect on the performance of an execution strategy. From the time that an agent submits her intention to trade until that signal arrives at the matching engine of the exchange, the state of the limit order bookmayhavechangedin suchawaythattheagent’s ordercannotbefilled.InCartea and Sánchez-Betancourt (2021b) the agent acknowledges that latency can aeff ct whether her submitted orders are executed as expected, and adjusts her performance criterion to account for the fraction of her trades that are filled. See also Moallemi and Sağlam ( 2013) for some empirical analysis of the effects of latency on optimal execution strategies. 6. Other Phenomena Below is a small number of other styles of model which do not explicitly fall into any of the previous categories. Many of the ideas explored in the following sections could be adapted in conjunction with the topics explored already. The referenced material should be consulted for the mathematical details used to approach these phenomena. APPLIED MATHEMATICAL FINANCE 207 6.1. Multiple Agents Each setting considered so far has incorporated the strategic actions of only one agent. Some of the previous models have incorporated total order flow of the market within the dynamics, but any such model does not arise as a result of an optimization procedure car- ried out by other agents. In reality, no single agent acts in isolation, and their actions will have an effect on the state of the market, thereby altering the actions of other agents. When modelling several agents simultaneously, we move from a stochastic optimization into a stochastic game setting. In Schied and Zhang (2019) a competitive game is considered between a finite number of agents where they are restricted to trading at discrete times. Continuous-time counterparts ares studied in Strehle (2017) and Neuman and Voß (2021), and in Voß (2022)atwoagent setting is considered in which they must both efficiently track an inventory target. Finite player stochastic differential games generally result in a set of coupled differential equations which can prove to be intractable. Modeling the problem as a mean-field game, appropriate for a large number of agents, often reduces the complexity of the computations. The optimal liquidation problem in a mean eld fi setting is detailed in Huang, Jaimungal, and Nourian (2019)and Fuetal.(2021). In Casgrain and Jaimungal (2020)andDonnelly and Lorig (2020) a mean-field game is used to consider agents who have different methods of speculation about the future price of the asset. 6.2. Robustness Each of the optimal trading strategies from previous sections depend on a specification of adynamic modelforthepriceprocessandotherrelevantquantities.Butthe performance of any trading strategy is subject to risks arising from using a misspecified model, thus it is desirable to account for these risks and attempt to formulate a trading strategy which is robust to model error. In Schied (2013) the optimality of a trading strategy with respect to the underlying modelisstudied.InNyström,Aly,andZhang(2014), Cartea, Donnelly, and Jaimungal (2018b), andHorst,Xia,and Zhou (2022), the agent’s performance criterion accounts for the possibility that the dynamics evolve according to a dieff rent model, and she thereby adjusts her trading strategy to compensate. 6.3. Market Making and Liquidation with Limit Orders The results which have been outlined in previous sections frame a trading strategy in terms of the speed or volume of trades. Keeping in mind the discussion about the LOB structure, an agent can only guarantee that a particular volume of trades be executed if she employs MOs. Here we give references which study the strategic placement of LOs instead of or in conjunction with MOs. An agent who simultaneously places buy and sell LOs and attempts to maximize protfi s is considered in Avellaneda and Stoikov (2008). Further work on optimal market making is considered in Guilbaud and Pham (2013), Guéant, Lehalle, and Fernandez-Tapia (2013), Cartea,Jaimungal,andRicci (2014), Cartea and Jaimungal (2015b), and Cartea, Donnelly, and Jaimungal (2018a). 208 R. DONNELLY The incorporation of LOs on only one side of the LOB for use in optimal liqui- dation is considered in Guéant, Lehalle, and Fernandez-Tapia (2012), Bayraktar and Ludkovski (2014), Guéant and Lehalle (2015), and Cartea and Jaimungal (2015a). Notes 1. Other matching rules such as Pro Rata are also used, but the intricate details of how they dieff r from FIFO will not be necessary for the types of models considered here. 2. The opposite inequality means the original optimization problem in (12) may lack concavity, and the agent could make infinite prots fi through price manipulations. 3. See Cartea, Donnelly, and Jaimungal (2018a) for some empirical investigation of LOB event frequency seasonality and self-excitation. 4. Refer to the discussion after Proposition 3.1. 5. We take all of these volumes to be positive whether they are buys or sells. One could alternatively consider the VWAP of only buy or sell trades depending on whether the agent is buying or sell- ing, but typically all trades are considered in practice. Additionally, VWAP is typically measured with respect to the midprice without taking transaction or price impact eeff cts into account. Acknowledgments The author would like to thank Fayçal Drissi, Matthew Lorig, and Álvaro Cartea for their comments and suggestions which have improved this manuscript. Disclosure statement No potential conflict of interest was reported by the author(s). References Ackermann J., Kruse T., and Urusov M. 2021. “Optimal Trade Execution in an Order Book Model with Stochastic Liquidity Parameters.” SIAM Journal on Financial Mathematics 12 (2): 788–822. doi:10.1137/20M135409X. Alfonsi A., Fruth A., and Schied A. 2010. “Optimal Execution Strategies in Limit Order Books with General Shape Functions.” Quantitative Finance 10 (2): 143–157. doi:10.1080/1469768080259 Almgren R. 2012. “Optimal Trading with Stochastic Liquidity and Volatility.” SIAM Journal on Financial Mathematics 3 (1): 163–181. doi:10.1137/090763470. AlmgrenR., andChrissN. 2001. “Optimal Execution of Portfolio Transactions.” The Journal of Risk 3: 5–39. doi:10.21314/JOR.2001.041. Almgren R., and Li T. M. 2016.“Option HedgingwithSmoothMarketImpact.”MarketMicrostruc- ture and Liquidity 2 (1): Article ID 1650002. doi:10.1142/S2382626616500027. AlmgrenR.,Thum C.,Hauptmann E.,and LiH. 2005. “Direct Estimation of Equity Market Impact.” Risk 18 (7): 58–62. Almgren R. F. 2003. “Optimal Execution with Nonlinear Impact Functions and Trading-Enhanced Risk.” Applied Mathematical Finance 10 (1): 1–18. doi:10.1080/135048602100056. Avellaneda M., and Stoikov S. 2008. “High-Frequency Trading in a Limit Order Book.” Quantitative Finance 8 (3): 217–224. doi:10.1080/14697680701381228. Bank P., Soner H. M., and Voß M. 2017. “Hedging with Temporary Price Impact.” Mathematics and Financial Economics 11 (2): 215–239. doi:10.1007/s11579-016-0178-4. Bank P., and Voß M. 2019. “Optimal Investment with Transient Price Impact.” SIAM Journal on Financial Mathematics 10 (3): 723–768. doi:10.1137/18M1182267. Barger W., and Lorig M. 2019. “Optimal Liquidation Under Stochastic Price Impact.” International Journal of Theoretical and Applied Finance 22 (2): Article ID 1850059. doi:10.1142/S0219024918500590. APPLIED MATHEMATICAL FINANCE 209 Bayraktar E., and Ludkovski M. 2014. “Liquidation in Limit Order Books with Controlled Intensity.” Mathematical Finance 24 (4): 627–650. doi:10.1111/ma.2014.24.is fi sue-4 . Becherer D., Bilarev T., and Frentrup P. 2018. “Optimal Liquidation Under Stochastic Liquidity.” Finance and Stochastics 22 (1): 39–68. doi:10.1007/s00780-017-0346-2. Bechler K., and Ludkovski M. 2017.“OrderFlowsandLimitOrderBookResiliencyon the Meso-Scale.” Market Microstructure and Liquidity 3 (03n04): Article ID 1850006. doi:10.1142/S2382626618500065. Bergault P., Drissi F., and Guéant O. 2022. “Multi-Asset Optimal Execution and Statistical Arbitrage Strategies Under Ornstein–Uhlenbeck Dynamics.” SIAM Journal on Financial Mathematics 13 (1): 353–390. doi:10.1137/21M1407756. BertsimasD.,andLoA.W. 1998. “Optimal Control of Execution Costs.” Journal of Financial Markets 1 (1): 1–50. doi:10.1016/S1386-4181(97)00012-8. Biais B., Hillion P., and Spatt C. 1995. “An Empirical Analysis of the Limit Order Book and the Order Flow in the Paris Bourse.” The Journal of Finance 50 (5): 1655–1689. doi:10.1111/j.1540-6261.1995.tb05192.x. Białkowski J., Darolles S., and Fol G. Le. 2008. “Improving Vwap Strategies: A Dynamic Volume Approach.” Journal of Banking & Finance 32 (9): 1709–1722. doi:10.1016/j.jbankfin.2007.09.023 . Björk T. 2009. Arbitrage Theory in Continuous Time. Oxford: Oxford University Press. Bouchard B., Dang N.-M., and Lehalle C.-A. 2011. “Optimal Control of Trading Algorithms: A General Impulse Control Approach.” SIAM Journal on Financial Mathematics 2 (1): 404–438. doi:10.1137/090777293. Cartea Á., Donnelly R., and Jaimungal S. 2017. “Algorithmic Trading with Model Uncertainty.” SIAM Journal on Financial Mathematics 8 (1): 635–671. doi:10.1137/16M106282X. Cartea Á., Donnelly R., and Jaimungal S. 2018. “Enhancing Trading Strategies with Order Book Signals.” Applied Mathematical Finance 25 (1): 1–35. doi:10.1080/1350486X.2018.1434009. Cartea Á., Donnelly R., and Jaimungal S. 2018. “Portfolio Liquidation and Ambiguity Aversion.” In High-Performance Computing in Finance, edited by M. Dempster, J. Kanniainen, J. Keane, and E. Vynckier, 77–114. New York: Chapman and Hall/CRC. Cartea Á., Donnelly R., and Jaimungal S. 2020. “Hedging Nontradable Risks with Transaction Costs and Price Impact.” Mathematical Finance 30 (3): 833–868. doi:10.1111/ma.v30.3 fi . Cartea Á., Drissi F., and Monga M. 2022. “Decentralised Finance and Automated Market Making: Predictable Loss and Optimal Liquidity Provision.” Available at SSRN 4273989. Cartea Á.,GanL.,andJaimungalS. 2019. “Trading Co-Integrated Assets with Price Impact.” Mathematical Finance 29 (2): 542–567. doi:10.1111/ma.2019.29.is fi sue-2 . Cartea Á., and Jaimungal S. 2015. “Optimal Execution with Limit and Market Orders.” Quantitative Finance 15 (8): 1279–1291. doi:10.1080/14697688.2015.1032543. Cartea Á., and Jaimungal S. 2015. “Risk Metrics and Fine Tuning of High-Frequency Trading Strategies.” Mathematical Finance 25 (3): 576–611. doi:10.1111/ma.2015.25.is fi sue-3 . Cartea Á., and Jaimungal S. 2016. “A Closed-Form Execution Strategy to Target Vol- ume Weighted Average Price.” SIAM Journal on Financial Mathematics 7 (1): 760–785. doi:10.1137/16M1058406. Cartea Á., and Jaimungal S. 2016. “Incorporating Order-Flow into Optimal Execution.” Mathematics and Financial Economics 10 (3): 339–364. doi:10.1007/s11579-016-0162-z. Cartea Á., Jaimungal S., and Penalva J. 2015. Algorithmic and High-Frequency Trading. Cambridge: Cambridge University Press. Cartea Á.,Jaimungal S.,andRicci J. 2014. “Buy Low, Sell High: A High Frequency Trading Perspective.” SIAM Journal on Financial Mathematics 5 (1): 415–444. doi:10.1137/130911196. Cartea A., and Sánchez-Betancourt L. 2021. “Optimal Execution with Stochastic Delay.” Available at SSRN 3812324. Cartea Á., and Sánchez-Betancourt L. 2021. “The Shadow Price of Latency: Improving Intraday Fill Ratios in Foreign Exchange Markets.” SIAM Journal on Financial Mathematics 12 (1): 254–294. doi:10.1137/19M1258888. Casgrain P., and Jaimungal S. 2019. “Trading Algorithms with Learning in Latent Alpha Models.” Mathematical Finance 29 (3): 735–772. doi:10.1111/ma.2019.29.is fi sue-3 . 210 R. DONNELLY Casgrain P., and Jaimungal S. 2020. “Mean-field Games with Differing Beliefs for Algorithmic Trading.” Mathematical Finance 30 (3): 995–1034. doi:10.1111/ma.v30.3 fi . Chan P., and Sircar R. 2015. “Optimal Trading with Predictable Return and Stochastic Volatility.” Available at SSRN 2623747. Chen Y., Horst U., and Tran H. H. 2019. “Portfolio Liquidation Under Transient Price Impact–Theoretical Solution and Implementation with 100 Nasdaq Stocks.” Available at SSRN Cheridito P., and Sepin T. 2014. “Optimal Trade Execution Under Stochastic Volatility and Liquid- ity.” Applied Mathematical Finance 21 (4): 342–362. doi:10.1080/1350486X.2014.881005. Colaneri K.,EksiZ.,FreyR.,and Szölgyenyi M. 2020. “Optimal Liquidation Under Partial Infor- mation with Price Impact.” Stochastic Processes and Their Applications 130 (4): 1913–1946. doi:10.1016/j.spa.2019.06.004. Cont R. 2011. “Statistical Modeling of High-Frequency Financial Data.” IEEE Signal Processing Magazine 28 (5): 16–25. doi:10.1109/MSP.2011.941548. ContR.,Kukanov A.,andStoikovS. 2014.“ThePriceImpact of OrderBookEvents.”Journal of Financial Econometrics 12 (1): 47–88. doi:10.1093/jjfinec/nbt003 . Donnelly R., and Lorig M. 2020. “Optimal Trading with Differing Trade Signals.” Applied Mathe- matical Finance 27 (4): 317–344. doi:10.1080/1350486X.2020.1847672. Ekren I., and Muhle-Karbe J. 2019. “Portfolio Choice with Small Temporary and Transient Price Impact.” Mathematical Finance 29 (4): 1066–1115. doi:10.1111/ma.v29.4 fi . Ekren I., and Nadtochiy S. 2022. “Utility-Based Pricing and Hedging of Contingent Claims in Almgren-Chriss Model with Temporary Price Impact.” Mathematical Finance 32 (1): 172–225. doi:10.1111/ma.v32.1 fi . Fleming W. H., and Soner H. M. 2006. Controlled Markov Processes and Viscosity Solutions. Vol. 25. New York: Springer Science & Business Media. Forde M., Sánchez-Betancourt L., and Smith B. 2021. “Optimal Trade Execution for Gaussian Signals with Power-Law Resilience.” Quantitative Finance 22 (3): 585–596. doi:10.1080/14697688.2021. Fouque J.-P., Jaimungal S., and Saporito Y. F. 2022.“OptimalTrading withSignalsandStochastic Price Impact.” SIAM Journal on Financial Mathematics 13 (3): 944–968. doi:10.1137/21M139 Frei C., and Westray N. 2015. “Optimal Execution of a VWAP Order: A Stochastic Control Approach.” Mathematical Finance 25 (3): 612–639. doi:10.1111/ma.2015.25.is fi sue-3 . Fruth A., Schöneborn T., and Urusov M. 2019. “Optimal Trade Execution in Order Books with Stochastic Liquidity.” Mathematical Finance 29 (2): 507–541. doi:10.1111/ma.2019.29.is fi sue-2 . FuG.,GraeweP.,HorstU., andPopierA. 2021. “A Mean Field Game of Optimal Portfolio Liquida- tion.” Mathematics of Operations Research 46 (4): 1250–1281. doi:10.1287/moor.2020.1094. Gatheral J. 2010. “No-Dynamic-Arbitrage and Market Impact.” Quantitative Finance 10 (7): 749–759. doi:10.1080/14697680903373692. Gatheral J., and Schied A. 2011. “Optimal Trade Execution Under Geometric Brownian Motion in the Almgren and Chriss Framework.” International Journal of Theoretical and Applied Finance 14 (3): 353–368. doi:10.1142/S0219024911006577. Gatheral J., and Schied A. 2013. “Dynamical Models of Market Impact and Algorithms for Order Execution.” In Handbook on Systemic Risk, edited by Fouque, Jean-Pierre, and Joseph A. Langsam, 579–599. Cambridge: Cambridge University Press. Graewe P., and Horst U. 2017. “Optimal Trade Execution with Instantaneous Price Impact and Stochastic Resilience.” SIAM Journal on Control and Optimization 55 (6): 3707–3725. doi:10.1137/16M1105463. Guéant O. 2016. The Financial Mathematics of Market Liquidity: From Optimal Execution to Market Making. Vol. 33. New York: CRC Press. Guéant O.,and LehalleC.-A. 2015. “General Intensity Shapes in Optimal Liquidation.” Mathemat- ical Finance 25 (3): 457–495. doi:10.1111/ma.2015.25.is fi sue-3 . Guéant O., Lehalle C.-A., and Fernandez-Tapia J. 2012. “Optimal Portfolio Liquidation with Limit Orders.” SIAM Journal on Financial Mathematics 3 (1): 740–764. doi:10.1137/110850475. APPLIED MATHEMATICAL FINANCE 211 Guéant O., Lehalle C.-A., and Fernandez-Tapia J. 2013.“DealingwiththeInventoryRisk: ASolu- tion to the Market Making Problem.” Mathematics and Financial Economics 7 (4): 477–507. doi:10.1007/s11579-012-0087-0. Guéant O., and Pu J. 2017. “Option Pricing and Hedging with Execution Costs and Market Impact.” Mathematical Finance 27 (3): 803–831. doi:10.1111/ma.2017.27.is fi sue-3 . Guéant O., and Royer G. 2014. “VWAP Execution and Guaranteed VWAP.” SIAM Journal on Financial Mathematics 5 (1): 445–471. doi:10.1137/130924676. Guilbaud F., and Pham H. 2013. “Optimal High-Frequency Trading with Limit and Market Orders.” Quantitative Finance 13 (1): 79–94. doi:10.1080/14697688.2012.708779. Guo X., and Zervos M. 2015. “Optimal Execution with Multiplicative Price Impact.” SIAM Journal on Financial Mathematics 6 (1): 281–306. doi:10.1137/120894622. Horst U., and Xia X. 2019. “Multi-Dimensional Optimal Trade Execution Under Stochastic Resilience.” Finance and Stochastics 23 (4): 889–923. doi:10.1007/s00780-019-00394-3. Horst U., Xia X., and Zhou C. 2022. “Portfolio Liquidation Under Factor Uncertainty.” The Annals of Applied Probability 32 (1): 80–123. doi:10.1214/21-AAP1672. Huang X., Jaimungal S., and Nourian M. 2019. “Mean-Field Game Strategies for Optimal Execution.” Applied Mathematical Finance 26 (2): 153–185. doi:10.1080/1350486X.2019.1603183. Ishitani K., and Kato T. 2014. “Optimal Execution for Uncertain Market Impact: Derivation and Characterization of a Continuous-Time Value Function.” In 2012 Recent Advances in Financial Engineering: Proceedings of the International Workshop on Finance 2012, edited by A. Takahashi, Y. Muromachi, and T. Shibata, 93–116. Tokyo: World Scientific. Konishi H. 2002. “Optimal Slice of a VWAP Trade.” Journal of Financial Markets 5 (2): 197–221. doi:10.1016/S1386-4181(01)00023-4. LehalleC.-A.,andMounjidO. 2017. “Limit Order Strategic Placement with Adverse Selec- tion Risk and the Role of Latency.” Market Microstructure and Liquidity 3 (1): 1750009. doi:10.1142/S2382626617500095. Lehalle C.-A., and Neuman E. 2019. “Incorporating Signals into Optimal Trading.” Finance and Stochastics 23 (2): 275–311. doi:10.1007/s00780-019-00382-7. Mitchell D., Białkowski J., and Tompaidis S. 2020. “Volume-Weighted Average Price Tracking: A Theoretical and Empirical Study.” IISE Transactions 52 (8): 864–889. doi:10.1080/24725854.2019. Moallemi C. C., and Sağlam M. 2013. “Or Forum–The Cost of Latency in High-Frequency Trading.” Operations Research 61 (5): 1070–1086. doi:10.1287/opre.2013.1165. Neuman E., and Schied A. 2022. “Protecting Pegged Currency Markets from Speculative Investors.” Mathematical Finance 32 (1): 405–420. doi:10.1111/ma.v32.1 fi . Neuman E.,SchiedA.,Weng C.,and XueX. 2020. “A Central Bank Strategy for Defending a Currency Peg.” Systems & Control Letters 144: Article ID 104761. doi:10.1016/j.sysconle.2020. Neuman E., and Voß M. 2021. Trading with the Crowd. arXiv preprint arXiv:2106.09267. Neuman E., and Voß M. 2022. “Optimal Signal-Adaptive Trading with Temporary and Transient Price Impact.” SIAM Journal on Financial Mathematics 13 (2): 551–575. doi:10.1137/20M137 NyströmK., AlyS.M.Ould, andZhangC. 2014. “Market Making and Portfolio Liquidation Under Uncertainty.” International Journal of Theoretical and Applied Finance 17 (5): Article ID 1450034. doi:10.1142/S0219024914500344. Obizhaeva A. A., and Wang J. 2013. “Optimal Trading Strategy and Supply/Demand Dynamics.” Journal of Financial Markets 16 (1): 1–32. doi:10.1016/j.finmar.2012.09.001 . Pham H. 2009. Continuous-Time Stochastic Control and Optimization with Financial Applications. Vol. 61. Berlin: Springer Science & Business Media. Predoiu S., Shaikhet G., and Shreve S. 2011. “Optimal Execution in a General One-Sided Limit- Order Book.” SIAM Journal on Financial Mathematics 2 (1): 183–212. doi:10.1137/10078534X. Ranaldo A. 2004. “Order Aggressiveness in Limit Order Book Markets.” Journal of Financial Markets 7 (1): 53–74. doi:10.1016/S1386-4181(02)00069-1. 212 R. DONNELLY Schied A. 2013. “Robust Strategies for Optimal Order Execution in the Almgren–Chriss Frame- work.” Applied Mathematical Finance 20 (3): 264–286. doi:10.1080/1350486X.2012.683963. Schied A., and Zhang T. 2019. “A Market Impact Game Under Transient Price Impact.” Mathematics of Operations Research 44 (1): 102–121. doi:10.1287/moor.2017.0916. Sirignano J. A. 2019. “Deep Learning for Limit Order Books.” Quantitative Finance 19 (4): 549–570. doi:10.1080/14697688.2018.1546053. Strehle E. 2017. “Optimal Execution in a Multiplayer Model of Transient Price Impact.” Market Microstructure and Liquidity 3 (03n04): Article ID 1850007. doi:10.1142/S2382626618500077. Touzi N., and Tourin A. 2013. Optimal Stochastic Control, Stochastic Target Problems, and Back- ward SDE. Vol. 29. New York: Springer. Voß M. 2022. “A Two-Player Portfolio Tracking Game.” Mathematics and Financial Economics 16 (4): 779–809. doi:10.1007/s11579-022-00324-6. Yong J., and Zhou X. Y. 1999. Stochastic Controls: Hamiltonian Systems and HJB Equations. Vol. 43. New York: Springer Science & Business Media.

Journal

Applied Mathematical FinanceTaylor & Francis

Published: May 4, 2022

Keywords: Algorithmic trading; price impact; stochastic optimization

References