L02 Risk Models & Market Risk

Financial Risk Management Fundamental

Financial Risk

Definition

  • Financial risk: prospect of financial gain or loss due to unforeseen changes in risk factors

  • Market risks: arising from changes in market prices or rates
    Interest-rate risks, equity risks, exchange rate risks, commodity price risks, etc.

  • Market vs. credit vs. operational risks

    • Credit risks: risks arising from possible default on contract
    • Op risks: risks from failure of people or systems

Contributory factors: volatile environment

  • Stock market volatility
    • 23% fall in Dow Jones on Oct. 19, 1987
    • Dow Jones fell around 20% in July-Aug 1998
    • Massive falls in East Asian stock markets in 1998
    • 2015–16 Chinese stock market turbulence
  • Exchange rate volatility
    • Problems of ERM (Exchange Rate Mechanism), peso, rouble, East Asia, etc.
  • Interest rate volatility
    • US interest rates doubled over 1994
  • Commodity market volatility
    • Electricity prices, etc.

Contributory factors: growth in trading activity

  • Number of shares traded on NYSE grown from 3.5m in 1970 to 100m in 2000
  • Turnover in FX markets grown from $1bn a day in 1965 to $1,210bn in April 2001
  • Explosion in new financial instruments
  • Securitization
  • Growth of offshore trading
  • Growth of hedge funds



  • Derivatives
    • Few derivatives until early 1970s
    • FX futures (CME, 1972), equity calls (CBOT, 1973), interest-rate futures (1975)
    • Swaps and exotics (1980s)
    • Cat, credit, electricity, weather derivatives (1990s)
    • Daily notional principals negligible in early 1970s; up to nearly $2,800bn in April 2001

Contributory factors: advances in IT

  • Huge increases in computational power and speed

    • Costs falling at 25-30% a year for 30 years
  • Improvements in software

  • Improvements in user-friendliness

    • E.g., in simulation software
  • Risk measurers no longer constrained to simple back-of-the-envelope methods

Financial Risk Management before VaR

Risk measurement before VaR

  • Gap analysis

  • PV01 analysis

  • Duration and duration-convexity analysis

  • Scenario analysis

  • Portfolio theory

  • Derivatives risk measures

  • Statistical methods, ALM, etc.

Gap Analysis

  • Developed to get crude idea of interest-rate risk exposure
  • Choose horizon, e.g., 1 year
  • Determine how much of asset or liability portfolio re-prices in that period
    • Rate-sensitive assets / liabilities
  • GAP = RS assets – RS liabilities
  • Exposure is change in change in net interest income when interest rates change
  • Pros
    • Easy to carry out
    • Intuitive
  • Cons
    • Crude
    • Only applies to on-balance sheet IR risk
    • Looks at impact on income, not asset / liability values
    • Results sensitive to choice of horizon

PV01 Analysis

  • Also applies to fixed income positions

  • Addresses what will happen to bond price if interest rises by 1 basis point

  • Price bond at current interest rates

  • Price bond assuming rate rise by 1 bp

  • Calculate loss as current minus prospective bond prices

Duration Analysis

  • Another traditional approach to IR risk assessment
  • Duration is weighted average of maturities of a bond's cashflows,
    • weights are present values of each cashflow, divided by PV of all cashflows
  • Duration indicates sensitivity of bond price to change in yield:
  • Can approximate duration using simple algorithm as , where is current price, is price when , is price when
  • Duration is a linear function: duration of portfolio is sum of durations of bonds in portfolio
  • Duration assumes relationship is linear when it is typically convex
  • Duration ignores embedded options
  • Duration analysis supposes that yield curve shifts in parallel

Convexity

  • If duration takes a relationship as linear, convexity takes it as (approx) quadratic
  • Convexity is the second order term in a Taylor series approximation

  • Convexity is defined as
  • Convexity term gives a refinement to the basic duration approximation
  • In practice, convexity adjustment often small
  • Convexity a valuable property
  • Makes losses smaller, gains bigger
  • Can approximate convexity as

Pros and cons of duration-convexity

  • Pros

    • Easy to calculate
    • Intuitive
    • Looks at values, not income
  • Cons

    • Crude (only first-order approx, problems with non-parallel moves in spot rate curve, etc.)
    • Often inaccurate even with refinements (e.g., convexity)
    • Applies only to IR risks

Scenario Analysis

  • 'What if' analysis – set out scenarios and work out what we gain/lose
  • Select a set of scenarios, postulate cashflows under each scenario, use results to come to a view about exposure
  • SA not easy to carry out
    • Much hinges on good choice of scenario
    • Need to ensure that scenarios do not involve contradictory or implausible assumptions
    • Need to think about interrelationships involved
    • Want to ensure that all important scenarios covered
  • SA tells us nothing about probabilities
    • Need to use judgement to determine significance of results
  • SA very subjective
  • Much depends on skill and intuition of analyst

Portfolio Theory

  • Starts from premise that investors choose between expected return and risk
    • Risk measured by std of portfolio return
  • Wants high expected return and low risk
  • Investor chooses portfolio based on strength of preferences for expected return and risk
    • Investor determines efficient portfolios, and chooses the one that best fits preferences
  • Investor who is highly (slightly) risk averse will choose safe (risky) portfolio
  • Key insight is that risk of any position is not its std, but the extent to which it contributes to portfolio std
    • Asset might have a high std, but contribute small risk, and vice versa
  • Risk is measured by beta (why?) – which depends on correlation of asset return with portfolio return
  • High beta implies high correlation and high risk
  • Low beta implies low correlation and low risk
  • Zero beta implies no risk
  • Ideally, looking for positions with negative beta
    • These reduce portfolio risk
  • PT widely used by portfolio managers
  • But runs into implementation problems
    • Estimation of beta highly problematic
    • Each beta is specific to data and portfolio
    • Need a long data set to get reliable result
  • Practitioners often try to avoid some of these problems by working with `the' beta (as in CAPM)
    • But this often requires CAPM assumptions to hold
    • One market risk factor, etc.
  • CAPM discredited (Fama-French, etc.)

Derivatives risk measures

  • Can measure risks of derivatives positions by their Greeks
    • Delta gives change in derivatives price for small change in underlying price
    • Gamma gives change in derivatives delta for small change in underlying price
    • Rho gives change in derivatives price for small change in interest rate
    • Theta gives change in derivatives price for small change in time to maturity
    • Vega (which is not really a Greek letter) gives change in derivatives price for small change in volatility
  • Use of Greeks requires considerable skill
  • Need to handle different signals at the same time
  • Risks measures only incremental
    • Work against small changes in exogenous factors
    • Can be Greek-hedged, but still be very exposed if there are large exogenous shifts
    • Hedging strategies require liquid markets, and can fail if market liquidity dries up (as in Oct 1987)
  • Risk measures are dynamic
    • Can change considerably over time
    • E.g., gamma of ATM option goes to infinity

Value at Risk (VaR)

Value at Risk

  • In late 1970s and 1980s, major financial institutions started work on internal models to measure and aggregate risks across institution
  • As firms became more complex, it was becoming more difficult but also more important to get a view of firmwide risks
  • Firms lacked the methodology to do so
    • Can't simply aggregate risks from sub-firm level

RiskMetrics

  • Bestknown system is RiskMetrics developed by JP Morgan
  • Supposedly developed by JPM staff to provide a '4:15' report to CEO, Dennis Weatherstone.
  • What is maximum likely trading loss over next day, over whole firm?
  • To develop this system, JPM used portfolio theory, but implementation issues were very difficult
  • Staff had to

    • Choose measurement conventions
    • Construct data sets
    • Agree statistical assumptions
    • Agree procedures to estimate volatilities and correlations
    • Set up computer systems for estimation, etc.
  • Main elements of system working by around 1990

  • Then decided to use the '4:15' report

    • A one-day, one-page summary of the bank's market risk to be delivered to the CEO in the late afternoon (hence the ''4:15'')
  • Found that it worked well

  • Sensitised senior management to risk-expected return tradeoffs, etc.

  • New system publicly launched in 1993 and attracted a lot of interest

  • Other firms working on their systems

    • Some based on historical simulation, Monte Carlo simulation, etc.
  • JPM decided to make a lower-grade version of its system publicly available

  • This was RiskMetrics system launched in Oct 1994

    • Stimulated healthy debate on pros/cons of RiskMetrics, VaR, etc.
  • Subsequent development of other VaR systems, applications to credit, liquidity, op risks, etc.

Portfolio theory and VaR

  • PT interprets risk as std of portfolio return, VaR interprets it as maximum likely loss

    • VaR notion of risk more intuitive
  • PT assumes returns are normal or near normal, whilst VaR systems can accommodate wider range of distributions

  • VaR approaches can be applied to a wider range of problems

    • PT has difficulty applying to non-market risks, whereas VaR applies more easily to them
  • VaR systems not all based on portfolio theory

    • Variance covariance systems are; others are not

Attractions of VaR

  • VaR provides a single summary measure of possible portfolio losses

  • VaR provides a common consistent measure of risk across different positions and risk factors

  • VaR takes account of correlations between risk factors

Uses of VaR

  • Can be used to set overall firm risk target

  • Can use it to determine capital allocation

  • Can provide a more consistent, integrated treatment of different risks

  • Can be useful for reporting and disclosing

  • Can be used to guide investment, hedging, trading and risk management decisions

  • Can be used for remuneration purposes

  • Can be applied to credit, liquidity and op risks

Criticisms of VaR

  • VaR was warmly embraced by most practitioners, but not by all
  • Concern with the statistical and other assumptions underlying VaR models
    • Hoppe and Taleb
  • Concern with imprecision of VaR estimates
    • Beder
  • Concern with implementation risk
    • Marshall and Siegel
  • Concern with risk endogeneity
    • Traders might 'game' VaR systems
    • Write deep out-of-the-money options
  • Uses of VaR as a regulatory constraint might destabilise financial system or obstruct good practice
  • Concern that VaR might not be best risk measure
    • Limits of VaR, development of coherent risk measures

Advanced Risk Models: Univariate

Value-at-Risk (VaR)

  • VaR is the maximum loss over a target horizon such that there is a low, prespecified probability that actual loss will be larger.
  • Formula

    or,

VaR Parameters: The Confidence Level (cl)

  • VaR rises with confidence level (cl) at an increasing rate

VaR Parameters: The Holding Period (hp)

  • If mean is zero, VaR rises with square root of holding period
  • 'Square root rule':
    • Daily innovations must be i.i.d. standard normal (why?)
  • Empirical plausibility of SRR very doubtful (why?)

VaR Surface

  • VaR surface is much more revealing
  • A single VaR number is just a point on the surface – doesn't tell much
  • With zero mean, get spike at high cl and high hp

Determine the Confidence Level

  • High cl if we want to use VaR to set capital requirements

    • E.g., 99% under Basel
  • Lower if we want to use VaR

    • to set position limits
    • for reporting/disclosure
    • for backtesting (we will discuss it later)

Determine the Holding Period

  • Depends on investment/reporting horizons

  • Daily common for cap market institutions

  • 10 days (or 2 weeks) for banks under Basel

  • Can depend on liquidity of market – hp should be equal to liquidation period

  • Short hp makes it easier to justify assumption of unchanging portfolio

  • Short hp preferable for model validation/backtesting requirements

    • Short hp means more data to use

算例:Parametric VaR — Two-Asset Portfolio

Given: A portfolio consists of $600,000 in Stock A and $400,000 in Stock B.

Parameter Stock A Stock B
Daily expected return 0.05% 0.03%
Daily volatility 2.0% 1.0%
Correlation 0.40

Step 1: Portfolio daily expected return and volatility

Step 2: 1-day 99% VaR ()

Step 3: Diversification benefit

Undiversified VaR:

Diversification reduces VaR by 16.4%, driven by imperfect correlation ().

算例:Historical Simulation VaR

Given: 500 daily portfolio P&L observations (sorted, worst to best, in $thousands):

Rank -25 -24 -23 ... -6 -5
P&L ...

99% VaR: The 5th worst loss out of 500 observations (99% → ).

95% VaR: The 25th worst loss out of 500 ().

Interpretation: With 99% confidence, the portfolio will not lose more than $27,100 in one day.

Limitation of VaR

  • VaR estimates subject to error

  • VaR models subject to (considerable!) model risk

    • Beder
  • VaR systems subject to implementation risk

    • Marshal and Siegel
  • But these problems common to all risk measurement systems

VaR uninformative of tail losses

  • VaR tells us most we can lose at a certain probability, i.e., if tail event does not occur

  • VaR does not tell us anything about what might happen if tail event does occur

  • Trader can spike firm by selling out of the money options

    • Usually makes a small profit, occasionally a very large loss
    • If prob of loss low enough, such a position appears to have no risk, but can be very dangerous
  • Two positions with equal VaRs not necessarily equally risky, because tail events might be very different

  • Solution to use more VaR information – estimate VaR at higher cl

VaR creates perverse incentives

  • VaR-based decision calculus can be misleading, because it ignores low-prob, high-impact events

    • Events with probs less than VaR tail prob ignored
  • Additional problems if VaR is used in a decentralized system

  • VaR-constrained traders/managers have incentives to 'game' the VaR constraint

    • Sell out of the money options, etc.

VaR can discourage diversification

  • VaR of diversified portfolio can be larger than VaR of undiversified one

  • Example

    • 100 possible states, 100 assets, each making a profit in 99 (different) states, and a high loss in one
    • Diversified portfolio is certain to experience a high loss
    • Undiversified portfolio is not
    • Hence, VaR of diversified portfolio higher than VaR of undiversified one

VaR not subadditive

  • A risk measure is subadditive if risk of sum is not greater than sum of risks

  • Aggregating individual risks does not increase overall risk

  • Important because: Adding risks together gives conservative (over-) estimate of portfolio risk – want bias to be conservative

  • If risks not subadditive and VaR used to measure risk

    • Firms tempted to break themselves up to release capital
    • Traders on organized exchanges tempted to break up accounts
    • In both cases, problem of residual risk exposure
  • Subadditivity is highly desirable

  • But VaR is only subadditive if risks are normal or elliptical

  • VaR not subadditive for arbitrary distributions

案例:2008金融危机中的VaR失效

背景: 2008年金融危机期间,多家大型金融机构的VaR模型严重低估了实际风险暴露。

Lehman Brothers(2008年8月):

  • 报告的日VaR约为$7,300万(99%, 1天)
  • 但实际交易损失经常大幅超过VaR限额
  • 2008年Q1, Lehman报告的VaR甚至在下降,而同期其持有的MBS和CDO头寸亏损正在加速

根本原因分析:

VaR假设 实际情况
资产回报近似正态 尾部极厚,极端事件频率远高于正态假设
历史波动率/相关性稳定 波动率急剧上升,相关性趋近于1
市场流动性恒定 危机中流动性蒸发,买卖价差扩大10-100倍
分散化有效 所有风险资产同步下跌,分散化失效

关键教训:

  • VaR仅度量"正常市场条件下的最大损失",对极端尾部事件无能为力
  • 依赖历史数据估计参数,在结构性断裂(regime change)时失效
  • VaR必须与压力测试和CVaR配合使用,不能作为唯一风险度量工具
  • 这一认识直接推动了Basel III将资本计量基准从VaR转向Expected Shortfall

案例:Basel监管框架演进 — 从VaR到ES

Basel II.5(2009年修订):

  • 引入压力VaR(Stressed VaR):基于250天连续压力期计算
  • 新增增量风险资本(IRC):覆盖违约和迁移风险,99.9%置信,1年期

Basel III / FRTB(Fundamental Review of the Trading Book, 2016/2019最终版):

维度 旧框架(Basel II) 新框架(FRTB)
风险度量 VaR (99%, 10-day) ES (97.5%, 12-month stressed)
资本计算 标准法或内部模型法 标准法(SA)或内部模型法(IMA)
交易台层级 全行汇总 必须在交易台层面验证和计算
非流动性风险 未显式处理 引入流动性期限调整(LR)
尾部风险 VaR不关注尾部 ES关注尾部平均损失
回测 基于VaR例外 基于ES的"超越比率"(exceedance)检验

为什么从VaR转向ES?

  • ES是相容风险度量(满足次可加性),VaR不是
  • ES更全面地捕捉尾部风险
  • 2008年危机表明VaR系统性低估尾部损失

Coherent Measure of Risk

Coherent Risk Measures

  • Let and be future values of two risky positions. A coherent measure of risk should satisfy the following axioms

    • Monotonicity: if ,
    • Translation invariance:
    • Homogeneity:
    • Subadditivity:
  • Homogeneity and Monotonicity imply convexity, which is important

  • Translation invariance means that adding a sure amount to our end-period portfolio will reduce loss by amount added

Implications of coherence

  • Any coherent risk measure is the maximum loss on a set of generalized scenarios
    • GS is set of loss values and associated probs
  • Maximum loss from a subset of scenarios is coherent
  • Outcomes of stress tests are coherent
    • Coherence theory provides a theoretical justification for stress testing!
  • Coherence risk measures can be mapped to user's risk preferences
  • Each coherent measure has a weighting function that weights loss values
  • can be linked to utility function (risk preferences)
  • Can choose a coherent measure to suit risk preferences

Alternative Measures of Risk: CVaR

  • The Conditional VaR (CVaR) is the expected loss, given a loss exceeding VaR

  • it is also called expected shortfall, tail conditional expectation, conditional loss, or expected tail loss

  • VaR tells us the most we can lose if a tail event does not occur, CVaR tells us the amount we expect to lose if a tail event does occur

  • CVaR is coherent

算例:CVaR vs. VaR

Given: A portfolio with normally distributed returns, daily, daily.

99% VaR (1-day):

99% CVaR (for normal distribution):

where and .

Measure Value (in % of portfolio)
99% VaR 3.409
99% CVaR 3.925
CVaR/VaR ratio 1.15

CVaR exceeds VaR by approximately 15%, reflecting the expected severity of tail losses.

CVaR is better than VaR

  • Tell us what to expect in bad states
    • VaR tells us nothing
  • CVaR-based decision rule valid under more general conditions than a VaR-based one
  • CVaR coherent, and therefore always subadditive
  • CVaR does not discourage risk diversification, VaR sometimes does
  • CVaR-based risk surface always convex
    • Convexity ensures that portfolio optimization has a unique well-behaved optimum
    • Convexity also enables problems to be solved easily using linear programming methods

Alternative Measures of Risk: Worst-case scenario analysis

  • This is the outcome of a worst-case scenario analysis (Boudoukh et al)
  • Can consider as high percentile of distribution of losses exceeding VaR
    • Whereas CVaR is expected value of this distribution
  • WCSA is also coherent, produces risk measures bigger than CVaR

Alternative Measures of Risk: SPAN

  • Standard-Portfolio Analysis Risk (SPAN, CME)

  • Considers 14 scenarios (moderate/large changes in vol, changes in price) + 2 extreme scenarios

  • Positions revalued under each scenario, and the risk measure is the maximum loss under the first 14 scenarios plus 35% of the loss under the two extreme scenarios

  • SPAN risk measure can be interpreted as maximum of expected loss under each of 16 probability measures, and is therefore coherent

Alternative Measures of Risk: Other Methods

  • The Semistandard Deviation

  • The Drawdown

  • Try to verify whether the following popular risk measures are coherent measures of risk or not

    • Standard deviation
    • VaR
    • CVaR
    • WCSA
    • SPAN

Stress-Testing

What are stress tests?

  • STs are procedures that gauge vulnerability to 'what if' events

    • Used for a long time, especially to gauge institutions IR exposure
  • Early stress tests 'back of envelope' exercises

  • Major improvements in recent years, helped by developments in computer power

  • Modern ST much more sophisticated than predecessors

Modern stress testing

  • ST received a major boost after 1998
  • Realisation that it would have helped firms to protect themselves better in '98 crisis'
  • ST very good for quantifying possible losses in non-normal situations
    • Breakdowns in normal correlation relationships
    • Sudden decreases in market liquidity
    • Handling concentration risks
    • Handling macroeconomic risks
  • STs versus probabilistic approaches
    • STs are a natural complement to probabilistic approaches
    • VaR and CVaR good on prob side, but poor on 'what if'
    • STs poor on prob side, but good on 'what if'
    • STs not designed to answer prob questions

Categories of STs

  • Different types of event (normal, extreme, etc.)
  • Different types of risk (market, credit, etc.) and risk factors (equity risks, yield curve, etc.)
  • Different country/region factors
  • Different methodologies (scenario analysis, factor push, etc.)
  • Different assumptions (relating to yields, stock markets, etc.)
  • Different books (trading, banking)
  • Differences in level of test (firmwide, business unit, etc.)
  • Different instruments (equities, futures, options, etc.)
  • Differences in complexity of portfolio

Main types of stress test

  • Scenario ('what if') analysis

    • Analyse impact of specific scenario on our financial position
  • Mechanical stress tests

    • Evaluate a number of math/stat defined scenarios (e.g., defined in terms of movements in underlying risk factors)
    • Work through these in a mechanical way

Uses of stress tests

  • Can provide good risk information

    • Good for communicating risk information
    • Results from STs readily understandable
    • But must avoid swamping recipients with too much data
  • Can guide decision making

    • Allocation of capital, setting of limits, etc.
  • ST can help firms to design systems to deal with bad events

    • Can check on modelling assumptions, contingency planning, etc.

Benefits of stress testing

  • Good for showing hidden vulnerability
  • Can improve on VaR type information in various ways
    • Since stress events unlikely, VaR systems unlikely to reveal much about them
    • Short VaR holding period will often be too short to reveal full impact of stress event
    • Stress tests better at handling non-linearities that arise in stress situations
    • Stress test can take account of unusual features in a stress scenario
  • Can more easily identify a firm's breaking point
    • Identify lethal conjunctions of events, etc.
  • Can give a clearer view about dangerous scenarios
    • This helps in deciding they might be handled
  • Stress tests particularly good for identifying and handling liquidity exposures
    • Loss of liquidity a key feature of crises
  • Stress tests also very good for handling large market moves
    • Market crashes, etc
  • Stress tests good for handling possible changes in market volatility
    • Help identify possible risks in overall risk exposure, gains/losses on derivatives positions, etc
  • Stress tests for good for handling exposure to correlation changes
    • These very significant in major crises
  • Stress tests good for highlighting hidden weaknesses in risk management system
    • Identify hidden 'hot spots' in portfolio

Difficulties with STs

  • Much less straightforward than it looks
  • Based on large numbers of decisions about
    • choice of scenarios and risk factors
    • how risk factors should be combined
    • Ranges of values to be considered
    • Choice of time horizon, etc.
  • ST results completely dependent on chosen scenarios and how they are modelled
    • When portfolios are complex, can be very difficult even to identify the risk factors to look at
  • Difficulty of working through scenarios in a consistent way

    • Need to follow through scenarios, and consequences can be very complex and can easily become unmanageable

    • Need to take account of interrelationships in a reasonable way

      • Need to account for correlations
    • Need to take account of zero arbitrage relationships

      • Eliminate 'impossible' events?

ST & Prob Analysis

  • Interpretation of ST results

    • STs do not address prob issues as such

    • Hence, always an issue of how to interpret results

    • This implies some informal notion of likelihood

      • If prob is negligible, result of ST might have little bearing
      • If prob is significant, result might be important
  • Integrating ST and prob analysis

    • Can integrate ST and prob analysis using Berkowitz's coherent framework

      • Do prob analysis – VaRs at different cls
      • Do ST analysis
      • Assign probs to ST scenarios
      • Integrate these outcomes into prob analysis
    • This approach is judgemental, but does give an integrated analysis of both probs and STs

Choosing scenarios

  • moving key variables one at a time

  • using historical scenarios

  • creating prospective scenarios

  • reverse stress tests

Stylised Scenarios

  • Simulated movement in major IRs, stock prices, etc.

    • Can be expressed in absolute changes or multiple-of-std changes
  • Long been used in ALM analysis

  • Problem is to keep scenarios down in number, without missing plausible important ones

Actual Historical Scenarios

  • Based on actual historical events

  • Advantages

    • They have plausibility because they have occurred
    • Readily understood
  • Can choose scenarios from a catalogue, which might include

    • Moderate market changes

    • More extreme events such as reruns of major crashes

      • A good guide is to choose scenarios that are of same order of magnitude as worst-case events in our data sets

Hypothetical One-Off Events

  • These might be natural, political, legal, major defaults, counterparty defaults, economic crises, etc.

  • Can obtain them by alternate history exercises

    • Rerun historical scenarios and ask what might have been
  • Can also look to historical record to guide us in working out what such scenarios might look like

Evaluating scenarios

  • Having specified our set of scenarios, we then need to evaluate their effects

  • Key is to get an understanding of the sensitivities of our positions to changes in the risk factors being changed

  • Easy for some positions

    • E.g., linear FX, stock, futures positions
  • Harder for options

    • Must account for (changing) sensitivities to underlying
    • Might need approximations, e.g., delta-gamma
  • Must pay particular attention to impact of stylised events on markets

  • Very unwise to assume that liquidity will remain strong in a crisis

  • If futures contracts are used hedges, must also take account of funding implications

  • Otherwise well-hedged positions can unravel because of interim funding

Mechanical ST

  • These approaches try to reduce subjectivity of SA and put ST on firmer foundation

    • Instead of using subjective scenarios, we use math/stat generated scenarios
    • Work through these in a systematic way
    • Work out most damaging conjunctions of events
  • Mechanical ST more systematic and thorough than SA, but also more intensive

Factor Push Analysis

  • Push each price or factor by a certain amount

    • Better to push factors than prices, due to varying sensitivities of prices to factors
  • Specify confidence level

  • Push factors up/down by

  • Revalue positions each time,

  • Work out most disadvantageous combination

  • This gives us worst-case maximum loss (ML)

  • FP is easy to program, at least for simple positions

  • Does not require very restrictive assumptions

  • Can be modified for correlations,

  • Results of FP are coherent

  • If we are prepared to make further assumptions, FP can also give us an indication of likelihoods

    • ML estimates tend to be conservative estimates of VaR
    • Can adjust the to obtain probs and VaRs
  • But FP rests on assumption that highest losses occur when factors move most, and this is not true for some positions

Maximum Loss Optimisation

  • Solution to this problem is to search over interim values of factor ranges

  • This is MLO

  • MLO is more intensive, but will pick up high losses that occur within factor ranges

    • E.g., as with losses on straddles
  • MLO is better for more complex positions, where it will uncover losses that FP might miss

Backtesting

Backtesting

  • Backtesting is the process to compare systematically the VaR forecasts with actual returns.

    • It should detect weaknesses in the models and point to areas for improvement
    • It is one of the reasons that bank regulators allow banks to use their own risk measures to determine the amount of regulatory capital
  • Backtesting compares the daily VaR forecast with the realized profit and loss (P&L) the next day.

    • It is recorded as an exception if the actual loss is worse than the VaR
    • The risk manager counts the number if exceptions over a window with observations.
  • Trading outcome

    • Actual portfolio
    • Hypothetical portfolio (no intraday trading, no fee income)
    • The Basel framework recommends using both hypothetical and actual trading outcomes in backtests

Preparing Data: Obtaining Data

  • Need to obtain suitable P/L data

  • Accounting (e.g., GAAP) data often inappropriate because of smoothing, prudence etc

  • Want P/L data that reflect market risks taken

    • Need to eliminate fee income, bid-ask profits, P/L from other risks taken (e.g., credit risks), unrealised P/L, provisions against future losses
  • Need to clean data or use hypothetical P/L data (obtained by revaluing periods from day to day)

Preparing Data: Draw up backtest chart

  • Good to plot P/L data against risk bounds over time
  • Chart good indicator of possible problems
    • over-estimation of risks
    • under-estimation of risks
    • bias
    • excessive smoothness in risk bounds, etc.

Preparing Data: Get to know data

  • Draw up summary statistics
    • Mean, std, skewness, kurtosis, min, max, etc.
  • Draw up QQ charts
    • Plots of predicted vs. empirical quantiles
  • Draw up charts of predicted vs. empirical probs
  • Shape of these curves indicates whether supposed pdf fits the data – very useful diagnostic

Preparing Data: Standardise Data

  • P/L data typically random

  • Portfolios and dfs often change from day to day

  • How to compare P/L data if underlying pdfs change?

  • Good practice to map P/L data to predicted percentile

    • If observation falls 90% percentile of predicted P/L distribution, then mapped value is 0.90
  • This standardizes data to make observations comparable given changes in pdf or portfolio

Measuring Exceptions

  • Binomial Distribution

    • Probability mass function

    • Mean: , Variance:
  • Example: For instance, we want to know what is the probability of observing exceptions out of a sample of observations when the true probability is 1%. We should expect to observe exceptions on average across many such samples. There will be, however, some samples with no exceptions at all simply due to luck. This probability is

So, we would expect to observe 8.1% of samples with zero exceptions under the null hypothesis. We can repeat this calculation with different values for . For example, the probability of observing eight exceptions is . Because this probability is so low, this outcome should raise questions as to whether the true probability is 1%.

  • Normal Approximation

  • Decision Rule for Backtests

    • Type 1 errors: kill the good guy
    • Type 2 errors: miss the bad guy
    • Power of a test is one minus the type 2 error rate
    • Most statistical tests fix the type 1 error rate, say at 5%, and structure the test so as to minimize the type 2 error rate, or maximize the test's power

Example

Consider a VaR measure over a daily horizon defined at the 99% level of confidence . The window for backtesting is days.

  • A higher cutoff point would lower the type 1 error rate
  • It is more likely to miss VaR models that are misspecified with higher cutoff points
Number of Exceptions Probability Cumulative Probability Type 1 Error Rate
0 0.0811 0.0811 100.00%
1 0.2047 0.2858 91.89%
2 0.2574 0.5432 71.42%
3 0.2150 0.7581 45.68%
4 0.1341 0.8922 24.19%
5 0.0666 0.9588 10.78%
6 0.0275 0.9863 04.12%
7 0.0097 0.9960 01.37%
8 0.0030 0.9989 00.40%
9 0.0008 0.9998 00.10%
10 0.0002 0.9999 00.03%

Basel Rule for Backtests (The Basel Penalty Zones)

Zone Number of Exceptions Potential Increase in
Green 0 to 4 0.00
Yellow 5 0.40
Yellow 6 0.50
Yellow 7 0.65
Yellow 8 0.75
Yellow 9 0.85
Red 1.00
  • The normal multiplier for capital charge is 3
  • After an incursion into the yellow zone is increased to according to the table
  • An incursion into the red zone generates an automatic, non-discretionary penalty

If the number of exceptions falls within the yellow zone, the supervisor has discretion to apply a penalty, depending on the causes for the exceptions. The Basel Committee uses these categories:

  • Basic integrity of the model: The deviation occurred because the positions were incorrectly reported or because of an error in the program code. This is a very serious flaw. In this case, a penalty should apply and corrective action should be taken.
  • Deficient model accuracy: The deviation occurred because the model does not measure risk with enough precision (e.g., does not have enough risk factors). This is a serious flaw, too. A penalty should apply and the model should be reviewed.
  • Intraday trading: Positions changed during the day. Here, a penalty “should be considered.” If the exception disappears with the hypothetical return, the problem is not in the bank's VAR model.
  • Bad luck: Markets were particularly volatile or correlations changed. These exceptions “should be expected to occur at least some of the time” and may not suggest a deficiency of the model but simply bad luck.

Evaluation of Backtesting

  • Exception tests focus only on the frequency of occurrences

  • It ignores the time pattern of losses

课堂练习

假设我们采用100天的数据来对VaR进行回溯测试,VaR所采用的置信水平为99%,在100天的数据中我们观察到5次例外。如果我们因此而拒绝该VaR模型,犯第一类错误的概率为多少?

课堂练习

A risk management team reports the following VaR backtesting results for a trading desk over 500 days at 99% confidence: 12 exceptions were observed.

(a) Under the null hypothesis that the VaR model is correct, what is the expected number of exceptions?
(b) Using the normal approximation, compute the test statistic. Would you reject the model at the 5% significance level? ()
(c) According to the Basel traffic-light system, which zone does this result fall in, and what is the regulatory consequence?

课堂练习

Two VaR models are being compared for a portfolio:

Model A Model B
VaR confidence 99% 99%
Backtest window 250 days 250 days
Exceptions observed 1 7
Average loss beyond VaR $2.5M $1.2M

(a) Using the binomial distribution, which model is more likely to be rejected at the 5% significance level?
(b) Despite having fewer exceptions, why might Model A actually be worse?
(c) What additional test could help distinguish the two models?

End of L02

Key takeaways:

  • VaR provides a single summary measure of portfolio risk but has fundamental limitations (not subadditive, uninformative of tail losses)
  • CVaR/Expected Shortfall is a coherent alternative that captures tail risk
  • Stress testing complements probabilistic risk measures with scenario-based analysis
  • Backtesting validates VaR models through exception counting, with the Basel traffic-light framework
  • The industry is shifting from VaR to ES as the primary regulatory risk measure (FRTB)

Next lecture: L03 — Volatility Modeling & Multivariate Risk Models (GARCH, copula, EVT, VaR methods)