ML in Finance · Lecture 03 · Deep Learning Algorithms

MLP	CNN	RNN	GNN
flexible for general tabular data.	local patterns and translation invariance (useful for structured signals like term structures or limit order books).	sequential dependence in returns, volatility, or flows.	relationships on graphs (counterparty, supply-chain, ownership).

Aspect	Pros	Cons / Risks
Flexibility	Universal approximator, rich nonlinearities	Easy to overfit with small samples
Data format	Works well on tabular, cross-sectional data	No built-in inductive bias for sequence/graph
Optimization	SGD-based training scales to large datasets	Nonconvex; local minima, saddle points
Interpret.	Can embed economic constraints via architecture	Harder to explain than linear / trees

Aspect	Pros	Cons / Risks
Inductive	Captures local patterns, translation invariance	Less suitable if no local structure
Efficiency	Fewer parameters than dense layers	Architecture choices can be ad-hoc
Data types	Works well on sequences and grids	May need many filters/levels
Interpret.	Filters sometimes interpretable as “motifs”	Still less transparent than linear

Application · (Re-)Imag(in)ing Price Trends (Jiang, Kelly & Xiu, 2023, JF)

Problem
- Revisit trend-based predictability by letting a model discover return-predictive price patterns, instead of pre-specifying momentum or reversal rules.
- Use stock-level price charts as inputs and test whether machine-learned patterns beat standard trend signals.

flowchart LR %% =========================== %% 节点定义 %% =========================== %% 1. 原始数据 RawData["Price Series
+ Volume
(1D Data)"] %% 2. 图像化 (核心步骤) %% 使用 {{ }} 形状代表“准备/转换”过程 ImgGen{{"TimeSeries
to
Image"}} %% 图像数据的抽象表示 ImgData[("2D Image
(e.g. GAF/RP)")] %% 3. 模型 %% 使用 [[ ]] 代表黑箱/计算密集型模型 CNN[["2D CNN
(Spatial Pattern)"]] %% 4. 预测信号 %% 使用 (( )) 代表单个数值输出 Prob(("P(Up)
Probability")) %% 5. 金融应用 Sorts["Decile Sorts
(Long/Short)"] Perf["Performance
(Sharpe / Alpha)"] %% =========================== %% 流程连接 %% =========================== %% 数据流：粗箭头 RawData ==> ImgGen ImgGen ==> ImgData ImgData ==> CNN CNN ==> Prob %% 策略流：细箭头 Prob --> Sorts Sorts --> Perf %% =========================== %% 样式美化 %% =========================== %% 原始数据：蓝色 style RawData fill:#e3f2fd,stroke:#1565c0,stroke-width:2px %% 图像转换部分：青色/视觉感 style ImgGen fill:#e0f7fa,stroke:#006064,stroke-dasharray: 5 5 style ImgData fill:#b2ebf2,stroke:#006064,stroke-width:2px %% 深度学习：紫色 style CNN fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px,rx:5 %% 信号：黄色/高亮 style Prob fill:#fff9c4,stroke:#fbc02d,stroke-width:2px %% 回测应用：绿色 style Sorts fill:#e8f5e9,stroke:#2e7d32 style Perf fill:#c8e6c9,stroke:#2e7d32,stroke-width:2px

Model / Algorithm
- Convert recent daily OHLC prices, volume, and moving averages (over 5, 20, 60 days) into black–white images (OHLC bars + MA line + volume bars) with standardized vertical scaling.
- Train 2D CNNs to classify whether the future return over the next 5/20/60 days is positive or not, using cross-entropy loss and standard CNN components (conv–activation–pooling, batch norm, dropout).

Aspect	Pros	Cons / Risks
Sequence	Natural for time series and sequences	Hard to parallelize across time
Memory	Can capture medium/long-term dependencies	Still struggles with very long range
Flexibility	Many variants (stacked, bidirectional)	Many hyperparameters, tuning heavy

Aspect	Pros	Cons / Risks
Long-range	Better handles long-range dependencies	Adds complexity and parameters
Interpret.	Weights can be visualized	Not always truly causal/explanatory
Flexibility	Works with RNN encoders/decoders, sets, etc.	Still sequence-length dependent

Model	Pros	Cons / Risks
VAE	Probabilistic, explicit latent structure	Reconstructions may be too “smooth”
GAN	Sharp, realistic samples	Training instability, mode collapse

Aspect	Pros	Cons / Risks
Structure	Respects network topology	Needs graph data and quality edges
Flexibility	Learns complex neighborhood interactions	Over-smoothing for many layers
Finance	Natural for systemic risk, contagion, spillover	Interpretability can be challenging

Lecture 03

Deep Learning Algorithms in Finance

Outlines

Motivation

Deep Learning as Representation Learning

Shallow vs Deep: A Finance-Oriented View

When Might Deep Models Be Useful?

Motivation

Neural Network Foundations for This Lecture

A Brief History of Neural Networks

The Perceptron and Its Limitation

Solving XOR with a Small MLP

Neuron, Layer, Network

Application · Empirical Asset Pricing via ML (Gu, Kelly & Xiu, 2020, RFS)

Sigmoid and Tanh: Saturating Activations

ReLU and Its Variants

Beyond ReLU: Smooth and Self-Gated Activations

Loss and Training Objective

Example models

Intuition and Universal Approximation

The "deep learning revolution"

Connections with biology

Training Procedure (Backpropagation): High-Level Steps

Computation Graph View of Backprop

Pros, Cons, and Finance Use Cases

Summary

Application · Autoencoder Asset Pricing Models (Gu, Kelly & Xiu, 2021, JoE)

Application · Deep Learning in Asset Pricing (Chen, Pelger & Zhu, 2023, MS)

Motivation

1D Convolution Formulation

1D Convolution as Filtering

Stride, Padding, and Convolution Types

Intuition and Architecture

Pros, Cons, and Finance Use Cases

From Feature Maps to Feature Maps

Pooling Layers

Historical CNN Architectures (Very Brief)

Summary

Application · (Re-)Imag(in)ing Price Trends (Jiang, Kelly & Xiu, 2023, JF)

Case Study · Charting by Machines (Murray, Xia & Xiao, 2024, JFE)

Motivation

From Feedforward to Time Delay to Recurrent

RNN Basic Formulation

Unrolling RNNs and Backpropagation Through Time

Vanishing and Exploding Gradients in RNNs

Application · Forecasting the Equity Premium: Mind the News! (Adämmer & Schüssler, 2020, RoF)

LSTM and GRU Formulation (Core Equations)

LSTM Cell Intuition

GRU Cell Intuition

Stacked and Bidirectional RNNs

Intuition, Pros & Cons, Finance Use Cases

Summary

Motivation

Attention Basic Formulation

Intuition and Variants

Pros, Cons, and Finance Uses

Example: Sequence-to-Sequence with Attention

Summary

Motivation

VAE Formulation (Core Ideas)

VAE Key Takeaways

GAN Formulation (Core Ideas)

GAN Key Takeaways

Pros, Cons, and Finance Uses

Summary

Application · Synthetic Data in Finance (Potluru et al., 2024)

Application · Generating Synergistic Alpha Collections via RL (Yu et al., 2023)

Motivation

Message Passing Formulation

Intuition, Pros & Cons, Finance Uses

Summary

Optimization for Deep Networks

Vanishing / Exploding Gradients in Deep Networks

Residual Connections to Ease Optimization

Regularization in Deep Networks

Bayesian View of Neural Networks (Very Brief)

Regularization, Overfitting, and Explainability

Hybrid Architectures in Finance

When to Use Deep Learning in Finance (and When Not)

Overall Summary and Outlook