How I would start Quantitative Analysis from scratch.

How I would start Quantitative Analysis from scratch.

Publish Date: Jul 16
0 0

Beyond the Random Walk:
If you've ever tried to learn quantitative analysis, you've likely been met with a wall of Greek letters and concepts like Brownian motion, Ito's Lemma, and autoregressive models. The traditional approach is rooted in a single, core idea: financial markets are, for the most part, a "random walk." The goal, then, is to find the tiny, fleeting signals hidden within that noise.

This is a valid approach, but it's not the only one. And for a beginner, it can be mathematically daunting and intuitively unsatisfying.

What if, instead of starting with randomness, you started with information? What if, instead of looking for linear patterns, you looked for the hidden shape of the market? This article proposes a different starting point for your quant journey, one that leverages unique, powerful concepts that are more intuitive and, arguably, more fundamental.

Ditch the Brownian Motion, Embrace the Surprise: Starting with Information Theory
Traditional finance models the path. Information theory models the surprise at each step of that path. The core idea was developed by Claude Shannon, the father of the information age, and it's called Entropy.

In simple terms, Shannon Entropy is a measure of uncertainty or surprise.

A predictable event (a biased coin that always lands on heads) has zero entropy. There is no surprise.

A completely unpredictable event (a fair coin flip) has maximum entropy.

Instead of modelling price, what if we modelled the information content of market data?

A Unique Algorithm: Order Flow Entropy
Price is a lagging indicator; it's the result of thousands of buy and sell orders. The real information lies in the order flow. Imagine you could see the stream of buy and sell orders for a stock.

High Entropy State: Buyers and sellers are evenly matched. The flow is chaotic and unpredictable. The next move is uncertain.

Low Entropy State: A huge wave of buy orders suddenly floods the market. The system becomes highly predictable for a short time. The uncertainty collapses. There is less "surprise."

The Hypothesis: A sudden, sharp drop in the entropy of order flow could be a powerful signal that a large, informed entity is making a move, creating a predictable pattern just before a significant price change. You are no longer predicting price based on its past; you are predicting price based on a measurable change in the market's underlying information structure.

This concept is more direct. It's about measuring the conviction of market participants, not just the noisy outcome of their actions.

Finding the "Shape" of Market Fear: An Introduction to Topological Data Analysis (TDA)
The second unconventional tool is Topological Data Analysis (TDA). While standard statistics is great at finding lines, clusters, and correlations, it struggles with more complex structures. TDA is a field of mathematics that studies the "shape" of data, looking for things like loops, voids, and other intricate structures.

The core algorithm here is Persistent Homology.

Imagine your data is a cloud of points in space. Persistent homology works by "inflating" a ball around each data point. As the balls grow, they start to connect, forming shapes. Some of these shapes are fleeting, but others persist for a long time. TDA tracks how long these topological features—like connected components, loops, and voids—last.

A Unique Application: The Shape of Systemic Risk
How can we use this? Instead of a 2D plot of price vs. time, imagine a 3D point cloud where each point represents a single trading day, defined by:

X-axis: S&P 500 daily return

Y-axis: VIX Index level (a measure of volatility or "fear")

Z-axis: 10-Year US Treasury yield change

The Hypothesis:
In a "normal" market, the shape of this data cloud might be a dense, formless blob.
In the lead-up to a financial crisis, the data points might start to form a distinct loop or void. This shape could represent a dangerous feedback loop where high volatility (high VIX) is persistently associated with negative returns and a flight to safety (falling bond yields). The market gets "stuck" in this topological feature.
This "shape of fear" is a multi-dimensional pattern that simple correlation analysis would completely miss. Detecting the formation and persistence of such a shape could be a powerful, non-obvious indicator of systemic risk.
Your First "Unconventional" Quant Project
You don't need a PhD to start exploring these ideas. Here’s a simple roadmap:

The Mindset Shift: Your goal isn't to predict the price of Apple tomorrow. Your goal is to measure a property of the market itself. Is the market's information content changing? Is its "shape" deforming?
Gather Data: Get free daily data from Yahoo Finance for the SPY (S&P 500 ETF) and the ^VIX (Volatility Index).

Calculate "Surprise": Write a simple Python script to calculate the daily percentage change of the VIX. Then, create a histogram of these changes. Is it a perfect bell curve, or is it skewed? A skewed distribution implies that "surprise" is not symmetrical—large spikes in fear are more common than large drops. This is your first, simple information-theoretic insight.

Visualise the "Shape": Create a 2D scatter plot of (SPY daily return, VIX daily change). Now, create this plot for different years. Compare the shape of the data cloud for a calm year (like 2017) versus a volatile year (like 2008 or 2020). You will physically see the shape of the market change. This is the core intuition behind TDA.

Therefore, starting your quantitative analysis journey with these concepts has a distinct advantage. It forces you to think about the market not as a series of numbers to be fit by an equation, but as a complex system with its structure, information flow, and geometry.

It's a more challenging, but ultimately more rewarding, path. Please stop trying to predict the random walk and start trying to understand the shape of the system itself.

Comments 0 total

    Add comment