Spread Decomposition: Huang-Stoll (1997)
The previous articles developed the three theoretical reasons a spread exists:
- Inventory risk — the maker bears directional exposure (ho-stoll-inventory-model)
- Adverse selection — some counterparties are informed (glosten-milgrom-model, kyle-lambda)
- Order processing — the operational cost of running a market-making business (technology, clearing, regulatory compliance)
Huang and Stoll (1997) unified these into an empirical framework that decomposes the observed spread into its constituent parts. This is where theory meets data.
The Decomposition Framework
Let be the observed spread. Huang-Stoll model the transaction price as:
where is the efficient (true) price and indicates the trade direction (buy or sell).
The efficient price evolves as:
where is the adverse selection component — the fraction of the half-spread that represents permanent information. The innovation captures public information arrivals.
The full model decomposes the half-spread into three components with weights summing to one:
| Component | Symbol | Economic Content | Observable Signature |
|---|---|---|---|
| Adverse selection | Permanent price impact of trades — information that does not revert | Trade direction predicts future price level | |
| Inventory | Transient price impact — quote adjustment to manage position | Trade direction predicts next quote change, but effect reverts | |
| Order processing | Pure cost — no price impact, no quote adjustment | Bid-ask bounce with no predictive content |
Estimation
The model generates a testable relationship between successive price changes and trade directions. The autocovariance structure of as a function of lagged identifies and .
In practice, estimation proceeds via GMM or OLS on the regression:
The coefficient on gives (total informative component), and the coefficient on identifies (inventory reversion). The residual is order processing.
Typical empirical findings for US equities (pre-decimalization):
- Adverse selection: 30-50% of the spread
- Inventory: 10-30%
- Order processing: 20-40%
Post-decimalization and with electronic trading, order processing costs collapsed. Adverse selection now dominates for most liquid names.
For how the three spread components manifest in decentralized exchanges, see constant-product-amm and impermanent-loss.
Practical Application
Spread decomposition is not just academic. It directly informs:
- Execution quality measurement: regulators and institutional desks use adverse selection estimates to evaluate broker performance
- Market maker strategy: knowing which component dominates tells you whether to improve hedging () or flow selection ()
- Protocol design: any venue designing fee structures should estimate the adverse selection share — a flat fee that covers order processing but not adverse selection will erode market maker capital
Connecting the Module
This article synthesizes the theoretical models into an empirical toolkit:
- ho-stoll-inventory-model provides the theory behind the component
- glosten-milgrom-model and kyle-lambda provide the theory behind the component
- trading-fundamentals introduced the spread as the price of immediacy — now we know that “immediacy” bundles three distinct costs
- order-books and trading-venues showed where these costs manifest across different venue types
The decomposition closes the loop: we started with “why do spreads exist?” and now have both theoretical models and an empirical method to answer “how much of the spread is due to each cause?”
Companion notebook: notebook — simulate Huang-Stoll tick data, estimate α/β via OLS, visualize the three-way spread decomposition.
Questions to sit with:
- If adverse selection dominates the spread for a given asset, what does that imply about the profitability of passive market making on that asset?
- If the adverse selection component of the spread increases but the order processing component decreases (e.g., due to lower exchange fees), what happens to total quoted spreads? What does this predict about market maker profitability?