Spread Decomposition: Huang-Stoll (1997)

The previous articles developed the three theoretical reasons a spread exists:

  1. Inventory risk — the maker bears directional exposure (ho-stoll-inventory-model)
  2. Adverse selection — some counterparties are informed (glosten-milgrom-model, kyle-lambda)
  3. Order processing — the operational cost of running a market-making business (technology, clearing, regulatory compliance)

Huang and Stoll (1997) unified these into an empirical framework that decomposes the observed spread into its constituent parts. This is where theory meets data.

The Decomposition Framework

Let be the observed spread. Huang-Stoll model the transaction price as:

where is the efficient (true) price and indicates the trade direction (buy or sell).

The efficient price evolves as:

where is the adverse selection component — the fraction of the half-spread that represents permanent information. The innovation captures public information arrivals.

The full model decomposes the half-spread into three components with weights summing to one:

ComponentSymbolEconomic ContentObservable Signature
Adverse selectionPermanent price impact of trades — information that does not revertTrade direction predicts future price level
InventoryTransient price impact — quote adjustment to manage positionTrade direction predicts next quote change, but effect reverts
Order processingPure cost — no price impact, no quote adjustmentBid-ask bounce with no predictive content

Estimation

The model generates a testable relationship between successive price changes and trade directions. The autocovariance structure of as a function of lagged identifies and .

In practice, estimation proceeds via GMM or OLS on the regression:

The coefficient on gives (total informative component), and the coefficient on identifies (inventory reversion). The residual is order processing.

Typical empirical findings for US equities (pre-decimalization):

  • Adverse selection: 30-50% of the spread
  • Inventory: 10-30%
  • Order processing: 20-40%

Post-decimalization and with electronic trading, order processing costs collapsed. Adverse selection now dominates for most liquid names.

For how the three spread components manifest in decentralized exchanges, see constant-product-amm and impermanent-loss.

Practical Application

Spread decomposition is not just academic. It directly informs:

  • Execution quality measurement: regulators and institutional desks use adverse selection estimates to evaluate broker performance
  • Market maker strategy: knowing which component dominates tells you whether to improve hedging () or flow selection ()
  • Protocol design: any venue designing fee structures should estimate the adverse selection share — a flat fee that covers order processing but not adverse selection will erode market maker capital

Connecting the Module

This article synthesizes the theoretical models into an empirical toolkit:

The decomposition closes the loop: we started with “why do spreads exist?” and now have both theoretical models and an empirical method to answer “how much of the spread is due to each cause?”


Companion notebook: notebook — simulate Huang-Stoll tick data, estimate α/β via OLS, visualize the three-way spread decomposition.

Questions to sit with:

  1. If adverse selection dominates the spread for a given asset, what does that imply about the profitability of passive market making on that asset?
  2. If the adverse selection component of the spread increases but the order processing component decreases (e.g., due to lower exchange fees), what happens to total quoted spreads? What does this predict about market maker profitability?