1. Motivation: Why Particle Filters Exist
Many real-world systems evolve over time, but their true internal state cannot be observed directly. Examples include the position of a robot, the health state of a patient, the volatility of a financial market, or the location of a moving target. We observe noisy measurements and must infer the hidden state sequentially.
Classical filters such as the Kalman filter work beautifully for linear-Gaussian systems. However, many practical systems are nonlinear, non-Gaussian, multimodal, or include complicated likelihoods. In these cases, exact Bayesian filtering is usually analytically intractable.
Why this matters
- Nonlinearity: particle filters do not require linear system dynamics.
- Non-Gaussian noise: they can handle heavy-tailed, skewed, mixture, or discrete-continuous noise models.
- Multimodality: they can represent several plausible state hypotheses at once.
- Online inference: they update estimates sequentially as new observations arrive.
- Flexible modeling: they can work with complex simulation-based transition models and arbitrary likelihoods.
2. Basic Concept of Particle Filtering
The particle filter approximates a probability distribution by many samples. Each sample represents one possible hidden state, and its weight represents how plausible that state is after observing the data. Instead of storing only a mean and covariance, the filter stores an empirical distribution.
| Element | Meaning | Role in particle filtering |
|---|---|---|
| Particle | A sampled hypothesis for the hidden state. | Represents one possible state of the system. |
| Weight | A normalized importance value attached to a particle. | Measures how well the particle explains the observation. |
| Transition model | The model describing how the state evolves over time. | Used to propagate particles forward. |
| Observation likelihood | The probability of the observation given a state. | Used to update particle weights. |
| Resampling | A selection step that duplicates high-weight particles and removes low-weight particles. | Controls weight degeneracy and focuses computation on promising regions. |
Particle representation
The filtering posterior is represented as a weighted empirical distribution:
3. State-Space Model
Particle filters are usually formulated for hidden Markov or state-space models. The hidden state evolves over time, and observations are generated from the hidden state with measurement noise.
| Symbol | Meaning |
|---|---|
| x_t | Hidden state at time t. |
| y_t | Observation or measurement at time t. |
| f_t(x_t | x_{t−1}) | Transition density describing the system dynamics. |
| g_t(y_t | x_t) | Observation likelihood describing how likely a measurement is for a given state. |
| p(x_t | y_1:t) | Filtering posterior: the distribution of the current state given all observations so far. |
Filtering objective
The main objective is to recursively estimate the posterior distribution of the current hidden state:
This posterior combines prior information from the dynamics with new information from the latest observation.
4. Mathematical Formulation
Bayesian filtering consists of two conceptual steps: prediction and update. Particle filters approximate these steps using Monte Carlo samples.
4.1 Prediction step
Before observing y_t, the previous posterior is propagated through the transition model:
4.2 Update step
After observing y_t, Bayes' rule updates the predicted distribution:
4.3 Monte Carlo expectation
Any posterior expectation can be approximated by a weighted sum over particles:
4.4 Posterior mean estimate
A common point estimate is the weighted posterior mean:
4.5 Importance weight update
In a general sequential importance sampling formulation, particles may be sampled from a proposal distribution q rather than directly from the transition model. The weights correct the mismatch between the proposal and the target distribution.
5. Basic Particle Filter Algorithm
The most common introductory algorithm is the bootstrap particle filter, also called the sampling-importance-resampling filter. It uses the transition model as the proposal distribution.
5.1 Bootstrap particle filter
- Initialize: draw particles x₀⁽ⁱ⁾ from the initial distribution p(x₀).
- Propagate: sample new particles from the dynamics, x_t⁽ⁱ⁾ ~ f_t(x_t | x_{t−1}⁽ⁱ⁾).
- Weight: assign each particle a likelihood weight, w̃_t⁽ⁱ⁾ = g_t(y_t | x_t⁽ⁱ⁾).
- Normalize: divide each weight by the sum of all weights.
- Resample: sample N particles from the current particle set according to normalized weights.
- Estimate: compute posterior summaries such as mean, variance, quantiles, or MAP estimate.
5.2 Effective sample size
Weight degeneracy is commonly monitored using effective sample size:
If all particles have equal weight, ESS is close to N. If one particle dominates, ESS is close to 1. Many implementations resample only when ESS drops below a threshold such as N/2.
5.3 Resampling methods
| Method | Main idea | Comment |
|---|---|---|
| Multinomial resampling | Draw N ancestor indices independently according to particle weights. | Simple but can have high variance. |
| Residual resampling | Deterministically allocate copies based on integer parts of Nw, then sample the residual. | Lower variance than multinomial in many cases. |
| Stratified resampling | Divide the unit interval into N strata and sample once from each stratum. | Reduces sampling variability. |
| Systematic resampling | Use one random offset and equally spaced points on the cumulative weight distribution. | Very common because it is simple and efficient. |
6. Main Variants and Extensions
The basic bootstrap filter is only one member of a large family of sequential Monte Carlo methods. Many variants were developed to reduce degeneracy, use better proposals, handle structure, or perform smoothing and parameter estimation.
| Variant | Main idea | Best used when |
|---|---|---|
| Bootstrap particle filter | Samples from the transition model and weights by the observation likelihood. | A simple baseline is needed and transition simulation is easy. |
| Sequential importance sampling | Uses a proposal distribution and updates weights sequentially. | The proposal can be designed to better match the posterior. |
| Auxiliary particle filter | Uses look-ahead information to select particles likely to explain the next observation. | Observations are informative and the bootstrap proposal is inefficient. |
| Regularized particle filter | Adds kernel smoothing or jittering after resampling. | Sample impoverishment is severe. |
| Rao-Blackwellized particle filter | Samples only part of the state and marginalizes tractable components analytically. | The model has conditional linear-Gaussian or discrete structure. |
| Resample-move particle filter | Adds MCMC rejuvenation steps after resampling. | Particle diversity must be restored after repeated resampling. |
| Particle smoother | Estimates full trajectories p(x₁:T | y₁:T), not only the current state. | Offline reconstruction or retrospective analysis is needed. |
| Particle MCMC | Uses particle filters inside MCMC algorithms for full Bayesian inference. | Joint state and parameter inference is required. |
| SMC² / nested SMC | Uses particles over parameters and state trajectories. | Online parameter learning is needed in complex state-space models. |
Why variants matter
A basic particle filter may fail if particles are proposed in regions that are inconsistent with the latest observation. More advanced filters attempt to use observations earlier, exploit model structure, or rejuvenate particles after resampling.
Better proposals
Use y_t when proposing x_t so particles are more likely to land in high-likelihood regions.
Auxiliary PF Optimal proposalModel structure
Marginalize tractable state components analytically instead of sampling everything.
Rao-Blackwellization Kalman submodelDiversity restoration
Use jittering, kernels, or MCMC moves to avoid collapse into repeated copies.
Regularization Resample-moveBeyond filtering
Extend particle methods to smoothing, parameter estimation, and full Bayesian inference.
PMCMC SMC²7. Applications of Particle Filters
Particle filters are used in many domains where hidden states must be estimated sequentially from noisy data and where nonlinear or non-Gaussian effects are important.
| Application area | Why particle filters are useful | Example use |
|---|---|---|
| Robotics and localization | Robot position posteriors can be multimodal, especially under ambiguous sensor readings. | Monte Carlo localization using laser, vision, or range sensors. |
| Target tracking | Targets can move nonlinearly and observations can be cluttered or intermittent. | Radar, sonar, visual object tracking, and multi-target tracking. |
| Navigation | Sensor fusion often involves nonlinear motion and observation models. | GPS/INS fusion, pedestrian tracking, and vehicle localization. |
| Computer vision | Visual tracking often has occlusion, appearance changes, and multimodal hypotheses. | Tracking faces, hands, objects, or articulated body pose. |
| Econometrics and finance | Latent volatility and regime-switching models can be nonlinear and non-Gaussian. | Stochastic volatility estimation and financial time-series filtering. |
| Signal processing | Dynamic signals may involve nonlinear hidden states and nonstandard noise. | Channel estimation, speech processing, and source localization. |
| Biomedicine | Physiological states are hidden and measurements are noisy or sparse. | Disease progression, neural decoding, and patient-state monitoring. |
| Environmental systems | Physical systems can be nonlinear and uncertain, with partial observations. | Hydrology, meteorology, ecology, and data assimilation. |
8. Challenges and Limitations
Particle filters are flexible, but they are not automatic. Their performance depends strongly on the number of particles, proposal distribution, observation informativeness, state dimension, and resampling strategy.
Weight degeneracy
After several iterations, most particles may have nearly zero weight, so only a few particles meaningfully represent the posterior.
Central problem ESS diagnosticSample impoverishment
Resampling duplicates high-weight particles and discards low-weight ones, which can reduce particle diversity.
Resampling side effect Low process noiseCurse of dimensionality
In high-dimensional spaces, particle weights can collapse so that one particle dominates unless the particle count is extremely large.
High-dimensional failure ScalabilityPoor proposal design
If particles are proposed without considering the latest observation, many particles may land in low-likelihood regions.
Bootstrap weakness Need better qComputational cost
The cost grows with particle number, time horizon, state dimension, and likelihood complexity.
O(NT) Expensive likelihoodsPath degeneracy
For smoothing, repeated resampling causes particles to share common ancestors, making early trajectory estimates unreliable.
Smoothing issue Genealogical collapseImportant research gaps
- How can particle filters be made reliable in very high-dimensional systems?
- How can proposal distributions be learned or adapted automatically?
- How can resampling preserve diversity without introducing excessive variance?
- How should particle filters be combined with neural networks and differentiable simulators?
- How can uncertainty, robustness, and calibration be preserved in real-time deployments?
9. Practical Guidelines
Successful particle filtering is usually a matter of careful modeling and diagnostics. The following guidelines are useful when implementing or evaluating a particle filter.
| Question | Recommended practice | Reason |
|---|---|---|
| How many particles? | Start small for debugging, then increase N while monitoring ESS and estimation stability. | Too few particles cause degeneracy; too many particles increase computation. |
| When to resample? | Use adaptive resampling based on ESS rather than resampling at every time step. | This reduces unnecessary sample impoverishment. |
| Which proposal? | Use the transition model for simple cases, but consider observation-informed proposals when measurements are strong. | Better proposals reduce weight variance. |
| Which resampling scheme? | Systematic or stratified resampling is often preferred over basic multinomial resampling. | They are simple and usually lower variance. |
| How to handle structure? | Use Rao-Blackwellization when some state components can be integrated analytically. | Sampling fewer variables improves efficiency and accuracy. |
| How to diagnose failure? | Track ESS, weight variance, particle diversity, log-likelihood estimates, and posterior visualizations. | Particle collapse can otherwise be hidden by a plausible point estimate. |
When to use particle filters
Good fit
- Low- to moderate-dimensional hidden state.
- Nonlinear or non-Gaussian dynamics.
- Multimodal posterior distributions.
- Sequential, online inference.
- Simulation from the transition model is easy.
Poor fit
- Very high-dimensional state spaces.
- Extremely sharp likelihoods with few particles.
- Nearly deterministic dynamics with little process noise.
- Expensive likelihood evaluations.
- Static parameter learning without advanced methods.
10. Conclusion
Particle filters are best understood as a general Monte Carlo solution to recursive Bayesian filtering. They approximate the posterior distribution of a hidden state using weighted samples, making them suitable for nonlinear and non-Gaussian systems where analytic filters are unavailable.
The strongest explanation structure is therefore: motivation → state-space model → Bayesian recursion → particle approximation → algorithm → variants → applications → limitations.
Their greatest strength is flexibility: particles can approximate complex posterior shapes. Their greatest weakness is scalability: weight degeneracy, sample impoverishment, and high-dimensional collapse can make naive particle filters unreliable. In practice, good performance requires careful proposal design, adaptive resampling, diagnostics, and sometimes model-specific structure such as Rao-Blackwellization or MCMC rejuvenation.
References and Key Papers
- Gordon, N. J., Salmond, D. J., & Smith, A. F. M. (1993). Novel approach to nonlinear/non-Gaussian Bayesian state estimation. IEE Proceedings F, 140(2), 107–113.
- Kitagawa, G. (1996). Monte Carlo filter and smoother for non-Gaussian nonlinear state space models. Journal of Computational and Graphical Statistics, 5(1), 1–25.
- Liu, J. S., & Chen, R. (1998). Sequential Monte Carlo methods for dynamic systems. Journal of the American Statistical Association, 93(443), 1032–1044.
- Doucet, A., de Freitas, N., & Gordon, N. J. (Eds.). (2001). Sequential Monte Carlo Methods in Practice. Springer.
- Arulampalam, M. S., Maskell, S., Gordon, N., & Clapp, T. (2002). A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Transactions on Signal Processing, 50(2), 174–188.
- Pitt, M. K., & Shephard, N. (1999). Filtering via simulation: Auxiliary particle filters. Journal of the American Statistical Association, 94(446), 590–599.
- Doucet, A., Godsill, S., & Andrieu, C. (2000). On sequential Monte Carlo sampling methods for Bayesian filtering. Statistics and Computing, 10, 197–208.
- Cappé, O., Godsill, S. J., & Moulines, E. (2007). An overview of existing methods and recent advances in sequential Monte Carlo. Proceedings of the IEEE, 95(5), 899–924.
- Doucet, A., & Johansen, A. M. (2011). A tutorial on particle filtering and smoothing: Fifteen years later. In The Oxford Handbook of Nonlinear Filtering.
- Andrieu, C., Doucet, A., & Holenstein, R. (2010). Particle Markov chain Monte Carlo methods. Journal of the Royal Statistical Society: Series B, 72(3), 269–342.
- Bengtsson, T., Bickel, P., & Li, B. (2008). Curse-of-dimensionality revisited: Collapse of the particle filter in very large scale systems. IMS Collections, 2, 316–334.
- Snyder, C., Bengtsson, T., Bickel, P., & Anderson, J. (2008). Obstacles to high-dimensional particle filtering. Monthly Weather Review, 136(12), 4629–4640.
- Kantas, N., Doucet, A., Singh, S. S., Maciejowski, J., & Chopin, N. (2015). On particle methods for parameter estimation in state-space models. Statistical Science, 30(3), 328–351.
- Chopin, N., & Papaspiliopoulos, O. (2020). An Introduction to Sequential Monte Carlo. Springer.
- Särkkä, S. (2013). Bayesian Filtering and Smoothing. Cambridge University Press.
- Wills, A., & Schön, T. B. (2023). Sequential Monte Carlo: A Unified Review. Annual Review of Control, Robotics, and Autonomous Systems, 6, 159–182.