Overconfident \(R_t\) estimates from the Poisson renewal equation method

paper-summary
methods
epidemiological-parameters
Authors
Affiliation

Barbora Němcová

Karlsruhe Institute of Technology

Karlsruhe Institute of Technology

Published

January 21, 2026

This blog post is based on a preprint we recently posted, summarizing the main message. The figure used here is ours.

The problem of overconfidence in the Poisson renewal equation method

Estimating the effective reproductive number, \(R_t\), is often done using methods based on the Poisson renewal equation, which makes use of the recent incidence data. A part of the appeal of the renewal equation approach is that it is simple to understand and apply in practice. However, the underlying Poisson assumption enforces equidispersion and therefore cannot handle the overdispersion, commonly observed in infectious disease data. As a result, the method often issues too narrow confidence intervals for the \(R_t\) estimates. A highly popular implementation, the EpiEstim package by Cori et al. (2013), is notably affected by this issue, as are several related \(R_t\) estimation tools, including EpiInvert, ern, estimateR and Rtglm. While this limitation has been pointed out empirically on several occasions (Goldstein, Wakefield, and Minin 2023; Brockhaus et al. 2023), we summarise it concisely in our preprint to bring it to the attention of the community.

The renewal equation and the Poisson assumption

The Poisson renewal equation can be expressed as:

\[ \begin{split} X_t \mid \text{Past} & \sim \text{Pois}(\mu_t),\ t = 1, \dots, T \\ \mu_t & = R\times \Lambda_t,\\ \Lambda_t & = \sum_{d = 1}^D w_d \times X_{t - d}. \end{split} \]

Here, \(X_t\) denotes the incidence at time \(t\), and \(w_1, \, \dots, \,w_D\) represents the discrete-time serial interval distribution. The reproductive number \(R\) is assumed constant over an estimation window of width \(T\) days (which is why we omit the index \(_t\)). If we denote the conditional variance \(\text{Var}(X_t\mid \text{Past})\) as \(\sigma^2_t\), then the Poisson distribution implies equidispersion: \(\sigma^2_t = \mu_t\). In practice, however, we often observe overdispersion, where \(\sigma^2_t > \mu_t\). In this case, applying a Poisson model leads to incorrect quantification of the estimation uncertainty.

The renewal equation can be interpreted as a Poisson regression model with an identity link, no intercept, and a single covariate \(\Lambda_t\). We can therefore use standard GLM theory to study its properties. Under a frequentist approach, we can calculate the maximum likelihood estimate of \(R\) as

\[ \hat{R}_{\text{Po}} = \frac{\sum_{t = 1}^T X_t}{\sum_{t = 1}^T \Lambda_t}, \] while the standard error of \(\hat{R}_{\text{Po}}\) based on the Poisson assumption is

\[ \widehat{\text{se}}(\hat{R}_\text{Po}) = \frac{\hat{R}_\text{Po}}{\sqrt{\sum_{t = 1}^T X_t}}. \] The standard error of the estimate thus depends only on the magnitude of the incidence, not on its variability or “noisiness”. Arguments from quasi-Poisson regression imply that if overdispersion is present, the above expression will underestimate the true standard error of \(\hat{R}_{\text{Po}}\) by a factor of \(\sqrt{\phi}\), with

\[ \phi = \frac{\sum_{t = 1}^T \sigma^2_t}{\sum_{t = 1}^T \mu_t}. \]

Illustrative case studies

We can use the regression model formulation of the renewal equation model to relax the equidispersion assumption. One option is to use quasi-Poisson regression instead of Poisson, which yields identical point estimates, but allows for overdispersion. Another possible approach is to replace the Poisson distribution with a negative binomial distribution. In this paper we used two different parametrizations, which we call NegBin-L and NegBin-Q. In our naming convention, L stands for the linear mean-variance relationship, while Q indicates a quadratic mean-variance relationship.

To compare estimates from the Poisson model with those from models that account for overdispersion, we selected three datasets corresponding to different pathogens and locations where the Poisson renewal equation, specifically the EpiEstim method, was used in the past – Influenza among the active U.S. military personnel (Nash et al. 2023), COVID-19 in Austria (Richter et al. 2022) and Ebola in Guinea (Green, Ferguson, and Cori 2022).

Top: incidence from the three case studies. Middle: comparison of reproductive number estimates issued by the Poisson and quasi-Poisson models. Bottom: comparison of reproductive number estimates issued by the NegBin-L and NegBin-Q models.

As shown in the second row of the figure, for Influenza and COVID-19, the Poisson confidence intervals (green) were visually indistinguishable from the point estimates, while the quasi-Poisson intervals (orange) are much wider. This is the consequence of the formula for \(\widehat{\text{se}}(\hat{R}_\text{Po})\), which in the Poisson case is dominated by large incidence values in the denominator. For Ebola, the Poisson intervals are wider. However, they rarely include \(R = 1.0\), implying high confidence about epidemic growth or decline and the quasi-Poisson model is still more uncertain. Comparing the negative binomial models, we can notice some disagreement in point estimates and the confidence interval widths for the Influenza example, which is caused by strong weekday effects. Nonetheless, in all examples, confidence intervals are much wider than those from the Poisson model. In the paper we also conduct a simulation study, which shows that all three models accounting for overdispersion (quasi-Poisson, NegBin-L, and NegBin-Q) work reasonably well, and that it is not crucial whether the mean variance relationship is specified correctly, unless inference about the disease dynamics is required.

The key takeaway

In the paper, we demonstrate that \(R_t\) estimates based on the Poisson renewal equation are overconfident in presence of overdispersion; i.e., the associated uncertainty intervals are too narrow. We formulate the renewal equation model as a GLM to derive, by how much the uncertainty is underestimated. This effect can be very pronounced in practice. Therefore, we do not recommend relying on the Poisson renewal equation model to assess the uncertainty of \(R_t\) estimates. Consequently, we discourage using the current form of EpiEstim and the related R packages discussed above for interval estimation. Alternative tools like EpiNow2 (Abbott et al. 2020) and EpiLPS (Gressani et al. 2022) that account for overdispersion are favorable for the uncertainty quantification, even though their added complexity may present a challenge for some users.

References

Abbott, S, J Hellewell, RN Thompson, K Sherratt, HP Gibbs, NI Bosse, JD Munday, et al. 2020. “Estimating the Time-Varying Reproduction Number of SARS-CoV-2 Using National and Subnational Case Counts.” Wellcome Open Research 5 (112). https://doi.org/10.12688/wellcomeopenres.16006.2.
Brockhaus, E. K., D. Wolffram, T. Stadler, M. Osthege, T. Mitra, J. M. Littek, E. Krymova, et al. 2023. “Why Are Different Estimates of the Effective Reproductive Number so Different? A Case Study on COVID-19 in Germany.” PLOS Computational Biology 19 (11): 1–27. https://doi.org/10.1371/journal.pcbi.1011653.
Cori, A., N. M. Ferguson, C. Fraser, and S. Cauchemez. 2013. “A New Framework and Software to Estimate Time-Varying Reproduction Numbers During Epidemics.” American Journal of Epidemiology 178 (9): 1505–12. https://doi.org/10.1093/aje/kwt133.
Goldstein, I. H., J. Wakefield, and V. M. Minin. 2023. “Incorporating Testing Volume into Estimation of Effective Reproduction Number Dynamics.” Journal of the Royal Statistical Society Series A: Statistics in Society 187 (2): 436–53. https://doi.org/10.1093/jrsssa/qnad128.
Green, WD, NM Ferguson, and A Cori. 2022. “Inferring the Reproduction Number Using the Renewal Equation in Heterogeneous Epidemics.” Journal of The Royal Society Interface 19 (188): 20210429. https://doi.org/10.1098/rsif.2021.0429.
Gressani, O, J Wallinga, C L. Althaus, N Hens, and C Faes. 2022. EpiLPS: A Fast and Flexible Bayesian Tool for Estimation of the Time-Varying Reproduction Number.” PLOS Computational Biology 18 (10): 1–27. https://doi.org/10.1371/journal.pcbi.1010618.
Nash, RK, S Bhatt, A Cori, and P Nouvellet. 2023. “Estimating the Epidemic Reproduction Number from Temporally Aggregated Incidence Data: A Statistical Modelling Approach and Software Tool.” PLOS Computational Biology 19 (8): 1–14. https://doi.org/10.1371/journal.pcbi.1011439.
Richter, L, A Chalupka, D Schmid, A Chakeri, S Maritschnik, S Pfeiffer, and E Stadlober. 2022. Epidemiologische Parameter des COVID19 Ausbruchs - Update 01.04.2022, Österreich, 2020–2022.” Austrian Agency for Health; Food Safety (AGES). https://www.ages.at/fileadmin/Corona/Epidemiologische-Parameter/updates/Update_Epidemiologische_Parameter_des_COVID19_Ausbruchs_2022-04-01.pdf.