← Back to feed
2026-05-25datacommunity code

Statistical Inference for Stochastic Gradient Descent Beyond Finite Variance

Jose Blanchet, Peter Glynn, Wenhao Yang

PDF preview unavailable
Read on arXiv →

Key claim

New method for confidence regions in SGD trajectories.

This paper introduces a model-agnostic method for creating confidence regions from stochastic gradient descent (SGD) trajectories, addressing challenges in statistical inference when gradients have infinite variance. The key result is that the proposed method is straightforward to implement and provides asymptotically valid confidence regions in both finite- and infinite-variance scenarios.

In plain English

This paper introduces a model-agnostic method for creating confidence regions from stochastic gradient descent (SGD) trajectories, addressing challenges in statistical inference when gradients have infinite variance. The key result is that the proposed method is straightforward to implement and provides asymptotically valid confidence regions in both finite- and infinite-variance scenarios.

Novelty
7.0/10

The paper presents a new methodology for constructing confidence regions from SGD trajectories, which is a meaningful extension of existing statistical learning techniques.

Reliability
8.0/10

The claims are well-supported by simulation studies demonstrating reliable coverage across various settings.

Deep reliability assessment

The methodology provides a practical tool for constructing confidence regions in both finite- and infinite-variance regimes, but its robustness to varying tail indices and real-world applicability may be overclaimed without further empirical validation.

Reproducibility

No open source code or dataset is mentioned in the paper.

Discussion questions

  1. 1.How does the assumption of heavy-tailed noise impact the generalizability of the proposed method to different types of datasets?
  2. 2.What are the practical implications of using this method for uncertainty quantification in real-world machine learning applications?
  3. 3.What specific conditions or empirical results would falsify the claims made about the robustness of the proposed inference procedure?

Key figure

Figure 1 illustrates the average coverage rate and length of confidence intervals for linear regression with different subsample sizes.

GitHub1 repo
ganluannj/Spatial_SGD_InferenceCommunity