Do We Need Asynchronous SGD? On the Near-Optimality of Synchronous Methods

Authors: Grigory Begunov, Alexander TyurinPublished: Feb 3, 2026

Abstract

Modern distributed optimization methods mostly rely on traditional synchronous approaches, despite substantial recent progress in asynchronous optimization. We revisit Synchronous SGD and its robust variant, called $m$-Synchronous SGD, and theoretically show that they are nearly optimal in many heterogeneous computation scenarios, which is somewhat unexpected. We analyze the synchronous methods under random computation times and adversarial partial participation of workers, and prove that their time complexities are optimal in many practical regimes, up to logarithmic factors. While synchronous methods are not universal solutions and there exist tasks where asynchronous methods may be necessary, we show that they are sufficient for many modern heterogeneous computation scenarios.

No state

This paper is currently showing No growth state computed yet..

No citation data yet.Citation metrics will appear here once available.

Citation metrics and growth state from academic sources (e.g. Semantic Scholar). See About for details.

Cited by (0)

No citing papers yet

Papers that cite this one will appear here once data is available.

View citations page →

References (0)

No references in DB yet

References for this paper will appear here once ingested.

Related papers in Distributed, Parallel, and Cluster Computing

More in Distributed, Parallel, and Cluster Computing →

Growth transitions

No transitions recorded yet

Growth state transitions will appear here once computed.

Growth and citations

Cited by (0)

References (0)

Related papers in Distributed, Parallel, and Cluster Computing

Growth transitions