Soheil Behnezhad

I am an Assistant Professor of Computer Science at Northeastern University.

I am broadly interested in theoretical computer science. Much of my work focuses on the theoretical foundations of big data algorithms. This includes sublinear algorithms, parallel algorithms, streaming algorithms, dynamic algorithms, and graph sparsification.

Before joining Northeastern, I had a wonderful year at Stanford as a Motwani postdoc hosted by Moses Charikar, Aviad Rubinstein, Amin Saberi, and Li-Yang Tan. I got my PhD from UMD where I was advised by MohammadTaghi Hajiaghayi and my BSc from Sharif University.

Office: WVH 348

E-mail: s.behnezhad@northeastern.edu

CV, DBLP , Google Scholar

Teaching

(Fall '24) CS3000: Algorithms and Data
(Spr '24) CS3000: Algorithms and Data
(Fall '23) CS5800: Algorithms
(Fall '22) CS7880: Special Topics in TCS — Algorithms for Big Data

Advising

(PhD '22—) Amir Azarmehr
(PhD '22—) Mohammad Saneian
(PhD '23—) Alma Ghafari

Service

PC: STOC 2025, SODA 2025, SOSA 2024, ESA 2023, SWAT 2022
Junior PC: AAAI 2021, AAAI 2020, NeurIPS 2019, ICML 2019

Misc

Recent Preprints

Vizing’s Theorem in Near-Linear Time (Oct 2024)

Sepehr Assadi, Soheil Behnezhad, Sayan Bhattacharya, Martin Costa, Shay Solomon, Tianyi Zhang

Summary, arXiv

Vizing's theorem from 1964 states that any graph of maximum degree ∆ admits a ∆+1 edge coloring. We present the first near-linear time algorithm that finds such a coloring.

Publications

Settling the Pass Complexity of Approximate Matchings in Dynamic Graph Streams ( SODA '25 )

Sepehr Assadi, Soheil Behnezhad, Christian Konrad, Kheeran Naidu, Janani Sundaresan

Summary, arXiv

We settle the complexity of approximate maximum matching in dynamic streams. Our upper bound improves prior logarithmic rounds exponentially to O(log log n), and the lower bound matches this.

Massively Parallel Minimum Spanning Tree in General Metric Spaces ( SODA '25 )

Amir Azarmehr, Soheil Behnezhad, Rajesh Jayaram, Jakub Łącki, Vahab Mirrokni, and Peilin Zhong

Summary, arXiv

This paper presents improved upper and lower bounds for finding the minimum spanning tree (MST) of a general metric space.

Fully Dynamic (∆ + 1) Coloring Against Adaptive Adversaries ( SODA '25 )

Soheil Behnezhad, Rajmohan Rajaraman, and Omer Wasim

Summary

We present improved bounds for (∆ + 1) vertex coloring fully dynamic graphs against adaptive adversaries.

Fully Dynamic Matching and Ordered Ruzsa-Szemerédi Graphs ( FOCS '24 )

Soheil Behnezhad and Alma Ghafari

Summary, arXiv

We relate the complexity of maintaining an approximate maximum matching in a dynamic graph to an extremal combinatorics object that we call Oredered Ruzsa-Szemeredi (ORS) graphs.

Bipartite Matching in Massive Graphs: A Tight Analysis of EDCS ( ICML '24 )

Amir Azarmehr, Soheil Behnezhad, and Mohammad Roghani

Summary, arXiv

We present a tight analysis of EDCS, which is a versatile matching sparsifier, in bipartite graphs. We show that, surprisingly, the EDCS obtains a strictly better than 2/3-approximation with certain parameters.

Streaming Edge Coloring with Asymptotically Optimal Colors ( ICALP '24 )

Soheil Behnezhad and Mohammad Saneian

Summary, arXiv

We present the first streaming algorithm that finds an O(∆) edge-coloring using subquadratic space.

Sublinear Algorithms for TSP via Path Covers ( ICALP '24 )

Soheil Behnezhad, Mohammad Roghani, Aviad Rubinstein, and Amin Saberi

Summary, arXiv

We present improved sublinear time algorithms for estimating the cost of TSP in graphic and (1,2) metrics.

Approximating Maximum Matching Requires Almost Quadratic Time ( STOC '24 )

Soheil Behnezhad, Mohammad Roghani, and Aviad Rubinstein

Summary arXiv

We prove that near-quadratic in n time is needed to approximate the size of maximum matching within an additive error of εn.

Fully Dynamic Matching: (2-√2)-Approximation in Polylog Update Time ( SODA '24 )

Amir Azarmehr, Soheil Behnezhad, and Mohammad Roghani

Summary, arXiv

We present a (2-√2)-approximate algorithm for maximum matching in general graphs which has implications in dynamic and streaming settings. Our algorithm removes a large gap that previously existed between general and bipartite graphs.

Local Computation Algorithms for Maximum Matching: New Lower Bounds ( FOCS '23 )

Soheil Behnezhad, Mohammad Roghani, and Aviad Rubinstein

Summary, arXiv

We prove that any local computation algorithm (LCA) for computing a (1-ε) approximate maximum matching in graphs of maximum degree ∆ needs to spend at least ∆^Ω(1/ε) time per query. This resolves a decade old open problem of the area.

Robust Communication Complexity of Matching: EDCS Achieves 5/6 Approximation ( ICALP '23 )

Amir Azarmehr and Soheil Behnezhad

Summary, arXiv

We study the "robust communication complexity" of the maximum matching problem. The edges of an adversarially chosen n-vertex graph G are partitioned randomly between Alice and Bob. Alice has to send a single message to Bob, using which Bob has to output an approximate maximum matching of G. We present a new (tight) analysis of a known protocol by Bernstein, proving that it obtains a 5/6 approximation with O(n) communication, significantly improving prior approximations.

Sublinear Time Algorithms and Complexity of Approximate Maximum Matching ( STOC '23 )

Soheil Behnezhad, Mohammad Roghani, and Aviad Rubinstein

Summary, arXiv, blog post on property testing review

In this paper, we present the first super linear in n lower bound for approximating maximum matching size. We also present improved upper bounds.

On Regularity Lemma and Barriers in Streaming and Dynamic Matching ( STOC '23 )

Sepehr Assadi, Soheil Behnezhad, Sanjeev Khanna, and Huan Li

Summary, arXiv

This paper shows that Szemeredi's celebrated regularity lemma can be used to obtain non-trivial albeit slight improvements over longstanding barriers for matchings in streaming and dynamic graphs.

Dynamic Algorithms for Maximum Matching Size ( SODA '23 )

Soheil Behnezhad

Summary, arXiv

Best Paper Award at SODA '23.

Invited to TALG, special issue for SODA '23.

Invited to Highlights of Algorithms (HALG) 2024.

This paper improves two longstanding approximation/update-time trade-offs for maximum matching in fully dynamic graphs when the goal is to maintain just the size of the matching.

Beating Greedy Matching in Sublinear Time ( SODA '23 )

Soheil Behnezhad, Mohammad Roghani, Aviad Rubinstein, and Amin Saberi

Summary, arXiv, blog post on property testing review

We give an algorithm that (1/2+Ω(1))-approximates the size of a maximum matching in O(n^1+ε) time. No subquadratic time algorithm for beating 1/2-approximation was known prior to our work.

Single-Pass Streaming Algorithms for Correlation Clustering ( SODA '23 )

Soheil Behnezhad, Moses Charikar, Weiyun Ma, and Li-Yang Tan

Summary

This paper presents improved algorithms for min-disagreement correlation clustering in a single pass of the streaming setting.

Almost 3-Approximate Correlation Clustering in Constant Rounds ( FOCS '22 )

Soheil Behnezhad, Moses Charikar, Weiyun Ma, and Li-Yang Tan

Summary, arXiv

We show that a (3+ε)-approximation of correlation clustering can be found in O(1/ε) rounds. This is a culminating point for the rich literature on parallel correlation clustering as the approximation matches a natural bound and the round complexity is essentially constant.

New Trade-Offs for Fully Dynamic Matching via Hierarchical EDCS ( SODA '22 )

Soheil Behnezhad and Sanjeev Khanna

Summary, arXiv

We study the maximum matching problem in fully dynamic graphs where a graph is undergoing both edge insertions and deletions, and the goal is to efficiently maintain a large matching after each edge update. This problem has received considerable attention in recent years. The known algorithms naturally exhibit a trade-off between the quality of the matching maintained (i.e., the approximation ratio) and the time needed per update. While several interesting results have been obtained, the optimal behavior of this trade-off remains largely unclear. Our main contribution is a new approach to designing fully dynamic approximate matching algorithms that in a unified manner not only (essentially) recovers all previously known tradeoffs that were achieved via very different techniques, but reveals some new ones as well.

Stochastic Vertex Cover with Few Queries ( SODA '22 )

Soheil Behnezhad, Avrim Blum, and Mahsa Derakhshan

Summary, arXiv

We study the problem of finding a minimum vertex cover (MVC) of a random subgraph of a given graph. The algorithm is unaware of this random subgraph but can learn if an edge of the base graph exists in it by querying it. The goal is to find an approximate MVC by querying few edges. This stochastic setting has been studied extensively for various problems such as minimum spanning trees, matroids, shortest paths, and matchings. However, no non-trivial bound was known for MVC prior to our work. We show in this work that a constant number of queries per vertex suffice to obtain good approximations for the stochastic MVC problem.

Modern Large-Scale Algorithms for Classical Graph Problems ( PhD Thesis )

Soheil Behnezhad

Charles A. Caramello Award for Best Thesis in Math, Physical Sciences, and Engineering at UMD, '21.

Larry S. Davis Award for Best PhD Thesis in CS at UMD, 2021.

Summary, PDF

Although computing power has advanced at an astonishing rate, it has been far outpaced by the growing scale of data. This has led to an abundance of algorithmic problems where the input tends to be, by orders of magnitude, larger than the memory available on a single machine. The challenges of data processing at this scale are inherently different from those of traditional algorithms. For instance, without having the whole input properly stored in the memory of a single machine, it is unrealistic to assume that any arbitrary location of the input can be accessed at the same cost; an assumption that is essential for traditional algorithms. In this thesis, we focus on modern computational models that capture these challenges more accurately, and devise new algorithms for several classical graph problems.

Specifically, we study models of computation that only allow the algorithm to use sublinear resources (such as time, space, or communication). Examples include (i) massively parallel computation algorithms where the workload is distributed among several machines each with sublinear space/communication, (ii) sublinear-time algorithms that take time sublinear in the input size, (iii) streaming algorithms that take only few passes over the input having access to a sublinear space, and (iv) dynamic algorithms that maintain a property of a dynamically changing input using a sublinear time per update.

We propose new algorithms for classical graph problems such as maximum/maximal matching, maximal independent set, minimum vertex cover, and graph connectivity in these models that substantially improve upon the state-of-the-art and are in many cases optimal. Many of our algorithms build on model-independent tools and ideas that are of independent interest and lead to improved bounds in more than one of the aforementioned settings.

Time-Optimal Sublinear Algorithms for Matching and Vertex Cover ( FOCS '21 )

Soheil Behnezhad

Invited to Highlights of Algorithms (HALG) 2022.

Summary, arXiv, blog post on property testing review

This paper gives a near-tight analysis of the average "query complexity" of the randomized greedy maximal matching algorithm. This leads to a number of time-optimal algorithms for approximating the size of maximum matching and minimum vertex cover in sublinear time.

On the Robust Communication Complexity of Bipartite Matching ( RANDOM '21 )

Sepehr Assadi and Soheil Behnezhad

Summary, PDF, Talk@RANDOM by Sepehr

We study the "robust communication complexity" of the maximum bipartite matching problem. The edges of an adversarially chosen n-vertex bipartite graph G are partitioned randomly between Alice and Bob. Alice has to send a single message to Bob, using which Bob has to output an approximate maximum matching of G. We present a new protocol obtaining a 0.716 approximation with O(n) communication, improving over the previous close-to-2/3 approximations.

Beating Two-Thirds For Random-Order Streaming Matching ( ICALP '21 )

Sepehr Assadi and Soheil Behnezhad

Summary, arXiv, Talk@IRIF, Talk@ICALP

We study matchings in the random-order streaming setting. We prove that for an absolute constant ε>0, one can find a (2/3+ε)-approximate maximum matching of any n-vertex graph using O(n log n) space with high probability. This breaks the natural boundary of 2/3 for this problem prevalent in the prior work and resolves an open problem of Bernstein [ICALP '20].

Stochastic Weighted Matching: (1 - ε) Approximation ( FOCS '20 )

Soheil Behnezhad and Mahsa Derakhshan

Summary, arXiv, Talk@FOCS

In this paper, we settle the approximation factor of the stochastic weighted matching problem, showing a (1-ε) approximation is achievable for any constant ε > 0. Previously, only close to 0.5-approximations were known.

Stochastic Matching with Few Queries: (1 - ε) Approximation ( STOC '20 )

Soheil Behnezhad, Mahsa Derakhshan, and Mohammad Hajiaghayi

52^nd Annual ACM Symposium on Theory of Computing

Summary, arXiv, Talk@STOC (short, full), Talk@WOLA

In this paper, we settle the approximation factor of the unweighted stochastic matching problem. We provide an analysis of a non-adaptive algorithm showing that it obtains a (1-ε) approximation for any constant ε > 0. The best known approximation prior to our work was 2/3.

Parallel Graph Algorithms in Constant Adaptive Rounds: Theory meets Practice ( VLDB '20 )

Soheil Behnezhad, Laxman Dhulipala, Hossein Esfandiari, Jakub Łącki, Vahab Mirrokni, and Warren Schudy

Summary, arXiv

In this paper, we propose theoretically improved algorithms for several important graph problems in the Adaptive Massively Parallel Computations (AMPC) setting, which we introduced in an earlier paper and also evaluate them empirically.

Fully Dynamic Matching: Beating 2-Approximation in ∆^ε Update Time ( SODA '20 )

Soheil Behnezhad, Jakub Łącki, and Vahab Mirrokni

The 31^st Annual ACM-SIAM Symposium on Discrete Algorithms

Summary, arXiv

For fully dynamic graphs, we have algorithms that maintain a 2-approximation of maximum matching extremely fast. But all known algorithms for general graphs maintaining a better-than-2 approximate matching require a large polynomial update time. In this paper, we show that in this regime, the update time can be improved to any arbitrarily small polynomial. Namely, we give an algorithm that for any desirably small constant ε>0, takes only O(∆^ε) update-time.

Exponentially Faster Massively Parallel Maximal Matching ( FOCS '19, JACM '23 )

Soheil Behnezhad, Mohammad Hajiaghayi, and David Harris

60th Annual IEEE Symposium on Foundations of Computer Science

Summary, arXiv, blog post on property testing review

This paper presents exponentially faster Massively Parallel Computation (MPC) algorithms for maximal matching. This is achieved by providing an analysis of a natural algorithm which was conjectured to work by Czumaj et al. [STOC '18]. The analysis reveals surprising connections between MPC and the notion of query-complexity developed for sublinear algorithms.

Fully Dynamic Maximal Independent Set with Polylogarithmic Update Time ( FOCS '19 )

Soheil Behnezhad, Mahsa Derakhshan, Mohammad Hajiaghayi, Cliff Stein, and Madhu Sudan

60th Annual IEEE Symposium on Foundations of Computer Science

Summary, arXiv

This paper presents the first algorithm for maintaining a maximal independent set (and related problems) in fully dynamic graphs that takes polylogarithmic time per update. Previous algorithms required polynomial update time.

Near-Optimal Massively Parallel Graph Connectivity ( FOCS '19 )

Soheil Behnezhad, Laxman Dhulipala, Hossein Esfandiari, Jakub Łącki, and Vahab Mirrokni

60th Annual IEEE Symposium on Foundations of Computer Science

Summary, arXiv

This paper presents a strongly sublinear space Massively Parallel Computation algorithm for graph connectivity which for graphs with diameter D > log^εn, takes O(log D) rounds and takes O(log log n) rounds on other graphs. This improves upon an algorithm of Andoni et al. [FOCS '19] and almost settles the problem due to an Ω(log D) conditional lower bound.

Streaming and Massively Parallel Algorithms for Edge Coloring ( ESA '19 )

Soheil Behnezhad, Mahsa Derakhshan, Mohammad Hajiaghayi, Marina Knittel, and Hamed Saleh

27th Annual European Symposium on Algorithms

Summary, PDF, Talk@ESA

This paper initiates the study of edge coloring in the streaming model. In addition, an improved Massively Parallel Computation algorithm for edge coloring is presented.

Stochastic Matching on Uniformly Sparse Graphs ( SAGT '19 )

Soheil Behnezhad, Mahsa Derakhshan, Alireza Farhadi, Mohammad Hajiaghayi, and Nima Reyhani

12th International Symposium on Algorithmic Game Theory

Massively Parallel Computation of Matching and MIS in Sparse Graphs ( PODC '19 )

Soheil Behnezhad, Sebastian Brandt, Mahsa Derakhshan, Manuela Fischer,
Mohammad Hajiaghayi, Richard Karp, and Jara Uitto

ACM Symposium on Principles of Distributed Computing

Summary, arXiv 1, arXiv 2

This paper presents strongly sublinear space Massively Parallel Computation algorithms for maximal matching and maximal independent set that for graphs with polylogarithmic arboricity (which includes most families of sparse graphs such as minor-free graphs and bounded-genus graphs) take O(log² log n) rounds. Previous algorithms required polylogarithmic rounds.

Optimal Strategies of Blotto Games: Beyond Convexity ( EC '19 )

Soheil Behnezhad, Avrim Blum, Mahsa Derakhshan, Mohammad Hajiaghayi,
Christos Papadimitriou, and Saeed Seddighin

The 20^th ACM Conference on Economics and Computation

Summary, arXiv

This paper presents algorithms providing near-optimal strategies for the Colonel Blotto game that in addition have small support size. Previous algorithms produced strategies with large support. While bounding the support size makes the solution space non-convex (and thus prevents the use of convex programming), we show through a set of structural results that the space can be decomposed into polynomially many disjoint convex polytopes that can be solved independently.

Massively Parallel Computation via Remote Memory Access ( SPAA '19, TOPC )

Soheil Behnezhad, Laxman Dhulipala, Hossein Esfandiari, Jakub Łącki, Vahab Mirrokni, and Warren Schudy

The 31^st ACM Symposium on Parallelism in Algorithms and Architectures

Invited to TOPC, special issue for SPAA '19.

Summary, arXiv

This paper introduces the Adaptive Massively Parallel Computations (AMPC) model which is an extension of the Massively Parallel Computations (MPC) model. Compared to MPC, AMPC allows the machines to also adaptively query a read-only data store within a round, subject to the same communication bounds as in MPC. This feature is inspired by RDMA systems that are widely available. We give AMPC algorithms that are significantly faster than the state-of-the-art for MPC.

Stochastic Matching with Few Queries: New Algorithms and Tools ( SODA '19 )

Soheil Behnezhad, Alireza Farhadi, Mohammad Hajiaghayi, and Nima Reyhani

The 30^th Annual ACM-SIAM Symposium on Discrete Algorithms

Summary, arXiv, Talk@Simons

This paper presents improved non-adaptive algorithms for the stochastic matching problem. This is achieved by analysing a natural randomized algorithm that differs from those in the literature in a fundamental way. The improvement for unweighted graphs is significant: from the previous close-to-half approximations to 0.6568 approximation. For weighted graphs, the approximation factor is at least 0.501, which is the first to break half.

Almost Optimal Stochastic Weighted Matching With Few Queries ( EC '18 )

Soheil Behnezhad and Nima Reyhani

The 19^th ACM Conference on Economics and Computation

Summary, arXiv

This paper presents improved stochastic matching algorithms for weighted graphs. The main result is an adaptive algorithm that obtains a (1-ε) approximation using a constant number of queries per vertex. Previous algorithms for weighted graphs required super-constant queries.

Spatio-Temporal Games Beyond One Dimension ( EC '18 )

Soheil Behnezhad, Mahsa Derakhshan, Mohammad Hajiaghayi, and Saeed Seddighin

The 19^th ACM Conference on Economics and Computation

Summary

This paper considers a generalization of spatio-temporal games from a one-dimensional space to graphs. An approximation algorithm (that relaxes the number of resources) is given for general graphs and a dependent-rounding technique is also shown to provide an algorithm for the one-dimensional case that simplifies the prior work.

Brief Announcement: MapReduce Algorithms on Massive Trees ( ICALP '18 )

MohammadHossein Bateni, Soheil Behnezhad, Mahsa Derakhshan, Mohammad Hajiaghayi, and Vahab Mirrokni

The 45^th International Colloquium on Automata, Languages, and Programming

arXiv

Winning Strategies of Blotto and Auditing Games ( SODA '18 )

Soheil Behnezhad, Avrim Blum, Mahsa Derakhshan, Mohammad Hajiaghayi,
Mohammad Mahdian, Christos Papadimitriou, Ron Rivest, Saeed Seddighin, and Philip Stark

The 29^th Annual ACM-SIAM Symposium on Discrete Algorithms

Summary, PDF

This paper introduces a notion of (u, p)-maxmin strategies which guarantee receiving a minimum utility of u with probability at least p. It is shown that (u, p)-maximin strategies of the Colonel Blotto and auditing games can be well-approximated in polynomial time.

Affinity Clustering: Hierarchical Clustering at Scale ( NIPS '17 )

MohammadHossein Bateni, Soheil Behnezhad, Mahsa Derakhshan,
Mohammad Hajiaghayi, Raimondas Kiveris, Silvio Lattanzi, and Vahab Mirrokni

The 30^th Annual Conference on Neural Information Processing Systems

Summary, PDF, Spotlight Video, On Google AI Blog

This paper proposes affinity clustering which is a hierarchical clustering algorithm based on Boruvka's minimum spanning tree algorithm. It is shown to be superior to many other clustering methods on a number of public and private data sets. In addition, the paper presents fast large-scale algorithms for affinity clustering.

A Polynomial Time Algorithm for Spatio-Temporal Games ( EC '17 )

Soheil Behnezhad, Mahsa Derakhshan, Mohammad Hajiaghayi, and Aleksandrs Slivkins

The 18^th ACM Conference on Economics and Computation

Summary, arXiv, Talk@EC

This paper presents the first polynomial time algorithm for a well-studied variant of security games that is played out in space and time. Prior work obtained polynomial-time algorithms only for special cases of the problem e.g., when there are a constant number of timesteps.

Brief Announcement: Graph Matching in Massive Datasets ( SPAA '17 )

Soheil Behnezhad, Mahsa Derakhshan, Hossein Esfandiari, Elif Tan, and Hadi Yami

The 29^th ACM Symposium on Parallelism in Algorithms and Architectures

Summary, PDF

This paper presents a remarkably simple algorithm for approximate maximum matching of bipartite graphs in the streaming and Massively Parallel Computations models. The algorithm requires 1/ε rounds and O(n^1.5) central space to achieve (1-ε) approximation.

Faster and Simpler Algorithm for Optimal Strategies of Blotto Game ( AAAI '17 )

Soheil Behnezhad, gateio, Mahsa Derakhshan, Mohammad Hajiaghayi, and Saeed Seddighin

The 31^st AAAI Conference on Artificial Intelligence

Summary, arXiv

This paper presents a linear programming (LP) formulation for the well-studied Colonel Blotto game with polynomially many constraints. Prior work relied on the ellipsoid method to solve an exponential size LP. It is further shown that the number of constraints of the formulation is optimal.