PLAN: Variance-Aware Private Mean Estimation

Authors: Martin Aumüller (IT University of Copenhagen), Christian Janos Lebeda (Basic Algorithms Research Copenhagen and IT University of Copenhagen), Boel Nelson (Aarhus University), Rasmus Pagh (Basic Algorithms Research Copenhagen and University of Copenhagen)

Volume: 2024
Issue: 3
Pages: 606–625
DOI: https://doi.org/10.56553/popets-2024-0095

Download PDF

Abstract: Differentially private mean estimation is an important building block in privacy-preserving algorithms for data analysis and machine learning. Though the trade-off between privacy and utility is well understood in the worst case, many datasets exhibit structure that could potentially be exploited to yield better algorithms. In this paper we present Private Limit Adopted Noise (PLAN), a family of differentially private algorithms for mean estimation in the setting where inputs are independently sampled from a distribution D over R^d, with coordinate-wise standard deviations σ in R^d. Similar to mean estimation under Mahalanobis distance, PLAN tailors the shape of the noise to the shape of the data, but unlike previous algorithms the privacy budget is spent non-uniformly over the coordinates. Under a concentration assumption on D, we show how to exploit skew in the vector σ, obtaining a (zero-concentrated) differentially private mean estimate with l_2 error proportional to ||σ||_1. Previous work has either not taken σ into account, or measured error in Mahalanobis distance --- in both cases resulting in l_2 error proportional to sqrt{d}||σ||_2, which can be up to a factor sqrt{d} larger. To verify the effectiveness of PLAN, we empirically evaluate accuracy on both synthetic and real-world data.

Keywords: differential privacy, mean estimation

Copyright in PoPETs articles are held by their authors. This article is published under a Creative Commons Attribution 4.0 license.