Unlearning Clients, Features and Samples in Vertical Federated Learning

Ayush K. Varshney; Konstantinos Vandikas; Vicenç Torra

Unlearning Clients, Features and Samples in Vertical Federated Learning

Authors: Ayush K. Varshney (Umeå University), Konstantinos Vandikas (Ericsson Research, Ericsson), Vicenç Torra (Umeå University)

Volume: 2025
Issue: 2
Pages: 39–53
DOI: https://doi.org/10.56553/popets-2025-0048

Download PDF

Abstract: Federated Learning ( FL ) has emerged as a prominent distributed learning paradigm that allows multiple users to collaboratively train a model without sharing their data thus preserving privacy. Within the scope of privacy preservation, information privacy regulations such as GDPR entitle users to request the removal (or unlearning) of their contribution from a service that is hosting the model. For this purpose, a server hosting an ML model must be able to unlearn certain information in cases such as copyright infringement or security issues that can make the model vulnerable or impact the performance of a service based on that model. While most unlearning approaches in FL focus on Horizontal Federated Learning (HFL), where clients share the feature space and the global model, Vertical Federated Learning (VFL) has received less attention from the research community. VFL involves clients (passive parties) sharing the sample space among them while not having access to the labels. In this paper, we explore unlearning in VFL from three perspectives: unlearning passive parties, unlearning features, and unlearning samples. To unlearn passive parties and features we introduce VFU-KD which is based on knowledge distillation (KD) while to unlearn samples, VFU-GA is introduced which is based on gradient ascent (GA). To provide evidence of approximate unlearning, we utilize Membership Inference Attack (MIA) to audit the effectiveness of our unlearning approach. Our experiments across six tabular datasets and two image datasets demonstrate that VFU-KD and VFU-GA achieve performance comparable to or better than both retraining from scratch and the benchmark R2S method in many cases, with improvements of (0 − 2%). In the remaining cases, utility scores remain comparable, with a modest utility loss ranging from 1 − 5%. Unlike existing methods, VFU-KD and VFU-GA require no communication between active and passive parties during unlearning. However, they do require the active party to store the previously communicated embeddings.

Keywords: Federated learning; Unlearning, Vertical federated learning, Auditing, Membership Inference Attack (MIA)

Copyright in PoPETs articles are held by their authors. This article is published under a Creative Commons Attribution 4.0 license.