TPDP 2026 – Theory and Practice of Differential Privacy

Workshop Information

TPDP 2026 will take place on June 1 and 2 at the Northeastern University Curry Student Center (CSC) Ballroom (260) and 2nd Floor Suites. TPDP is co-located with the OpenDP Differential Privacy for Health and Genomics workshop (June 2-3) and the Foundations of Responsible Computing conference (June 3-5). We hope you will attend!

Registration: Registration is now closed.

Invitation Letter: If you require an invitation letter for your visa application, please fill out this request form.

Note: The 2026 FIFA World Cup will be partially hosted in Boston, with matches beginning on June 13. We encourage attendees to book accommodations well in advance.

Program (Tentative)

All times are Eastern Daylight Time. Sessions will be held in the CSC Ballroom (260) unless specified below.

Monday, June 1

8:30-9:00		Breakfast
9:00-9:05		Welcome note from chairs!
9:05-9:50		Keynote #1 For several years, on-device federated learning (FL) was the most common approach for training machine learning (ML) models on private, distributed user data. Despite this, on-device training has several drawbacks: (1) foundation models have become far too large to train on client devices, (2) on-device training is communication- and computation-intensive, and (3) on-device training can be difficult to debug and deploy. To address these problems, we study a pipeline in which models are trained at a central server on differentially-private synthetic data from client devices. We show how a recent algorithm called Private Evolution can outperform traditional federated learning baselines in utility and cost. We provide early theoretical analysis of its properties, including distributional convergence guarantees. Finally, we show how the Private Evolution algorithm can be reformulated as a preference optimization problem, thereby significantly improving the performance of private synthetic data relative to on-device baselines and prior synthetic data baselines. Giulia Fanti's website
9:50-10:45		Contributed Talks: The Theory of εverything Channeling a bit of Stephen Hawking, this session probes the deepest laws of the differential privacy cosmos--where epsilon is small, lower bounds are inevitable and proofs curve spacetime. Private Evolution (PE) is a differentially private algorithm for synthetic data generation. While it can be viewed as a Wasserstein learning algorithm, it performs much better in practice than worst‑case Wasserstein analyses would predict. We recast PE as a generative model‑augmented Wasserstein learning. We show theoretically that when we take into account the use of a generative model that is able to capture something about the true distribution, then we can obtain much better performance bounds. For example, if the generator gives samples in the same low-dimensional space as the distribution, then sample complexity depends on intrinsic, not ambient, dimension. We also show that standard variants of PE can fail to converge on simple well-clustered instances, and propose a new geometry-aware version of PE with provable convergence on such instances. Experimentally, we show that our new algorithm has consistent empirical gains over standard baselines. Bilevel optimization, in which one optimization problem is nested inside another, underlies many machine learning applications with a hierarchical structure -- such as meta-learning and hyperparameter optimization. Such applications often involve sensitive training data, raising pressing concerns about individual privacy. Motivated by this, we study differentially private bilevel optimization. We first focus on settings where the outer-level objective is convex, and provide novel upper and lower bounds on the excess empirical risk for both pure and approximate differential privacy. These bounds are nearly tight and essentially match the optimal rates for standard single-level differentially private ERM, up to additional terms that capture the intrinsic complexity of the nested bilevel structure. We also provide population loss bounds for bilevel stochastic optimization. The bounds are achieved in polynomial time via efficient implementations of the exponential and regularized exponential mechanisms. A key technical contribution is a new method and analysis of log-concave sampling under inexact function evaluations, which may be of independent interest. In the non-convex setting, we develop novel algorithms with state-of-the-art rates for privately finding approximate stationary points. Notably, our bounds do not depend on the dimension of the inner problem. We study the computational cost of differential privacy in terms of memory efficiency. While the trade-off between accuracy and differential privacy is well-understood, the inherent cost of privacy regarding memory use remains largely unexplored. This paper establishes for the first time an unconditional space lower bound for user-level differential privacy by introducing a novel proof technique based on a multi-player communication game. We study differentially private continual release of the number of distinct items in a turnstile stream, where items may be both inserted and deleted. We show that existing polynomial lower bounds on the additive error required for privacy can be circumvented when some multiplicative error is also allowed. We give algorithms for continual estimation of the number of distinct elements or F2 moment of the stream with polylogarithmic additive error at the cost of a small multiplicative error.
10:45-11:00		Break
11:00-12:00		Poster Session A (2nd Floor Suites)
12:00-1:30		Lunch (on your own)
1:30-2:45		Contributed Talks: A.I. Artificial Intelligencε As exciting as the Steven Spielberg blockbuster, this session brings together AI and differential privacy--no robot children were harmed during the making of these papers. The widespread adoption of AI assistants has prompted the development of privacy-aware platforms designed to extract insights from real-world usage. Their privacy protections primarily rely on layering multiple heuristic techniques, such as PII redaction, clustering, aggregation, and LLM-based privacy auditing. In this paper, we put their privacy claims to the test by presenting CLIOPATRA, the first attack against "privacy-preserving" LLM-based insights systems. Differential privacy (DP) has a wide range of applications for protecting data privacy, but designing and verifying DP algorithms requires expert-level reasoning, creating a high barrier for non-expert practitioners. Prior works either rely on specialized verification languages that demand substantial domain expertise or remain semi-automated and require human-in-the-loop guidance. In this work, we investigate whether large language models (LLMs) can automate DP reasoning. We introduce DPrivBench, a benchmark in which each instance asks whether a function or algorithm satisfies a stated DP guarantee under specified assumptions. The benchmark is carefully designed to cover a broad range of DP topics, span diverse difficulty levels, and resist shortcut reasoning through trivial pattern matching. Experiments show that while the strongest models handle textbook mechanisms well, all models struggle with advanced algorithms, revealing substantial gaps in current DP reasoning capabilities. Our benchmark provides a solid foundation for developing and evaluating such methods, and complements existing benchmarks for mathematical reasoning. Differentially private (DP) text synthesis promises to unlock sensitive corpora for model training, but it remains unclear whether DP synthetic data transmits genuinely new knowledge and capabilities present only in those corpora. This is because existing evaluations rely on tasks that are nearly solvable without training, so strong benchmark performance does not establish that DP synthesis can substitute original data access. Thus, we introduce ContinuousBench, a continuously and automatically-regenerated benchmark that measures capability gain from DP synthetic text. Each quarter, a new release pairs a never-before-seen training corpus with a derived QA set, constructed to be: (1) unsolvable sans-corpus; and (2) learnable under DP, as the tested knowledge is supported by hundreds of independent records. Researchers produce DP synthetic data from the training corpus and run our standardized training and evaluation harness on their synthetic data to measure gains. We instantiate two tracks: Geminon, a procedurally-generated dataset about fictional creatures; and News, a stream of newly scraped public news articles. Although standard benchmarks are nearly saturated, on ContinuousBench we find that non-private synthesis transfers substantial knowledge from the original corpus, while state-of-the-art DP synthesis methods generally fail to do so, even at ε = 100. ContinuousBench is available at https://huggingface.co/ContinuousBench Are there any conditions under which a generative model's outputs are guaranteed not to infringe the copyrights of its training data? This is the question of "provable copyright protection" first posed by Vyas, Kakade, and Barak (ICML 2023). They define near access-freeness (NAF) and propose it as sufficient for protection. This paper revisits the question and establishes new foundations for provable copyright protection -- foundations that are firmer both technically and legally. First, we show that NAF alone does not prevent infringement. In fact, NAF models can enable verbatim copying, a blatant failure of copyright protection that we dub being tainted. Then, we introduce our blameless copyright protection framework for defining meaningful guarantees, and instantiate it with clean-room copyright protection. Clean-room copyright protection allows a user to control their risk of copying by behaving in a way that is unlikely to copy in a counterfactual "clean-room setting." Finally, we formalize a common intuition about differential privacy and copyright by proving that DP implies clean-room copyright protection when the dataset is golden, a copyright deduplication requirement. We study coalition formation for data sharing under differential privacy when agents have heterogeneous privacy preferences. We study a fully decentralized data sharing mechanism where each agent holds a sensitive data point and decides whether to participate in a data-sharing coalition and how much noise to add to their data. Privacy choices induce a fundamental trade-off: higher privacy reduces individual data-sharing costs but degrades data utility and statistical accuracy for the coalition. These choices generate externalities across agents, making both participation and privacy levels strategic. Our goal is to understand which coalitions are stable, how privacy choices shape equilibrium outcomes, and how fully decentralized data sharing compares to a centralized, socially optimal benchmark when the number of players is large. We provide a comprehensive analysis across a range of privacy-cost regimes, from decreasing costs (privacy amplification from pooling data) to increasing costs (greater exposure to privacy attacks in larger coalitions), characterizing: i) which regimes offer non-trivial improvements in accuracy and social cost; and ii) the efficiency gap between the centralized and decentralized mechanisms. The main insight is that full decentralization is often highly inefficient, primarily due to players being risk-averse and selfishly choosing highly stringent privacy levels for themselves.
2:45-3:00		Break
3:00-4:00		Poster Session B (2nd Floor Suites)
4:00-4:20		Break
4:20-5:30		Contributed Talks: The Practicε This session shares more than just the Boston zipcode with the legal drama--we promise the same thrill as a courtroom showdown as differential privacy becomes the star witness in practical deployment. While differential privacy provides strong mathematical guarantees, practical implementations often suffer from subtle bugs that invalidate these theoretical protections. To address this, we introduce a novel auditing framework, Re:cord-play, that inspects the internal states of DP algorithms to overcome the limitations of standard black-box testing. By isolating the privacy mechanisms, our approach can deterministically catch data leaks into data-independent logic or flag sensitivity miscalculations. In this presentation, we will detail the framework's methodology and showcase real-world vulnerabilities discovered through audits of popular open-source DP libraries, revealing actionable privacy violations. JAX-Privacy is a library designed to simplify the deployment of robust and performant mechanisms for differentially private machine learning. Guided by design principles of usability, flexibility, and efficiency, JAX-Privacy serves both researchers requiring deep customization and practitioners who want a more out-of-the-box experience. The library provides verified, modular primitives for critical components for all aspects of the mechanism design including batch selection, gradient clipping, noise addition, accounting, and auditing, and brings together a large body of recent research on differentially private ML. Differential Privacy (DP) bounds the privacy leakage of a mechanism against worst-case membership inference, but the precise tradeoff between complex adversarial models and DP protections remains poorly understood. In this paper, we present a unified framework that generalizes the patchwork of existing bounds across membership inference, attribute inference, and data reconstruction attacks. There is a need in the community of privacy practitioners for a trustworthy, collaborative shared database of differentially private deployments, to help foster norms about best practices. We propose a set of guidelines aimed at groups seeking to develop such a database. Such a governance resource to (1) help industry grow to consensus on best practices, (2) provide public snapshots of the privacy landscape so that regulators can judge new deployments in context and shape guidance accordingly, and (3) incentivize industry to make their choices public. We describe an initial schema to systematize this information and an editorial and governance process to ensure this information is reliable, and demonstrate a prototype interface. The 2020 United States Census adopted differential privacy to protect individual confidentiality, adding calibrated noise to billions of demographic measurements spanning six geographic levels from nation to census block. The deployed post-processing method, TopDown, uses a series of heuristic optimizations to reconcile these noisy measurements into self-consistent population tables. We introduce BlueDown, a new post-processing algorithm that improves accuracy while achieving the same privacy guarantee. BlueDown is derived by constraining the best linear unbiased estimator (BLUE), which is efficiently computed across all geographic levels by exploiting the symmetries and block-hierarchical structure of the measurement queries. On 2020 Census data, BlueDown reduces estimation error by 8–45% for queries at the county and tract levels while satisfying all structural constraints. These gains come at no cost to confidentiality and could directly improve the downstream analyses used to guide the distribution of over $1.5 trillion in annual spending across hundreds of federal programs that rely on census data.
5:30-7:00		Job Market Session and Social Hour

Tuesday, June 2

8:30-9:00		Breakfast
9:00-9:45		Keynote #2 Algorithmic predictions are increasingly used to target benefits to individuals, particularly in low- and middle-income countries where traditional data sources are limited. In these settings, practitioners often utilize non-traditional data -- such as mobile phone metadata and other remotely sensed signals -- across a range of welfare-enhancing programs, including emergency response, anti-poverty targeting, and expanding financial inclusion. In this talk, we revisit a real-world anti-poverty program that used machine learning models trained on mobile phone metadata to allocate aid to over 800,000 individuals, and examine it through the lens of differential privacy. We show that the operational and policy constraints of this setting motivate two application-specific adaptations of standard differential privacy definitions to enable accurate targeting while providing formal privacy guarantees. We then characterize the resulting privacy-program effectiveness tradeoffs, highlighting how design choices shape both statistical performance and welfare outcomes. We conclude with concrete recommendations for structuring targeting programs to support privacy-preserving deployment in practice. Nitin Kohli's website
9:45-10:00		Break
10:00-11:00		Panel: 20 Years of DP! Panelists: Cynthia Dwork (Harvard), Salil Vadhan (Harvard), Adam Smith (Boston University), Jonathan Ullman (Northeastern University) Moderator: Amrita Roy Chowdhury (University of Michigan, Ann Arbor)
11:00-12:00		Poster Session C (2nd Floor Suites)
12:00-1:00		Lunch (on your own)
1:00-1:30		Level Setting: DP for Health (hosted by OpenDP) Speakers: Salil Vadhan (Harvard), Rachel Cummings (Columbia University), Hoon Cho (Yale University), Li Xiong (Emory University)
1:30-2:30		OpenDP Talk Session Healthcare Data Sharing: Privacy-Protecting Technical and Policy Controls Lucila Ohno-Machado A Legal Perspective on Differential Privacy for Biomedical Data Sharing Alexandra Wood Privately Fine-Tuned LLMs Preserve Temporal Dynamics in Tabular Data Lucas Rosenblatt, Peihan Liu, Ryan McKenna, Natalia Ponomareva Fairness, Privacy, and the Ethical Use of Personal Data Cynthia Dwork
2:30-3:00		Break
3:00-4:00		Panel: Deployments of DP in Health -- Lessons Learned and Future Needs (Hosted by OpenDP) Panelists: Paul Comerford (ICO), Joe Near (University of Vermont), Peter Kairouz (Google), Lucila Ohno-Machado (Yale Univeristy) Moderator: James Honaker (Harvard)
4:00-4:30		Break
4:30-5:25		DP for Health Talks: Grεy’s Anatomy This session is as binge-worthy as the medical drama--but fortunately, no code blues, only rigorous privacy guarantees. Sharing health and behavioral data raises significant privacy concerns, as conventional de-identification methods are susceptible to privacy attacks. Differential Privacy (DP) provides formal guarantees against re-identification risks, but practical implementation necessitates balancing privacy protection and the utility of data. We demonstrate the use of DP to protect individuals in a real behavioral health study, while making the data publicly available and retaining high utility for downstream users of the data. We use the Adaptive Iterative Mechanism (AIM) to generate DP synthetic data for Phase 1 of the Lived Experiences Measured Using Rings Study (LEMURS). The LEMURS dataset comprises physiological measurements from wearable devices (Oura rings) and self-reported survey data from firstyear college students. We evaluate the synthetic datasets across a range of privacy budgets, ε = 1 to 100, focusing on the trade-off between privacy and utility. In statistical applications it has become increasingly common to encounter data structures that live on non-linear spaces such as manifolds. Classical linear regression, one of the most fundamental methodologies of statistical learning, captures the relationship between an independent variable and a response variable which both are assumed to live in Euclidean space. Thus, geodesic regression emerged as an extension where the response variable lives on a Riemannian manifold. The parameters of geodesic regression, as with linear regression, capture the relationship of sensitive data and hence one should consider the privacy protection practices of said parameters. We consider releasing Differentially Private (DP) parameters of geodesic regression via the K-Norm Gradient (KNG) mechanism for Riemannian manifolds. We derive theoretical bounds for the sensitivity of the parameters showing they are tied to their respective Jacobi fields and hence the curvature of the space. This corroborates, and extends, recent findings of differential privacy for the Fr\'echet mean. We demonstrate the efficacy of our methodology on the sphere, $S_2\subset\mbR^3$, the space of symmetric positive definite matrices, and Kendall's planar shape space. Our methodology is general to any Riemannian manifold, and thus it is suitable for data in domains such as medical imaging and computer vision. Epidemiologic studies of infectious diseases often rely on models of contact networks to capture the complex interactions that govern disease spread, and ongoing projects aim to vastly increase the scale at which such data can be collected. However, contact networks may include sensitive information, such as sexual relationships or drug use behavior. Protecting individual privacy while maintaining the scientific usefulness of the data is crucial. We propose a privacy-preserving pipeline for disease spread simulation studies based on a sensitive network that integrates differential privacy (DP) with statistical network models such as stochastic block models (SBMs) and exponential random graph models (ERGMs). Our pipeline comprises three steps: (1) compute network summary statistics using node-level DP (which corresponds to protecting individuals' contributions); (2) fit a statistical model, such as an ERGM, using these summaries, which allows generating synthetic networks reflecting the structure of the original network; and (3) simulate disease spread on the synthetic networks using an agent-based model. We evaluate the effectiveness of our approach using a simple Susceptible-Infected-Susceptible (SIS) disease model under multiple configurations. We compare both numerical results, such as simulated disease incidence and prevalence, as well as qualitative conclusions such as intervention effect size, on networks generated with and without differential privacy constraints. Our experiments are based on egocentric sexual network data from the ARTNet study (a survey about HIV-related behaviors). Our results show that the noise added for privacy is small relative to the other sources of error (sampling and model misspecification, for example). This suggests that, in principle, curators of such sensitive data can provide valuable epidemiologic insights while protecting privacy. De-identification is still the standard approach to privacy in biomedical data sharing, despite well-known demonstrations of its vulnerabilities. Differential privacy is rarely adopted in practice, in part because its standard (ε, δ) parameterization does not map directly to the concrete inference risks discussed by data-protection guidelines, and when these parameters are mapped to risks, we need high values of ε to achieve reasonable utility. This paper presents a unifying threat modeling framework grounded in f-DP for bounding membership inference, re-identification, attribute inference, and data reconstruction risks using a single trade-off curve, and enables direct calibration of differentially private mechanisms to a target level of inference risk as in the standard guidelines such as from ISO and European Medicines Agency. We demonstrate the framework on clinical language modeling tasks and show that we can preserve reasonable utility.
5:25-5:30		Thank you note from chairs!
5:30-7:00		Reception

Accepted Papers

Poster Session A

A Bayesian Approach to Membership Inference for Statistical Release
Lisa Oakley, Sam Stites, Cameron Moy, Steven Holtzen, Alina Oprea, Marco Gaboardi

A Fast, Timing-Attack-Resistant Discrete Sampler for Verifying DP, DP-ML, & Beyond
Zoë Ruha Bell, Avishay Tal

An Efficient Gaussian Mechanism under Continual Observation
Rasmus Pagh, Sia Sejer

Can Graphical Tools Help Analysts Implement Differential Privacy?
Onyinye Dibia, Mengyi Lu, Prianka Bhattacharjee, Chuck McCallum, Joseph P. Near, Yuanyuan Feng

Computation-Utility-Privacy Tradeoffs in Bayesian Estimation
Sitan Chen, Jingqiu Ding, Mahbod Majid, Walter McKelvie

Differentially Private and Affine-Equivariant Median Estimation
Gavin Brown, John Duchi, Saminul Haque, Sewoong Oh

Differentially Private Minimum Spanning Tree and Clustering in Euclidean Graphs
Zongrui Zou, Alessandro Epasto, Chenglin Fan, Rudrajit Das

Differentially Private Secure Multiplication: Beyond Two Multiplicands
Haoyang Hu, Viveck R. Cadambe

DP-$\lambda$CGD: Efficient Noise Correlation for Differentially Private Model Training
Nikita Kalinin, Ryan McKenna, Rasmus Pagh, Christoph Lampert

Efficient Distributed Differentially Private Synthetic Data with High Utility via Secure Aggregation
Ratang Sedimo, Joseph P. Near

Efficient DP-SGD for LLMs with Randomized Clipping
Enayat Ullah, Sai Aparna Aketi, Devansh Gupta, Huanyu Zhang, Meisam Razaviyayn

Efficient Privacy Loss Accounting for Subsampling and Random Allocation
Moshe Shenfeld, Vitaly Feldman

Equivariant Differentially Private Deep Learning: Exploiting Symmetry under Privacy
Margarita Ionides

Fast-MWEM: Private Data Release in Sublinear Time
Themistoklis Haris, Steve Choi, Mutiraj Laksanawisit

"Having Confidence in My Confidence Intervals": How Data Users Engage with Privacy-Protected Wikipedia Data
Harold Triedman, Jayshree Sarathy, Priyanka Nanayakkara, Rachel Cummings, Gabriel Kaptchuk, Sean Kross, Elissa Redmiles

Inference and Unbiased Estimation for Differentially Private Ratio Statistics
Brian Finley

Interpreting the Error of Differentially Private Median Queries through Randomization Intervals
Thomas Humphries, Tim Li, Shufan Zhang, Karl Knopf, Xi He

Keeping a Secret Requires a Good Memory: Space Lower-Bounds for Private Algorithms
Alessandro Epasto, Xin Lyu, Pasin Manurangsi

LAPRAS : Learning-Augmented PRivate Answering for linear query Streams
Pranay Mundra, Adam Sealfon, Ziteng Sun, Quanquan C. Liu

Local Differential Privacy with Correlated Noise Achieves Central-DP Optimal Cost
Madhura Pathegama, Srikanth Avasarala, Viveck Cadambe, Juba Ziani

Lower Bounds for Private Hierarchical Clustering via Reconstruction Attacks
Jacob Imola, Lukas Retschmeier

Making Privacy Public: Toward a Differential Privacy Deployment Registry
Priyanka Nanayakkara, Elena Ghazi, Salil Vadhan

MAPLE: Metadata Augmented Private Language Evolution
Eli Chien, Yuzheng Hu, Ryan McKenna, Shanshan Wu, Zheng Xu, Peter Kairouz

Model Agnostic Differentially Private Causal Inference
Christian Janos Lebeda, Mathieu Even, Aurélien Bellet, and Julie Josse

Near-Optimal Private Tests for Simple and MLR Hypotheses
Yu-Wei Chen, Raghu Pasupathy, Jordan Awan

Observational Auditing of Label Privacy
Iden Kalemaj, Luca Melis, Maxime Boucher, Ilya Mironov, Saeed Mahloujifar

On Differential Privacy and Caching
Badih Ghazi, Ravi Kumar, Pasin Manurangsi, Jelani Nelson, Adam Sealfon, Samson Zhou

Optimal Domain-Aware Privacy Mechanisms for Synthetic Data Generation
Sajani Vithana, Sangwon Jung, Haoyang Hu, Viveck R. Cadambe, Flavio P. Calmon, Haewon Jeong

Phantoms and Disclosures: a Causal Framework for Auditing Synthetic Data
K. Amin, R. Das, A. Epasto, A. Javanmard, D. Kraft, M. Ribero, S. Vassilvitskii

Privacy and Utility Tradeoffs in Quantum Information Processing
Theshani Nuradha, Sujeet Bhalerao, Felix Leditzky

Privacy Filters are Captured by Residues: A Characterization of Free Natural Filters and the Cost of Adaptivity
Matthew Regehr, Bingshan Hu, Ethan Leeman, Pasin Manurangsi, Pierre Tholoniat, Mathias Lécuyer

Privacy in Theory, Bugs in Practice: Grey-Box Auditing of Differential Privacy Libraries
Tudor Cebere, David Erb, Damien Desfontaines, Aurélien Bellet, Jack Fitzsimons

Privacy, Prediction, and Allocation
Ben Jacobsen, Nitin Kohli

Private Preference Recovery: Aggregate User Intent via Differentially Private Preference Optimization
Tsubasa Takahashi

ReBound: Reuse-Aware Privacy For Interactive Decision Support
Nada Lahjouji, Shufan Zhang, Xi He, Sharad Mehrotra

Separating Oblivious and Adaptive Differential Privacy under Continual Observation
Mark Bun, Marco Gaboardi, Connor Wagaman

Skirting Additive Error Barriers for Private Turnstile Streams
Anders Aamand, Justin Y. Chen, Sandeep Silwal

The Access–Similarity Lens: An Operational Copyright Framework for Generative Models
Amit Saha, Yinan Huang, Pan Li, Eli Chien

The Sample Complexity of Membership Inference and Privacy Auditing
Mahdi Haghifam, Adam Smith, Jonathan Ullman

Towards Differentially Private Reinforcement Learning with General Function Approximation
Yi He, Xingyu Zhou

Unbiased Estimators from the Discrete Laplace Mechanism
Quentin Hillebrand, Jacob Imola, Rasmus Pagh, and Sia Sejer

Weighted Fourier Factorizations: Optimal Gaussian Noise for Differentially Private Marginal and Product Queries
Christian Lebeda, Aleksandar Nikolov, Haohua Tang

Poster Session B

ACME: Approximate Chebyshev Moment Estimation for Differentially Private Synthetic Data
Lucas Rosenblatt, Apoorv Vikram Singh, Christopher Musco

Adaptively Robust Resettable Streaming
Edith Cohen, Elena Gribelyuk, Jelani Nelson, Uri Stemmer

An Efficient and Practical Method for Exact Privacy Accounting in the 2020 U.S. Decennial Census
Buxin Su, Weijie Su, Chendi Wang

An Õptimal Differentially Private PAC Learner for Concept Classes with VC Dimension 1
Chao Yan

Balanced Additive Randomized Encodings with Application to Computational Differential Privacy in the Shuffle Model
Yu Wei, Jaspal Singh, Adya Agrawal, Vassilis ZIkas

Better Marginal Measurement via Residual Decomposition
Brett Mullins, Miguel Fuentes, Cecilia Ferrando, Cameron Musco, Daniel Sheldon

ContinuousBench: Can Differentially Private Synthetic Text Improve Capabilities?
Peihan Liu, Lucas Rosenblatt, Weiwei Kong, Natalia Ponomareva, Gautam Kamath, Rachel Cummings, Roxana Geambasu, Yu Gan, Lillian Tsai, Alex Bie

CLIOPATRA: Extracting Private Information from LLM Insights
Meenatchi Sundaram Muthu Selva Annamalai, Emiliano De Cristofaro, Peter Kairouz

Data Sharing with Endogenous Choices over Differential Privacy Levels
Raef Bassily, Kate Donahue, Diptangshu Sen, Annuo Zhao, Juba Ziani

Denoising the US Census: Succinct Block Hierarchical Regression
Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Adam Sealfon

Differentially Private Bilevel Optimization: Efficient Algorithms with Near-Optimal Rates
Andrew Lowy, Daogao Liu

Differentially Private Graph Coloring
Michael Xie, Jiayi Wu, Dung Nguyen, Aravind Srinivasan

Differentially Private Inference for Longitudinal Linear Regression
Getoar Sopa, Marco Avella-Medina, Cynthia Rush

Differentially Private Insights into AI Use
E. Cohen, V. Doroshenko, B. Ghazi, C. Harrison, P. Kamath, A. Knop, R. Kumar, E. Leeman, P. Manurangsi, A. Sealfon, C. Zhang, P. Kairouz, D. Liu, D. Yu

Differentially Private Language Generation in the Limit
Anay Mehrotra, Grigoris Velegkas, Xifan Yu, Felix Zhou

Differentially Private Linear Regression and Synthetic Data Generation with Statistical Guarantees
Shurong Lin, Aleksandra Slavkovic, Deekshith Reddy Bhoomireddy

Differentially Private Model Merging
Qichuan Yin, Manzil Zaheer, Tian Li

Differentially Private Multimodal In-Context Learning
Ivoline C. Ngong, Zarreen Reza, Joseph P. Near

DP-Blockly: A Block-Based Framework with Type-Guided Checking for Differential Privacy
Juko Yamamoto, Misato Nakabayashi

DPrivBench: Benchmarking LLMs' Reasoning for Differential Privacy
Erchi Wang, Pengrun Huang, Eli Chien, Om Thakkar, Kamalika Chaudhuri, Yu-Xiang Wang, Ruihan Wu

Extracting Training Data from Differentially Private Pre-trained LLM
Nirav Diwan, Gang Wang, Daniel Alabi

Formalizing Local Differential Privacy in Lean
Perryn Chang, Robert Shlyakhtenko, Renee Tetlow, Aras Yilmaz

Fundamental Limits of Reconstruction from Repeated DP Aggregates: A Cramer–Rao Perspective
Chenyue Zhang, Andrew Campbell, Anna Scaglione, Sean Peisert

Hardening Confidential Federated Compute against Side-channel Attacks
James Bell-Clark, Albert Cheu, Adria Gascon, Jonathan Katz

High-Probability Bounds For Heterogeneous Local Differential Privacy
Maryam Aliakbarpour, Alireza Fallah, Swaha Roy, Ria Stevens

Improved Accuracy for Private Continual Cardinality Estimation in Fully Dynamic Streams via Matrix Factorization
Joel Daniel Andersson, Palak Jain, Satchit Sivakumar

Barriers to Counterfactual Credit Attribution for Autoregressive Models
Aloni Cohen, Chenhao Zhang

Less Noise, Same Certificate: Retain Sensitivity for Unlearning
Carolin Heinzler, Kasra Malihi, Amartya Sanyal

Local Node Differential Privacy
Sofya Raskhodnikova, Adam Smith, Connor Wagaman, Anatoly Zavyalov

Near-Optimal Private Linear Regression via Iterative Hessian Mixing
Omri Lev, Moshe Shenfeld, Vishwak Srinivasan, Katrina Ligett, Ashia Wilson

Membership Inference Attack on Tabular Data
Prianka Bhattacharje, Mohsen Ghasemizade, Protiva Sen, Steven Baldasty, Mengyi Lu, Juniper Lovato, Joseph P. Near

Optimal Conversion from Rényi Differential Privacy to $f$-Differential Privacy
Anneliese Riess, Juan Felipe Gomez, Flavio du Pin Calmon, Julia Anne Schnabel, Georgios Kaissis

Optimal Guarantees for Auditing Rényi Differentially Private Machine Learning
Benjamin D. Kim, Lav R. Varshney, Daniel Alabi

PRIME: A Modular Approach for Private Synthetic Data
Miguel Fuentes, Brett Mullins, Cecilia Ferrando, Cameron Musco, Daniel Sheldon

Privacy Amplification for BandMF via b-Min-Sep Subsampling
Andy Dong, Arun Ganesh

Private Linear Regression via a Privacy to Down-Sensitivity Reduction
Ittai Rubinstein, Chris Ge, Samuel Hopkins

Privately Estimating Monotone Statistics in Polynomial Time
Gavin Brown, Ephraim Linder, Mahbod Majid, Vikrant Singhal

Publishing Below-Threshold Triangle Counts under Local Weight Differential Privacy
Kevin Pfisterer, Quentin Hillebrand, Vorapong Suppakitpaisarn

Scalable K-clique Estimation with Differential Privacy
Dung Nguyen, Ritwick Mishra, Anil Vullikanti

SPARTA: An Optimization Framework for Differentially Private Sparse Fine-Tuning
Mehdi Makni, Rahul Mazumder, Kayhan Behdin, Gabriel Afriat, Natalia Ponomareva, Zheng Xu, Sergei Vassilvitski, Hussein Hazimeh

Tight Auditing of Differential Privacy in MST and AIM
Georgi Ganev, Meenatchi Sundaram Mutu Selva Annamalai, Bogdan Kulynych

Tight Bounds for Answering Adaptively Chosen Concentrated Queries
Emma Rapoport, Edith Cohen, Uri Stemmer

Understanding Private Evolution as Learning-Augmented Clustering
Audra McMillan, Kunal Talwar, Felix Zhou

VDDP: Verifiable Distributed Differential Privacy under the Client-Server-Verifier Setup
Haochen Sun, Xi He

Poster Session C

A Community-Driven Differential Privacy Deployment Registry
Micah Altman, Sharon Gibbons, Rachel Cummings, Damien Desfontaines, Jack Fitzsimons, Elena Ghazi, Andrew Gruen, James Honaker, Gary Howarth, Nitin Kohli, Chuck McCallum, Priyanka Nanayakkara, Joseph P. Near, Robert Pisarczyk, Salil Vadhan

A Law of Data Reconstruction for Random Features (and Beyond)
Leonardo Iurada, Simone Bombari, Tatiana Tommasi, Marco Mondelli

A Unified Framework for Adversary-Aware Differential Privacy Bounds
Marika Swanberg, Meenatchi Sundaram Muthu Selva Annamalai, Jamie Hayes, Borja Balle, Adam Smith

ACTG-ARL: Differentially Private Conditional Text Generation with RL-Boosted Control
Yuzheng Hu, Ryan McKenna, Da Yu, Shanshan Wu, Han Zhao, Zheng Xu, Peter Kairouz

Aim High, Stay Private: Differentially Private Synthetic Data Enables Public Release of Behavioral Health Information with High Utility
Mohsen Ghasemizade, Juniper Lovato, Christopher M. Danforth, Peter Sheridan Dodds, Laura S P Bloomfield, Matthew Price, Joseph Near

Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents
Toan Tran, Olivera Kotevska, Li Xiong

Black-Box Differentially Private Cheap-Subsample Confidence Intervals
Soufiane Fafe, Mohammed Jamal

Blameless Users in a Clean Room: Defining Copyright Protection for Generative Models
Aloni Cohen

Certified Machine Unlearning for High Dimensional Models
Haolin Zou, Arnab Auddy, Yongchan Kwon, Kamiar Rahnama Rad, Arian Maleki

Characterizing Online and Private Learnability under Distributional Constraints via Generalized Smoothness
Moise Blanchard, Abhishek Shetty, Alexander Rakhlin

Computationally Efficient Replicable Learning of Parities
Moshe Noivirt,Jessica Sorrell,Eliad Tsfadia

Developing Unsupervised Learning Techniques for Finding Information Leakage in US Equities Trading Data
A. Americo, A. Bishop, P. Cesaretti, G. Grogan, S. Markelon, R. Moss, L. Oakley, A. Shahi, M. Shokri

Differentially Private Geodesic Regression
Aditya Kulkarni, Carlos Soto

Differentially Private High-Dimensional Variable Selection via Integer Programming
Petros Prastakos, Kayhan Behdin, Rahul Mazumder

Differentially Private Sparse Reward Estimation with Preference Feedback
Meng Ding, Mingxi Lei, Jie Zhang, Jinyan Liu, Di Wang

DPSQL+: A Differentially Private SQL Library with a Minimum Frequency Rule
Tomoya Matsumoto, Shokichi Takakura, Shun Takagi, Satoshi Hasegawa

Efficient and Optimal Learning of Discrete Distributions with Person-Level Privacy
Gautam Kamath, Mahbod Majid, Ankur Moitra, Argyris Mouzakis, and Jonathan Ullman

f-Differential Privacy Filters: Validity and Approximate Solutions
Long Tran, Antti Koskela, Ossi Räisä, Antti Honkela

Geometric Garbling: Efficient Two-Party Computation for Differentially Private Mechanisms
Arisa Tajima, Adam O'Neill, Wei Jiang, Gerome Miklau

Hamiltonian Monte Carlo for Bayesian Inference on Privatized Data
Arin Chang, Jordan Awan, Vinayak Rao

How to Motivate Differential Privacy Adoption in New Domains?
Lu Xian, Jayshree Sarathy, Sean Kross, Gabriel Kaptchuk

JAX-Privacy: A Library for Differentially Private Machine Learning
Ryan McKenna, Galen Andrew, Borja Balle, Vadym Doroshenko, Arun Ganesh, Weiwei Kong, Alex Kurakin, Brendan McMahan, Mikhail Pravilov

LoRA and Privacy: When Random Projections Help (and When They Don't)
Yaxi Hu, Johanna Düngler, Bernhard Schölkopf, Amartya Sanyal

Normalized Square Root: Sharper Matrix Factorization Bounds for Differentially Private Continual Counting
Monika Henzinger, Nikita P. Kalinin, Jalaj Upadhyay

On User-Level Differential Privacy and Timing Attacks
Elena Ghazi, Zachary Ratliff

Optimal Partition Selection with Renyi Differential Privacy
Charlie Harrison, Pasin Manurangsi

Oracle Efficiency for Differential Privacy and Small Loss Online Learning: A Gaussian Process Perspective
Adam Block, Abhishek Shetty

Privacy-Preserving Information Sharing in Oligopoly Competitions
Yuxin Liu, M. Amin Rahimian

P4T: Accuracy-Aware Differentially Private Query Processing with Foreign Keys
Shufan Zhang, Max Tang, Yuhan Liu, Xi He

Practical and Accurate Local Edge Differentially Private Graph Algorithms
Pranay Mundra, Charalampos Papamanthou, Julian Shun, Quanquan C. Liu

Private Prediction via Shrinkage
Chao Yan

Privately Fine-Tuned LLMs Preserve Temporal Dynamics in Tabular Data
Lucas Rosenblatt, Peihan Liu, Ryan McKenna, Natalia Ponomareva

Protecting the Undeleted in Machine Unlearning
Aloni Cohen, Refael Kohen, Kobbi Nissim, Uri Stemmer

Query-Efficient Locally Private Hypothesis Selection via the Scheffé Graph
Gautam Kamath, Alireza F. Pour, Matthew Regehr, David P. Woodruff

Random Variable Commitments for Any Sampleable Distribution, and a Certified Laplace Mechanism
Fredrik Meisingseth, Christian Rechberger, Fabian Schmid

Reconciling Differnetially Private Medical Data and Model Sharing with Data Protection
Bogdan Kulynych, Farah Briki, Jean Louis Raisaro

Robust and Differentially Private Principal Component Analysis
Minwoo Kim, Sungkyu Jung

Structural Privacy via Orbit Invariance: Mixing Amplification for Terminal and Transition-Flow Releases
Fengnan Deng, Anand Vidyashankar

The Normal Distributions Indistinguishability Spectrum and its Application to Privacy-Preserving Machine Learning
Yu Wei, Yun Lu, Malik Magdon-Ismail, Vassilis Zikas

To Count or Not to Count: Practical DP Mean Estimation with Unknown Dataset Size
Marcel Neunhoeffer, Shlomi Hod, Joerg Drechsler

Adaptive Sampling for Private Worst-case Group Optimization
Max Cairney-Leeming, Amartya Sanyal, Christoph H. Lampert

Virtual Poster Presentations

A Simplified Approach for Tradeoffs in Differential Privacy
Mohamad Senno, Jihad Fahs, Razane Tajeddine
[Click Here for Video]

An Efficient Private Algorithm for Community Detection
Vincent Cohen-Addad, Alessandro Epasto, Haim Kaplan, Hanna Komlós, Silvio Lattanzi
[Click Here for Video]

Clipping Calibration Under Structured Sparsity for Noise-Efficient Differentially Private Training
Deepak Singh Kalhan

Differential Privacy Configurations in the Real World: A Comparative Analysis
Michael Khavkin, Eran Toch
[Click Here for Video]

DP-SPRT: Differentially Private Sequential Probability Ratio Tests
Thomas Michel, Debabrota Basu, Emilie Kaufmann
[Click Here for Video]

GEM+: Scalable Differentially Private Synthetic Data with Generator Networks
Samuel Maddock, Shripad Gade, Graham Cormode, Will Bullock

Metric-Aware Private Approximate Near Neighbors
Martin Aumüller, Nikolaj Munk Binder Jensen
[Click Here for Video]

On the Curse of Dimensionality in Private Sparse Covariance Estimation and PCA
Syamantak Kumar, Shourya Pandey, Purnamrita Sarkar, Kevin Tian
[Click Here for Video]

One-Shot Private Confidence Regions via Resampling
Po-Ling Loh, Debepsita Mukherjee, Shourya Pandey, Purnamrita Sarkar

Private Adaptive Covariance Estimation via Gaussian Graphical Models
Cecilia Ferrando, Miguel Fuentes, Brett Mullins, Daniel Sheldon
[Click Here for Video]

Prophet Inequalities under Local Differential Privacy
Achraf Azize, Mathieu Molina, Hugo Richard, Vianney Perchet

Refined Differentially Private Linear Regression via Extension of a Free Lunch result
Sasmita Harini S, Anshoo Tandon
[Click Here for Video]

Call for Papers

Differential privacy (DP) is the leading framework for data analysis with rigorous privacy guarantees. In the last two decades, it has transitioned from the realm of pure theory to large scale, real world deployments.

Differential privacy is an inherently interdisciplinary field, drawing researchers from a variety of academic communities including machine learning, statistics, security, theoretical computer science, databases, and law. The combined effort across a broad spectrum of computer science is essential for differential privacy to realize its full potential. To this end, this workshop aims to stimulate discussion among participants about both the state-of-the-art in differential privacy and the future challenges that must be addressed to make differential privacy more practical.

New this year! We will be hosting a special session on "Differential Privacy for Health" and are especially encouraging submissions aligned with this theme.

Specific topics of interest for the workshop include (but are not limited to):

Theory of DP
DP and security
Privacy preserving machine learning
DP and statistics
DP and data analysis
Trade-offs between privacy protection and analytic utility
DP and surveys
Programming languages for DP
Relaxations of DP
Relation to other privacy notions and methods
Experimental studies using DP
DP implementations
DP and policy making
Applications of DP
Reconstruction attacks and memorization

Submissions: Authors are invited to submit a short abstract of new work or work published since June 2025 (the most recent TPDP submission deadline). Submissions must be 4 pages maximum, not including references. Submissions may also include appendices, but these are only read at reviewer's discretion. There is no prescribed style file, but authors should ensure a minimum of 1-inch margins and 10pt font. Submissions are not anonymized, and should include author names and affiliations.

Submissions will undergo a lightweight review process and will be judged on originality, relevance, interest, and clarity. Based on the volume of submissions to TPDP 2025 and the workshop's capacity constraints, we expect that the review process will be somewhat more competitive than in years past. Accepted abstracts will be presented at the workshop either as a talk or a poster.

The workshop will not have formal proceedings and is not intended to preclude later publication at another venue. In-person attendance is encouraged, though authors of accepted abstracts who cannot attend in person will be invited to submit a short video to be linked on the TPDP website.

Selected papers from the workshop will be invited to submit a full version of their work for publication in a special issue of the Journal of Privacy and Confidentiality.

Important Dates

Abstract Submission: February 18, 2026 (AoE)
Notification: April 2, 2026
Workshop: June 1-2, 2026

Diamond Tier Sponsor

Apple logo

Platinum Tier Sponsor

Google logo

Submission website

https://tpdp26.cs.uchicago.edu

For concerns regarding submissions, please contact tpdp.chairs@gmail.com

Organizing and Program Committee

Amrita Roy Chowdhury (co-chair)
University of Michigan
Jayshree Sarathy (co-chair)
Northeastern University
Adam Sealfon
Google Research
Adam Smith
Boston University
Ajinkya Kiran Mulay
Meta
Alejandro Russo
Dpella
Aleksandar Nikolov
University of Toronto
Alessandro Epasto
Google Research
Alexandra Wood
Purdue University
Amin Rahimian
University of Pittsburgh
Andrew Lowy
CISPA Helmholtz Center
Antti Honkela
University of Helsinki
Antti Koskela
Nokia Bell Labs
Anupama Nandi
Yale University
Arun Ganesh
Google Research
Audra McMillan
Apple
Ayelet Gordon-Tapiero
Hebrew University
Brett Mullins
UMass Amherst
Chiké Abuah
Walla Walla University
Christian Janos Lebeda
Inria
Connor Wagaman
Boston University
Daniel Kifer
Penn State University
Daogao Liu
Google
Dima Usynin
Huawei Research / Technical University of Munich
Dung Nguyen
Haverford College
Edo Roth
Google
Edwige Cyffers
CNRS
Eli Chien
National Taiwan University
Enayat Ullah
Meta
Fan Wu
University of Illinois Urbana-Champaign
Felix Zhou
Yale University
Georgi Ganev
UCL / SAS
Gerome Miklau
Umass Amherst / LinkedIn
Hal Triedman
Cornell University
Hao Wu
University of Waterloo
Hilal Asi
Apple
Jacob Imola
University of Waterloo
Jalaj Upadhyay
Rutgers University
Jatan Loya
Google
Jiayuan Ye
National University of Singapore
Joann Chen
San Diego State University
Joel Daniel Andersson
Institute of Science and Technology Austria
Joerg Drechsler
Institute for Employment Research
Johes Bater
Tufts University
John Abascal
Northeastern University
Jonathan Ullman
Northeastern University
Jordan Awan
University of Pittsburgh
Joseph Near
University of Vermont
Kunal Talwar
Apple
Lucas Rosenblatt
New York University
Ludmila Glinskih
Google
Lukas Retschmeier
University of Copenhagen
Lydia Zakynthinou
Johns Hopkins University
Marika Swanberg
Google
Mahdi Haghifam
TTIC
Matthew Joseph
Google Research
Miguel Fuentes
University of Massachusetts Amherst
Myna Vajha
IIT Hyderabad
Nitin Kohli
UC Berkeley
Onyinye Dibia
University of Vermont
Or Sheffet
Bar Ilan University
Palak Jain
Boston University
Pasin Manurangsi
Google Research
Peter Kairouz
Google
Pierre Tholoniat
Google
Prajjwal Gupta
Cloudflare Research
Quentin Hillebrand
University of Copenhagen
Rachel Cummings
Columbia University
Ruihan Wu
Openai
Ryan McKenna
Google Research
Shengyuan Hu
Meta
Shlomi Hod
Weizenbaum Institute
Shubhankar Mohapatra
University of Waterloo
Shufan Zhang
University of Waterloo
Sofya Raskhodnikova
Boston University
Stacey Truex
Denison University
Tal Wagner
Tel Aviv University and Amazon
Tamalika Mukherjee
Max Planck Institute for Security and Privacy
Tatsuki Koga
Apple
Thomas Steinke
Google DeepMind
Tianhao Wang
University of Virginia
Tudor Cebere
Inria
Vikrant Singhal
University of Copenhagen
Vitaly Feldman
Apple
Viveck Cadambe
Georgia Institute of Technology
Xingyu Zhou
Wayne State University
Youssef Allouah
Stanford University
Yuzheng Hu
University of Illinois Urbana-Champaign
Zachary Ratliff
Harvard University
Zeyu Ding
Binghamton University

TPDP 2026 - Theory and Practice of Differential Privacy

Boston - June 1-2, 2026