How to Secure Data When Computing with PAC Privacy

By Dick Weisinger

Data privacy is a major concern for many applications of machine learning, especially when sensitive information such as medical records or personal images is involved. How can we share useful models without revealing the data they were trained on?

A new technique developed by MIT researchers could offer a solution. It is called Probably Approximately Correct (PAC) Privacy, and it allows users to automatically determine the minimal amount of noise that needs to be added to a model to protect the data from adversaries.

Unlike other privacy approaches, PAC Privacy does not require knowledge of the model’s architecture or training process. It only focuses on the output of the model and how hard it would be for an adversary to reconstruct any part of the data from it.

For example, if the data are images of human faces, PAC Privacy could measure whether an adversary could extract a recognizable silhouette of a face from the model, rather than just whether they could tell if a face was in the dataset or not.

The user can specify their desired level of confidence and accuracy for the privacy guarantee. For instance, they may want to ensure that an adversary will not be more than 1% confident that they have successfully reconstructed the data to within 5% of its actual value. The PAC Privacy algorithm will then tell the user the optimal amount of noise that needs to be added to the model before it is shared publicly.

The researchers show that PAC Privacy can significantly reduce the amount of noise needed to protect sensitive data, compared to other methods. This could help preserve the accuracy and utility of machine-learning models in real-world settings, while still ensuring data privacy.

PAC Privacy is a novel and powerful framework that exploits the uncertainty or entropy of the data in a meaningful way. It could enable engineers and scientists to share their models with confidence, without compromising the privacy of their data sources.

August 3rd, 2023

Category: Computing, Privacy, Security

Leave a Reply Cancel reply

Legal Terms & Disclaimers

This blog site is accessed from the website of Formtek, Inc. All visitors to or users of this blog site are subject to the terms and conditions and privacy policy that govern the Formtek website, links for which are provided above.

Some of the individuals posting to this blog site, including the moderators, work for Formtek. Postings by these individuals are the personal opinions of these individuals, not of Formtek. Their posted content is provided for informational purposes only and is not meant to be an endorsement or representation by Formtek or any other party. Postings to this blog site may be outdated, invalid or inaccurate by the time you read them. Individuals posting to this blog site make no statements, representations or warranties as to the timing, validity, accuracy or reliability of their postings.

This blog site may contain links to third party sites. Access to any third party site linked to this blog site is at your own risk. None of Formtek, the blog site moderator(s) and the individuals posting on this blog site that work for Formtek is responsible for the timing, validity, accuracy or reliability of any information, data, opinions, advice or statements made on these third party sites. These links are provided merely as a convenience and do not imply any endorsement.

Postings to this blog site are available to the public. You should not post, link to or otherwise upload any information considered confidential to this blog site. All postings to this blog site are moderated. Postings will appear if and when they are approved by the moderator. Notwithstanding any approval by the moderator, by posting information to this blog site, you agree to be solely responsible for the information you post, link to, or otherwise upload to the blog site. You agree to release Formtek from any liability related to that information or to your use of the blog site. You grant Formtek a worldwide, perpetual, irrevocable, royalty-free, fully-paid, and transferable (including rights to sublicense) right to exercise all copyright, publicity, and moral rights with respect to any information you post, link to or otherwise upload to this blog site.

How to Secure Data When Computing with PAC Privacy

Leave a Reply Cancel reply

Company

Products and Services

News

Resources

Legal Terms & Disclaimers