liuyugeng / ml-doctor Goto Github PK

Code for ML Doctor

License: Apache License 2.0

Python 100.00%

machine-learning membership-inference-attack attribute-inference-attack model-inversion-attacks model-stealing differential-privacy knowledge-distillation

ml-doctor's Introduction

ML-Doctor Demo Code

This is the demo code for our USENIX Security 22 paper ML-Doctor: Holistic Risk Assessment of Inference Attacks Against Machine Learning Models

Please find the updated version in our lab's repo.

Building Datasets

We prefer that users provide the data loader themselves, but we show the demo data loader in the code. Due to the size of the dataset, we won't upload it to GitHub.

For UTKFace, we have two folders downloaded from official website in the UTKFace folder. The first is the "processed" folder, which contains three landmark_list files(which can also be downloaded from the official website). It is used to quickly get the image name because all the features of the images can be achieved from the file names. The second folder is the "raw" folder, which contains all the aligned and cropped images.

For the CelebA dataset, we have one folder and three files in the "celeba" folder. The "img_celeba" folder contains all the images downloaded from the official website, and we align and crop them by ourselves. The others are three files used to get the attributes or file names, named "identity_CelebA.txt," "list_attr_celeba.txt," and "list_eval_partition.txt." The crop center is [89, 121], but it is ok if the users wouldn't like to crop it because we have a resize function in the transforms so that it will not affect the input shapes.

For FMNIST and STL10, PyTorch has offered datasets that can be easily employed.

Preparing

Users should install Python3 and PyTorch first. To train differential privacy shadow models, you should also install opacus. Based on the official documents, we recommend using conda to install it.

Or directly run pip install -r requirements.txt.

Testing

python demo.py --attack_type X --dataset_name Y

Attack Type	0	1	2	3
Name	MemInf	ModInv	AttrInf	ModSteal

For dataset name, there are four datasets in the code, namely CelebA, FMNIST (Fashion-MNIST), STL10, and UTKFace.

For AttrInf, users should provide two attributes in the command line with the format "X_Y," and only CelebA and UTKface contain two attributes, e.g. python demo.py --attack_type 2 --dataset_name UTKFace --attributes race_gender

For MemInf

We have four modes in this function

Mode	0	1	2	3
Name	BlackBox Shadow	BlackBox Partial	WhiteBox Partial	WhiteBox Shadow

When building attack dataset

When using mode 0 and mode 3, i.e., having shadow models, users should choose the get_attack_dataset_with_shadow function. For the others (mode 1 and mode 2), it should be get_attack_dataset_without_shadow function.

When choosing the attack model

When using mode 0, attack_model should be ShadowAttackModel, while PartialAttackModel is attack_model for mode 1 in blackbox. As for whitebox (mode 2 and mode 3), users need to change attack_model to WhiteBoxAttackModel. Users can also define attack models by themselves, so we didn't fix the models here.

Note: we have the same ShadowAttackModel and PartialAttackModel in the code.

For ModInv

For the Secret Revealer method, users should pre-train an evaluation model with ResNet18 architecture and name it as your model name + "_eval.pth", e.g., "UTKFace_eval.pth", with the same path as the target model.

For AttrInf and ModSteal

There are two general modes, i.e., partial and shadow. Users could change the training set in main function

Citation

Please cite this paper in your publications if it helps your research:

@inproceedings{LWHSZBCFZ22,
author = {Yugeng Liu and Rui Wen and Xinlei He and Ahmed Salem and Zhikun Zhang and Michael Backes and Emiliano De Cristofaro and Mario Fritz and Yang Zhang},
title = {{ML-Doctor: Holistic Risk Assessment of Inference Attacks Against Machine Learning Models}},
booktitle = {{USENIX Security Symposium (USENIX Security)}},
pages = {4525-4542},
publisher = {USENIX},
year = {2022}
}

License

ML-Doctor is freely available for free non-commercial use, and may be redistributed under these conditions. For commercial queries, please drop an e-mail at [email protected]. We will send the detail agreement to you.

ml-doctor's People

Contributors

Stargazers

Watchers

ml-doctor's Issues

The test acc membership inference attack does not increase

Needs migrating to Opacus 1.0

The changes introduced in the Opacus 1.0 API mean that this code doesn't work with the current version and it needs to be migrated. For example, simply following the README instructions results in the following error:

ImportError: cannot import name 'module_modification' from 'opacus.utils'

When I used the member inference attack, the matrix calculation of the attack model training was incorrect

$ python demo.py --attack_type 0 --dataset_name mnist

Traceback (most recent call last):
  File "demo.py", line 250, in <module>
    main()
  File "demo.py", line 227, in main
    test_meminf(TARGET_PATH, device, num_classes, target_train, target_test, shadow_train, shadow_test, target_model, shadow_model, train_shadow, use_DP, noise, norm, delta)
  File "demo.py", line 104, in test_meminf
    attack_mode0(PATH + "_target.pth", PATH + "_shadow.pth", PATH, device, attack_trainloader, attack_testloader, target_model, shadow_model, attack_model, 1, num_classes)
  File "/mnt/f/PYProject/doctor/meminf.py", line 748, in attack_mode0
    res_train = attack.train(flag, RESULT_PATH)
  File "/mnt/f/PYProject/doctor/meminf.py", line 289, in train
    results = self.attack_model(output, prediction)
  File "/home/ddq/miniconda3/envs/opacus/lib/python3.7/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/mnt/f/PYProject/utils/define_models.py", line 48, in forward
    Prediction_Component_result = self.Prediction_Component(prediction)
  File "/home/ddq/miniconda3/envs/opacus/lib/python3.7/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/ddq/miniconda3/envs/opacus/lib/python3.7/site-packages/torch/nn/modules/container.py", line 141, in forward
    input = module(input)
  File "/home/ddq/miniconda3/envs/opacus/lib/python3.7/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/ddq/miniconda3/envs/opacus/lib/python3.7/site-packages/torch/nn/modules/linear.py", line 103, in forward
    return F.linear(input, self.weight, self.bias)
RuntimeError: mat1 and mat2 shapes cannot be multiplied (1x64 and 1x128)

Just add the following code to the dataloader and I can't find the problem

elif dataset_name.lower() == "mnist":
      num_classes = 10
      transform = transforms.Compose([
          transforms.Resize((64, 64)),
          transforms.ToTensor(),
          transforms.Normalize((0.1307,), (0.3081,))
      ])

      train_set = torchvision.datasets.MNIST(
              root=root, train=True, download=True, transform=transform)
      test_set = torchvision.datasets.MNIST(
              root=root, train=False, download=True, transform=transform)

      dataset = train_set + test_set
      input_channel = 1

UTKFace dataset error

When trying to run the demo/tutorial code with the UTKFace dataset, it produces the following error:

ML-Doctor/demoloader/dataloader.py", line 73, in __getitem__
    gender = int(attrs[1])
ValueError: invalid literal for int() with base 10: ''

When printing the attrs it shows that the problem is with the missing gender:

['53', '', '0', '20170116184028385.jpg']

which seems to be a problem on line 4172 of the data file: landmark_list_part2.txt : 53__0_20170116184028385.jpg

There seem to be two entries for that jpg since it is repeated on line 4213 with 62_1_0_20170116184028385.jpg

Removing line 4172 fixes the error.

Is this the correct way to preprocess the UTKFace data? If so, perhaps a note in the README would help others avoid the pain?

How did you train UTKFace_eval.pth?

Hi,
I am trying to run secret-revealver mode in ModelInv attack,
but I am confused how to train the evaluation model.

As far as I know, the evaluation model should be trained to detect the identity of the face in each file.
However, the UTKFace dataset is not labeled with the identity.

Could you tell me how you did train the evaluation model for UTKFace?