Here are the details for preprocessing datasets in 2 steps. We provide the preprocess tools for
- celeba
- facescrub
- ffhq64
- ffhq256
- metfaces256
- afhqdog256
Note that when using the celeba64
and facescrub64
datasets you can directly use the transform Resize((64,64))
in torchvision on celeba112
and facescrub112
datasets respectively.
Download celeba dataset from here.
The structure of the dataset is as follows:
<DOWNLOAD_PATH>
├── img_align_celeba
├── identity_CelebA.txt
├── list_attr_celeba.txt
├── list_bbox_celeba.txt
├── list_eval_partition.txt
├── list_landmarks_align_celeba.txt
└── list_landmarks_celeba.txt
For celeba
with low resolution, you can directly use your download file above for step 3.
For celeba
with high resolution (e.g. img_align_celeba
.
python align.py --crop_size_h 224 --crop_size_w 224 --order 3 --save_format png --face_factor 0.65 --n_worker 32
Use this script to download facescrub and some links are unavailable.
The structure of the dataset is as follows:
<DOWNLOAD_PATH>
├── actors
│ └── faces
└── actresses
└── faces
For ffhq64
, download thumbnails128x128.
For ffhq256
, download images1024x1024.
Download here.
Follow StyleGAN2-ada to download afhqdog dataset.
python dataset_tool.py --source=~/downloads/afhq/train/dog --dest=~/datasets/afhqdog.zip
Fill the relative path for relative scripts in examples/standard/datasets and run the scripts. Note that FaceScrub dataset do not need to be preprocessed. The parameters are as follows:
- src_path: The path for the dataset you download.
- dst_path: The path for the preprocessed dataset.
- split_file_path: Only
celeba
need this parameter. We provide split files to split the dataset into train and test subset forceleba
. Split files are available at here. Note that you need to unzip the file.
The file structure of split files for celeba
is as follows:
split_files/
├── private_test.txt
├── private_train.txt
└── public.txt