Sampling Strategies for GAN Synthetic Data

Binod Bhattarai, Seungryul Baek, Rumeysa Bodur, Tae-Kyun Kim

Research output: Chapter in Book/Report/Conference proceedingPublished conference contribution

Abstract

Generative Adversarial Networks (GANs) have been used widely to generate large volumes of synthetic data. This data is being utilised for augmenting with real examples in order to train deep Convolutional Neural Networks (CNNs). Studies have shown that the generated examples lack sufficient realism to train deep CNNs and are poor in diversity. Unlike previous studies of randomly augmenting the synthetic data with real data, we present our simple, effective and easy to implement synthetic data sampling methods to train deep CNNs more efficiently and accurately. To this end, we propose to maximally utilise the parameters learned during training of the GAN itself. These include discriminator's realism confidence score and the confidence on the target label of the synthetic data. In addition to this, we explore reinforcement learning (RL) to automatically search a subset of meaningful synthetic examples from a large pool of GAN synthetic data. We evaluate our method on two challenging face attribute classification data sets viz. AffectNet and CelebA. Our extensive experiments clearly demonstrate the need of sampling synthetic data before augmentation, which also improves the performance of one of the state-of-the-art deep CNNs in vitro.
Original languageEnglish
Title of host publication2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Pages2303-2307
Number of pages5
DOIs
Publication statusPublished - 2020
Externally publishedYes
EventICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Barcelona, Spain , Barcelona, Spain
Duration: 4 May 20208 May 2020

Conference

ConferenceICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Country/TerritorySpain
CityBarcelona
Period4/05/208/05/20

Fingerprint

Dive into the research topics of 'Sampling Strategies for GAN Synthetic Data'. Together they form a unique fingerprint.

Cite this