Shivam Aggarwal

Followers

Following

Public Views

Uploads

Papers by Shivam Aggarwal

Robust and Resource-Efficient Data-Free Knowledge Distillation by Generative Pseudo Replay

Proceedings of the AAAI Conference on Artificial Intelligence

Data-Free Knowledge Distillation (KD) allows knowledge transfer from a trained neural network (te... more Data-Free Knowledge Distillation (KD) allows knowledge transfer from a trained neural network (teacher) to a more compact one (student) in the absence of original training data. Existing works use a validation set to monitor the accuracy of the student over real data and report the highest performance throughout the entire process. However, validation data may not be available at distillation time either, making it infeasible to record the student snapshot that achieved the peak accuracy. Therefore, a practical data-free KD method should be robust and ideally provide monotonically increasing student accuracy during distillation. This is challenging because the student experiences knowledge degradation due to the distribution shift of the synthetic data. A straightforward approach to overcome this issue is to store and rehearse the generated samples periodically, which increases the memory footprint and creates privacy concerns. We propose to model the distribution of the previously ...

Download

Robust and Resource-Efficient Data-Free Knowledge Distillation by Generative Pseudo Replay

Proceedings of the AAAI Conference on Artificial Intelligence

Download

Shivam Aggarwal

Uploads

Papers by Shivam Aggarwal

Log In