GitHub

This repository contains the code used in the paper "Target-specific Dataset Pruning for Compression of Audio Tagging Models"

The proposed method combines model compression and domain adaptation to obtain better efficient audio tagging models. It consists of three steps: Data pruning, knowledge distillation and a final fine-tuning.

This method was used for a submission to Task 1 of the DCASE Challenge 2024

Installation

Install requirements manually from the requirements.txt and install the package.

Make sure to modify the system path configuration or add your own in target_distillation/conf/system/*.yaml

Data preparation

Refer to audio-data to obtain a webdataset version of AudioSet. ESC-50 can be downloaded here. Create logit datasets for both AudioSet and the target datasets using target_distillation/create_ensemble_embeddings.py.

Domain classifier training

The notebook in domain_classifier allows to train a domain classifier model for a given dataset. For new datasets, a dataset class needs to be added in target_distillation/data.

Model distillation

With the trained domain classifier, AudioSet can be filtered using the create_wds.py script. Use the ex_distill.py script to distill a model, a fine-tuning is automatically performed after the training is finished.

If there are some missing files or packages, please let me know.

Acknowledgement

The project was funded by the Federal Ministry of Education and Research (BMBF) under grant no. 01IS22094E WEST-AI.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
domain_classifier		domain_classifier
images		images
target_distillation		target_distillation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Installation

Data preparation

Domain classifier training

Model distillation

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

alexanderwerning/target_distillation

Folders and files

Latest commit

History

Repository files navigation

Installation

Data preparation

Domain classifier training

Model distillation

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages