This repository includes the implementations of "MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection"(accepted by Interspeech 2024).
The code will be publicly available after the interspeech2024 conference(September 1st to 5th, 2024).