Giter Site home page Giter Site logo

xiaojieli0903 / maskagain Goto Github PK

View Code? Open in Web Editor NEW
23.0 2.0 0.0 424 KB

Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)

Home Page: https://dl.acm.org/doi/10.1145/3581783.3612129

License: Other

Shell 3.69% Python 96.31%
knowledge-distillation masked-video-modeling video-representation-learning

maskagain's Introduction

Official PyTorch Implementation of Mask Again: Masked Knowledge Distillation for Masked Video Modeling (ACM MM 2023).

MMKD Framework

Mask Again: Masked Knowledge Distillation for Masked Video Modeling
Xiaojie Li^1,2, Shaowei He^1, Jianlong Wu^1*, Yue Yu^2, Liqiang Nie^1*, Min Zhang^1
^1Harbin Institute of Technology, Shenzhen, ^2Peng Cheng Laboratory *Corresponding Author

🚀 Main Results

✨ Kinetics-400

Method Extra Data Backbone Resolution #Frames x Clips x Crops Top-1 Top-5
VideoMAE no ViT-S 224x224 16x5x3 78.7 93.6
VideoMAE no ViT-B 224x224 16x5x3 81.0 94.6

✨ UCF101 & HMDB51

Method Extra Data Backbone UCF101 HMDB51
VideoMAE Kinetics-400 ViT-S 92.9 72.0
VideoMAE Kinetics-400 ViT-B 96.2 77.1

🔨 Installation

Please follow the instructions in INSTALL.md.

📍Model Zoo

We provide pre-trained and fine-tuned models in MODEL_ZOO.md.

👀 Visualization

We provide the script for visualization in vis_kd.sh.

✏️ Citation

If you find this project useful for your research, please considering leaving a star⭐️ and citing our paper:

@inproceedings{li2023mask,
  title={Mask Again: Masked Knowledge Distillation for Masked Video Modeling},
  author={Li, Xiaojie and He, Shaowei and Wu, Jianlong and Yu, Yue and Nie, Liqiang and Zhang, Min},
  booktitle={Proceedings of the 31st ACM International Conference on Multimedia},
  pages={2221--2232},
  year={2023}
}

🔒 License

This project is made available under the Apache 2.0 license.

👍 Acknowledgements

This project is built upon VideoMAE. Thanks to the contributors of this great codebase.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.