Giter Site home page Giter Site logo

audiosetzsl's Introduction

AudioSetZSL

This repsoitory conatins the audio-visual dataset proposed for the task of multi-modal zeroshot learning.

The dataset is curated from a large dataset, AudioSet. While the original dataset was multilabel, the example videos were selected such that every video in AudioSetZSL has only one label, ie. it is a multiclass dataset. For more details on creation of the dataset, refer to our paper.

Here, we provide the Youtube IDs for each class in the folder youtube-id. The dataset is divided into 2-parts for a broader use for both the task of classification and zero-shot learning.

The examples for each class has been divided into three subsets namely, train, test and val.

Similary, for the task of ZSL the classes in the dataset is divided into seen and unseen.

We also provide the pre-trained features for both audio and video. The features are so obtained that it can be used for the task of ZSL as there is no unseen class overlap with the pre-training of the network (refer to our paper for the detailed process of the dataset split). To download the pretrained feature follow the link : Download

Contact

Kindly contact [email protected] for any issues, comments etc.

Disclaimer

  1. The dataset collection was done at IIT Kanpur.
  2. The dataset is intended to be used for academic research only.
  3. The links are YouTube links and the user is responsible for compliance with YouTube's terms and conditions.
  4. The videos are the property of the respective YouTube uploader. If any video belongs to you and you would like to have it removed kindly let us know and we will remove it from the dataset.

audiosetzsl's People

Contributors

krantiparida avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

xiaokangpeng

audiosetzsl's Issues

CJME code

Hello, when can the CJME code be published? Through reading your article, I am very interested in the GZSL task, and I am very eager to use this code for academic research. Thank you very much!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.