cdjkim / audiocaps Goto Github PK

View Code? Open in Web Editor NEW

127.0 127.0 16.0 40.55 MB

🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps

Home Page: https://audiocaps.github.io/

License: MIT License

Python 96.68% Shell 3.32%

audiocaps's People

Contributors

Stargazers

Watchers

Forkers

ishine garden1984 rohan1561 ankitshah009 janberg1 xinhaomei suzhiba ppunia74 keunwoochoi twashuettl konstantin-hadzh mrsndmn kingfener rescenic petersunlab hackayan

audiocaps's Issues

How to Download AudioCaps

Hi,

Thanks so much for this great and impressive resource!
I am relatively new to the field of audio captioning, so apologies if my question is basic :)
I was wondering if you have a piece of code to download the relevant files?
Or do I need to download the entire AudioSet data? If so, can you please point me to a code that does that reliably?

Thanks in advance,
Felix

About audio class on Audiocaps

Thanks a lot for your great contribution!
May I ask that could you release the audio classes (or sound events) responding to each sample on Audiocaps?

Inconsistent number of files in AudioCaps dataset

Hi all,

Thank you for creating and sharing the AudioCaps dataset. I found it to be very useful.

However, I noticed that the number of files in each set (training, validation, and test) is very different from the numbers presented in the official repository. Here are the number of files I obtained:

Set	Number of files
Training	45458
Validation	2245
Test	4440

However, the original values in the repository are:

Set	Number of files
Training	49,838
Validation	495
Test	975
Total	51,308

I also noticed that the csv files contain more rows than what is proposed as the validation and test set, and are more similar to the number of files I obtained.

I am wondering if there is something I am missing or if there is an issue with the original values provided in the repository. Please let me know if there is any clarification needed or if there are any updates to the dataset.

I also created a python package to download the dataset very easily: https://github.com/MorenoLaQuatra/audiocaps-download

Thank you for your time and for providing this valuable resource.

Best regards,

Moreno La Quatra

Best,
Yapeng

Pretrained model download error

hello, pre-trained model does not download (google driver link is error)
so, Could I download pre-trained model audiocaps ?
thank you

Video Captions

Really enjoyed reading your work. Are the video captioning sentences available by any chance?

cdjkim / audiocaps Goto Github PK

audiocaps's People

Contributors

Stargazers

Watchers

Forkers

audiocaps's Issues

How to Download AudioCaps

About audio class on Audiocaps

Inconsistent number of files in AudioCaps dataset

Is the duration of each audio clip in AudioCaps 10 seconds?

dataset

Request final 115K video ids with word labels

Pretrained model download error

Video Captions

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent