stonemo / ez-vsl Goto Github PK
View Code? Open in Web Editor NEWOfficial Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
License: Apache License 2.0
Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
License: Apache License 2.0
How to use this on our custom videos ? Please guide
Hi, thanks for sharing the codes.
I read your paper enjoyably. It was very well-written. I have one question and appreciate your reply.
In the "Experimental Setup" section of the paper, you mentioned that a single image frame is extracted (with the 20s of audio) from each video clip. Did you extract the frame randomly? or you chose the middle frame?
Thanks :)
In ur setting, I wonder that how do u use the Flickr dataset, train from raw data or soundnet feature ?
File "/data2/scripts/EZ-VSL/audio_io.py", line 66, in load_audio_av
audio = np.concatenate([af[1] for af in chunks], 1)
File "<__array_function__ internals>", line 180, in concatenate
ValueError: need at least one array to concatenate
I realized the error comes from audio_io.py
at line 59
.
chunks.append((chunk_start_time, resampler.resample(frame).to_ndarray()))
should be changed to
chunks.append((chunk_start_time, resampler.resample(frame)[0].to_ndarray()))
Hi, I have tried many ways to download the two datasets you mentioned in the article but they all failed. Could you kindly provide a way to download both datasets?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.