Giter Site home page Giter Site logo

Visual Token of HowTo100M about vimpac HOT 3 OPEN

zhengsipeng avatar zhengsipeng commented on August 17, 2024
Visual Token of HowTo100M

from vimpac.

Comments (3)

zhengsipeng avatar zhengsipeng commented on August 17, 2024 1

We pre-extracted the tokens and used them during pre-training. The pre-extraction script is provided here: video2token.

I do not have the exact number of disk space for now. It should take 100~200G for saving all the tokens since the original video is largely compressed.

We pre-extracted the tokens and used them during pre-training. The pre-extraction script is provided here: video2token.

I do not have the exact number of disk space for now. It should take 100~200G for saving all the tokens since the original video is largely compressed.

Hi, Can you privode data processing code for HowTo100M Pretraining? It seems a bit different from datasets?

from vimpac.

airsplay avatar airsplay commented on August 17, 2024

We pre-extracted the tokens and used them during pre-training. The pre-extraction script is provided here: video2token.

I do not have the exact number of disk space for now. It should take 100~200G for saving all the tokens since the original video is largely compressed.

from vimpac.

dongzhiwu avatar dongzhiwu commented on August 17, 2024

Hi, is there code for HowTo100M video process?
Because it seems that the video2token only provide the process code for downstream dataset

from vimpac.

Related Issues (2)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.