Giter Site home page Giter Site logo

egorlakomkin / ktspeechcrawler Goto Github PK

View Code? Open in Web Editor NEW
151.0 151.0 38.0 782 KB

Automatically constructing corpus for automatic speech recognition from YouTube videos

Home Page: https://arxiv.org/abs/1903.00216

License: MIT License

Python 75.87% HTML 20.92% Shell 3.21% Dockerfile 0.01%
asr crawler speech-recognition youtube

ktspeechcrawler's People

Contributors

egorlakomkin avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ktspeechcrawler's Issues

why GOOGLE_TEST default OK? Why didn't added GoogleRandomSubsetWERFilter class in process.py?

why GOOGLE_TEST default OK?

Why didn't added GoogleRandomSubsetWERFilter class in process.py pipeline?

first up all thanks for Given this Project as Open-Source. Awesome work. thank you so much KTSpeechCrawler team.:)

i was tried KTSpeechCrawler project to collecting youtube audio datasets for ASR Speech-to-text task.

i was collected and finished entire steps. after that i was tested transcipt with corresponding audio files (.wav, .txt).

here i getting 11/100 audios are mistakes.

if we will apply google_speech_test , and validate to remove less than the threshold means (threshold=0.85) we can get good proper audiofiles and transcipt.

can you please tell where i need to start and add this module to do google_speech_test?

Here any complexity will come, for using google_speech_test?

that pipeline module,

pipeline = Pipeline([
    OverlappingSubtitlesRemover(),
    SubtitleCaptionTextFilter(),
    CaptionNormalizer(),
    CaptionRegexMatcher(good_chars_regexp),
    CaptionLengthFilter(min_length=5),
    CaptionLeaveOnlyAlphaNumCharacters(),
    SubtitleMerger(max_len_merged_sec=10),
    CaptionDurationFilter(min_length=1, max_length=20.0)
])

here which place i need add that module? last is enough?

Thank you sir :)

Failed to download

Tried to run the script both on local machine and google colab, but failed. Please see the error message as below.

截屏2021-03-04 下午4 29 34

Question: Where exactly is Gentle being used

First of all great work. Kudos!
In your paper you have written than Gentle is being used for forced alignment in the post processing stage. But in the code, I wasn't able to find out where it was being used. Can you please help?

0 videos downloaded

I tried running this to collect sample data. But for every keyword i am getting same issue.

[youtube:search_url] in: Downloading webpage
[download] Downloading playlist: in
[youtube:search_url] playlist in: Downloading 0 videos
[download] Finished downloading playlist: in
[youtube:search_url] is: Downloading webpage
[download] Downloading playlist: is
[youtube:search_url] playlist is: Downloading 0 videos

whats wrong I might have done ?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.