This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Nice project !!
From my test, rVADfast performs good segmentation of a medium sized wav file ( > 10/15 seconds )
However, If we cut the input audio (ex: to 5 sec. ), the results are not the same, why is a posteriori analysis done?
It would be nice to apply this library in realtime systems, where server analyzes audio stream, that is short packets of bit-audio in sequence.