Giter Site home page Giter Site logo

flutydeer / audio-slicer Goto Github PK

View Code? Open in Web Editor NEW

This project forked from openvpi/audio-slicer

1.2K 1.2K 161.0 716 KB

A simple GUI application that slices audio with silence detection

License: MIT License

Python 97.78% PowerShell 2.22%
audio-processing gui pyside6 qt6

audio-slicer's People

Contributors

flutydeer avatar jobsecond avatar lbrooney avatar sinestriker avatar wangs-offical avatar yqzhishen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

audio-slicer's Issues

can i obtain each segment's timestamps?

Thank you for your great work. However, considering the unique nature of the data I'm working with, the automated output isn't precisely tailored to my needs. I'm interested in learning how to extract timestamps from the processed audio segments so that I can manually refine and merge them as necessary. Could you please provide guidance on how to achieve this? Your help is greatly appreciated!

Unhandled exception in script

I get this error when trying to run slicer-gui.exe:

Unhandled exception in script
Failed to execute script 'slicer-gui' due to unhandled exception:
IMPORTANT: PLEASE READ THIS FOR ADVICE ON HOW TO SOLVE THIS ISSUE!

Importing the numpy C-extensions failed. This error can happen for many reasons, often due to issues with your setup or how NumPy was installed."

I tried uninstalling and re-installing NumPy, and it did not fix the error.

No module named qdarktheme [OSX]

Traceback (most recent call last): File "/Users/alvarom2/Documents/RVC/AudioslicerGUI/audio-slicer/slicer-gui.py", line 4, in <module> import qdarktheme ModuleNotFoundError: No module named 'qdarktheme'

Cloned the repo and did run requirements installation but when launching it show's this error...
how to fix this?

请问能否增加一个批量操作的命令

我有许多目录需要进行处理,担心手动操作会有失误。
我看了下slicer.py 和slicer2.py,好像都是对单个文件进行操作的。
于是我写了个linux命令,
find . -type f -name "*.wav" -exec python slicer.py {} \; -exec rm -f {} \;
这个命令可以处理,并删除旧文件。
但是执行效率非常低,用gui 1秒就能处理200多个文件,在命令行5秒只能处理1个文件。
不知道是因为wsl效率低还是python的问题。

issue when running python slicer-gui.py

i got this after running python slicer-gui.py :
qt.qpa.xcb: could not connect to display
qt.qpa.plugin: Could not load the Qt platform plugin "xcb" in "" even though it was found.
This application failed to start because no Qt platform plugin could be initialized. Reinstalling the application may fix this problem.

Available platform plugins are: minimalegl, offscreen, xcb, wayland, eglfs, wayland-egl, minimal, vnc, linuxfb, vkkhrdisplay.

Aborted (core dumped)

How to fix this?

Cannot run slicer-gui.py

I have installed all requirements, but I cannot run slicer-gui.py:
[qdarktheme.qtpy] [WARNING] Failed to import QtCore, QtGui, QtSvg and QtWidgets. Traceback (most recent call last): File "F:\audio-slicer\audio-slicer-main\slicer-gui.py", line 5, in <module> from PySide6.QtWidgets import QApplication, QStyleFactory ImportError: DLL load failed while importing QtWidgets

[Proposal] Generate mapping file of slices and original audio files

Summary

As a video editor, I want to slice videos based on the audio part. I need to remove silence parts from my live playbacks. If this tool can produce a file that maps sliced audios to time spans of the original audio, I can use the mapping file to slice my videos to remove silence parts automatically with video processing tools (like ffmpeg). I can also use speech to text tools to filter audio slices, then filter spans of my videos based on the mapping file.

File format

The following file maps time spans of 2 original audios to 5 audio slices. The output path of the mapping file needs to be specified from GUI before slicing.

{
  "outputFolder": "C:\\Users\\UserName\\Videos\\PlaybackSlices",
  "tasks": [
    {
      "originalFile": "C:\\Users\\UserName\\Videos\\Playback202405030234.wav",
      "slices": [
        {
          "start": 0,
          "end": 1780,
          "file": "Playback202405030234_0.wav"
        },
        {
          "start": 1780,
          "end": 2460,
          "file": "Playback202405030234_1.wav"
        }
      ]
    },
    {
      "originalFile": "C:\\Users\\UserName\\Videos\\Playback202405040330.wav",
      "slices": [
        {
          "start": 0,
          "end": 2790,
          "file": "Playback202405040330_0.wav"
        },
        {
          "start": 3150,
          "end": 9460,
          "file": "Playback202405040330_1.wav"
        },
        {
          "start": 12800,
          "end": 14690,
          "file": "Playback202405040330_2.wav"
        }
      ]
    }
  ]
}

Note

I'm not sure whether I can implement this feature by myself. Because I'm new to Python. If I managed to implement this feature, I'll open a pull request.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.