Giter Site home page Giter Site logo

audiocraft-webui's Introduction

Discord

audiocraft-webui v2.0!

Local webui for Facebook's Audiocraft model: https://github.com/facebookresearch/audiocraft

Features:

  • Long Audio: Make audio as long as you like.
  • Processing Queue: Add as many different prompts to the processing queue as you like, go have a cup of coffee, come back to sweet sweet audio.
  • Generation history: Playback what you generated and see the settings used to get the gen.

Install:

If you'd like gpu acceleration and do not have torch installed, visit https://pytorch.org/get-started/locally/ for instructions on installing gpu torch correctly.

pip install -r requirements.txt (If you encounter errors with audiocraft installing, please refer to their docs here: https://github.com/facebookresearch/audiocraft)

Run:

python webui.py

There's no need to download any external models, pick a model in the dropdown and when you hit run for the first time it will be automatically downloaded via audiocraft. If you want to use the melody mode, select the Melody model and a selector for your melody audio file will appear.

Notes:

Files are saved to the statc/audio/ directory.

The currently active model stays loaded in memory by default, if you want it to be unloaded after each generation, launch with python webui.py --unload-after-gen

The UI is in desperate need of an actual UI design if anyone wants to take on the task.

Parameters:

  • Top-K: Higher top-k increases the amount of possible instrumentation/continuations will happen. Higher top-k combines with higher temperature to make more interesting music but could make things really odd. I generally tune this in combination with temperature.
  • Top-P: Recommended around 0.7, this means we are sampling from the top 70% of the possible continuations. Higher top-p will be less interesting and creative, and might also introduce static and noise. Recommendation is to leave this around 0.7 and not think about it much.
  • Duration: Length of generated music.
  • CFG/Classifier Free Guidance: The higher this is, the more strongly it will match what you prompted. Generally recommend this between 3-5.
  • Temperature: How much randomness to introduce, recommended around 1.05-1.5, if you want chaotic music this should be higher, more regular/repition like club beats this should be lower.

Changelog:

Feb-25-2024:

  • Rewrote everything.
  • Added a history of generation parameters for generated music
  • Removed a bunch of garbage dependencies
  • Removed overlap and segments as they were antiquated parameters.

audiocraft-webui's People

Contributors

coffeevampir3 avatar synopsis99 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.