Giter Site home page Giter Site logo

abysz_lab's Introduction

Abysz AI temporal coherence lab. Gradio GUI.

ABYSZLAB

This is a project under construction. Currently, only a basic use of DFI (Differential Frame Interpolation) has been added.

DFI video2video postprocessing.

Basic guide:

This tool analyzes the stability of the original video, and processes the generated video with that information. Example, if your original background is static, it will force the generated video to respect that, acting as a deflicker. It is an aggressive process, for which we need and will have a lot of control.

Gui version 0.0.2 includes the following parameters.

Frame refresh frequency: Every how many frames the interpolation is reduced. It allows to keep more information of the generated video, and avoid major ghosting.

Refresh Strength: Opacity % of the interpolated information. 0 refreshes the entire frame, with no changes. Here you control how much change you allow overall.

DFI Strength: Amount of information that tries to force. 4-6 recommended.

DFI Deghost: A variable that generally reduces the areas affected by DFI. This can reduce ghosting without changing DFI strength.

Smooth: Smoothes the interpolation. High values reduce the effectiveness of the process.

USE STRATEGIES:

Basic:

The simplest use is to find the balance between deflicking and deghosting. However, this is not efficient.

Multipass:

The most efficient way to use this tool is to allow a certain amount of corruption and ghosting, in exchange for more stable video. Once we have that base, we must use a second step in Stable Diffusion, at low denoising (1-4). In most cases, this brings back much of the detail, but retains the stability we've gained.

Multibatch-controlnet:

The best, best way to use this tool is to use our "stabilized" video in img2img, and the original video in controlnet HED. Then use a parallel batch to retrieve details. This considerably improves the multipass technique. Unfortunately, that function is not available in the controlnet gui as of this writing.

TODO

Automatic1111 extension. Given my limited knowledge of programming, I had trouble getting my script to interact within A1111. I hope soon to solve details to integrate this tool. Also, there are many important utilities that are in development, waiting to be added soon, such as polar rendering (front/back), luminosity deficker, blend deflicker, gif viewer, source analysis, preprocessing, etc.

Requirements

OpenCV: pip install opencv-python

Imagemagick library: https://imagemagick.org/script/download.php

abysz_lab's People

Contributors

abyszone avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.