Giter Site home page Giter Site logo

cog-wd-tagger's Introduction

WD Image Tagger ๐Ÿท๏ธ๐Ÿ–ผ๏ธ

Replicate - WD Image Tagger

The WD Image Tagger is a powerful AI model that automatically analyzes and tags your images with descriptive labels. It's trained on a large dataset of anime-style images and can recognize a wide range of content, including general attributes, characters, and age ratings.

This tool was developed using resources and models available on SmilingWolf's wd-tagger Hugging Face Space, ensuring state-of-the-art performance and ease of use.

Whether you're managing a large image library, looking to generate accurate prompts for an AI art model, or want to quickly filter out potentially sensitive content, the WD Image Tagger can help streamline your workflow.

Features

  • ๐ŸŒŸ Pre-trained on a diverse dataset of anime images
  • ๐Ÿท๏ธ Tags images with general attributes, characters, and content ratings
  • ๐Ÿ” Supports multiple state-of-the-art model architectures like SwinV2, ConvNext, and ViT
  • โš™๏ธ Adjustable tag probability thresholds for fine-grained control over results
  • ๐Ÿงฎ Optional MCut algorithm for automatic threshold optimization
  • ๐Ÿ—‚๏ธ Filter tags by category to focus on what's most relevant to you
  • ๐Ÿ”Œ Easy integration into existing applications via a simple API

Getting Started

To start tagging your images with the WD Image Tagger:

  1. Upload your image
  2. Select the pre-trained model you'd like to use
  3. Adjust the tag probability thresholds and category filters as needed
  4. Let the model analyze your image and output the relevant tags

The model will return a list of tags, each with a confidence score and category label (general, character, or rating).

Pre-trained Models

The WD Image Tagger comes with several pre-trained model options, each with its own strengths:

  • SwinV2: A powerful and accurate model architecture well-suited for most use cases
  • ConvNext: An efficient model that offers a good balance of speed and accuracy
  • ViT (Vision Transformer): A transformer-based model that excels at capturing global context

Models are provided in both the latest Dataset v3 series and the earlier v2 series. The v3 models were trained on a larger and more diverse dataset, while the v2 models offer compatibility with older workflows.

Acknowledgments

The WD Image Tagger was trained using the SW-CV-ModelZoo toolkit, with TPUs generously provided by the TRC program. Special thanks to the researchers and engineers who made this powerful tool possible!

Learn More

For more technical details on the available models and their expected performance, check out the WD Image Tagger GitHub repository.


We hope the WD Image Tagger helps make your image analysis workflows faster and more effective. If you have any questions or feedback, don't hesitate to reach out!

cog-wd-tagger's People

Contributors

zsxkib avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.