Giter Site home page Giter Site logo

Comments (4)

amyeroberts avatar amyeroberts commented on July 18, 2024

Hi @gmanlan, thanks for opening an issue!

Regarding the docs, the model was deprecated in between the release of v4.41.0 and the new release today v4.42.0. The docs should show the latest release, v4.42.0 but seems to be still showing for the previous one. If you go onto the docs for the development branch, you will see a message highlighting that the model has been deprecated, and the links to source code point to the new, deprecated path. Thanks for highlighting that the docs are doing this! We'll look into trying to fix this as soon as possible as incorrect docs are both frustrating and confusing.

In terms of why models are deprecated, this is a maintenance decision. The transformers library is huge and currently has over 200 models. Every one of those models has an active maintenance cost in terms of things like compute for running tests, fixing bugs, updates to align with the rest of the library, reviewing new PRs etc. The reality is the transformers team is pretty small and in order for us to have time to develop new features, add new models, and maintain the library it's necessary to prioritise our time and resources. One way we reduce our maintenance burden is through periodic deprecations of lower use models.

We deprecate models which are at least a year old and have had a low number of checkpoint downloads from the hub over the past 30 days. This serves a proxy for lower community usage and hence reduced effort to impact ratio for on-going maintenance. This isn't a judgement on whether the model is useful, which is why we still keep it in the library: importable and runnable.

from transformers.

gmanlan avatar gmanlan commented on July 18, 2024

Thanks @amyeroberts for the clarification. Glad to be of help WRT documentation, and I now have a better understanding on the burden HF has WRT maintenance here. I'm glad to hear we are not claiming these models are useless.

One general thought is that since HF has become a central hub for most AI practitioners, it would be great to keep model (model architecture specifically) diversity in mind as we move forward. Most people will simply rank the models by downloads/popularity and run with that, which creates a less competitive environment (if 99% of people are using the same Swin Transformer or Llama model...) and gives new/lesser known options/models a hard time proving they could be better given the opportunity.

Thanks again! Keep up the good work.

from transformers.

amyeroberts avatar amyeroberts commented on July 18, 2024

@gmanlan The docs should be up-to-date now!

Yes, having many different models available is both good and something we want. There's a wider question about how we can make lesser-known models more discoverable, but this is quite a difficult question and will probably require many different approaches.

The good news is that it's easier than ever for people to add their own models directly onto the hub e.g. like florence2. This removes the entry barrier of the PR process into the transformers library and relieves maintenance responsibilities for the transformers team meaning more models can be added quickly. Hopefully this helps in increasing diversity of models available for everyone in the community.

from transformers.

gmanlan avatar gmanlan commented on July 18, 2024

That's wonderful - very excited for the future of huggingface, thank you @amyeroberts!

from transformers.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.