Hey I was looking to set up something where I am loading models from

Just download a model from HuggingFace? about bumblebee HOT 5 CLOSED

lawik commented on June 10, 2024

Just download a model from HuggingFace?

from bumblebee.

Comments (5)

josevalim commented on June 10, 2024

The load model should be fast compared to the download, as it does not compile anything, and it will help validate you have downloaded the right artifact. And I believe safe tensors, which we want to make the default, load parameters lazily, so it should be even less work.

from bumblebee.

lawik commented on June 10, 2024

Maybe the right idea but this makes it kind of rough if you just want to load/upload the model on machine that isn't set up for inference. Or at least with the times I'm seeing.

Example: https://huggingface.co/google-bert/bert-base-cased

Download, clocked by counting out loud while the progress bars were going:

~14 seconds

Total load_model execution time:

76 seconds

Tested with:

:timer.tc(fn -> Bumblebee.load_model({:hf, "google-bert/bert-base-cased"}) end) |> elem(0) |> then(& &1 / 1000) |> IO.inspect(label: "ms")

No configuration done at all.

from bumblebee.

lawik commented on June 10, 2024

Ideally I'd love to stream the download from hugging face to an S3-compatible but that is further out of scope from what Bumblebee is about.

from bumblebee.

jonatanklosko commented on June 10, 2024

You can take files from HF repository and put in S3 or wherever, then when you download onto the local machine use {:file, path_to_repo_dir} (just make sure you don't copy parameter files in multiple formats, as that would be unnecessary).

In the future we may have our own serialisation format for things, but I don't think we should be exposing the download of hf/transformers files.

76 seconds

You'd need to use EXLA.Backend, because there are some transformations that are going to be slow otherwise.

from bumblebee.

lawik commented on June 10, 2024

That was a lot faster. I can make do.

from bumblebee.

Recommend Projects

Just download a model from HuggingFace? about bumblebee HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent