Taking the spoken Italian model as an example, the process of loading the model into m

Model loading is very memory hungry about someweta HOT 5 CLOSED

ianroberts commented on July 28, 2024

Model loading is very memory hungry

from someweta.

Comments (5)

tsproisl commented on July 28, 2024

I've been unhappy with that for a long time. There are mainly two reasons why I haven't changed the model format, so far:

While it is annoying, it does not seem to be a problem in practice as most people seem to have enough RAM (or do not complain if they haven't).
I don't want to render any existing models useless.

Of course, dealing with 2 is just a matter of making the tagger recognize the format and handle the model file appropriately. It's just that it hasn't been a top priority for me.

from someweta.

ianroberts commented on July 28, 2024

Sure. It has only become an issue for me because I'm working with a project that wants to expose a web service based on your tagger in a platform that uses Kubernetes. I need to apply memory limits to the pod definitions but for this service I have to make the pod request 4GB even though it only needs 1.7GB after the startup phase.

For this particular use case I've developed a workaround where I transform the model into a gzipped pickle format file, which is quite a bit larger than the original gzipped JSON but loads faster and with virtually no additional memory overhead. However it occurred to me today that it's actually possible to implement a more efficient streaming load of the current model format using ijson, I can submit a PR for this if you like?

from someweta.

tsproisl commented on July 28, 2024

Ah, the ijson solution is nice! A PR would be most welcome. The only thing that needs to be taken into account is that this will produce garbage on Python versions <3.7 that have ijson installed. I see two possible solutions: Either always fall back to the standard parser for these older versions or use a collections.OrderedDict instead of a dict if the version is <3.7.

from someweta.

ianroberts commented on July 28, 2024

PR submitted - I've made it use the optimised algorithm on CPython 3.6+ or (any) Python 3.7+, which are the ones where dict iteration order is guaranteed, and fall back to the original algorithm on earlier versions.

from someweta.

tsproisl commented on July 28, 2024

Thank you! I've updated the README and created a new release.

from someweta.

Model loading is very memory hungry about someweta HOT 5 CLOSED

Comments (5)

Related Issues (10)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent