Comments (3)
Hello, you should initialise the model only once and assign it to a variable so this doesn't happen at each iteration:
model = Detoxify("original", device=device)
comments_list = comments["text"].to_list()
df = pd.DataFrame()
for i in range(0, len(comments_list), 32):
comms = comments_list[i : i + 32]
results = model.predict(comms)
results = pd.DataFrame(results)
df = df.append(results, ignore_index=True)
Now you should be able to use a bigger batch size as well.
Hope this helps!
from detoxify.
Thanks for the suggestion and pointing it out. The change worked pretty well infact insanely well.
Previously it was averaging around 7 mins to get all the predictions, now everything is done in about 8 seconds.
from detoxify.
Great, glad it helped!
from detoxify.
Related Issues (20)
- Pinpoint the parts of the speech that trigger high values
- OSError: Unable to load vocabulary from file. Please check that the provided vocabulary is accessible and not corrupted. HOT 1
- Number of epochs to get the best model HOT 2
- Progress Bar HOT 5
- TypeError: 'NoneType' object is not subscriptable HOT 8
- UnicodeDecodeError when installing from git
- Add dutch language HOT 1
- Question regards use case HOT 1
- Question - Adding additional models and labels. HOT 1
- Question regards training with other models HOT 1
- Error during training HOT 1
- Multi GPU predict
- Add license information for toxic-bert on HF HOT 7
- Converting model to AWS Inferentia hardware using Optimum-cli
- FileNotFoundError: [Errno 2] No such file or directory: 'jigsaw_data/jigsaw-toxic-comment-classification-challenge/val.csv'
- What are the max token lengths for the models?
- Error on PIP install HOT 6
- Bump up Transformers version HOT 2
- "torch" error by just importing Detoxify HOT 1
- OSError: None is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models' HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from detoxify.