hooshvare / parsgpt Goto Github PK
View Code? Open in Web Editor NEWPersian GPT2
Home Page: https://huggingface.co/HooshvareLab/gpt2-fa
License: Apache License 2.0
Persian GPT2
Home Page: https://huggingface.co/HooshvareLab/gpt2-fa
License: Apache License 2.0
hey im using your model and when i set num_sents
to a number bigger than two , is see this
"Setting pad_token_id
to eos_token_id
:5 for open-end generation."
how and where can I adjust this limitation?
tnx
Is it possible to fine-tune the general Persian GPT-2 model on the question answering task with a custom dataset? I couldn't find any sources describing how to fine-tune a GPT model on a custom dataset. (except using the gpt-2-simple library which doesn't support Farsi or other huggingface models)
Hi,
Thank you for providing this useful resource. I would like to know more about the characteristics of this transformer compared to English ones. For example, how many parameters it has and on home many texts it was trained or even the genre and types of text. If possible compare these statistics with a model in English.
Thank you!
Hi,
Firstly thank you very much for creating such amazing models and explanations of each.
Secondly, I had to report a bug. In the Persian GPT2 - Visualization.ipynb
notebook, after running the second cell (lm = ecco.from_pretrained('HooshvareLab/gpt2-fa')
) we get the following error:
The model 'HooshvareLab/gpt2-fa' is not defined in Ecco's 'model-config.yaml' file and so is not explicitly supported yet.
I'd really appreciate it if you guys could take a look at this issue. The Ecco library can be a lot of help in understanding this model.
Thanks!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.