Comments (8)
Hi. The major difference lies in character classes. (1) You should prepare a character list for a new language and you will know the total character classes. In the config file, set MODEL.TRANSFORMER.VOC_SIZE to the number of character classes. (2) Prepare your data. In the json file, 'rec' is the character index list converted from text transcript. (3) Remember to change the character list in evaluation code and visualization code for evaluation and visualization.
Note: In the CTC decoding part of evaluation and visualization code, because the character list additionally includes an "unknown" class which is not shown in the character list and can be ignored during inference, "if c < self.voc_size - 1" is used (such as here and here). Otherwise, for example, if the new English character class is 36 but not 37 (i.e., the "unknown" class is not included), using "if c < self.voc_size" is correct. Remember to check it for new dataset.
from deepsolo.
@ymy-k Thanks a lot. Now I know I can refer to the ABCNet model using "chn_cls_list.txt", but I'm confused if I can use the pretrained ViTAEv2-S model. It seems not good because the voc_size doesn't match.
from deepsolo.
It's pretrained on English data. Thus, it's not a good choice to use it. The vos_size doesn't match and the linear layer for character classification is not useable.
from deepsolo.
@ymy-k So kind of you. BTW, when will the Chinese model be available?
from deepsolo.
Maybe this week, I will update the Chinese model first.
from deepsolo.
Thank you and hope to release sooner.
from deepsolo.
Hi, the code and models for ReCTS have been updated.
from deepsolo.
Thank you so much!!
from deepsolo.
Related Issues (20)
- nan value HOT 2
- All pretrained model weight links is unavailable HOT 3
- Deepsolo++ model/code release
- load_zip_file Error loading the ZIP archive HOT 1
- How to custom japannese dataset?
- Cannot use ViT as a backbone
- No such file or directory: 'datasets/totaltext/train_37voc.json' HOT 2
- how to solve it? HOT 1
- TensorRT inference HOT 1
- How to calculate the "bezier_pts" and "rec" in annotations HOT 1
- KeyError: 'e2e_method' HOT 4
- num_gpu
- The purpose of output_proposals
- What is the specification of annotation and how to create the dataset?
- Request for Visualization Code for Attention Visualization in Paper
- FileNotFoundError: [Errno 2] No such file or directory: 'datasets/totaltext/test.json'
- Unavailability of Annotation Links in Totaltext Dataset HOT 2
- 希望开源DeppSolo++以及修复数据集失效链接
- cannot download dataset
- not able to reproduce the results of the ctw1500 dataset
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deepsolo.