Comments (10)
How can I create a lmdb dataset for Chinese character?
from fudanocr.
The link of the dataset is shown in http://www.nlpr.ia.ac.cn/databases/handwriting/Home.html
from fudanocr.
The link of the dataset is shown in http://www.nlpr.ia.ac.cn/databases/handwriting/Home.html
Hi, Chen, thanks for your reply. I have downloaded the dataset, but I don’t understand why a string is looped here. The code is:
def get_data_package():
train_dataset = []
# 'train_dataset': './data/mydata/train_1000' why loop this path?
for dataset_root in config['train_dataset'].split(','):
_, dataset = get_dataloader(dataset_root, shuffle=True)
train_dataset.append(dataset)
from fudanocr.
What type of data set should I replace the path './data/mydata/train_1000'?
from fudanocr.
The link of the dataset is shown in http://www.nlpr.ia.ac.cn/databases/handwriting/Home.html
Hi, Chen, thanks for your reply. I have downloaded the dataset, but I don’t understand why a string is looped here. The code is:
def get_data_package(): train_dataset = [] # 'train_dataset': './data/mydata/train_1000' why loop this path? for dataset_root in config['train_dataset'].split(','): _, dataset = get_dataloader(dataset_root, shuffle=True) train_dataset.append(dataset)
A loop is used to concatenate multiple datasets. For example, the dataset can be formulated in this way:
'train_dataset': './data/mydata/train_1000,./data/mydata/train_1500,./data/mydata/train_2000'
from fudanocr.
What type of data set should I replace the path './data/mydata/train_1000'?
The format should be lmdb
from fudanocr.
Thank you very much, it has helped me a lot!
from fudanocr.
Hello, have you successfully converted LMDB format? I want to know how to convert, I have tried many methods without success
from fudanocr.
Hello, have you successfully converted LMDB format? I want to know how to convert, I have tried many methods without success
Hi, you can see in #57.
from fudanocr.
Hello, I am a little confused about the loop connection of multiple data sets, may I ask why this operation is carried out, and what is the difference between it and the direct single training? Thank you very much for your reply. I would appreciate it if you could help me.
'train_dataset': './data/mydata/train_1000,./data/mydata/train_1500,./data/mydata/train_2000'
from fudanocr.
Related Issues (20)
- Question about CLIP-like pre-training in image-ids-CTR HOT 1
- train.py在计算loss有个小错误 HOT 1
- 关于CCR-CLIP HOT 1
- 数据集请求 HOT 2
- 请问text-focused-Transformers model和datasets如何下载 HOT 1
- VCTR dataset read error
- 请问在竖直文本识别那篇论文中,有没有提供预训练模型呢 HOT 2
- How to download VCTR dataset? HOT 2
- How to extract only image after super resolution?
- CCR-CLIP pretraining HOT 2
- About CCR-CLIP pretrained HOT 1
- inference of orientation-independent-CTR ?
- CCR-CLIP使用印刷字体进行增强 HOT 1
- Fix link for Baidu download in text-focused-Transformers/
- train.py的坑:容易误删已训练模型 HOT 2
- lmdb can not find my file. HOT 2
- 关于Chinese-CLIP复现中的几点疑惑,期待答复!
- Question about CCR-CLIP experiment and code HOT 4
- 字典里面的字符是不是不够啊?除了没有逗号,有时候会出现字典里没有这个汉字。 HOT 4
- 是否有HWDB 1.0-1.2 的lmdb数据集 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fudanocr.