Comments (1)
hi @lukas-blecher and thank you for your efforts and for answering our questions in the issues section
this is an important question and we would appreciate it if you could guide us
I want to train facebook/nougat-base model using my custom dataset that consists of images and their corresponding text, how can I create a dataset through this so that I can train the model further using my own dataset.
I tried to do it using the given method too, but the json file created lacks a lot of keys, due to which I'm getting an empty index.jsonl generated.
from nougat.
Related Issues (20)
- How to set the train_nougat.yaml to finetune the nougat base model ?
- Please add Rocm support HOT 1
- Cannot close object, library is destroyed. HOT 1
- Having issue on -> Cannot close object, library is destroyed. This may cause a memory leak! HOT 9
- Recompute option in nougat_api HOT 1
- Have anybody trained a Chinese version?
- Why citations come out like this?
- Why can't I run nougat-ocr on pdfs? HOT 1
- What does this issue mean?
- TypeError: BARTDecoder.prepare_inputs_for_inference() got an unexpected keyword argument 'cache_position' HOT 11
- Big Bug: Can not detect some kinds of whitespace in arxiv computer science paper.
- UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at ../aten/src/ATen/native/TensorShape.cpp:3587.) return _VF.meshgrid(tensors, **kwargs) # type: ignore[attr-defined] and TypeError: BARTDecoder.prepare_inputs_for_inference() got an unexpected keyword argument 'cache_position' -> Cannot close object, library is destroyed. This may cause a memory leak! HOT 4
- Questions Regarding the Nougat Model's Pre-training Process
- pydantic error HOT 8
- The command to generate mmd from pdf, the output result is empty
- TypeError: BARTDecoder HOT 2
- Low amount of recognised pages
- pydantic_core._pydantic_core.ValidationError: 1 validation error for InitSchema | Windows, Python 3.11.5 HOT 3
- Training set format problem
- nought项目可以下载直接部署到服务器对外暴露接口提供使用吗
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nougat.