Comments (11)
Yes!! tesseract causes R 4.2.0 to crash. Works okay with 4.1.x. Tested on multiple machines.
Does package need new binaries on CRAN for new 4.2.0?
from tesseract.
I am experiencing this problem also.
R version: 4.2.0 (2022-04-22 ucrt)
Running: Windows 11 Home 21H2
from tesseract.
Same problem here
tesseract 5.0.0
R version 4.2.0 (2022-04-22 ucrt)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running: Windows 11 Pro 21H2
from tesseract.
Hmm I cannot reproduce this. Seems to load fine on my Windows...
from tesseract.
Update: On both work and home machine, library(tesseract) take about 5 - 6 minutes to load. So R didn't crash. Just it takes an unreasonably long time to load. Can this be fixed? Home machine:
R version 4.2.0 (2022-04-22 ucrt)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 10 x64 (build 22000)
from tesseract.
I tried building a new version of tesseract. Can you please test if the problem still appears with tesseract 5.1.0? To install the new test version, run this in a clean empty r session
install.packages("tesseract", repos = 'https://ropensci.r-universe.dev')
from tesseract.
The package is still taking a long time to load, but maybe a little bit faster! On home machine, I deleted the tesseract directory from my windows library folder and installed from ropensci repo. I will test on work machine and report.
from tesseract.
The newly compiled package might even be slower. This morning, I timed with stopwatch (smartphone) and got approximately five and half minutes. Now...
beginTime = Sys.time()
library(tesseract)
endTime = Sys.time()
endTime - beginTime
Time difference of 6.699568 mins
sessionInfo()
R version 4.2.0 (2022-04-22 ucrt)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 10 x64 (build 19042)
from tesseract.
still getting a fatal error and aborted session with the new package,
same R version, Platform and Windows as before
from tesseract.
This has been fixed now in the upstream libtesseract code: tesseract-ocr/tesseract#3830
from tesseract.
I have deployed a fixed version to r-universe, and will submit to CRAN soon:
install.packages("tesseract", repos = 'https://ropensci.r-universe.dev')
from tesseract.
Related Issues (20)
- Feature Request: Get all characters with confidence >x HOT 1
- Multi langague text HOT 2
- Failed loading language 'osd' with tessedit_pageseg_mode = 1 HOT 1
- Undo locale workaround for Engine 4.1 +
- Failed to extract text~~ HOT 2
- Unable to install tesseract (R package) on docker HOT 1
- Not installable for R 3.6 HOT 1
- The text is not recognized from png HOT 3
- Tesseract package installation issue from R studio in CentOS 7 server HOT 4
- train own tesseract model HOT 4
- Add ocr'ed text back to image and generate a PDF HOT 1
- Tesseract_download() Error
- tesseract very slow in R HOT 1
- Can't compile on Linux CentOS 7 HOT 5
- Mass Converting PDF Files into Text HOT 1
- build with tesseract 5.0.0 failed HOT 4
- Installation error: fatal error: /usr/local/include/leptonica/allheaders.h HOT 2
- PDF To OCR To CSV In R
- Configure fails to get leptonica include dir HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tesseract.