Bindings to 'Tesseract': a powerful optical character recognition (OCR) engine that supports over 100 languages. The engine is highly configurable in order to tune the detection algorithms and obtain the best possible results.
Version: | 5.2.1 |
Imports: | Rcpp (≥ 0.12.12), pdftools (≥ 1.5), curl, rappdirs, digest |
LinkingTo: | Rcpp |
Suggests: | magick (≥ 1.7), spelling, knitr, tibble, rmarkdown |
Published: | 2023-11-20 |
DOI: | 10.32614/CRAN.package.tesseract |
Author: | Jeroen Ooms [aut,
cre] tesseract author details |
Maintainer: | Jeroen Ooms <jeroen at berkeley.edu> |
BugReports: | https://github.com/ropensci/tesseract/issues |
License: | Apache License 2.0 |
URL: | https://docs.ropensci.org/tesseract/ (website) https://github.com/ropensci/tesseract (devel) |
NeedsCompilation: | yes |
SystemRequirements: | Tesseract >= 3.03 (libtesseract-dev / tesseract-devel) and Leptonica (libleptonica-dev / leptonica-devel). On Debian you need to install the English training data separately (tesseract-ocr-eng) |
Language: | en-US |
Materials: | NEWS |
In views: | NaturalLanguageProcessing |
CRAN checks: | tesseract results |
Reference manual: | tesseract.pdf |
Vignettes: |
Using the Tesseract OCR engine in R |
Package source: | tesseract_5.2.1.tar.gz |
Windows binaries: | r-devel: tesseract_5.2.1.zip, r-release: tesseract_5.2.1.zip, r-oldrel: tesseract_5.2.1.zip |
macOS binaries: | r-release (arm64): tesseract_5.2.1.tgz, r-oldrel (arm64): tesseract_5.2.1.tgz, r-release (x86_64): tesseract_5.2.1.tgz, r-oldrel (x86_64): tesseract_5.2.1.tgz |
Old sources: | tesseract archive |
Reverse suggests: | camtrapR, imagerExtra, inlpubs, magick, pdftools, poldis |
Please use the canonical form https://CRAN.R-project.org/package=tesseract to link to this page.