Topics   All   MacOS (Only)   Windows (Only)   Linux (Only, Not)   iOS (Only, Not)  
Components   Crossplatform Mac & Win   Server   Client   Old   Deprecated   Guides   Examples   Videos
New in version: 11.4   11.5   12.0   12.1   12.2   12.3   12.4   12.5   13.0   13.1    Statistic    FMM    Blog  

Component: OCR

Recognize text on images.

See also WindowsOCR functions for Windows and Vision functions for macOS.

Version macOS Windows Linux Server iOS SDK
2.9 / 11.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes ✅ Yes

Item Details
OCR.Cleanup
Shutdown the engine and free all memory.
All
2.9
OCR.Clear
Free up recognition results and any stored image data, without actually freeing any recognition data that would be time-consuming to reload.
All
2.9
OCR.GetBoxText
The recognized text is returned as a text which is coded in the same format as a box file used in training.
All
2.9
OCR.GetHOCRText
Make a HTML-formatted string with hOCR markup from the internal data structures.
All
2.9
OCR.GetPageSegMode
Queries page segmentation mode.
All
2.9
OCR.GetText
Returns recognized text.
All
2.9
OCR.GetTextWithCoordinates
Queries text with coordinates.
All
6.5
OCR.GetVariable
Queries a variable.
All
7.5
OCR.Initialize
Initializes tesseract.
All
2.9
OCR.IsInitialized
Checks if OCR library has been initialized.
All
4.2
OCR.IsLoaded
Whether newer tesseract is loaded.
All
11.3
OCR.Language
Return the language used in the last valid initialization.
All
2.9
OCR.Load
Loads newer tesseract engine.
All
11.3
OCR.MeanTextConf
Returns the (average) confidence value between 0 and 100.
All
2.9
OCR.Recognize
Recognize the image.
All
2.9
OCR.SetImage
Provide an image for Tesseract to recognize.
All
2.9
OCR.SetImageContainer
Loads a picture from container.
All
11.3
OCR.SetImageFile
Loads a picture from file directly.
All
11.3
OCR.SetPageSegMode
Sets page segmentation mode.
All
2.9
OCR.SetRectangle
Restrict recognition to a sub-rectangle of the image.
All
2.9
OCR.SetResolution
Set the resolution of the source image in pixels per inch.
All
5.1
OCR.SetVariable
Sets a variable.
All
7.5
OCR.Version
Query version of Tesseract library.
All
11.3
OCR.WriteToPDF
Writes text on the PDF.
All
3.1

24 functions shown.

4 functions require a paid license (17%) and 20 functions are free to use.

Release notes

  • Version 12.3
    • Updated tesseract support for OCR functions to work with version 5.1.
  • Version 11.3
  • Version 11.2
    • Changed OCR functions to separate different threads on server better.
  • Version 7.5

Blog Entries

Example Databases

Videos

💬 Ask a question or report a problem


Start Chat