Sectors: Finance • Information Technoloy • Government • Healthcare • Industries • Education • (show all)
To recognize text only:
s = ocrEngine.recognize("test.png", -1, -1, -1, -1, -1,
OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)
To recognize barcode only:
s = ocrEngine.recognize("test.png", -1, -1, -1, -1, -1,
RECOGNIZE_TYPE_BARCODE, OCR_OUTPUT_FORMAT_PLAINTEXT)
In some cases, you might not want to OCR the whole image. In that case, you can OCR on part of the image to save time:
s = ocrEngine.recognize("test.png", -1, 0, 0, 400, 200,
OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)
The above code OCR the top left part of the image with width 400 pixels and height 200 pixels.
s = ocrEngine.recognize("test1.png;test2.png", -1, -1, -1, -1, -1,
OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)
A TIFF file may contain multiple pages. If you need to recognize only a certain page, you can specify the page number as following:
s = ocrEngine.recognize("test.tif", 2, -1, -1, -1, -1,
OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)
Note 2 means the second page (the page number of the first page is 1).
You use the following method to perform OCR on a PDF input file:
s = ocrEngine.recognize("test.pdf", -1, -1, -1, -1, -1,
OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)