5

When I try to detect text on my jpeg, it shows correctly all areas where it suspects text and images, but when I export it to ODT it only creates an ODT with empty text- and imageframes.

Do I have to configure tesseract somehow?

(I use Ubuntu 14.10 32bit)

rubo77
  • 34,024
  • 52
  • 172
  • 299

2 Answers2

3

Try this:

Open the ocrfeeder program.

Edit the engine: Click Tools - OCR Engine

Select the Tesseract engine and click Edit

Where it says arguments engine changed the script for this:

$IMAGE $FILE -l eng -psm 3 > /dev/null 2> /dev/null; cat $FILE.txt; rm $FILE $FILE.txt

To export the document click File - Export

Select the desired output format.

If the document has pictures I advise using the html format text.

If only has text the best is to use the format plain text txt .

kyodake
  • 17,808
0

I have better results with : $IMAGE $FILE -l eng > /dev/null 2> /dev/null; cat $FILE.txt; rm $FILE $FILE.txt in tools > Roc engines > tesseract > modify > parameters of the engine.