Highest Voted 'optical-character-recognition' Questions - Artificial Intelligence Stack Exchange

24

votes

3 answers

Why can't OCR be perceived as a good example of AI?

On the Wikipedia page about AI, we can read: Optical character recognition is no longer perceived as an exemplar of "artificial intelligence" having become a routine technology. On the other hand, the MNIST database of handwritten digits is…

philosophy definitions optical-character-recognition

asked Aug 06 '16 at 01:57

kenorb

10,525
6
45
95

10

votes

3 answers

Are there any textual CAPTCHA challenges which can fool AI, but not human?

Are there any modern techniques of generating textual CAPTCHA (so person needs to type the right text) challenges which can easily fool AI with some visual obfuscation methods, but at the same time human can solve them without any struggle? For…

image-recognition research optical-character-recognition

asked Aug 05 '16 at 01:45

kenorb

10,525
6
45
95

7

votes

2 answers

Effective algorithms for OCR

I am using Google's OCR to extract text from images, like receipts and invoices. Whare examples of techniques used to make sense of the text? For example, I would like to extract the date, name of the business, address, total amount, etc. Before…

machine-learning reference-request optical-character-recognition

asked Nov 21 '17 at 21:34

Abhay Naik

179
2

6

votes

1 answer

How should the racing agent take into account the velocity of the vehicle, given the images with a speedometer?

I'm developing a game AI, which tries to master racing simulations. I already trained a CNN (AlexNet) on in-game footage of me playing the game and the pressed keys as the target. As the CNN is only making predictions on a frame-to-frame basis, and…

convolutional-neural-networks game-ai optical-character-recognition alexnet optical-flow

asked Sep 12 '17 at 17:59

TheJD

103
5

5

votes

1 answer

Why object detection algorithms are poor in optical character recognition?

OCR is still a very hard problem. We don't have universal powerful solutions. We use the CTC loss function An Intuitive Explanation of Connectionist Temporal Classification | Towards Data Science Sequence Modeling With CTC | Distill which is very…

object-detection object-recognition optical-character-recognition ctc-loss

asked Apr 19 '21 at 14:04

user40943

5

votes

1 answer

In OCR, how should I deal with the warped text on the sides of oval objects?

Consider an image that contains one can (or bottle, or any similar oval object), which has texts all over it. In the image below, I have many bottles, but you can assume that each image only contains one such object. As we can see, in each can, the…

python image-processing data-preprocessing optical-character-recognition

asked Jan 06 '21 at 09:09

Red

175
6

5

votes

2 answers

How can we recognise musical notes in low-resolution or blurry images?

I was looking for an approach to recognise musical notes from photos. I found this repository https://github.com/mpralat/notesRecognizer. However, it doesn't seem good enough. If you look into the bad folder, you can see that just tiny variations of…

image-recognition optical-character-recognition

asked Mar 24 '19 at 23:20

Toskan

151
1
4

4

votes

1 answer

How should I define the loss function for a multi-object detection problem?

I'm trying to create a text recognition project using CNN. I need help regarding the text detection task. I have the training images and bounding box details for them. But I'm unable to figure out how to create the loss function. Can anyone help…

deep-learning convolutional-neural-networks classification image-recognition optical-character-recognition

asked Apr 28 '20 at 15:03

h4x

41
1

3

votes

0 answers

zonal or template ocr invoices reading

I'd like to explore the possibilities of applying artificial intelligence to ocr reading. Basic ocr invoices processing let me convert 30% of them only. The main purpose is defining invoices areas by training an ai, then process those areas with…

neural-networks topology optical-character-recognition

asked Nov 05 '18 at 08:32

Gab

31
1

3

votes

2 answers

How could I use machine learning to detect text and non-text regions in scanned documents?

I have a collection of scanned documents (which come from newspapers, books, and magazines) with complex alignments for the text, i.e. the text could be at any angle w.r.t. the page. I can do a lot of processing for different features extraction.…

machine-learning natural-language-processing pattern-recognition optical-character-recognition

asked Oct 17 '18 at 10:54

bipul kalita

79
5

3

votes

2 answers

How to improve the performance of Easy OCR

I am working on a project that requires me to identify a product on a grocery shelf. For that, I am trying to use test recognition and localization to spot a product. I tried Easy OCR and tesseract OCR because they are giving me accurate results,…

deep-learning optical-character-recognition text-detection

asked Jul 29 '22 at 18:09

Suresh Nayak

31
1
2

3

votes

0 answers

Is there a deep learning-based architecture for digit localisation?

I'm new to object detectors and segmentation. I want to localize digits on a plate as fast as possible. All images of the dataset are normalized to $300 \times 60$. There are different approaches to solve the problem. For example, binarization +…

deep-learning optical-character-recognition image-segmentation

asked Oct 21 '19 at 16:18

Babak.Abad

131
3

3

votes

1 answer

Attempting to solve a optical character recognition task using a feed-forward network

I am doing some experimentation on neural networks, and for that I am trying to program a plain OCR task. I have learned CNNs are the best choice ,but for the time being and due to my inexperience, I wanna go step by step and start with feedforward…

feedforward-neural-networks optical-character-recognition

asked Mar 04 '19 at 21:13

Chal.lo

51
1

3

votes

0 answers

How does a neural network output text box location data?

I'm interested in creating a convolutional neural network or LSTM to locate text in an image. I don't want to OCR the text yet, just find the text regions. Yes, I know Tesseract and other systems can do this, but I want to learn how it works by…

convolutional-neural-networks long-short-term-memory optical-character-recognition

asked Jan 27 '19 at 20:21

Matthew Bishop

31
1

2

votes

1 answer

How do I go about performing OCR text extraction from thousands of PDFs for training an AI model?

I have lots of data (PDFs) that I want to train an AI model to extract info from. All of them are a little different but have the same key data points. Is it possible to train an AI on the PDFs I have so that it would be able to recognize other PDFs…

training ai-design optical-character-recognition

asked May 25 '23 at 14:01

Alen Ramic

21
1

Questions tagged [optical-character-recognition]