Application of Machine Learning in Text Recognition : Part III (END)

Abhishek Ghosh

By Abhishek Ghosh August 24, 2018 2:15 pm Updated on August 24, 2018

Application of Machine Learning in Text Recognition : Part III (END)

In First Part of Application of Machine Learning in Text Recognition, we have clarified the basic terms from the field of machine learning and text recognition, explained various types of machine learning in brief. In Second Part of Application of Machine Learning in Text Recognition, we have discussed about the types of text recognition and application text recognition. Third part will complete the series.In This Third and Final Part of Application of Machine Learning in Text Recognition, We Have Discussed About Machine Learning as Part of Text Recognition and Draw Conclusion.

Application of Machine Learning in Text Recognition

Instance-Based Learning (IBL)

As described in previous articles of this series, the Instance-Based Learning is based on instances. These are described by n-many attributes and represent a point in an n-dimensional instance space. Each attribute is defined by a set of ordered numeric values as well as unordered symbolic values. Supported by methods such as supervised learning, Instance-Based Learning (IBL) algorithms also learn a concept of something by entering training instances that are already classified or categorized.

Course of the Learning Process

Algorithms based on instance-based learning basically consist of the following four functions:

Normalization
Similarities
Prediction
Memory update

The first function normalizes each numeric value of an attribute to allow the second function to assume that each attribute has the same range of possible values. Once a new input is made, repeated normalization is required.

IBL Phase 1

The second function returns a numeric value that expresses the degree of agreement or commonality of the new instance versus the classification of the concepts already known. This is done by a simple commonality function such as the inverse of the Euclidean distance between two instances x and y, where n is the number of attributes.

IBL Phase 2

The third function uses the values obtained so far to make a prediction of the most likely values to be expected. The prediction function for the numeric values then calculates a weighted similarity to the most similar instances.

IBL Phase 3

In the fourth functional step, the memory of the algorithm is finally updated, i.e. all processed and updated training data are written into the target concept.

Use in Handwriting Recognition

Sequence HCR with IBL

After a scanner has scanned, binarized, and saved the text to be processed as a bitmap, the system searches for lines. Rows in this context are basically rectangular areas containing sequences of letters. A page is thus separated horizontally by lines. Once all rows have been found, they will search for more rectangular sub-areas that will vertically separate the rows and contain isolated letters. The next phase is about learning. Each found letter is normalized in an area of the bitmap which is 16×16 pixels in size. If one letter can not be correctly separated from another and normalized, it is possible to manually set the boundaries of the letter to other letters. The obtained 16×16 bitmap now receives a numeric code. A normalized letter is represented by 256 (16×16) dots. Each point can assume the state 0 (white) or 1 (black). Since letters can be scanned from texts in various resolutions (dpi) (75 dpi – 600 dpi), normalization into a 16×16 pixel bitmap is inevitable. For each of the 256 points in the normalized bitmap it is necessary to assign a corresponding part of the letter.

Letters can be classified in different ways. The system described here (“template matching”) uses five methods to compare the normalized bitmap (ie the known instance) with the bitmap of the input letter:

Compare adjacent ones and zeroes with a simple XOR query
Exclusive comparison of the ones by an AND query
Exclusive comparison of the zeroes by an OR query
The Euclidean distance of the circle around a letter
The Euclidean distance of the circle around the normalized letter

Summary and Conclusion

In this scientific work, it has been that explained how today’s text recognition relies on machine learning and it’s effects on the successful recognition of texts. In the analysis of largely English technical contributions, it quickly became apparent that machine learning has no insignificant influence on the quality of text recognition. While in classical text recognition of machine typeface the systems still manage quite well without learned knowledge and are usually limited to pure comparison methods, it is almost impossible in the recognition of handwritten characters to correctly recognize these without experience from previous analyzes, so that In this case, different learning methods are used, which are no longer based purely on comparison.

It uses different machine learning methods, each of which has its advantages and disadvantages. Overall, however, these methods can be divided into 2 groups. Once the completely self-learning methods and methods based on existing knowledge and learning with the help of people. Fully self-learning methods are based on artificial intelligence.

It can also be stated that, above all, instance-Based Learning is very well suited to the recognition of handwritten characters. Depending on the quality of the graphics to be read, this method still achieves an accuracy of more than 80%. For printed characters, almost 100%. These values will continue to increase over time, because every time the system rediscovers a character or reads it out for the first time, it trains itself. If you then add the context-based correction to the whole then the accuracy would improve again not insignificant .

In the present day, when artificial intelligence and machine learning already have a big impact on our lives, machine learning in text recognition will evolve. Thus, in the future, further artificial intelligence technologies will be developed, which are expected to provide even better analysis results. It can be surmised that these technologies will continue to evolve until they can guarantee a 100% correct selection.

It can also be stated that the technologies will continue to move away from the comparison methods and to the artificial intelligence, since this delivers substantially better results in the text recognition. For example, international competitions have been won with the technology of artificial intelligence and deep learning against the comparative methods.

Tagged With how to combine letters for machine learning text recognition , application of machine learning in text recognition : part iii , recognise most important words in text machine learning , text recognition machine learning

About Abhishek Ghosh

Here’s what we’ve got for you which might like :

Take The Conversation Further ...

Get new posts by email: