• Home
  • Archive
  • Tools
  • Contact Us

The Customize Windows

Technology Journal

  • Cloud Computing
  • Computer
  • Digital Photography
  • Windows 7
  • Archive
  • Cloud Computing
  • Virtualization
  • Computer and Internet
  • Digital Photography
  • Android
  • Sysadmin
  • Electronics
  • Big Data
  • Virtualization
  • Downloads
  • Web Development
  • Apple
  • Android
Advertisement
You are here: Home » Application of Machine Learning in Text Recognition : Part III (END)

By Abhishek Ghosh August 24, 2018 2:15 pm Updated on August 24, 2018

Application of Machine Learning in Text Recognition : Part III (END)

Advertisement

In First Part of Application of Machine Learning in Text Recognition, we have clarified the basic terms from the field of machine learning and text recognition, explained various types of machine learning in brief. In Second Part of Application of Machine Learning in Text Recognition, we have discussed about the types of text recognition and application text recognition. Third part will complete the series.In This Third and Final Part of Application of Machine Learning in Text Recognition, We Have Discussed About Machine Learning as Part of Text Recognition and Draw Conclusion.

 

Application of Machine Learning in Text Recognition

 

Instance-Based Learning (IBL)

As described in previous articles of this series, the Instance-Based Learning is based on instances. These are described by n-many attributes and represent a point in an n-dimensional instance space. Each attribute is defined by a set of ordered numeric values ​​as well as unordered symbolic values. Supported by methods such as supervised learning, Instance-Based Learning (IBL) algorithms also learn a concept of something by entering training instances that are already classified or categorized.

Advertisement

---

Course of the Learning Process

Algorithms based on instance-based learning basically consist of the following four functions:

  1. Normalization
  2. Similarities
  3. Prediction
  4. Memory update

The first function normalizes each numeric value of an attribute to allow the second function to assume that each attribute has the same range of possible values. Once a new input is made, repeated normalization is required.

IBL Phase 1

The second function returns a numeric value that expresses the degree of agreement or commonality of the new instance versus the classification of the concepts already known. This is done by a simple commonality function such as the inverse of the Euclidean distance between two instances x and y, where n is the number of attributes.

IBL Phase 2

The third function uses the values ​​obtained so far to make a prediction of the most likely values ​​to be expected. The prediction function for the numeric values ​​then calculates a weighted similarity to the most similar instances.

IBL Phase 3

In the fourth functional step, the memory of the algorithm is finally updated, i.e. all processed and updated training data are written into the target concept.

Use in Handwriting Recognition

Sequence HCR with IBL

After a scanner has scanned, binarized, and saved the text to be processed as a bitmap, the system searches for lines. Rows in this context are basically rectangular areas containing sequences of letters. A page is thus separated horizontally by lines. Once all rows have been found, they will search for more rectangular sub-areas that will vertically separate the rows and contain isolated letters. The next phase is about learning. Each found letter is normalized in an area of ​​the bitmap which is 16×16 pixels in size. If one letter can not be correctly separated from another and normalized, it is possible to manually set the boundaries of the letter to other letters. The obtained 16×16 bitmap now receives a numeric code. A normalized letter is represented by 256 (16×16) dots. Each point can assume the state 0 (white) or 1 (black). Since letters can be scanned from texts in various resolutions (dpi) (75 dpi – 600 dpi), normalization into a 16×16 pixel bitmap is inevitable. For each of the 256 points in the normalized bitmap it is necessary to assign a corresponding part of the letter.

Letters can be classified in different ways. The system described here (“template matching”) uses five methods to compare the normalized bitmap (ie the known instance) with the bitmap of the input letter:

  1. Compare adjacent ones and zeroes with a simple XOR query
  2. Exclusive comparison of the ones by an AND query
  3. Exclusive comparison of the zeroes by an OR query
  4. The Euclidean distance of the circle around a letter
  5. The Euclidean distance of the circle around the normalized letter
Machine Learning in Text Recognition

 

Summary and Conclusion

 

In this scientific work, it has been that explained how today’s text recognition relies on machine learning and it’s effects on the successful recognition of texts. In the analysis of largely English technical contributions, it quickly became apparent that machine learning has no insignificant influence on the quality of text recognition. While in classical text recognition of machine typeface the systems still manage quite well without learned knowledge and are usually limited to pure comparison methods, it is almost impossible in the recognition of handwritten characters to correctly recognize these without experience from previous analyzes, so that In this case, different learning methods are used, which are no longer based purely on comparison.

It uses different machine learning methods, each of which has its advantages and disadvantages. Overall, however, these methods can be divided into 2 groups. Once the completely self-learning methods and methods based on existing knowledge and learning with the help of people. Fully self-learning methods are based on artificial intelligence.

It can also be stated that, above all, instance-Based Learning is very well suited to the recognition of handwritten characters. Depending on the quality of the graphics to be read, this method still achieves an accuracy of more than 80%. For printed characters, almost 100%. These values ​​will continue to increase over time, because every time the system rediscovers a character or reads it out for the first time, it trains itself. If you then add the context-based correction to the whole then the accuracy would improve again not insignificant .

In the present day, when artificial intelligence and machine learning already have a big impact on our lives, machine learning in text recognition will evolve. Thus, in the future, further artificial intelligence technologies will be developed, which are expected to provide even better analysis results. It can be surmised that these technologies will continue to evolve until they can guarantee a 100% correct selection.

It can also be stated that the technologies will continue to move away from the comparison methods and to the artificial intelligence, since this delivers substantially better results in the text recognition. For example, international competitions have been won with the technology of artificial intelligence and deep learning against the comparative methods.

Tagged With how to combine letters for machine learning text recognition , application of machine learning in text recognition : part iii , recognise most important words in text machine learning , text recognition machine learning

This Article Has Been Shared 238 Times!

Facebook Twitter Pinterest

Abhishek Ghosh

About Abhishek Ghosh

Abhishek Ghosh is a Businessman, Surgeon, Author and Blogger. You can keep touch with him on Twitter - @AbhishekCTRL.

Here’s what we’ve got for you which might like :

Articles Related to Application of Machine Learning in Text Recognition : Part III (END)

  • Integration of Apache Hadoop With OpenStack Swift

    The Topic Integration of Apache Hadoop With OpenStack Swift Is Not Exactly New. You Can Follow Our Guide Specially For Handling OpenStack.

  • Join/Merge Multiple Log Files For Big Data Analysis

    Here Are The Ways To Join/Merge Multiple Log Files For Big Data Analysis, Store Them To OpenStack Based Cloud Storage And Delete Old Files.

  • How To Install Apache Pig On Ubuntu 16.04

    Apache Pig is intended for analyzing large data sets. Usually we combine Pig with Hadoop. Here is How To Install Apache Pig On Ubuntu 16.04.

  • How To Learn Big Data : Free Courses and Resources

    How To Learn Big Data With Free Courses and Resources? First You Need to Know the Proper Websites, Second You Need to Know What Not to Learn.

  • How To Install Apache Flink on Ubuntu Server

    Apache Flink is a big data processing engine which can run in both streaming & batch mode.Here Are the Steps on How To Install Apache Flink on Ubuntu Server.

Additionally, performing a search on this website can help you. Also, we have YouTube Videos.

Take The Conversation Further ...

We'd love to know your thoughts on this article.
Meet the Author over on Twitter to join the conversation right now!

If you want to Advertise on our Article or want a Sponsored Article, you are invited to Contact us.

Contact Us

Subscribe To Our Free Newsletter

Get new posts by email:

Please Confirm the Subscription When Approval Email Will Arrive in Your Email Inbox as Second Step.

Search this website…

 

Popular Articles

Our Homepage is best place to find popular articles!

Here Are Some Good to Read Articles :

  • Cloud Computing Service Models
  • What is Cloud Computing?
  • Cloud Computing and Social Networks in Mobile Space
  • ARM Processor Architecture
  • What Camera Mode to Choose
  • Indispensable MySQL queries for custom fields in WordPress
  • Windows 7 Speech Recognition Scripting Related Tutorials

Social Networks

  • Pinterest (24.3K Followers)
  • Twitter (5.8k Followers)
  • Facebook (5.7k Followers)
  • LinkedIn (3.7k Followers)
  • YouTube (1.3k Followers)
  • GitHub (Repository)
  • GitHub (Gists)
Looking to publish sponsored article on our website?

Contact us

Recent Posts

  • What is Voice User Interface (VUI) January 31, 2023
  • Proxy Server: Design Pattern in Programming January 30, 2023
  • Cyberpunk Aesthetics: What’s in it Special January 27, 2023
  • How to Do Electrical Layout Plan for Adding Smart Switches January 26, 2023
  • What is a Data Mesh? January 25, 2023

About This Article

Cite this article as: Abhishek Ghosh, "Application of Machine Learning in Text Recognition : Part III (END)," in The Customize Windows, August 24, 2018, February 1, 2023, https://thecustomizewindows.com/2018/08/application-of-machine-learning-in-text-recognition-part-iii-end/.

Source:The Customize Windows, JiMA.in

PC users can consult Corrine Chorney for Security.

Want to know more about us? Read Notability and Mentions & Our Setup.

Copyright © 2023 - The Customize Windows | dESIGNed by The Customize Windows

Copyright  · Privacy Policy  · Advertising Policy  · Terms of Service  · Refund Policy

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
Do not sell my personal information.
Cookie SettingsAccept
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT