• Home
  • Archive
  • Tools
  • Contact Us

The Customize Windows

Technology Journal

  • Cloud Computing
  • Computer
  • Digital Photography
  • Windows 7
  • Archive
  • Cloud Computing
  • Virtualization
  • Computer and Internet
  • Digital Photography
  • Android
  • Sysadmin
  • Electronics
  • Big Data
  • Virtualization
  • Downloads
  • Web Development
  • Apple
  • Android
Advertisement
You are here: Home » Application of Machine Learning in Text Recognition : Part I

By Abhishek Ghosh August 23, 2018 9:37 am Updated on August 23, 2018

Application of Machine Learning in Text Recognition : Part I

Advertisement

In today’s digital, increasingly interconnected and fast paced world, the boundaries between what people have been what computers can do for us. While computers have been better in computing tasks from simple mathematics faster than the human brain for several decades, the type and efficiency of how human brain identifies and correlates patterns is still unsuccessful, despite various researches. However, with the increased power of up-to-date hardware, programming advances, and improved algorithms, computers today can detect faces and emotions in them. Using Google Image Search to display information about a highly charged image. Also in the pattern recognition of the publication, on the manuscript to the handwriting, now there is great progress. In order to improve the pattern recognition and the underlying algorithms of machines in these areas, we can use the principles of machine learning. The aim of this series of article is to discuss application of machine learning in text recognition which can be used to achieve improvements in applications. After a short presentation and classification of the topic, as well as the clarification of basic terms from the field of machine learning and text recognition, various types of machine learning that can be helpful for the support and improvement of text recognition will be analyzed and evaluated.

 

Application of Machine Learning in Text Recognition : Revision of Basics

 

Machine Learning

The notion of learning is commonly defined as the process in which humans, or animals, gain knowledge, understanding, or abilities through study, guidance, or experience in a field. In machine learning, due to existing parallels to the way animals learn, animals are attempting to adapt the knowledge gained in animals about their learning behavior to that of machines. Interestingly enough, the parallels between the two types of learning also allow conclusions to be drawn about the other way round, i.e. we can gain insights from machine learning that broaden our knowledge of biological learning. Machines learn whenever possible, based on input and output, data, structure, or program, and as a result improve the expected future performance. Accordingly, text recognition is well suited as a field of application for the machine learning application.

Advertisement

---

Algorithmic Approaches

In machine learning, there are several practical implementations using algorithmic approaches to learning. In the following, five possible and partly consecutive Algorithmic Solutions will be briefly presented and explained by means of examples.

Application of Machine Learning in Text Recognition Part I

Supervised learning

In supervised learning, the goal is to learn a concept based on classifiable examples. An example would be pictures of means of transport with wheels: car and bicycle. After the learner has seen a number of classified examples, he should be able to independently formulate hypotheses about the examples he continues to present and to approach the target concept as well as possible. Possible associations in this example would be the presence of wheels, which alone have direct ground contact. So he could make a statement about a motorcycle, which will also be a means of transportation, while he could also rule out that, for example, a watermill is a means of transportation. If you transfer this type of learning to a machine-processable algorithm, Here, too, a “teacher” of the machine provides several inputs from a topic area. First, the machine is provided with a correct function value, i.e. in our example pictures of: car, bicycle = means of transport with wheels. According to some examples, after the learning phase, the algorithm should also be able to independently formulate hypotheses about submitted examples. Monitored learning is therefore well-suited to improve handwriting recognition of algorithms. While machine-made block letters require only the font and uppercase and lowercase letters to be learned, a human’s handwriting follows the same principles, but is very individual in design.

Unsupervised learning

In contrast to supervised learning, unsupervised learning does not provide correct function values ​​or already classified outputs in order to create associations. Thus, in unsupervised learning, the algorithm also receives inputs, but does not receive any monitored expected outputs. Instead, it is left to the algorithm itself to find patterns in the obtained data and to make probable assumptions in order to create, classify, and output one or more models from the input data. In all other input data is then searched for the found patterns and classifications, or the existing patterns and classifications expanded. Unmonitored learning can thus be seen as a form of learning that creates a probabilistic model from the data provided. In models in which the timing and order of the data obtained are irrelevant, the algorithm can be used to monitor and detect outliers. In an example, thin of the sensor data of the temperature detectors of a data center. The algorithm could recognize a range of normal values ​​as a pattern from all previous data and detect an anomaly in the case of outliers in the form of temperature differences of the amount.

Naive Bayes

Naive Bayes classifications are renowned for their efficiency despite their simplicity. The probabilistic model of naive Bayes classifications is based on the theorem of Bayes. “Naive” comes in this context from the assumption that the characteristics of an input data set are independent. While this assumption is often refuted, naive Bayes algorithms are nevertheless very successful in practice and often achieve better results than supposedly more powerful algorithms. The result and the success of this algorithm depends strongly on its field of application. An example for the application of a naive Bayes based algorithm would be the classification of an apple: As a parameter is a classification (trained by eg supervised learning):

Fruit: apple
Circumference: ~ 10 cm
Red color
Shape: round

In all other inputs, the algorithm will be able to make relatively quickly relatively correct classification, which is a clear advantage. Among the classifications could, however, possibly, for example, appear a tomato, since their parameters also corresponds to those of the learned classification, which in turn to be Disadvantage.

Nearest Neighbor Algorithm

The Nearest Neighbor Algorithm (KNN) uses trained training examples and classifications through supervised learning. If a new object with an unknown classification is entered, its closest neighbors will search for the most similar objects with the highest match and classify the new object according to the attributes of the known objects. This approach can be extended by not only considering the set of matching nearest neighbor attributes, but also weighting them. Thus, neighbors who are “further away”, or have a smaller amount, but higher weighting of attributes, can be used as a classification example.

Instance-Based Learning

In instance-based learning (IBL), to avoid confusion in terminology over other methods that have been referred to as examples, patterns, or instances, instances are spoken here. IBL has great similarities to supervised learning and derives from the nearest neighbor concept. Again with this method, the source for the classifications of an instance is an external input. However, unlike supervised learning, IBL algorithms do not expect a specific input pair, but passively use external processes or data that already exists. The instances are described by pairs of attribute values. This pair of attribute values ​​and their similarities, or differences, is stored as a learned relationship and can be used for future predictions (classifications), or be extended by these. Compared to other approaches, this approach stands out for its general simplicity and simple way of analyzing attribute pairs. A consequent disadvantage may be a lack of missing relationships between attribute pairs, but they belong together. This has a negative effect on the precision of the output or prediction. An application example for the IBL can be found in handwriting recognition.

 

Conclusion of Part I

 

This article’s revision is probably helpful to the newbies with not much sound technical background. We learned the basic classification of ways of learning, which essentially based on human learning and which can be converted to logical thoughts i.e. programming. We would suggest the newbies to read our another series on deep learning. In the next part of this series, we will discuss about types of text recognition and application text recognition.

Tagged With machine learning text recognition , machine learning for text recognition , how to do speech to text recognition in machine learning , machine learning recognizing text , machine learning to detect if a pattern exists in a dynamically generated text , speech recognition

This Article Has Been Shared 908 Times!

Facebook Twitter Pinterest

Abhishek Ghosh

About Abhishek Ghosh

Abhishek Ghosh is a Businessman, Surgeon, Author and Blogger. You can keep touch with him on Twitter - @AbhishekCTRL.

Here’s what we’ve got for you which might like :

Articles Related to Application of Machine Learning in Text Recognition : Part I

  • Big Data as a Service (BDaaS) Basics

    Big Data as a Service or BDaaS, is as if combination of SaaS, PaaS and DaaS. Self Hosting Big Data platform is time consuming and costly.

  • Real-time Big Data Analytics in Health Care Using Tools From IBM

    Goal of the article Real-time Big Data Analytics in Health Care Using Tools From IBM is to provide understanding of big data in the health.

  • How to Include Jupyter Notebooks in WordPress Posts

    How to Include Jupyter Notebooks in WordPress Posts? There Are Several Ways. It Depends on What Exactly Your Blogs Are About.

  • How To Install Apache Avro On Ubuntu Running Apache Hadoop

    Here is How To Install Apache Avro On Ubuntu Running Apache Hadoop. Avro is a Framework Supporting RPC and Data Serialization.

  • Configure Apache Tika With WordPress to Search, Get Meta of PDF/Doc Files

    Apache Tika Can Be Combined With PHP. Here is How to Configure Apache Tika With WordPress to Search, Get Meta of PDF/Doc/Excel/Text and Other Type of Files.

Additionally, performing a search on this website can help you. Also, we have YouTube Videos.

Take The Conversation Further ...

We'd love to know your thoughts on this article.
Meet the Author over on Twitter to join the conversation right now!

If you want to Advertise on our Article or want a Sponsored Article, you are invited to Contact us.

Contact Us

Subscribe To Our Free Newsletter

Get new posts by email:

Please Confirm the Subscription When Approval Email Will Arrive in Your Email Inbox as Second Step.

Search this website…

 

Popular Articles

Our Homepage is best place to find popular articles!

Here Are Some Good to Read Articles :

  • Cloud Computing Service Models
  • What is Cloud Computing?
  • Cloud Computing and Social Networks in Mobile Space
  • ARM Processor Architecture
  • What Camera Mode to Choose
  • Indispensable MySQL queries for custom fields in WordPress
  • Windows 7 Speech Recognition Scripting Related Tutorials

Social Networks

  • Pinterest (24.3K Followers)
  • Twitter (5.8k Followers)
  • Facebook (5.7k Followers)
  • LinkedIn (3.7k Followers)
  • YouTube (1.3k Followers)
  • GitHub (Repository)
  • GitHub (Gists)
Looking to publish sponsored article on our website?

Contact us

Recent Posts

  • What is Voice User Interface (VUI) January 31, 2023
  • Proxy Server: Design Pattern in Programming January 30, 2023
  • Cyberpunk Aesthetics: What’s in it Special January 27, 2023
  • How to Do Electrical Layout Plan for Adding Smart Switches January 26, 2023
  • What is a Data Mesh? January 25, 2023

About This Article

Cite this article as: Abhishek Ghosh, "Application of Machine Learning in Text Recognition : Part I," in The Customize Windows, August 23, 2018, January 31, 2023, https://thecustomizewindows.com/2018/08/application-of-machine-learning-in-text-recognition-part-i/.

Source:The Customize Windows, JiMA.in

PC users can consult Corrine Chorney for Security.

Want to know more about us? Read Notability and Mentions & Our Setup.

Copyright © 2023 - The Customize Windows | dESIGNed by The Customize Windows

Copyright  · Privacy Policy  · Advertising Policy  · Terms of Service  · Refund Policy

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
Do not sell my personal information.
Cookie SettingsAccept
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT