• Home
  • Archive
  • Tools
  • Contact Us

The Customize Windows

Technology Journal

  • Cloud Computing
  • Computer
  • Digital Photography
  • Windows 7
  • Archive
  • Cloud Computing
  • Virtualization
  • Computer and Internet
  • Digital Photography
  • Android
  • Sysadmin
  • Electronics
  • Big Data
  • Virtualization
  • Downloads
  • Web Development
  • Apple
  • Android
Advertisement
You are here: Home » Knowledge Discovery in Databases : Part I

By Abhishek Ghosh August 26, 2018 2:52 pm Updated on August 26, 2018

Knowledge Discovery in Databases : Part I

Advertisement

Data which can be stored almost limitless in today’s world. Knowledge discovery in databases is a good way to extract knowledge from data. The aim of this article is to inform business related peoples about Knowledge Discovery in Databases, without the need to have greater knowledge in the areas of foundation basics in computer science. Commercial devices and scientific instruments, such as scanners, cash registers and telescopes are generating ever-increasing amounts of data. These data contain potential knowledge that far exceeds human capacity.
This is where knowledge discovery in databases (KDD) is used. The knowledge discovery in databases has the motivation to filter out the potential knowledge from this large amount of data. Knowledge discovery in databases works at the interface of statistics and database systems. In addition, knowledge discovery is a subset of machine learning. Knowledge discovery in databases is the process of semi-automatic extraction of knowledge from databases, which is valid data, previously unknown and is potentially useful for a given purpose.

For the sake of this article, initially the basics of statistics and database systems for knowledge discovery in databases will be explained. After discussing the basics, the specific details of the topic will be discussed and the aims and methods of the knowledge discovery in databases (KDD) will be explained. Later, the steps in the Knowledge Discovery process flow into databases will be explained. After the basic process has been explained, which methods are used in the KDD that will be discussed. In the final analysis, a conclusion will be drawn and a number of reasons will be discussed to point out why this area will continue to gain importance in the coming future.

Knowledge Discovery in Databases Part I

 

Fundamentals of Database Systems

 

Database system

Advertisement

---

A database system is an electronic data management system. The main task is to save large amounts of data efficiently, consistently and permanently. This is called both persistent storage and consistent (correct) storage.
Database systems consist of a database and a database management system.

Database

The database (DB) is a collection of all existing data with their descriptions or attributes.

Database Management System

The database management system (DBMS) is a program for managing the database. With this program the access to the data is controlled and the existing data can be supplemented or changed. Database systems have three levels of abstraction. There are the external level, the conceptual level and the internal level. This is also called the ANSI 3-plane model.

External level

The external level allows different users or user groups to have a different view of the data stock. Not every user may or wants to see all data.

Conceptual level

The conceptual level specifies the overall logical view of all data. The overall view describes the relationships and objects between data. Their goal is to ensure completely redundancy-free storage of all data in the database. The data is stored in the third form of normalization according to the relational database schema.

Internal level

The internal level deals with the physical implementation of the Conceptual Scheme. Information about the physical memory data structures and access mechanisms are part of the internal layer. Therefore, the internal level is also called the physical level. In addition, base tables are often provided in non-normalized form to speed up access to the data. Performance intensive aggregation tables are usually calculated overnight and stored in additional tables. This allows the user to retrieve the results very quickly during the day.

Database language

As a relational database language, the language SQL has prevailed in the industry standard. SQL consists of three parts:

Data Definition Language (DDL) : The DDL is the part of the language of SQL to create databases, tables, views, etc. (CREATE), make structure changes (ALTER) or delete databases or database objects (DROP).
Data Manipulation Language (DML) : The DML is the part of the SQL language that makes it possible to insert records (INSERT), to change (UPDATE), to delete (DELETE) or to query (SELECT).
Data Control Language (DCL) : The DCL is the part of the language of SQL to assign access rights (GRANT) or withhold (REVOKE).

 

Fundamentals of Statistics and Big Data

 

Statistics is divided into three basic tasks. The descriptive statistics, inductive statistics and the exploratory statistics.

Descriptive statistics

In descriptive statistics, the available data are described and graphically compressed in tables, graphs or figures. These contain parameters such as mean and scatter as well as diagrams and curves. With this statistic slightly inconsistent data can be filtered out. The descriptive statistics work with the question: How can one describe a distribution of a feature?

Inductive statistics

The inductive statistics try to determine from a small part of the data mass characteristics of the population over which one wants to make a statement. This is analyzed and implemented using stochastic models. Inductive statistics work with the question: How can a sample be identified as a property for all relevant population data?

Exploratory statistics

Exploratory statistics is an intermediate form of descriptive and inductive statistics. It uses descriptive methods and inductive test methods to detect possible relationships or differences between data. The results found are called hypotheses. The explorative statistics work with the question: What is remarkable or unusual about a distribution of a feature?

Data Warehouse

A data warehouse (DW) is a persistent, integrated collection of data from multiple sources for the purpose of analysis and decision support. For a more comprehensive introduction to the topic, read our topics in this website on fundamentals of Big Data, Data Warehouse, Data Lake, Data Mining by using the search function.

Architecture of a Data Warehouse

The Data Warehouse collects the data from various operational database systems, such as purchasing, order processing and marketing. The complex process of integrating these operational database systems creates metadata for describing the data warehouse. Data marts are extracted from the data warehouse, which represent certain partial views/extracts as a copy of the actual data. The Data Warehouse is used to process ad hoc requests, for online analytical processing and for data mining, thus laying the foundation for data mining.

 

Fundamentals of Knowledge Discovery in Databases

 

The aim of the Knowledge Discovery in Databases is to independently discover decision-relevant, but previously unknown relationships and links in the data of large amounts of data and present them to the analyst or the user in a clear format. These gained relationships provide a knowledge gain. A key statement is that it is a non-trivial process whose purpose is to extract patterns from large data sets. These patterns should also have the properties that they are valid for a large part of the database (valid) and describe previously unknown (potentially useful) and easily understandable (ultimately understandable) relationships within the database.

 

Conclusion of Part I

 

In this first part, we introduced the readers to Knowledge Discovery in Databases (KDD). In the second part of this article, we will discuss the process of the Knowledge Discovery in Databases and Methods of the Knowledge Discovery in Databases.

Tagged With 1NNI , knowledge discovery databases explained

This Article Has Been Shared 793 Times!

Facebook Twitter Pinterest

Abhishek Ghosh

About Abhishek Ghosh

Abhishek Ghosh is a Businessman, Surgeon, Author and Blogger. You can keep touch with him on Twitter - @AbhishekCTRL.

Here’s what we’ve got for you which might like :

Articles Related to Knowledge Discovery in Databases : Part I

  • Install Apache Kafka on Ubuntu 16.04 : Single Cloud Server

    Here Are The Steps On How To Install Apache Kafka on Ubuntu 16.04 Running One Single Cloud Server Instance. Apache Kafka

  • Install Apache Zeppelin On Ubuntu 16.04

    Zeppelin Can Be Pre-Built Package Or Can Be Build From Source. Here Is How To Install Apache Zeppelin On Ubuntu 16.04 Building From Source.

  • WordPress Interactive Python Widget For Python (Like JSFiddle For Jupyter Notebook)

    Here is WordPress Interactive Python Widget For Python With DataCamp Light For Peoples Who Hunt Something Like JSFiddle For Jupyter Notebook.

  • How To Learn Big Data For Beginners

    Many Are Interested to Learn Big Data Analytics and Other Works But Clueless Where From to Start. Here is How To Learn Big Data For Beginners.

  • How to Install Apache Ignite on Ubuntu Server

    How to Install Apache Ignite on Ubuntu Depends on the Purpose. Apache Ignite is a distributed database, also is a caching and processing platform.

Additionally, performing a search on this website can help you. Also, we have YouTube Videos.

Take The Conversation Further ...

We'd love to know your thoughts on this article.
Meet the Author over on Twitter to join the conversation right now!

If you want to Advertise on our Article or want a Sponsored Article, you are invited to Contact us.

Contact Us

Subscribe To Our Free Newsletter

Get new posts by email:

Please Confirm the Subscription When Approval Email Will Arrive in Your Email Inbox as Second Step.

Search this website…

 

Popular Articles

Our Homepage is best place to find popular articles!

Here Are Some Good to Read Articles :

  • Cloud Computing Service Models
  • What is Cloud Computing?
  • Cloud Computing and Social Networks in Mobile Space
  • ARM Processor Architecture
  • What Camera Mode to Choose
  • Indispensable MySQL queries for custom fields in WordPress
  • Windows 7 Speech Recognition Scripting Related Tutorials

Social Networks

  • Pinterest (24.3K Followers)
  • Twitter (5.8k Followers)
  • Facebook (5.7k Followers)
  • LinkedIn (3.7k Followers)
  • YouTube (1.3k Followers)
  • GitHub (Repository)
  • GitHub (Gists)
Looking to publish sponsored article on our website?

Contact us

Recent Posts

  • Cyberpunk Aesthetics: What’s in it Special January 27, 2023
  • How to Do Electrical Layout Plan for Adding Smart Switches January 26, 2023
  • What is a Data Mesh? January 25, 2023
  • What is Vehicular Ad-Hoc Network? January 24, 2023
  • Difference Between Panel Light, COB Light, Track Light January 21, 2023

About This Article

Cite this article as: Abhishek Ghosh, "Knowledge Discovery in Databases : Part I," in The Customize Windows, August 26, 2018, January 29, 2023, https://thecustomizewindows.com/2018/08/knowledge-discovery-in-databases-part-i/.

Source:The Customize Windows, JiMA.in

PC users can consult Corrine Chorney for Security.

Want to know more about us? Read Notability and Mentions & Our Setup.

Copyright © 2023 - The Customize Windows | dESIGNed by The Customize Windows

Copyright  · Privacy Policy  · Advertising Policy  · Terms of Service  · Refund Policy

We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
Do not sell my personal information.
Cookie SettingsAccept
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT