• Home
  • Archive
  • Tools
  • Contact Us

The Customize Windows

Technology Journal

  • Cloud Computing
  • Computer
  • Digital Photography
  • Windows 7
  • Archive
  • Cloud Computing
  • Virtualization
  • Computer and Internet
  • Digital Photography
  • Android
  • Sysadmin
  • Electronics
  • Big Data
  • Virtualization
  • Downloads
  • Web Development
  • Apple
  • Android
Advertisement
You are here: Home » Knowledge Discovery in Databases : Part I

By Abhishek Ghosh August 26, 2018 2:52 pm Updated on August 26, 2018

Knowledge Discovery in Databases : Part I

Advertisement

Data which can be stored almost limitless in today’s world. Knowledge discovery in databases is a good way to extract knowledge from data. The aim of this article is to inform business related peoples about Knowledge Discovery in Databases, without the need to have greater knowledge in the areas of foundation basics in computer science. Commercial devices and scientific instruments, such as scanners, cash registers and telescopes are generating ever-increasing amounts of data. These data contain potential knowledge that far exceeds human capacity.
This is where knowledge discovery in databases (KDD) is used. The knowledge discovery in databases has the motivation to filter out the potential knowledge from this large amount of data. Knowledge discovery in databases works at the interface of statistics and database systems. In addition, knowledge discovery is a subset of machine learning. Knowledge discovery in databases is the process of semi-automatic extraction of knowledge from databases, which is valid data, previously unknown and is potentially useful for a given purpose.

For the sake of this article, initially the basics of statistics and database systems for knowledge discovery in databases will be explained. After discussing the basics, the specific details of the topic will be discussed and the aims and methods of the knowledge discovery in databases (KDD) will be explained. Later, the steps in the Knowledge Discovery process flow into databases will be explained. After the basic process has been explained, which methods are used in the KDD that will be discussed. In the final analysis, a conclusion will be drawn and a number of reasons will be discussed to point out why this area will continue to gain importance in the coming future.

Knowledge Discovery in Databases Part I

 

Fundamentals of Database Systems

 

Database system

Advertisement

---

A database system is an electronic data management system. The main task is to save large amounts of data efficiently, consistently and permanently. This is called both persistent storage and consistent (correct) storage.
Database systems consist of a database and a database management system.

Database

The database (DB) is a collection of all existing data with their descriptions or attributes.

Database Management System

The database management system (DBMS) is a program for managing the database. With this program the access to the data is controlled and the existing data can be supplemented or changed. Database systems have three levels of abstraction. There are the external level, the conceptual level and the internal level. This is also called the ANSI 3-plane model.

External level

The external level allows different users or user groups to have a different view of the data stock. Not every user may or wants to see all data.

Conceptual level

The conceptual level specifies the overall logical view of all data. The overall view describes the relationships and objects between data. Their goal is to ensure completely redundancy-free storage of all data in the database. The data is stored in the third form of normalization according to the relational database schema.

Internal level

The internal level deals with the physical implementation of the Conceptual Scheme. Information about the physical memory data structures and access mechanisms are part of the internal layer. Therefore, the internal level is also called the physical level. In addition, base tables are often provided in non-normalized form to speed up access to the data. Performance intensive aggregation tables are usually calculated overnight and stored in additional tables. This allows the user to retrieve the results very quickly during the day.

Database language

As a relational database language, the language SQL has prevailed in the industry standard. SQL consists of three parts:

Data Definition Language (DDL) : The DDL is the part of the language of SQL to create databases, tables, views, etc. (CREATE), make structure changes (ALTER) or delete databases or database objects (DROP).
Data Manipulation Language (DML) : The DML is the part of the SQL language that makes it possible to insert records (INSERT), to change (UPDATE), to delete (DELETE) or to query (SELECT).
Data Control Language (DCL) : The DCL is the part of the language of SQL to assign access rights (GRANT) or withhold (REVOKE).

 

Fundamentals of Statistics and Big Data

 

Statistics is divided into three basic tasks. The descriptive statistics, inductive statistics and the exploratory statistics.

Descriptive statistics

In descriptive statistics, the available data are described and graphically compressed in tables, graphs or figures. These contain parameters such as mean and scatter as well as diagrams and curves. With this statistic slightly inconsistent data can be filtered out. The descriptive statistics work with the question: How can one describe a distribution of a feature?

Inductive statistics

The inductive statistics try to determine from a small part of the data mass characteristics of the population over which one wants to make a statement. This is analyzed and implemented using stochastic models. Inductive statistics work with the question: How can a sample be identified as a property for all relevant population data?

Exploratory statistics

Exploratory statistics is an intermediate form of descriptive and inductive statistics. It uses descriptive methods and inductive test methods to detect possible relationships or differences between data. The results found are called hypotheses. The explorative statistics work with the question: What is remarkable or unusual about a distribution of a feature?

Data Warehouse

A data warehouse (DW) is a persistent, integrated collection of data from multiple sources for the purpose of analysis and decision support. For a more comprehensive introduction to the topic, read our topics in this website on fundamentals of Big Data, Data Warehouse, Data Lake, Data Mining by using the search function.

Architecture of a Data Warehouse

The Data Warehouse collects the data from various operational database systems, such as purchasing, order processing and marketing. The complex process of integrating these operational database systems creates metadata for describing the data warehouse. Data marts are extracted from the data warehouse, which represent certain partial views/extracts as a copy of the actual data. The Data Warehouse is used to process ad hoc requests, for online analytical processing and for data mining, thus laying the foundation for data mining.

 

Fundamentals of Knowledge Discovery in Databases

 

The aim of the Knowledge Discovery in Databases is to independently discover decision-relevant, but previously unknown relationships and links in the data of large amounts of data and present them to the analyst or the user in a clear format. These gained relationships provide a knowledge gain. A key statement is that it is a non-trivial process whose purpose is to extract patterns from large data sets. These patterns should also have the properties that they are valid for a large part of the database (valid) and describe previously unknown (potentially useful) and easily understandable (ultimately understandable) relationships within the database.

 

Conclusion of Part I

 

In this first part, we introduced the readers to Knowledge Discovery in Databases (KDD). In the second part of this article, we will discuss the process of the Knowledge Discovery in Databases and Methods of the Knowledge Discovery in Databases.

Tagged With 1NNI , knowledge discovery databases explained

This Article Has Been Shared 682 Times!

Facebook Twitter Pinterest
Abhishek Ghosh

About Abhishek Ghosh

Abhishek Ghosh is a Businessman, Orthopaedic Surgeon, Author and Blogger. You can keep touch with him on Twitter - @AbhishekCTRL.

Here’s what we’ve got for you which might like :

Articles Related to Knowledge Discovery in Databases : Part I

  • Install Apache Kafka on Ubuntu 16.04 : Single Cloud Server

    Here Are The Steps On How To Install Apache Kafka on Ubuntu 16.04 Running One Single Cloud Server Instance. Apache Kafka

  • Install Apache Zeppelin On Ubuntu 16.04

    Zeppelin Can Be Pre-Built Package Or Can Be Build From Source. Here Is How To Install Apache Zeppelin On Ubuntu 16.04 Building From Source.

  • WordPress Interactive Python Widget For Python (Like JSFiddle For Jupyter Notebook)

    Here is WordPress Interactive Python Widget For Python With DataCamp Light For Peoples Who Hunt Something Like JSFiddle For Jupyter Notebook.

  • How To Learn Big Data For Beginners

    Many Are Interested to Learn Big Data Analytics and Other Works But Clueless Where From to Start. Here is How To Learn Big Data For Beginners.

  • How to Install Apache Ignite on Ubuntu Server

    How to Install Apache Ignite on Ubuntu Depends on the Purpose. Apache Ignite is a distributed database, also is a caching and processing platform.

Additionally, performing a search on this website can help you. Also, we have YouTube Videos.

Take The Conversation Further ...

We'd love to know your thoughts on this article.
Meet the Author over on Twitter to join the conversation right now!

If you want to Advertise on our Article or want a Sponsored Article, you are invited to Contact us.

Contact Us

Subscribe To Our Free Newsletter

You can subscribe to our Free Once a Day, Regular Newsletter by clicking the subscribe button below.

Click To Subscribe

Please Confirm the Subscription When Approval Email Will Arrive in Your Email Inbox as Second Step.

Search this website…

 

Popular Articles

Our Homepage is best place to find popular articles!

Here Are Some Good to Read Articles :

  • Cloud Computing Service Models
  • What is Cloud Computing?
  • Cloud Computing and Social Networks in Mobile Space
  • ARM Processor Architecture
  • What Camera Mode to Choose
  • Indispensable MySQL queries for custom fields in WordPress
  • Windows 7 Speech Recognition Scripting Related Tutorials

Social Networks

  • Pinterest (21K Followers)
  • Twitter (5.3k Followers)
  • Facebook (5.7k Followers)
  • LinkedIn (3.7k Followers)
  • YouTube (1.3k Followers)
  • GitHub (Repository)
  • GitHub (Gists)
Looking to publish sponsored article on our website?

Contact us

Recent Posts

  • What is Standard Software February 28, 2021
  • WordPress Link to text Fragment February 27, 2021
  • How to Protect IP Cameras From Hackers February 25, 2021
  • 6 Sectors That Have Undergone Revamps in Digital Landscape February 24, 2021
  • What You Can Control with a Smartwatch and ESP32 February 23, 2021

 

About This Article

Cite this article as: Abhishek Ghosh, "Knowledge Discovery in Databases : Part I," in The Customize Windows, August 26, 2018, March 1, 2021, https://thecustomizewindows.com/2018/08/knowledge-discovery-in-databases-part-i/.

Source:The Customize Windows, JiMA.in

 

This website uses cookies. If you do not want to allow us to use cookies and/or non-personalized Ads, kindly clear browser cookies after closing this webpage.

Read Cookie Policy.

PC users can consult Corrine Chorney for Security.

Want to know more about us? Read Notability and Mentions & Our Setup.

Copyright © 2021 - The Customize Windows | dESIGNed by The Customize Windows

Copyright  · Privacy Policy  · Advertising Policy  · Terms of Service  · Refund Policy