• Home
  • Archive
  • Tools
  • Contact Us

The Customize Windows

Technology Journal

  • Cloud Computing
  • Computer
  • Digital Photography
  • Windows 7
  • Archive
  • Cloud Computing
  • Virtualization
  • Computer and Internet
  • Digital Photography
  • Android
  • Sysadmin
  • Electronics
  • Big Data
  • Virtualization
  • Downloads
  • Web Development
  • Apple
  • Android
Advertisement
You are here:Home » What Are Different Tasks of Data Mining?

By Abhishek Ghosh December 4, 2023 8:22 am Updated on December 4, 2023

What Are Different Tasks of Data Mining?

Advertisement

As explained in earlier articles, data mining is the systematic application of statistical methods to large data sets (especially “big data“) with the aim of creating new Identify cross-connections and trends. Due to their size, such databases are processed using computer-aided methods. In practice, the sub-term data mining has been applied to the entire process of so-called “knowledge discovery in databases“, KDD), which also includes steps such as pre-processing and evaluation, while data mining in the narrower sense only refers to the actual processing step of the process.

What Are Different Tasks of Data Mining

 

Data Mining Tasks

 

Typical tasks of data mining are:

  • Outlier Detection: Identification of unusual records: outliers, errors, changes
  • Cluster Analysis: Grouping Objects Based on Similarities
  • Classification: Elements that have not yet been assigned to classes are assigned to the existing classes.
  • Association analysis: Identification of relationships and dependencies in the data in the form of rules such as “A and B usually follow C”.
  • Regression analysis: Identification of relationships between (several) dependent and independent variables
  • Summary: Reduction of the data set to a more compact description without significant loss of information

These tasks can be roughly divided into observation problems (outlier detection, cluster analysis) and forecasting problems (classification, regression analysis).

Advertisement

---

Outlier Detection

This task looks for data objects that are inconsistent with the rest of the data, for example, by having unusual attribute values or deviating from a general trend. For example, the Local Outlier Factor method searches for objects that have a density that differs significantly from their neighbors, this is referred to as “density-based outlier detection”.

Identified outliers are often subsequently manually verified and hidden from the dataset, as they can worsen the results of other procedures. In some use cases, such as fraud detection, however, it is precisely the outliers that are the objects of interest.

Cluster Analysis

Cluster analysis is about identifying groups of objects that are in some way more similar to each other than other groups. Often these are accumulations in the data room, which is where the term cluster comes from. However, in a densely connected cluster analysis such as DBSCAN or OPTICS, the clusters can take any shape. Other methods, such as the EM algorithm or k-means algorithm, prefer spherical clusters.

Objects that have not been assigned to a cluster can be interpreted as outliers in the sense of the outlier detection mentioned above.

Classification

Similar to cluster analysis, classification is about assigning objects to groups (here referred to as classes). In contrast to cluster analysis, however, the classes are usually predefined (e.g. bicycles, cars) and machine learning methods are used to assign previously unassigned objects to these classes.

Association Analysis

In association analysis, frequent correlations in the data sets are searched for and usually formulated as inferential rules.

Regression Analysis

Regression analysis is used to model the statistical relationship between different attributes. This allows, among other things, the prediction of missing attribute values, but also the analysis of the deviation analogous to outlier detection. Using insights from cluster analysis and calculating separate models for each cluster, better forecasts can typically be made. If a strong correlation is established, this knowledge can also be used well for the summary.

Summary

Since data mining is often applied to large and complex amounts of data, an important task is also to reduce this data to a manageable amount for the user. In particular, outlier detection identifies individual objects that may be important; Cluster analysis identifies groups of objects that are often sufficient to examine only on the basis of a sample, which significantly reduces the number of data objects to be examined. Regression analysis allows redundant information to be removed, thus reducing the complexity of the data. Classification, association analysis and regression analysis (in some cases also cluster analysis) also provide more abstract models of the data.

With the help of these approaches, both the analysis of the data and, for example, its visualization (through sampling and reduced complexity) are simplified.

Tagged With foughtmy6
Facebook Twitter Pinterest

Abhishek Ghosh

About Abhishek Ghosh

Abhishek Ghosh is a Businessman, Surgeon, Author and Blogger. You can keep touch with him on Twitter - @AbhishekCTRL.

Here’s what we’ve got for you which might like :

Articles Related to What Are Different Tasks of Data Mining?

  • Uses of Text Mining in Web Content Mining : Part I

    This series will examine one of the discipline of knowledge discovery, that is Text Mining, and present the application possibilities of Web Content Mining.

  • What Is Data Mining? Examples of Data Mining Software

    Data mining is the systematic application of statistical methods to large databases with the aim of identifying new patterns and trends.

  • What is Pattern Recognition?

    Pattern recognition is the science that deals with the processes of engineering, computing, and mathematics related to physical or abstract objects, with the purpose of extracting information that allows properties to be set between sets of such objects. Pattern recognition also called pattern reading, figure identification, and shape recognition—consists of the recognition of signal patterns. […]

  • Knowledge Discovery in Databases : Part II

    In Part I of Knowledge Discovery in Databases, we discussed about the database systems, fundamentals of statistics and Big Data and fundamentals of knowledge discovery in databases. In this second part of Knowledge Discovery in Databases, we will discuss the process of the Knowledge Discovery in Databases and Methods of the Knowledge Discovery in Databases. […]

performing a search on this website can help you. Also, we have YouTube Videos.

Take The Conversation Further ...

We'd love to know your thoughts on this article.
Meet the Author over on Twitter to join the conversation right now!

If you want to Advertise on our Article or want a Sponsored Article, you are invited to Contact us.

Contact Us

Subscribe To Our Free Newsletter

Get new posts by email:

Please Confirm the Subscription When Approval Email Will Arrive in Your Email Inbox as Second Step.

Search this website…

 

vpsdime

Popular Articles

Our Homepage is best place to find popular articles!

Here Are Some Good to Read Articles :

  • Cloud Computing Service Models
  • What is Cloud Computing?
  • Cloud Computing and Social Networks in Mobile Space
  • ARM Processor Architecture
  • What Camera Mode to Choose
  • Indispensable MySQL queries for custom fields in WordPress
  • Windows 7 Speech Recognition Scripting Related Tutorials

Social Networks

  • Pinterest (24.3K Followers)
  • Twitter (5.8k Followers)
  • Facebook (5.7k Followers)
  • LinkedIn (3.7k Followers)
  • YouTube (1.3k Followers)
  • GitHub (Repository)
  • GitHub (Gists)
Looking to publish sponsored article on our website?

Contact us

Recent Posts

  • Cloud-Powered Play: How Streaming Tech is Reshaping Online GamesSeptember 3, 2025
  • How to Use Transcribed Texts for MarketingAugust 14, 2025
  • nRF7002 DK vs ESP32 – A Technical Comparison for Wireless IoT DesignJune 18, 2025
  • Principles of Non-Invasive Blood Glucose Measurement By Near Infrared (NIR)June 11, 2025
  • Continuous Non-Invasive Blood Glucose Measurements: Present Situation (May 2025)May 23, 2025
PC users can consult Corrine Chorney for Security.

Want to know more about us?

Read Notability and Mentions & Our Setup.

Copyright © 2026 - The Customize Windows | dESIGNed by The Customize Windows

Copyright  · Privacy Policy  · Advertising Policy  · Terms of Service  · Refund Policy