• Home
  • Archive
  • Tools
  • Contact Us

The Customize Windows

Technology Journal

  • Cloud Computing
  • Computer
  • Digital Photography
  • Windows 7
  • Archive
  • Cloud Computing
  • Virtualization
  • Computer and Internet
  • Digital Photography
  • Android
  • Sysadmin
  • Electronics
  • Big Data
  • Virtualization
  • Downloads
  • Web Development
  • Apple
  • Android
Advertisement
You are here:Home » Data Mining Issues

By Abhishek Ghosh December 4, 2023 7:01 pm Updated on December 4, 2023

Data Mining Issues

Advertisement

In our earlier discussions, we have clarified the data mining tasks. While most data mining methods try to be able to deal with data that is as general as possible, there are also specializations for more specialized data types.

  • Text mining: Text mining is about the analysis of large textual data sets. This can be used, for example, to detect plagiarism or to classify the text inventory.
  • Web mining: Web mining is about the analysis of distributed data as it is represented by websites. In order to detect clusters and outliers, however, not only the pages themselves are considered here, but also the relationships (hyperlinks) of the pages to each other. Due to the constantly changing content and the non-guaranteed availability of data, additional challenges arise. This topic area is also closely related to information retrieval.
  • Time series analysis: In time series analysis, temporal aspects and relationships play a major role. Here, existing data mining methods can be used by means of special distance functions such as dynamic time warping distance, but specialized methods are also being developed. An important challenge is to identify series with a similar trajectory, even if it is slightly offset in time, but still has similar characteristics.

Data Mining Issues
Next set of questions arise around issues with data mining.

 

What Are the Issues of Data Mining

 

Data Defects

Many of the problems with data mining stem from inadequate pre-processing of the data, or from systematic errors and biases in its collection. These problems are often statistical in nature and need to be solved at the time of collection: representative results cannot be obtained from non-representative data. Similar aspects must be taken into account here as when creating a representative sample.

Advertisement

---

Parameterization

The algorithms used in data mining often have several parameters that are suitable to choose. With all parameters, they provide valid results, and choosing the parameters in such a way that the results are also useful is a task of the user.

Evaluation

The evaluation of data mining results presents the user with the problem that on the one hand he wants to gain new insights, on the other hand it is difficult to evaluate processes automatically. For forecasting problems such as classification, regression analysis, and association analysis, the forecast on new data can be used for evaluation. This is more difficult for description issues such as outlier detection and cluster analysis. Clusters are usually evaluated internally or externally, i.e. based on their mathematical compactness or their agreement with known classes. The results of outlier detection methods are compared with known outliers. In both cases, however, the question arises as to whether this evaluation really fits the task of the “new findings” and does not ultimately evaluate the “reproduction of old knowledge”.

Interpretation

As statistical methods, the algorithms analyze the data without any background knowledge about its meaning. Therefore, the methods can usually only provide simple models such as groups or mean values. Often, the results are no longer comprehensible as such. However, these machine-generated results must then be interpreted by the user before they can really be called knowledge.

Facebook Twitter Pinterest

Abhishek Ghosh

About Abhishek Ghosh

Abhishek Ghosh is a Businessman, Surgeon, Author and Blogger. You can keep touch with him on Twitter - @AbhishekCTRL.

Here’s what we’ve got for you which might like :

Articles Related to Data Mining Issues

  • Uses of Text Mining in Web Content Mining : Part I

    This series will examine one of the discipline of knowledge discovery, that is Text Mining, and present the application possibilities of Web Content Mining.

  • What Is Data Mining? Examples of Data Mining Software

    Data mining is the systematic application of statistical methods to large databases with the aim of identifying new patterns and trends.

  • What is Text Mining?

    Text mining or textual data mining, is a bundle of algorithm-based analysis methods for the discovery of meaning structures from unstructured or weakly structured text data. Using statistical means, text mining software opens up structures from texts that are intended to enable users to quickly recognize core information of the processed texts. Ideally, text mining […]

  • Knowledge Discovery in Databases : Part II

    In Part I of Knowledge Discovery in Databases, we discussed about the database systems, fundamentals of statistics and Big Data and fundamentals of knowledge discovery in databases. In this second part of Knowledge Discovery in Databases, we will discuss the process of the Knowledge Discovery in Databases and Methods of the Knowledge Discovery in Databases. […]

performing a search on this website can help you. Also, we have YouTube Videos.

Take The Conversation Further ...

We'd love to know your thoughts on this article.
Meet the Author over on Twitter to join the conversation right now!

If you want to Advertise on our Article or want a Sponsored Article, you are invited to Contact us.

Contact Us

Subscribe To Our Free Newsletter

Get new posts by email:

Please Confirm the Subscription When Approval Email Will Arrive in Your Email Inbox as Second Step.

Search this website…

 

vpsdime

Popular Articles

Our Homepage is best place to find popular articles!

Here Are Some Good to Read Articles :

  • Cloud Computing Service Models
  • What is Cloud Computing?
  • Cloud Computing and Social Networks in Mobile Space
  • ARM Processor Architecture
  • What Camera Mode to Choose
  • Indispensable MySQL queries for custom fields in WordPress
  • Windows 7 Speech Recognition Scripting Related Tutorials

Social Networks

  • Pinterest (24.3K Followers)
  • Twitter (5.8k Followers)
  • Facebook (5.7k Followers)
  • LinkedIn (3.7k Followers)
  • YouTube (1.3k Followers)
  • GitHub (Repository)
  • GitHub (Gists)
Looking to publish sponsored article on our website?

Contact us

Recent Posts

  • Cloud-Powered Play: How Streaming Tech is Reshaping Online GamesSeptember 3, 2025
  • How to Use Transcribed Texts for MarketingAugust 14, 2025
  • nRF7002 DK vs ESP32 – A Technical Comparison for Wireless IoT DesignJune 18, 2025
  • Principles of Non-Invasive Blood Glucose Measurement By Near Infrared (NIR)June 11, 2025
  • Continuous Non-Invasive Blood Glucose Measurements: Present Situation (May 2025)May 23, 2025
PC users can consult Corrine Chorney for Security.

Want to know more about us?

Read Notability and Mentions & Our Setup.

Copyright © 2026 - The Customize Windows | dESIGNed by The Customize Windows

Copyright  · Privacy Policy  · Advertising Policy  · Terms of Service  · Refund Policy