Process Mining is an innovative discipline among Big Data technologies that makes it possible to analyze and map processes out of data. While process mining is already widely used, it is often underestimated in terms of process management potential, and when applied properly it creates a high degree of process transparency. What is Process Mining? The thing which … [Read more...]
What is Process Mining?
Process Mining is a process management technique that enables business processes to be reconstructed and evaluated on the basis of digital traces in IT systems. Process Mining is a discipline of business data analytics designed to improve processes in a company's operations. The individual steps of the process stored in the systems are combined and the process is visualized in its entirety. … [Read more...]
OLTP versus OLAP
In our previous posts, we have discussed the basics of OLTP and OLAP. OLTP (On-line Transaction Processing) and OLAP (On-line Analytical Processing) are often compared as a theoretical topic. The data from OLTP databases can be made to be ingested into OLAP systems through the extract, transform, load (ETL) process. The ETL tools help the users to collect data from several sources and send it … [Read more...]
What is OLTP (Online Transaction Processing)?
OLTP stands for Online Transaction Processing. OLTP is a type of processing to facilitate and manage the transactional applications for data entry and recovery. The software packages for OLTP are based on the client-server architecture as they are usually used by companies with a distributed computer network. The term can be understood in the context of "computational transactions" or "database … [Read more...]
How to Install PyCharm and Anaconda for Data Science
PyCharm is a cross-platform IDE. JetBrains developed and manage PyCharm. Pycharm provides the tools for Python development. It is not only common for developers but for introductory to intermediate python/computer science college courses available for free. We already discussed about installation of Anaconda on Ubuntu. Installation of JDK 8 and installation of JDK 11 on Windows has been discussed … [Read more...]
Fields of Application of Big Data : Part VII (END)
This is the final part of the series Fields of Application of Big Data. The 6th and previous part of this series can be read by clicking here. In this part, we are drawing a conclusion and providing an outlook. Although the development of big data is still in its infancy and many researches have only begun in recent years, the spectrum of application fields is already so broad that a … [Read more...]
Fields of Application of Big Data : Part VI
It is the continuation of part V of the series Fields of Application of Big Datapart V of the series Fields of Application of Big Data. Even a child have a tendency to act independently within the framework of one's own possibilities and to determine without external control the goals and procedures of one's own actions. This fact shows that the need for autonomy is given high priority, which … [Read more...]
Fields of Application of Big Data : Part V
This is the continuation of the forth part of Fields of Application of Big Data. Illnesses increasingly occur in old age. Even if the causes are different, the health usually deteriorates only among the older peoples. The latencies of diseases are often high, so that they occur only in middle and old age. In addition, secondary diseases are possible due to previous illnesses. This allows the big … [Read more...]
How to Install Apache Druid (Single Server)
Apache Druid is a column oriented, analytics database which can be used to build a data warehouse. Druid has APIs (via HTTP and JDBC) for loading, managing, and querying data and the user can also interact with Druid via the console. A mere 4GB RAM of an instance or computer is sufficient to test-run nano-quickstart configuration of Druid, and 16GB RAM instance can run it at production. Druid can … [Read more...]
How Data Sciences Can Determine the Future of Technology World
In the future, it is likely that individual products such as the iPhone or Amazon Echo speakers would no longer be at the centre of attention, but rather networking and the accompanying flood of data in the software-defined world and exist with the help of artificial intelligence. Already we can see the progress of wearable consumer products like Samsung Galaxy LTE. Ultimately the consistent … [Read more...]
What is OLAP and What is it Used For?
OLAP stands for Online Analytical Processing. OLAP is an application used to ease the query processing on data warehouse. The terminology OLAP was created by modifying the traditional database related term online transaction processing (OLTP). OLTP facilitates and also manages the transaction-oriented applications. OLAP is an approach to multi-dimensional analytical (MDA) queries and is a part of … [Read more...]