In order to follow this guide, one need to install Java (preferably Sun Java) installed, MySQL 5.x and Hadoop cluster. We already have published separate guides on how to install Percona MySQL on Ubuntu server and how to install Hadoop on single server. Chukwa visualization interface requires HBase. Chukwa is a system designed for reliable log collection and processing with Hadoop. Chukwa cluster … [Read more...]
Advantages of Graph Databases Over the Other Databases
Choosing the correct type of database is an important part of developing a new application. In our earlier publications, we have discussed about four common type of databases used in different data science related applications, which are Key-Value Database, Graph Database, Document-Oriented Database and Column-oriented Database. In addition, there is traditional RDMS, such as MySQL and the new … [Read more...]
What is Graph Database?
In the previous articles we have discussed about three common databases used in data sciences - Column-oriented Database, Key-Value Database and Document-Oriented Database. A graph database portrays the data like they are viewed conceptually. This graph is of mathematics. A graph in mathematics is not a bar or line chart. It is like a mind map. The graph paradigm is a reimagination of what is … [Read more...]
What is Column-oriented Database?
In our previous articles, we have discussed about Key-Value Database and Document-Oriented Database. The Column-oriented Database is another major type of database which is used in data sciences. The Column-oriented Databases store data tables by column (not row). These can use query languages like SQL and can serve data for extract, transform, load (ETL) and data visualization. The result, … [Read more...]
What Do Employees Gain from Using Advanced Search Analytics?
We have become accustomed to finding an answer to any question in seconds - all it takes is a mobile device and an internet connection. Entering queries into search engines multiple times a day is now second nature — whether it’s the height of an actor in a show you are streaming, the latest update on a current event, or fact-checking a piece of trivia someone just dropped on you. A similar … [Read more...]
What is Document-Oriented Database (Doument Store)?
In our previous article, we have discussed about Key-Value Database, which is type of NoSQL database. The document-oriented databases are designed to store, retrieve & manage document-oriented semi structured data. It is normal to compare the document-store with key-value database (we gave some example use cases in caching, image storage, financial sectors including forex trading) as some of … [Read more...]
What is Key-Value Database?
A key-value database is a type of NoSQL database which uses a key-value method to store data. Key-value databases work differently from the relational databases. They store data as a collection of key-value pairs where a key serves as a unique identifier. The key-value databases are designed for storing, retrieving, and managing a data structure known as a hash table (or dictionaries). They … [Read more...]
Fields of Application of Big Data : Part IV
Up to the third part on fields of application of Big Data, we have discussed about all topics which can be put under the header “Business Life Application”. Namely – (i) Research and product development (ii) Financial and Risk Controlling (iii) Production (iv) Marketing and Sales and (v) Distribution and logistics. In the this part, we will discuss the use cases related to personal life including … [Read more...]
Fields of Application of Big Data : Part III
In the second part of Fields of Application of Big Data, we have discussed about some topics which can be put under the header “Business Life Application”. They were (i) Research and product development (ii) Financial and Risk Controlling (iii) Production. In this 3rd part of of Application of Big Data, we have discussed use cases in two fields - marketing and sales, distribution and … [Read more...]
What is the Present Situation of Hadoop?
Hadoop is the pioneer platform which started the revolution of Big Data. Hadoop is one of the software used as an example of how big data works are handled. Hadoop is still used by many companies to store their data. The field of big data is rapidly progressing giving birth to cheap alternatives. So, our guides such as how to install Hadoop may not be useful in professional life. Presently, Hadoop … [Read more...]
Difference Between Standalone Database and Distributed Database
The obvious difference between the two is the number of servers. The standalone database requires one server while distributed databases require multiple servers (at least two). In the case of one-node (or one-server) centralized databases, all data are managed by a single DBMS. That is the common setup for almost everything from web software like WordPress to analytical software. Data consistency … [Read more...]