InfluxDB is a time series database written in the Golang programming language. It is easy to install for use in specific purposes, use SQL-like query language. It supports HTTP API, also can use InfluxDB to store metrics of our Internet of Things (IoT) projects. InfluxDB is optimized for fast, high-availability storage and retrieval of time series data in fields. It supports processing data from … [Read more...]
What is Time Series Database (TSDB)?
The time-series database systems provide various features based on time such as calculating at a high speed, executing query analysis from continuous values. Time-series databases are associated with industrial applications. They can store values from sensors. Compression algorithms may be used to manage the data. The design of these time series systems uses time as a key index which makes the … [Read more...]
Difference Between Apache Solr and Apache Lucene
One user have installation guide for Apache Solr but we do not have guide for Apache Lucene. It is kind of deliberate omission. There is an easy way to use Lucence on the command line. In most of the situations possibly you'll use Apache Solr and when you'll realize the need to utilize Apache Lucene, then you'll not need our help on installation. Solr is based on a Java library which is known as … [Read more...]
How to Install CockroachDB on Ubuntu Server
CockroachDB is an Open Source NewSQL DBMS. If you are new to the NewSQL world - CockroachDB can run on Raspberry Pi to large cluster of servers. Distributed Online Transaction Processing (OLTP) is one of the widely known use case of CockroachDB. CockroachDB is an interesting DBMS. It uses PostgreSQL protocol and you'll feel to find some MySQL syntax. It has good things of both RDBMS and NoSQL. It … [Read more...]
How to Package and Run Apache Twill (on Hadoop YARN)
YARN stands for Yet Another Resource Negotiator. YARN is a cluster management technology since Hadoop 2.0. YARN is being extensively used for writing applications by Hadoop Developers. Most of the readers of this website are beginners around big data. For that reason, we point towards the basic theoretical articles such as differences between batch processing and stream processing, differences … [Read more...]
How to Install Apache Crunch (on Hadoop)
By now, we know the differences between batch processing and stream processing, differences between Hadoop and Spark. Here are the steps to install the Apache Crunch on Hadoop. Crunch is used to creating pipelines that are composed of many user-defined functions. Crunch can run with Hadoop MapReduce and Apache Spark. Apache Crunch library is for tasks which are difficult to implement with just … [Read more...]
What is NewSQL?
NewSQL is a group of newer relational database management system which promises to achieve the scalable performance like NoSQL while being ACID compliant. ACID is a set of properties, namely Atomicity, Consistency, Isolation, Durability. That "standard" intended to guarantee validity facing events such as power failure. Although, NewSQL systems vary widely in their internal architectures, their … [Read more...]
Fields of Application of Big Data : Part II
This is the second part of the series Fields of Application of Big Data. In the first part, we have discussed the basics. We are starting this part discussing the marketability of Big Data. It can give the impression that the results of big data are equally common in every field of application and weighted equally. However, all fields of application are diverse, so that in some areas specific … [Read more...]
List of Python Packages for Deep Learning
Previously, we published a simillar list named List of Python Libraries For Data Science & Machine Learning. Just to repeat the same sayings - knowing basics around Python is a need for development around Data Science. We guess, most of the readers already used with installing Jupyter and working with Jupyter. This article is not exclusively related to Jupyter Notebook but is very important … [Read more...]
Fields of Application of Big Data : Part I
The possibilities with regard to the networking on the Internet means that a lot of data exchange takes place between the devices. This exchange includes both structured and unstructured data. More efficient process design and profit maximization should now be the focus of most business-driven companies. In order to be able to position the company position and the specially developed products … [Read more...]
Use of Bitcoins in Online Trading : Part III
This is the third and final part of the series Bitcoins in Online Trading. The second part of the article can be read here. Through the Internet and its commercial use, the demands on cryptography have increased. Due to these requirements, a separate area has developed whose main task is new information systems that protect against unauthorized access. The aim of these crypto methods is to … [Read more...]