Apache Hbase is column-oriented distributed datastore. Previously, we have shown installing Apache Accumulo. Accumulo is distributed key/value store, built on Hadoop. In another guide we have shown how to install Apache Cassandra. Cassandra is a column-oriented distributed datastore, inspired by BigTable. We have sown how to install Hadoop on single server instance. We can install Hbase without … [Read more...]
How to Install Apache Ignite on Ubuntu Server
Previously we described How Install Apache Cassandra. Apache Ignite is a distributed database, caching and processing platform. Apache Ignite utilizes RAM as it's default storage and processing tier making it included in the class of in-memory computing platforms. Data in Ignite is stored in the form of key-value pairs. The database component scales horizontally. How to Install Apache Ignite on … [Read more...]
How Install Apache Cassandra on Ubuntu (Single Cloud Server Instance)
Apache Cassandra is a wide column store NoSQL database management system. Cassandra provides high availability with no single point of failure, robust support for clusters spanning multiple datacenters. Facebook released Cassandra for Facebook inbox search feature. Main features of Apache Cassandra is it is - Distributed, Replication strategies are configurable, Scalable, Fault-tolerant, Has … [Read more...]
How to Install Apache Accumulo on Ubuntu (Single Cloud Server Instance)
Accumulo provides robust, scalable data storage. It is a scalable, distributed key-value store based on Google's Bigtable and built on top of Apache Hadoop, Apache ZooKeeper, and Apache Thrift. Accumulo is the third most popular NoSQL wide column store behind Apache Cassandra and Hbase. Here Are the Steps on How to Install Apache Accumulo on Ubuntu Running on Single Cloud Server Instance. You have … [Read more...]
How to Install Apache Kudu on Ubuntu Server
Apache Kudu is a column oriented data store of the Apache Hadoop system which is compatible with most of the data processing frameworks use in Hadoop environment. Apache Kudu provides Hadoop's storage layer to enable fast analytics on fast data. This project originally was of Cloudera. Kudu Has Official Kudu Quickstart VM and Cloudera Users Can Avoid This Guide. Kudu is now easier to install and … [Read more...]
How To Install SBT and Scala on Ubuntu Server
sbt is a build tool for Scala and Java projects Like Apache Maven, Apache Ant. In our previously published guides, we have shown the steps to install Apache Maven and Gradle. Here Are The Steps on How To Install SBT and Scala on Ubuntu Server. This tool also not uncommon need while installation of Big Data tools. Initially full form of sbt was Simple Build Tool, but it is now known simply as … [Read more...]
How To Install Apache Maven on Ubuntu Server
Apache Maven is a Build Automation Tool. Alternative technologies is Gradle and sbt as build tools. We published guide on how to install Gradle. Maven needs XML file to build. Gradle and sbt do not rely on XML, but basic concept is like Maven introduced. Maven Needed For Many Big Data Software. Here Are the Steps on How To Install Apache Maven on Ubuntu Server. Maven takes care of two aspects of … [Read more...]
How To Install Apache Beam
We have some series of articles on basics and essentials on Big Data touching ETL, batch and stream processing. That minimum theoretical idea is better to have to properly utilize Apache Beam. Apache Beam is a programming model to define and execute data processing. This article is On How To Install Apache Beam, it is for Whole Project. Beam SDKs available for Python, Java, Go. Their installation … [Read more...]
How to Install Apache Hama (HDFS Installation)
Apache Hama is a Distributed Computing Framework For Massive Scientific Computations. Hama consists of 3 major components - BSPMaster, GroomServers and Zookeeper. It is a framework for Big Data analytics which uses the Bulk Synchronous Parallel (BSP) computing model. It provides BSP programming model, vertex and neuron centric programming models. Hama can be installed in local/pseudo-distributed … [Read more...]
Configure Apache Tika With WordPress to Search, Get Meta of PDF/Doc Files
In our previously published article How to Install Apache Tika on Ubuntu Server, we learned basic about Apache Tika. Apache Tika Can Be Combined With PHP. Apache Tika can detect content, and extracts metadata and text from different file types – it can identify more than 1400 file types. Tika has relation with Apache Nutch codebase. Tika has fork in Python too. Tika has different way of … [Read more...]
How to Install Apache Tika on Ubuntu Server
Apache Tika is a Content Analysis Framework. Tika is like we right click on file and selecting properties option on desktop BUT for web. It also can detect content. Apache Tika detects and extracts metadata and text from different file types - it can identify more than 1400 file types. Tika has relation with Apache Nutch codebase. Tika has fork in Python too. Tika has different way of … [Read more...]