Apache Ambari helps in management, monitoring, provisioning of Hadoop cluster. Here is how to install Apache Ambari on Ubuntu 16.04 to manage Hadoop cluster. Ambari Server uses an embedded PostgreSQL database by default. When we install the Ambari Server, PostgreSQL packages and dependencies must be available for install. We are showing example with repo from Hortonworks for many reasons, one of … [Read more...]
Brute Force SSH Test Own Server With ncrack, hydra, medusa
If we use these purely hacking tools, we can really check how our servers appear to a black hat hacker. It is normal not to know how to use them by a relatively new sysadmin. These will give you idea why odd log entries appear too. Here is real life way to brute force SSH test own server with ncrack, hydra, medusa. These are no way closest to real black hat hackers work. They will have many … [Read more...]
How To Process Server Log in Hadoop, Pig : Free IBM Analytics Demo Cloud
In our previously published guides, we talked about installing Hadoop, installing Pig on own server. We also talked about free IBM Analytics Demo Cloud with needed softwares pre-installed, ready to use. Here is basic example with commands to show how to process server log in Hadoop, Pig can be done using free IBM Analytics Demo Cloud. This guide can be followed by even who never used Hadoop or … [Read more...]
Compatibility of Our Existing OpenStack Guides For IBM OpenStack
In recent times we noticed that peoples are searching around IBM OpenStack things. We have many guides on HP Cloud, Rackspace OpenStack. Here is disclaimer & short note on compatibility of our existing OpenStack guides for IBM OpenStack. Intended audience already know about what is OpenStack. We are talking about the IaaS, storage users of OpenStack installation by HP, IBM, Rackspace; not … [Read more...]
How To Install Apache Pig On Ubuntu 16.04
Apache Pig is intended for analyzing large data sets. Usually we combine Pig with Hadoop. The language of Pig is Pig Latin. Apache Pig can execute Hadoop jobs in MapReduce, Apache Tez, Apache Spark. Pig Latin has similarities with SQL for relational database management. Pig Latin can be extended with scripts written in Java, Python, JavaScript, Ruby, Groovy. Here is How To Install Apache Pig On … [Read more...]
Docker Tutorial For Beginners : Part 4 (docker compose)
We have previous three guides, one is how to install Docker and run it , second one is on some management GUI and Docker Part 3. docker-compose.yml file is a YAML file that defines how the Docker containers will in production. In Part 4 of Docker Tutorial For Beginners, we will talk about Docker Compose, which peoples say containers in production, docker-compose etc. Docker Tutorial … [Read more...]
What is Data Refining in Big Data?
Most commonly new developers, particularly who are interested in data analysis face some terminologies which have more to do with theoretical and practical part of engineering and analytical sciences. The developers can be from a variety of domains and the phrases often confuses them. The question what is data refining in big data such an obvious question and answer is commonly written for those … [Read more...]
IBM Analytics Demo Cloud : Free Hadoop, Ambari With SSH
Normally we install Apache Hadoop and other big data tools on our servers. IBM Analytics Demo Cloud is intended to learn Hadoop, Ambari, BigSQL free of cost with SSH access & web console. With various cloud offerings many things these days have a free usage tier like we shown with unrelated things - Heroku and OpenShift PaaS. Here is how to get started with non-root access to this system. In … [Read more...]
Docker Tutorial For Beginners : Part 3
In this series, first part of guide was on how to install Docker and use commands, second part was installation of some UI to visually check what is going on. In Part 3 of Docker Tutorial For Beginners, we will learn basics about Docker File, Port, Containers, Environment, Registry and Tag. Normally if we write a Python application, we install a Python runtime on the machine. But, with Docker, we … [Read more...]
Install Apache Mahout : Ubuntu 16.04 For Machine Learning Dev
Apache Mahout is a simple programming environment and also a framework for building algorithms for Scala, Apache Spark, H2O, Apache Flink and so on. Samsara is part of Mahout, an experimentation environment with R like syntax. Here is how to install Apache Mahout on Ubuntu 16.04 for machine learning development. This guide will show commands to give the correct idea not exact commands to copy … [Read more...]
Docker Tutorial For Beginners : Part 1
Many users face issues to get started with Docker. Here is Docker tutorial for beginners. Containers provide a service oriented easy design. Current trend is to use Virtual Machines to run software applications. That is traditional virtualisation. Inside Virtual Machines applications run inside guest Operating System with virtual BIOS, virtual hardware powered by the server’s host OS and of course … [Read more...]