• Home
  • Archive
  • Tools
  • Contact Us

The Customize Windows

Technology Journal

  • Cloud Computing
  • Computer
  • Digital Photography
  • Windows 7
  • Archive
  • Cloud Computing
  • Virtualization
  • Computer and Internet
  • Digital Photography
  • Android
  • Sysadmin
  • Electronics
  • Big Data
  • Virtualization
  • Downloads
  • Web Development
  • Apple
  • Android
Advertisement
You are here: Home » Apache Hadoop Framework : Basics

By Abhishek Ghosh January 13, 2014 5:27 am Updated on January 13, 2014

Apache Hadoop Framework : Basics

Advertisement

Apache Hadoop Framework is a free, written in Java, framework for scalable, distributed working. It is based on the well-known MapReduce algorithm of Google Inc. as well as proposals from the Google file system. Apache Hadoop Framework allows intensive computing processes with large amounts of data (Big Data in petabyte range) on clusters computer.

 

Apache Hadoop Framework : Basics

 

Apache Hadoop was originally was created by Doug Cutting and Mike Cafarella in 2005 for Yahoo. On 23 January 2008, it became the top-level project of the Apache Software Foundation. Users include Facebook, a9.com, AOL, Baidu, IBM, ImageShack.

Hadoop consists of the Hadoop Common package, which provides filesystem and OS level abstractions, a MapReduce engine (either MapReduce/MR1 or YARN/MR2) and the Hadoop Distributed File System (HDFS).  Hadoop’s HDFS is a highly available, high-performance file system for storing very large amounts of data on the file systems of multiple computers (nodes). Files are divided into data blocks with a fixed length and distributes them redundantly on the participating nodes. HDFS is pursuing a master-slave approach. A master node , called the NameNode, processes incoming data requests, organized storage of files in the slave node and stores resulting metadata . HDFS supports file systems with several 100 million files. Both file block length and degree of redundancy are configurable.

Advertisement

---

 

Apache Hadoop Framework : Commercial Support and commercial Forks

 

Apache Hadoop Framework

Hadoop implements the MapReduce algorithm with configurable classes for Map, Reduce and Combine phases. HBase is a scalable, simple database to manage very large amounts of data within a Hadoop cluster. The HBase database is based on a free implementation of Google’s BigTable . This data structure is suitable for data which is rarely changed, but very often updated. With HBase billions of rows can be distributed and efficiently can be managed.

Hadoop Hive extended to data warehouse functionalities, namely the query language HiveQL and indexes. HiveQL is on SQL -based query language and allows the developer to use a SQL-like syntax. In the summer of 2008, Facebook , the original developer of Hive, made the project open source. Hadoop database used by Facebook is one with a little more than 100 petabytes (August 2012) and is the largest in size in the world.

With Pig MapReduce programs can be written in high-level language Pig Latin for Hadoop. Chukwa enables real-time monitoring of very large distributed systems. ZooKeeper is the configuration of distributed systems.

As the use of Hadoop is particularly interesting for companies, there are a number of companies that commercial support or Forks of Hadoop:

  1. Cloudera provides CDH with an “enterprise ready” Open Source Distribution for Hadoop
  2. Microsoft integrates Hadoop currently in Windows Azure and SQL Server
  3. The Google App Engine support MapReduce programs.
  4. IBM InfoSphere product BigInsights based on Hadoop.
  5. EMC ² offers Greenplum HD Hadoop as part of a product package.
  6. Yahoo! provides Hadoop on branded Hortonworks.
Tagged With Hadoop 2014

This Article Has Been Shared 369 Times!

Facebook Twitter Pinterest
Abhishek Ghosh

About Abhishek Ghosh

Abhishek Ghosh is a Businessman, Orthopaedic Surgeon, Author and Blogger. You can keep touch with him on Twitter - @AbhishekCTRL.

Here’s what we’ve got for you which might like :

Articles Related to Apache Hadoop Framework : Basics

  • Change SSH Welcome Message (Ubuntu, Rackspace Cloud)

    Here is How You Can Change SSH Welcome Message, Also Known as Message of the Day or MOTD on Ubuntu Server on Rackspace Cloud By Simple Way.

  • Web Fonts, HTTPS and CDN : Error and Solution

    This guide is Intended to solve the errors associated with Web Fonts when served from Rackspace Cloud Files for a webpage using HTTPS.

  • Separate MySQL Database Setup on Rackspace Cloud Server

    Here is a Step by Step Guide on How To Setup a Separate MySQL Database Server on Rackspace Cloud Server and Connect From FTP Server Easily.

  • W3 Total Cache With SSL for Nginx Server

    W3 Total Cache With SSL for Nginx Server, thats on Rackspace Cloud with PHP5-FPM – here are some tips to work with this complex setup keeping everything fine.

  • Cloud Computing : Bandwidth and Network

    There are factors which gave a significant boost to the evolution of cloud. We will discuss about one such factor which influences Cloud Computing.

Additionally, performing a search on this website can help you. Also, we have YouTube Videos.

Take The Conversation Further ...

We'd love to know your thoughts on this article.
Meet the Author over on Twitter to join the conversation right now!

If you want to Advertise on our Article or want a Sponsored Article, you are invited to Contact us.

Contact Us

Subscribe To Our Free Newsletter

You can subscribe to our Free Once a Day, Regular Newsletter by clicking the subscribe button below.

Click To Subscribe

Please Confirm the Subscription When Approval Email Will Arrive in Your Email Inbox as Second Step.

Search this website…

 

Popular Articles

Our Homepage is best place to find popular articles!

Here Are Some Good to Read Articles :

  • Cloud Computing Service Models
  • What is Cloud Computing?
  • Cloud Computing and Social Networks in Mobile Space
  • ARM Processor Architecture
  • What Camera Mode to Choose
  • Indispensable MySQL queries for custom fields in WordPress
  • Windows 7 Speech Recognition Scripting Related Tutorials

Social Networks

  • Pinterest (21K Followers)
  • Twitter (5.3k Followers)
  • Facebook (5.7k Followers)
  • LinkedIn (3.7k Followers)
  • YouTube (1.3k Followers)
  • GitHub (Repository)
  • GitHub (Gists)
Looking to publish sponsored article on our website?

Contact us

Recent Posts

  • Basics on Python Tornado (web server) March 8, 2021
  • What You Need to Know About Hybrid Mobile App Development March 6, 2021
  • Why Not to Use Your Host for Email Marketing March 5, 2021
  • What You Need to Know About the Microservices March 4, 2021
  • Fix Missing/Bad FileProvider for Freshchat (Android error code 354) March 3, 2021

 

About This Article

Cite this article as: Abhishek Ghosh, "Apache Hadoop Framework : Basics," in The Customize Windows, January 13, 2014, March 8, 2021, https://thecustomizewindows.com/2014/01/apache-hadoop-framework-basics/.

Source:The Customize Windows, JiMA.in

 

This website uses cookies. If you do not want to allow us to use cookies and/or non-personalized Ads, kindly clear browser cookies after closing this webpage.

Read Cookie Policy.

PC users can consult Corrine Chorney for Security.

Want to know more about us? Read Notability and Mentions & Our Setup.

Copyright © 2021 - The Customize Windows | dESIGNed by The Customize Windows

Copyright  · Privacy Policy  · Advertising Policy  · Terms of Service  · Refund Policy