Saturday, October 21, 2017

Hadoop Ecosystem

Hadoop Ecosystem = HDFS + MapReduce + Tools(Hive,Pig,HBase,Zookeeper,Flume,Sqoop,Oozie,Mahout)

Three modes
Local standalone mode - single JVM
pseudo distributed mode - separate JVM
fully distributed mode - multiple JVM

Hadoop Components
Namenode
Datanode
Secondary Namenode
JobTracker
TaskTracker

Configuration files
core-site.xml => location of namenode
hdfs-site.xml => replication factor
mapred-site.xml => location of jobtracker

hadoop dfs -ls
dfs -copyFromLocal
dfs put
dfs -cat
dfs -get

No comments:

Post a Comment

Web Development

Design Phase:- Below all these represent different stages of the UX/UI design flow:- Wireframes represent a very basic & visual repr...