Hadoop Ecosystem = HDFS + MapReduce + Tools(Hive,Pig,HBase,Zookeeper,Flume,Sqoop,Oozie,Mahout)
Three modes
Local standalone mode - single JVM
pseudo distributed mode - separate JVM
fully distributed mode - multiple JVM
Hadoop Components
Namenode
Datanode
Secondary Namenode
JobTracker
TaskTracker
Configuration files
core-site.xml => location of namenode
hdfs-site.xml => replication factor
mapred-site.xml => location of jobtracker
hadoop dfs -ls
dfs -copyFromLocal
dfs put
dfs -cat
dfs -get
Three modes
Local standalone mode - single JVM
pseudo distributed mode - separate JVM
fully distributed mode - multiple JVM
Hadoop Components
Namenode
Datanode
Secondary Namenode
JobTracker
TaskTracker
Configuration files
core-site.xml => location of namenode
hdfs-site.xml => replication factor
mapred-site.xml => location of jobtracker
hadoop dfs -ls
dfs -copyFromLocal
dfs put
dfs -cat
dfs -get
No comments:
Post a Comment