HadoopïŒããã¥ãŒãïŒ ãšã¯ïŒ
- HDFSïŒåæ£ãã¡ã€ã«ã·ã¹ãã ïŒã§å€§éã®ããŒã¿ãè€æ°ãµãŒããŒã«åæ£ä¿åããé«ãèé害æ§ãå®çŸãã
- MapReduceãšããåŠçã¢ãã«ã§ãããŒã¿ããåå²â䞊ååŠçâéçŽãã®3ã¹ãããã§å¹ççã«åŠçãã
- YARNããªãœãŒã¹ç®¡çãæ åœããã¯ã©ã¹ã¿å šäœã®CPUãã¡ã¢ãªãå¹ççã«å²ãæ¯ã
- SparkãHiveãªã©åšèŸºãšã³ã·ã¹ãã ãšçµã¿åãããŠäœ¿ãããããšãå€ã
Hadoopã£ãŠãªãã ãå€ãã£ãååã ãã©ãã©ããããã®ãªã®ïŒ
Hadoopã¯ã1å°ã®ãµãŒããŒã§ã¯åŠçããããªããããªå€§éã®ããŒã¿ããããããã®ãµãŒããŒã§æåãããŠä¿åã»åŠçããããã®ãã¬ãŒã ã¯ãŒã¯ã ããGoogleãè«æã§çºè¡šãã忣åŠçã®ä»çµã¿ããªãŒãã³ãœãŒã¹ã§å®çŸãããã®ãªãã
ã©ãããŠ1å°ãããã¡ãªã®ïŒ
ããšãã°SNSã®æçš¿ããŒã¿ã1æ¥ã§æ°ãã©ãã€ããçãŸãããããªäžçã ãšã1å°ã®ãµãŒããŒã®ãã£ã¹ã¯ã«ãåãŸããªãããåŠçé床ã远ãã€ããªããã ãHadoopãªãæ°çŸå°ãæ°åå°ã®ãµãŒããŒãæããŠäžã€ã®å·šå€§ãªã¹ãã¬ãŒãžïŒèšç®æ©ãšããŠäœ¿ãããã ã
ãããïŒäžèº«ã¯ã©ããªã£ãŠããã®ïŒ
倧ãã3ã€ã®ã³ã³ããŒãã³ããããããããŒã¿ã忣ä¿åããHDFSãããŒã¿ã䞊ååŠçããMapReduceããããŠã¯ã©ã¹ã¿ã®ãªãœãŒã¹ã管çããYARNãHDFSãããŒã¿ã3éã³ããŒã§ä¿åããããããµãŒããŒãå£ããŠãããŒã¿ã倱ããã«ãããã
MapReduceã£ãŠããã®ã¯ã©ããªåŠçãªã®ïŒ
ååã®éããMapïŒåå²ããŠåŠçïŒããšãReduceïŒçµæãéçŽïŒãã®2段éã§åãããããšãã°10åè¡ã®ãã°ããåèªã®åºçŸåæ°ãæ°ãããšããMapã§åãµãŒããŒãæ åœåãæ°ããŠãReduceã§å šãµãŒããŒã®çµæãåèšããã€ã¡ãŒãžã ã
ãªãã»ã©ïŒã§ãæè¿ã¯Sparkã®ã»ããæåãªæ°ããããã©âŠ
ãããšããã«æ°ã¥ããããMapReduceã¯ãã£ã¹ã¯ããŒã¹ã§åŠçããããé ããã ãSparkã¯ã¡ã¢ãªäžã§åŠçããããæ°ååéãå Žåããããã ããä»ã¯HDFSãšYARNã¯äœ¿ãã€ã€ãåŠçãšã³ãžã³ã¯Sparkã«çœ®ãæãããã¿ãŒã³ãäž»æµã ã
ãããHadoopã¯ããå€ãã®ïŒ
ã³ã¢æè¡ãšããŠã¯æçæã«å ¥ã£ãŠããããã¯ã©ãŠãæä»£ã«ã¯Amazon S3ã®ãããªãªããžã§ã¯ãã¹ãã¬ãŒãžãHDFSã®ä»£ããã«ãªãããšãå€ããã§ãHadoopãšã³ã·ã¹ãã ïŒHiveãHBaseãPigãªã©ïŒã¯ä»ã§ãå€ãã®äŒæ¥ã§åããŠããŠãããŒã¿åºç€ã®åå°ãšããŠéèŠãªååšã ã
ãšã³ã·ã¹ãã ããšçè§£ããªããšãããªããã ãïŒ
ããã ããHadoopã¯åäœããããHadoopãšã³ã·ã¹ãã ããšããŠæããã®ããã€ã³ãã ããããã°ããŒã¿ã®æŽå²ãèªãããã§é¿ããŠéããªãæè¡ã ãããä»çµã¿ã®åºæ¬ãæŒãããŠãããšåšèŸºããŒã«ã®çè§£ãã°ããšæ·±ãŸãã