- 论坛徽章:
- 16
|
本帖最后由 wenhq 于 2016-01-20 22:49 编辑
1、MapReduce的主要应用领域在哪里?在哪些场合被取代可能性不高?
a. offline computing. batch computing
b. query data with SQL?
c. it's very hard to be replace under batch processing.
2、对比YARN和Mesos的优势和劣势,以及YARN框架未来的发展方向?
a. Yarn support capacity/fair scheduler on memory/cpu which has fine-grained scheduler.
b. Mesos supprot coarse-grained scheduler which support yarn job also with non-yarn job.
3、HDFS缺少哪些你需要的特性,或者你比较喜欢其哪一个特性,也可以谈谈您比较看好哪个存储系统,为什么?
a.I like hdfs easily scaling. has default 3 replication with high availability. also it's take server down as common problems ,also build on commodity server.reduce server-farm cost.
b. compare to Glusterfs, Hdfs balance doesn't have high impact than GlusterFS.
c. compare to Fastdfs, I thought it's can commit data replication more accurate than Fastdfs, which it is very hard under high volume write situation.
d. but, hdfs sync between cluster/DC. we have to use distcp tools to make it, doesn't like NFS which need sync data easily.
e. hdfs doesn't like new tech ignite/tachyon which support memory-based storage will provide more faster access data, as it's data store on disk. you know, Disk I/O is always bottleneck of performance.
4、Hadoop从业者应该如何进行职业规划?
Hadoop is a big ecosystem include storage/database/processing/security. I thought it's better do some project/experience under some mentor if possible. also you have to strong java coding skill, as it's based on java. after you did some projects, then try to understand the principle of Hadoop.
try to fix some bugs under github/googlegroup. the most important part, you have to keep hungry till to understand the truth of Hadoop.
Just part of my opinion. |
|