论坛徽章:: 0

电梯直达

1楼 [收藏(0)] [报告]

发表于 2006-07-18 22:29 |只看该作者 |倒序浏览

在openmosix的主页看了一会，主要看了introduction和faq，发现一个问题，openmosix和pbs系统的区别在哪？openmosix无法将一个程序并行化，他只是将一堆计算机整合成一台逻辑上的smp的机器，让多个任务可以透明的在集群上运行，openmosix则负责资源的调度和任务的管理。试想这样的事情pbs不是也能很好的做到嘛？

文库|博客

nntp

版主

论坛徽章:: 0

2楼 [报告]

发表于 2006-07-18 23:36 |只看该作者

PBS is just only a scheduler, It do nothing except arranging the job in Q under user predefined policy. Other compute nodes in this hpc cluster "execute" the jobs, processing the parallel codes themselves and exchanging the computing message base upon PVM/MPI.

openmosix is sophisticated cluster system who "execute" the jobs, sharing computing load between different nodes and different processor(maybe core), openmosix's kernel is responsible for not only "schedule" but "executing". And in it's "schedule" is vary different from PBS's.

I suggest you to install both of them even in virtual environment, you'll get the "Exact" answer by yourself.

[ 本帖最后由 nntp 于 2006-7-18 23:39 编辑 ]

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

kartwall

稍有积蓄

论坛徽章:: 0

3楼 [报告]

发表于 2006-07-20 13:25 |只看该作者

回复 2楼 nntp 的帖子

谢谢NNTP，这里贴一些其他人给的意见，供大家参考，从你们的建议中，我学到了很多。

Not sure exactly what a PBS is but form what you describe it would
run on top of an openmosix cluster. OpenMosix creates what is called a
Single System Image. This wya your jobs do not have to know anything
about a cluster, they do not have to be submitted to a scheduler, you
just run and forget. The cluster will automatically shift the load
around to get best cpu per job usage. A scheduler on the other hand
requires either the jobs to understnad how to do parrallel work or for
the submitter to pre-split the job to make use of the cluster. That is
a bit of a simple explantation but Ithin it should give you a good
idea of the difference.

================================================================

While having similar aims, the way batch systems like the grid engine
achieve them is quite different from the openMosix approach. Roughly
speaking, batch systems start jobs on free nodes by "ssh-ing" to the
given node (or do something a bit more clever but still somehow
equivalent). This is also the reason why they have bring their own job
management tools (e.g. qstat, qsub etc.) -- jobs just have to be
processes on your local node. This does also limit the kind of jobs
you can use with batch systems, as they're usually unable to execute
interactive or X applications.

Contrary to that, openMosix engages in a much lower level. It is a
kernel patch that allows processes running locally on a node to be
transmitted to another node transparently during runtime. The last
part is quite important, as it has some interesting consequences:

* If there is a load inequality among the cluster nodes, it can be
equalized much smoother by simply migrating some jobs to the idling
machines within seconds. Batch systems can only equalize load by
starting new jobs which isn't as elegant and, more important, will
fail if the queue is empty.

* You don't have to use special job management tools, as openMosix
can migrated nearly every process on your node (ok, there are some
limitations: no multithreading, no shared memory, but for oM's use
case this is a weak limitation).

* oM does also work with interactive and X applications. For instance
if you have a graphical fractal generator which is creating high
load on your login machine, oM could easily migrate it to an idling
machine without you noticing it.

So, oM is, despite some limitations, way more elegant. Give it a try.

================================================================

By using the openMosix kernel along side other clustering apps, a more
generalised beowulf style cluster can be built to cater for all types of
use.

I have used PBS and found it tricky to set up jobs to run quickly across
nodes but that would NOT mean that you cannot use PBS along side openMosix.

If a job you schedule for a particular node is openMosix firiendly, then
openMosix could cause that particular job to migrate on to a faster free
node and if your particular job spawns sub processes that are openMosix
friendly, then each one of those processes could infact migrate in order to
get 100% CPU usage from all the openMosix nodes in your network.

ie
PBS spawns 10 openMosix friendly processes for 1 node on the network,
openMosix would migrate each of those processes to a different node.
If one node is then used for something else, then openMosix could migrate
the process again to find the maximum CPU use for that process.
Without openMosix, PBS would only allow you to set the same 10 processes to
run across 10 nodes and stay where they run.

Quote from Andreas Sch鋐er:
So, oM is, despite some limitations, way more elegant. Give it a try.

*Yes, and use it along side PBS and other clustering apps.*

I also use DSH from within cron...
My cron scheduler runs on a designated master node, jobs are set using
crontab -e from any node and DSH is used to run them.

eg:

0 * * * * dsh -c -m 192.168.1.20 -m 192.168.1.21 /home/mydir/myscript.sh
Would cause the script 'myscript.sh' to run hourly on nodes 192.168.1.20 and
21 concurrently.
(please note that /home and /var/spool/cron/ are available from NFS )
If myscript.sh contains oM friendly processes, then those too will migrate
across the network to other nodes.

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

返回列表

Chinaunix › 论坛 › IT运维 › 集群和高可用 › openmosix和pbs的区别体现在哪里？

openmosix和pbs的区别体现在哪里？ [复制链接]

回复 2楼 nntp 的帖子