¨~¨~¨~mfsÈ¨ÍþÖ¸ÄÏ(moosefs)·Ö²¼Ê½ÎÄ¼þÏµÍ³Ò»Õ¾Ê½½â¾ö·½°¸(²¿Êð£¬ÐÔÄÜ²âÊÔ)²»¶Ï¸üÐÂ

shinelian ·¢±íÓÚ 2010-01-14 12:55

#!/bin/tony

1. ÎÒÔÚÐÔÄÜ²âÊÔÖÐ¼äÓöµ½Ð©ÎÊÌâ£¬ÒòÎªÎÒÊ±¼äÓÐÏÞ£¬ËùÒÔÏ£Íû´ó¼ÒÒ»ÆðÀ´²âÊÔ½â¾ö£¬Èº²ßÈºÁ¦¡£ÓÐÊ²Ã´ÎÊÌâÇë´ó¼Ò¼°Ê±Ö¸³öÀ´£¬ÒòÎªÎÒÒ²´¦ÔÚÒ»¸ö²»¶ÏÃþË÷µÄ½×¶Î¡£
2. mfs²»¶à×ö½éÉÜ£¬¾ßÌåÏ¸½ÚÇë²Î¿¼±¾°æmfsÊµ¼ùÎÄÕÂhttp://bbs.chinaunix.net/thread-1643863-1-1.html £¬»òÕßbaidu,google ¹Ø¼ü×ÖÌïÒÝ¡£
3. Ï£Íû´ó¼ÒÄÜÌá¹©¸üºÃµÄ´æ´¢/ÎÄ¼þÏµÍ³µÄ²âÊÔÄ£ÐÍ£¬À´Ò»ÆðÍêÉÆ±¾ÎÄµµ¡££¨ÈÈÁÒ»¶Ó´ó¼Ò·îÏ×²âÊÔ½Å±¾£¬²âÊÔÓÃÀýµÈ£©¡£
4. Ï£Íû´ó¼ÒÌá¹©Éú²ú»·¾³µÄÊµ¼Ê°¸Àý£¬ÅäÖÃ»·¾³£¬½Å±¾£¬¼à¿Ø»úÖÆµÈµÈ¡£
5. Ï£ÍûÊìÏ¤´úÂëµÄÅóÓÑÈ¥¿´¿´mfsÄÚ²¿ÊµÏÖµÄ»úÖÆ¡£
6. ÌØ±ð¸ÐÐ»ÌïÒÝµÄÎÄµµ http://sery.blog.51cto.com/10037/263515 ¡£
7. ÌØ±ð¸ÐÐ»qqÈºÕ½ÓÑ:tt£¬ÁéÏ¬£¬Á÷ÔÆ·ç£¬hzqbbcÔÚqqÈºÄÚ¶Ô¹ã´ó°®ºÃÕß·ÖÏí±¦¹ó¾Ñé¡£
8. ÌØ±ð¸ÐÐ»´æ´¢×¨¼Ò-¡¶´ó»°´æ´¢¡·µÄ×÷Õß£º¶¬¹ÏÍ· £¬ÔÚÎÒ½øÐÐÐÔÄÜ²âÊÔµÄÊ±ºò£¬¶ÔÎÒ½øÐÐµÄÖ¸µ¼¡£
9. ÌØ±ð¸ÐÐ»qqÈºÕ½ÓÑ£º¸ßÐÔÄÜ¼Ü¹¹£¬CU ID: leo_ss_pku£¬ÖÆ×÷¸ü×¨Òµ¸ü¾«ÃÀµÄpdf°æ±¾£º£¬ ´ó¼ÒÒ²¿ÉÒÔËûµÄblogÉÏä¯ÀÀÔÚÏß°æ±¾£ºhttp://www.himysql.com/doc/mfs.html

mfsÓÅÊÆ£º
-1. Free(GPL)
0. Í¨ÓÃÎÄ¼þÏµÍ³£¬²»ÐèÒªÐÞ¸ÄÉÏ²ãÓ¦ÓÃ¾Í¿ÉÒÔÊ¹ÓÃ£¨ÄÇÐ©ÐèÒª×¨ÃÅapiµÄdfsºÃÂé·³Å¶£¡£©¡£
1. ¿ÉÒÔÔÚÏßÀ©ÈÝ£¬ÌåÏµ¼Ü¹¹¿ÉÉìËõÐÔ¼«Ç¿¡££¨¹Ù·½µÄcase¿ÉÒÔÀ©µ½70Ì¨ÁË£¡£©
2. ²¿Êð¼òµ¥¡££¨saÃÇÌØ±ð¸ßÐË£¬Áìµ¼ÃÇÌØ±ðhappy£¡£©
3. ÌåÏµ¼Ü¹¹¸ß¿ÉÓÃ£¬ËùÓÐ×é¼þÎÞµ¥µã¹ÊÕÏ¡£ £¨Äú»¹µÈÊ²Ã´£¿£©
4. ÎÄ¼þ¶ÔÏó¸ß¿ÉÓÃ£¬¿ÉÉèÖÃÈÎÒâµÄÎÄ¼þÈßÓà³Ì¶È£¨Ìá¹©±Èraid1+0¸ü¸ßµÄÈßÓà¼¶±ð£©£¬¶ø¾ø¶Ô²»»áÓ°Ïì¶Á»òÕßÐ´µÄÐÔÄÜ£¬Ö»»á¼ÓËÙÅ¶£¡£©
5. Ìá¹©Windows»ØÊÕÕ¾µÄ¹¦ÄÜ.£¨²»ÅÂÎó²Ù×÷ÁË£¬Ìá¹©ÀàËÆoralce µÄÉÁ»ØµÈ¸ß¼¶dbmsµÄ¼´Ê±»Ø¹öÌØÐÔ£¬oralceÕâÐ©ÌØÐÔ¿ÉÊÇÊÕ·ÑµÄÅ¶£¡£©
6. Ìá¹©ÀàËÆJavaÓïÑÔµÄ GC£¨À¬»ø»ØÊÕ£©.
7. Ìá¹©netapp£¬emc£¬ibmµÈÉÌÒµ´æ´¢µÄsnapshotÌØÐÔ¡£
8. google filesystemµÄÒ»¸öcÊµÏÖ¡££¨googleÔÚÇ°Ãæ¿ªÂ·Å¶£¡£©
9. Ìá¹©web gui¼à¿Ø½Ó¿Ú¡£
10. Ìá¸ßËæ»ú¶Á»òÐ´µÄÐ§ÂÊ£¨ÓÐ´ý½øÒ»²½Ö¤Ã÷£©¡£
11. Ìá¸ßº£Á¿Ð¡ÎÄ¼þµÄ¶ÁÐ´Ð§ÂÊ£¨ÓÐ´ý½øÒ»²½Ö¤Ã÷£©¡£
¿ÉÄÜµÄÆ¿¾±£º
0. master±¾ÉíµÄÐÔÄÜÆ¿¾±¡££¨²»Ì«Ç¡µ±µÄ±È·½£ºÀàËÆmysql Ö÷´Ó¸´ÖÆ£¬´ÓµÄ¿ÉÒÔÀ©Õ¹£¬Ö÷µÄ²»ÈÝÒ×À©Õ¹£©¡£ £¨qqÈºÕ½ÓÑ £º hzqbbc£©
      ¶ÌÆÚ¶Ô²ß£º°´ÒµÎñÇÐ·Ö
1. ÌåÏµ¼Ü¹¹´æ´¢ÎÄ¼þ×ÜÊýµÄ¿ÉÓö¼ûµÄÉÏÏÞ¡£
   £¨mfs°ÑÎÄ¼þÏµÍ³µÄ½á¹¹»º´æµ½masterµÄÄÚ´æÖÐ£¬¸öÈËÈÏÎªÎÄ¼þÔ½¶à£¬masterµÄÄÚ´æÏûºÄÔ½´ó£¬8g¶ÔÓ¦2500kwµÄÎÄ¼þÊý£¬2ÒÚÎÄ¼þ¾ÍµÃ64GBÄÚ´æ £©¡££¨qqÈºÕ½ÓÑ £º hzqbbc£©
      ¶ÌÆÚ¶Ô²ß£º°´ÒµÎñÇÐ·Ö
2. µ¥µã¹ÊÕÏ½â¾ö·½°¸µÄ½¡×³ÐÔ¡££¨qqÈºÕ½ÓÑ £º tt, hzqbbc£©

¼Ü¹¹Í¼

¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª
index
1. mfs master
2. mfschunkserver
3. mfs client
4. ÏµÍ³¹ÜÀí
5. ÐÔÄÜ²âÊÔ
6. ²Î¿¼ÎÄÏ×
6.1 ²âÊÔÊý¾Ý
            ²âÊÔÄ£ÐÍ1
            ²âÊÔÄ£ÐÍ2
7. ¸ÐÐ»
8. ¸½Â¼
9. Êµ¼Ê²Ù×÷°¸Àý
10. Éú²ú»·¾³°¸Àý
11. web gui ¼à¿Ø
12. ¹Ù·½¹ØÓÚ1.6.x°æ±¾µÄ½éÉÜ (ÖÐÎÄ·Òë£ºQQÈºÕ½ÓÑ Cuatre )
13. mfs¹Ù·½Ó¢ÎÄFAQ£¨TC°æ£©£¨Ìá¹©Õß£ºQQÈºÕ½ÓÑ ÁéÏ¬ £©
14. mfs master ÈÈ±¸·½°¸
15. mfs nagios¼à¿Ø³ÌÐò£¨Ìá¹©Õß£ºQQÈºÕ½ÓÑ Á÷ÔÆ·ç£©
¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª
»·¾³£º
master       1Ì¨
chunkserver 3Ì¨
client          1Ì¨
os£º
centos5.3 x64
1 mfs master
1.1 °²×°mfs master
wget http://ncu.dl.sourceforge.net/project/moosefs/moosefs/1.6.11/mfs-1.6.11.tar.gz
tar zxvf mfs-1.6.11.tar.gz
cd mfs-1.6.11
useradd mfs -s /sbin/nologin
./configure --prefix=/usr/local/mfs --with-default-user=mfs --with-default-group=mfs
make
make install
cd /usr/local/mfs/etc/
cp mfsmaster.cfg.dist mfsmaster.cfg
cp mfsexports.cfg.dist mfsexports.cfg
vim mfsmaster.cfg
vim mfsexports.cfg
cd ..
cd var/
mfs/
cp metadata.mfs.empty metadata.mfs
cat metadata.mfs
/usr/local/mfs/sbin/mfsmaster start
ps axu | grep mfsmaster
lsof -i
tail -f /var/log/messages

1.2 Æô¶¯master·þÎñ
/usr/local/mfs/sbin/mfsmaster start
working directory: /usr/local/mfs/var/mfs
lockfile created and locked
initializing mfsmaster modules ...
loading sessions ... ok
sessions file has been loaded
exports file has been loaded
loading metadata ...
create new empty filesystemmetadata file has been loaded
no charts data file - initializing empty charts
master <-> metaloggers module: listen on *:9419
master <-> chunkservers module: listen on *:9420
main master server module: listen on *:9421
mfsmaster daemon initialized properly

1.3. Í£Ö¹master·þÎñ
/usr/local/mfs/sbin/mfsmaster -s

1.4Æô¶¯ºÍÍ£Ö¹web gui
Æô¶¯£º /usr/local/mfs/sbin/mfscgiserv
Í£Ö¹£º kill /usr/local/mfs/sbin/mfscgiserv

1.5Ïà¹ØÅäÖÃÎÄ¼þ
vimmfsexports.cfg
192.168.28.0/24. rw
192.168.28.0/24/    rw

2. mfschunkserver
2.1 ´Ó¿éÉè±¸´´½¨±¾µØÎÄ¼þÏµÍ³
fdisk -l
mkfs.ext3 /dev/sdb
mkdir /data
chown mfs:mfs /data
mount -t ext3 /dev/sdb /data
df -ah
/dev/sdb          133G188M126G 1% /data

2.2 ´´½¨50GµÄloop deviceÎÄ¼þ
df -ah
dd if=/dev/zero of=/opt/mfs.img bs=1M count=50000
losetup /dev/loop0 mfs.img
mkfs.ext3 /dev/loop0
mkdir /data
chown mfs:mfs /data
mount -o loop /dev/loop0 /data
df -ah

2.3 °²×°chunkserver
wget http://ncu.dl.sourceforge.net/project/moosefs/moosefs/1.6.11/mfs-1.6.11.tar.gz
tar zxvf mfs-1.6.11.tar.gz
cd mfs-1.6.11
useradd mfs -s /sbin/nologin
./configure --prefix=/usr/local/mfs --with-default-user=mfs --with-default-group=mfs
make
make install
cd /usr/local/mfs/etc/
cp mfschunkserver.cfg.dist mfschunkserver.cfg
cp mfshdd.cfg.dist mfshdd.cfg

2.4 Æô¶¯chunkvserver
/usr/local/mfs/sbin/mfschunkserver start
ps axu |grep mfs
tail -f /var/log/messages

2.5 Í£Ö¹chunksever
/usr/local/mfs/sbin/mfschunkserver stop

3. mfs client
3.1 °²×°fuse
yum install kernel.x86_64 kernel-devel.x86_64 kernel-headers.x86_64
###reboot server####
yum install fuse.x86_64 fuse-devel.x86_64 fuse-libs.x86_64
modprobe fuse

3.2 °²×°mfsclient
wget http://ncu.dl.sourceforge.net/project/moosefs/moosefs/1.6.11/mfs-1.6.11.tar.gz
tar zxvf mfs-1.6.11.tar.gz
cd mfs-1.6.11
useradd mfs -s /sbin/nologin
./configure --prefix=/usr/local/mfs --with-default-user=mfs --with-default-group=mfs --enable-mfsmount
make
make install

3.3 ¹ÒÔØÎÄ¼þÏµÍ³
cd /mnt/
mkdir mfs
/usr/local/mfs/bin/mfsmount /mnt/mfs/ -H 192.168.28.242

mkdir mfsmeta
/usr/local/mfs/bin/mfsmount -m /mnt/mfsmeta/ -H 192.168.28.242

df -ah

4.ÏµÍ³¹ÜÀí

4.1 ¹ÜÀíÃüÁî

ÉèÖÃ¸±±¾ µÄ·ÝÊý£¬ÍÆ¼ö3·Ý
/usr/local/mfs/bin/mfssetgoal -r 3 /mnt/mfs

²é¿´Ä³ÎÄ¼þ
/usr/local/mfs/bin/mfsgetgoal/mnt/mfs

²é¿´Ä¿Â¼ÐÅÏ¢
/usr/local/mfs/bin/mfsdirinfo -H /mnt/mfs

5. ÐÔÄÜ²âÊÔ

5.1 mfs

1. ´óÎÄ¼þ(block=1Mbyte)
dd if=/dev/zero of=1.img bs=1M count=5000
5242880000 bytes (5.2 GB) copied, 48.8481 seconds, 107 MB/s

2. Ð¡ÎÄ¼þ( 50 byte * 100w¸ö * 1 client ) ( 1000 * 1000)Ð´Èë
real 83m41.343s
user 4m17.993s
sys 16m58.939s
ÁÐ±í
time find ./ -type f | nl | tail
999999./0/1
1000000 ./0/0
real 0m39.418s
user 0m0.721s
sys 0m0.225s
É¾³ý
time rm -fr *
real 6m35.273s
user 0m0.394s
sys 0m23.546s

3. Ð¡ÎÄ¼þ( 1K byte * 100w¸ö * 100 client ) { 1000 * 1000 )
Ð´Èë£¨100client£©
time ../../p_touch_file.sh
real 22m51.159s
user 4m42.850s
sys 18m41.437s
ÁÐ±í£¨1client£©
time find ./ | nl | tail
real 0m35.910s
user 0m0.628s
sys 0m0.204s
É¾³ý£¨1client£©
time rm -fr *
real 6m36.530s
user 0m0.697s
sys 0m21.682s

4. Ð¡ÎÄ¼þ£¨1k byte* 100w¸ö * 200 client£©{ 1000 * 1000 )
time ../../p_touch_file.sh
real 27m56.656s
user 5m12.195s
sys 20m52.079s

5. Ð¡ÎÄ¼þ£¨1k byte* 100w¸ö * 1000 client£©{ 1000 * 1000 )
Ð´Èë
time ../../p_touch_file.sh
real 30m30.336s
user 5m6.607s
sys 21m

5.2 ±¾µØ´ÅÅÌ
1. ´óÎÄ¼þ(block=1Mbyte)
dd if=/dev/zero of=1.img bs=1M count=5000
5242880000 bytes (5.2 GB) copied, 58.7371 seconds, 89.3 MB/s

2. Ð¡ÎÄ¼þ(50 byte * 100w¸ö * 1 client) { 1000 * 1000£©
Ð´Èë
time ../touch_file.sh
real17m47.746s
user 4m54.068s
sys12m54.425s
ÁÐ±í
time find ./ -type f | nl | tail
1000000 ./875/582
1000001 ./875/875
real 0m9.120s
user 0m1.102s
sys 0m0.726s
É¾³ý
time rm -fr *
real 0m37.201s
user 0m0.432s
sys 0m15.268s

5.3 »ù×¼²âÊÔ(µÚÒ»´Î)
5.3.1Ëæ»ú¶Á

5.3.2Ëæ»úÐ´

5.3.3 Ë³Ðò¶Á

5.3.4Ë³ÐòÐ´

5.4 »ù×¼²âÊÔ£¨µÚ2´Î£©
5.4.1 Ëæ»ú¶Á

[ ±¾Ìû×îºóÓÉ shinelian ÓÚ 2010-1-27 13:36 ±à¼ ]

shinelian ·¢±íÓÚ 2010-01-14 14:15

Ðø1

6. ²Î¿¼ÎÄÏ×£º
6.1 ÎÄÏ×
http://sery.blog.51cto.com/10037/263515ÌïÒÝ
http://bbs.chinaunix.net/thread-1643863-1-1.htmlltgzs777
http://www.moosefs.org/¹ÙÍø
http://bbs.chinaunix.net/thread-1643015-1-2.html ²âÊÔ¹¤¾ß

6.1²âÊÔÊý¾Ý

ÐÔÄÜ²âÊÔÄ£ÐÍ1
Ò»¸ö²»ÖªµÀÃû×ÖµÄ¸çÃÇµÄ²âÊÔ½á¹û£¬ÎÒÏÈÌù³öÀ´£¬ÄÇ¸çÃÇ¿´µ½ÁËÃÜÎÒ.

Ð¡ÎÄ¼þÐÔÄÜ²âÊÔ
¶þ¼¶100*100ÎÄ¼þ¼Ð
´´½¨
ÁÐ±í
É¾³ý
µ¥Æ¬15k.5
ext3
clientµ¥½ø³Ì
real
0m0.762s
user
0m0.048s
sys
0m0.261s
real
0m0.179s
user
0m0.036s
sys
0m0.125s
real
0m0.492s
user
0m0.036s
sys
0m0.456s
µ¥Æ¬15k.5
ext3
client 10²¢·¢½ø³Ì
×î³¤Ê±¼ä£º
real
0m0.724s
user
0m0.015s
sys
0m0.123s
×î³¤Ê±¼ä£º
real
0m0.057s
user
0m0.006s
sys
0m0.025s
×î³¤Ê±¼ä£º
real
0m0.226s
user
0m0.010s
sys
0m0.070s
6chunkserver
cache
clientµ¥½ø³Ì
real
0m2.084s
user
0m0.036s
sys
0m0.252s
real
0m4.964s
user
0m0.043s
sys
0m0.615s
real
0m6.661s
user
0m0.046s
sys
0m0.868s
6chunkserver
cache
client 10²¢·¢½ø³Ì
×î³¤Ê±¼ä£º
real
0m1.422s
user
0m0.007s
sys
0m0.050s
×î³¤Ê±¼ä£º
real
0m2.022s
user
0m0.008s
sys
0m0.108s
×î³¤Ê±¼ä£º
real
0m2.318s
user
0m0.008s
sys
0m0.136s
¶þ¼¶1000*1000ÎÄ¼þ¼Ð
´´½¨
ÁÐ±í
É¾³ý
µ¥Æ¬15k.5
ext3
clientµ¥½ø³Ì
real
11m37.531s
user
0m4.363s
sys
0m37.362s
real
39m56.940s
user
0m9.277s
sys
0m48.261s
real
41m57.803s
user
0m10.453s
sys
3m11.808s
µ¥Æ¬15k.5
ext3
client 10²¢·¢½ø³Ì
×î³¤Ê±¼ä£º
real
11m7.703s
user
0m0.519s
sys
0m10.616s
×î³¤Ê±¼ä£º
real
39m30.678s
user
0m1.031s
sys
0m4.962s
×î³¤Ê±¼ä£º
real
40m23.018s
user
0m1.043s
sys
0m19.618s
6chunkserver
cache
clientµ¥½ø³Ì
real
3m17.913s
user
0m3.268s
sys
0m30.192s
real
11m56.645s
user
0m3.810s
sys
1m10.387s
real
12m14.900s
user
0m3.799s
sys
1m26.632s
6chunkserver
cache
client 10²¢·¢½ø³Ì
×î³¤Ê±¼ä£º
real
1m13.666s
user
0m0.328s
sys
0m3.295s
×î³¤Ê±¼ä£º
real
4m31.761s
user
0m0.531s
sys
0m10.235s
×î³¤Ê±¼ä£º
real
4m26.962s
user
0m0.663s
sys
0m13.117s
Èý¼¶100*100*100ÎÄ¼þ¼Ð
´´½¨
ÁÐ±í
É¾³ý
µ¥Æ¬15k.5
ext3
clientµ¥½ø³Ì
real
9m51.331s
user
0m4.036s
sys
0m32.597s
real
27m24.615s
user
0m8.907s
sys
0m44.240s
real
28m17.194s
user
0m10.644s
sys
1m34.998s
µ¥Æ¬15k.5
ext3
client 10½ø³Ì
×î³¤Ê±¼ä£º
real
10m22.170s
user
0m0.580s
sys
0m11.720s
×î³¤Ê±¼ä£º
real
33m32.386s
user
0m1.127s
sys
0m5.280s
×î³¤Ê±¼ä£º
real
33m7.808s
user
0m1.196s
sys
0m10.588s
6chunkserver
cache
clientµ¥½ø³Ì
real
3m21.720s
user
0m3.089s
sys
0m26.635s
real
9m26.535s
user
0m3.901s
sys
1m11.756s
real
10m51.558s
user
0m4.186s
sys
1m26.322s
6chunkserver
cache
client 10²¢·¢½ø³Ì
×î³¤Ê±¼ä£º
real
1m23.023s
user
0m0.429s
sys
0m3.869s
×î³¤Ê±¼ä£º
real
4m10.617s
user
0m0.643s
sys
0m11.588s
×î³¤Ê±¼ä£º
real
4m20.137s
user
0m0.649s
sys
0m14.120s
6chunkserver
cache
client 50²¢·¢½ø³Ì
×î³¤Ê±¼ä£º
real
1m26.388s
user
0m0.074s
sys
0m0.679s
×î³¤Ê±¼ä£º
real
4m37.102s
user
0m0.132s
sys
0m2.160s
×î³¤Ê±¼ä£º
real
4m37.392s
user
0m0.132s
sys
0m2.755s
6chunkserver
cache
client 100²¢·¢½ø³Ì
×î³¤Ê±¼ä£º
real
1m29.338s
user
0m0.062s
sys
0m0.363s
×î³¤Ê±¼ä£º
real
4m54.925s
user
0m0.069s
sys
0m1.212s
×î³¤Ê±¼ä£º
real
4m35.845s
user
0m0.068s
sys
0m1.640s
6chunkserver
cache
remote
client 10²¢·¢½ø³Ì
×î³¤Ê±¼ä£º
real
4m0.411s
user
0m2.985s
sys
0m12.287s
×î³¤Ê±¼ä£º
real
8m31.351s
user
0m4.223s
sys
0m29.800s
×î³¤Ê±¼ä£º
real
4m3.271s
user
0m3.206s
sys
0m11.922s
Èý¼¶100*100*100ÎÄ¼þ¼Ð
1
2
3
4
5
±ä¸üÈÕÖ¾/ÔªÊý¾Ý´óÐ¡
55M×óÓÒ
60M×óÓÒ
60M×óÓÒ
60M×óÓÒ
60M×óÓÒ
Á¬Ðø´´½¨ºÄÊ±
real
4m0.411s
user
0m2.985s
sys
0m12.287s
real
4m12.309s
user
0m3.039s
sys
0m12.899s
real
4m14.010s
user
0m3.418s
sys
0m12.831s
real
4m14.214s
user
0m3.247s
sys
0m12.871s
real
4m14.417s
user
0m3.170s
sys
0m12.948s

×¢£º
µ¥ÅÌ¶à½ø³ÌÐÔÄÜÃ»ÓÐÌáÉý£¬ÒòÎª¶¼ÔÚio wait£¬ÉõÖÁÔö¼Ó½ø³Ì»áÏûºÄ´óÁ¿µ÷¶ÈÊ±¼ä
MFS¶à½ø³ÌÊ±ÐÔÄÜ»áÌáÉý£¬Ö÷ÒªÐÔÄÜÏûºÄ¼¯ÖÐÔÚCPUÏµÍ³Ê±¼ä¡£Òò´ËÊµ¼Êº£Á¿Ð¡ÎÄ¼þÐÔÄÜÒª´ó´óÇ¿ÓÚ±¾µØÎÄ¼þÏµÍ³

ÐÔÄÜ²âÊÔÄ£ÐÍ2 £¨¸ÐÐ» qqÈºÕ½ÓÑ Æ¦×Ó°× Ìá¹©£©
Á½¸öClientÍ¬Ê±dd²âÊÔ
Êý¾Ý¿é1M ÎÄ¼þ´óÐ¡20G
Client1 Ð´£º68.4MB/s¶Á£º25.3MB/s
Client2 Ð´£º67.5MB/s¶Á£º24.7MB/s
×ÜÍÌÍÂ£ºÐ´£º135.9MB/s ¶Á£º50.0MB/s

Ð´ÃüÁî£ºdd if=/dev/zero of=/mfs/test.1 bs=1M count=20000
¶ÁÃüÁî£ºdd if=/mfs/test.1 of=/dev/null bs=1M

7. ¸ÐÐ»
ÌïÒÝ
Ò»¸ö²»ÖªµÀÃû×ÖµÄ¸çÃÇ£¨¿´µ½ÇëÁªÏµÎÒ£©

8. ¸½Â¼
8.11000 * 1000 * 1 client ½Å±¾
#!/bin/bash
for ((i=0;i<1000;i++))
do
mkdir ${i}
cd ${i}
for ((j=0;j<1000;j++))
   do
   cp /mnt/test ${j}
   done
   cd ..
done
8.21000* 1000*£¨ 100£¬200 ,1000 client )½Å±¾
#!/bin/bash
declare -f make_1000_dir_file
cd `pwd`
function make_1000_dir_file {
start=${1}
stop=${2}
for ((i=${start};i<${stop};i++))
do
   mkdir ${i}
   for ((j=0;j<1000;j++))
   do
         cp /mnt/test ${i}/${j}
   done
done
}
i=1
while [ ${i} -le 1000 ]
do
((n=${i}+1))
make_1000_dir_file ${i} $ &
((i=${i}+1))
done
wait

[ ±¾Ìû×îºóÓÉ shinelian ÓÚ 2010-1-20 17:23 ±à¼ ]

shinelian ·¢±íÓÚ 2010-01-14 14:16

Ðø2

9. Êµ¼Ê²Ù×÷°¸Àý
9.1 Ä¬ÈÏµÄÀ¬»ø»ØÊÕÊ±¼äÊÇ86400£¬´æÔÚÒ»ÖÖ¿ÉÄÜÐÔÊÇÀ¬»ø»¹Ã»»ØÊÕÍê£¬ÄãµÄ´æ´¢ÈÝÁ¿¾Í±©µôÁË¡££¨°¸ÀýÌá¹©Õßshinelian£©

·½°¸1£ºÉèÖÃÀ¬»ø»ØÊÕÊ±¼ä£¬»ý¼«¼à¿Ø´æ´¢ÈÝÁ¿¡£
      ¾¹ý²âÊÔ£¬°ÑÀ¬»ø»ØÊÕÊ±¼äÉèÖÃ300Ãë£¬ÍêÈ«¿ÉÒÔÕýÈ·»ØÊÕÈÝÁ¿¡£

·½°¸2£ºÊÖ¶¯ÖÜÆÚÐÔÈ¥É¾³ýmetamfsÀïµÄtrashÄ¿Â¼ÏÂµÄÎÄ¼þ£¨½¡×³ÐÔ»¹ÓÐ´ý²âÊÔ£¬·´ÕýÉ¾³ýºóÈÝÁ¿ÊÇ»ØÊÕÁË£¬²»ÏþµÃÓÐÃ»ÓÐÊ²Ã´ºóÒÅÖ¢¡££©
      ¾¹ý²âÊÔ£¬Ã²ËÆÃ»ºóÒÅÖ¢£¬ÓÐºóÒÅÖ¢µÄÍ¬Ñ§ÇëÔÚqqÈºÀïÃæÁªÏµÎÒ¡£

9.2 mfs 1.6.xµÄUser GuidesºÍFAQ£¬²¢ºÍÁéÏ¬¹µÍ¨¶ÔÎÄµµÖÐ²»Àí½âµÄµØ·½£¬¾ÍÀí½â²»Ò»ÖÂµÄµØ·½´ï³ÉÒ»ÖÂ¡£MFS1.6.x±È1.5.xÖÐÓÐÒÔÏÂµÄ±ä»¯£º(ÌØ±ð¸ÐÐ»qqÈºÄÚÍøÓÑ Á÷ÔÆ·ç ºÍ ÁéÏ¬ )
£¨1£©ÐÞ¸´1.5.xÖÐÔÚ´óÅúÁ¿²Ù×÷Ê±´ò¿ªÎÄ¼þ¹ý¶àµÄbug¡£Õâ¸ö´íÎóÒ²ÔÚÎÒÃÇ´Ë´Î²âÊÔµÄÊ±ºòÓöµ½£¬±¨µÄ´íÎóËµÊÇ´ò¿ªµÄÎÄ¼þ¹ý¶à£¬Ôì³Échunker serverµÄÁ´½Ó´íÎó¡£ËäÈ»ºóÀ´µÄ²âÊÔÖÐÒ»Ö±ÏëÄ£Äâ³öÀ´Õâ¸öÎÊÌâ£¬µ«ÊÇÒ»Ö±ÎÞ·¨Ä£Äâ³öÀ´¡£ÔÚ1.6.xÖÐ½â¾ö´ËÎÊÌâ£¬¾Í½â¾öÁËºÜ´óµÄÎÊÌâ¡£

   £¨2£©ÐÂÔö¼ÓÁËmasterlogger·þÎñÆ÷¡£ÕâÊÇÔÚ1.5.xÖÐËùÃ»ÓÐµÄ£¬¾ÍÊÇ×öÁËmaster·þÎñÆ÷µÄÈßÓà£¬½øÒ»²½µÄ¼ÓÇ¿µÄmaster·þÎñÆ÷µÄÎÈ¶¨ÐÔ¡£ÔÚmfsÌåÏµÖÐmasterÊÇÒªÇó×îÎÈ¶¨ÒÔ¼°ÐÔÄÜÒªÇó×î¸ßµÄ£¬Òò´ËÎñ±Ø±£Ö¤masterµÄÎÈ¶¨¡£

   £¨3£©ÐÞ¸Ä1.5.xÖÐ´æÔÚµÄ¶ÔÓÚ»µ¿éµÄÐÞ¸´¹¦ÄÜ¡£ÔÚmfs1.5.xÖÐÓöµ½chunker»µ¿éÐ£Ñé£¬´íÎó±È½Ï¶àµÄÊÇºÜÍùÍùµ¼ÖÂmaster½«³öÏÖ»µ¿éµÄchunker×Ô¶¯µÄÌÞ³ý³öÈ¥µÄÇé¿ö£¬´Ë´ÎÔö¼ÓÁË¶Ô»µ¿éµÄÐÞ¸´¹¦ÄÜ£¬ºÜ·½±ãµÄ½øÐÐÐÞ¸´£¬¼ò»¯¶Ô»µ¿éµÄ´¦Àí¹¦ÄÜ¡£

   £¨4£©¶ÔmetadataºÍchangelogµÄÐÂÈÏÊ¶¡£Ö®Ç°ÈÏÎªchangelog¼ÇÂ¼µÄÊÇÎÄ¼þµÄ²Ù×÷£¬¶¨ÆÚµÄÏñÊý¾Ý¿âµÄÈÕÖ¾Ò»Ñù¹éµµµ½metadataÖÐ¡£·¢ÏÖÉÏÃæµÄÀí½â´æÔÚÎóÇø£¬ÕæÕýµÄÊÇchangelogÖÐ¼ÇÂ¼ÁË¶ÔÎÄ¼þµÄ²Ù×÷£¬metadata¼ÇÂ¼ÎÄ¼þµÄ´óÐ¡ºÍÎ»ÖÃ¡£Òò´ËmetadataÊÇ±È½ÏÖØÒªµÄ£¬ÔÚ½øÐÐÐÞ¸´µÄ¹ý³ÌÖÐÊÇ²ÉÓÃmetadataºÍ×îºóÒ»´ÎµÄchangelog½øÐÐÐÞ¸´µÄ¡£

   £¨5£©MFSÎÄµµÖÐÃ÷È·Ö¸³ö¶ÔÓÚÄÚ´æºÍ´ÅÅÌ´óÐ¡µÄÒªÇó¡£¡¾In our environment (ca. 500 TiB, 25 million files, 2 million folders distributed on 26 million chunks on 70 machines) the usage of chunkserver CPU (by constant file transfer) is about 15-20% and chunkserver RAM usually consumes about 100MiB (independent of amount of data).
The master server consumes about 30% of CPU (ca. 1500 operations per second) and 8GiB RAM. CPU load depends on amount of operations and RAM on number of files and folders.¡¿

   £¨6£©Ö¸³öÁËÔÚ²âÊÔµÄ¹ý³ÌÖÐ¶à¸öchunker²¢²»Ó°ÏìÐ´µÄËÙ¶È£¬µ«ÊÇÄÜ¼Ó¿ì¶ÁµÄËÙ¶È¡£ÔÚÔÀ´µÄ»ù´¡ÉÏÔö¼ÓÒ»¸öchunkerÊ±£¬Êý¾Ý»á×Ô¶¯Í¬²½µ½ÐÂÔöµÄchunkerÉÏÒÔ´ïµ½Êý¾ÝµÄÆ½ºâºÍ¾ùºâ¡£

9.3 mfs1.5.x Êý¾Ý»Ö¸´ÊµÀý £¨°¸Àý·ÖÏí £º QQÈºÕ½ÓÑ Xufeng£©
         ÆäÊµºÜ¼òµ¥£¬¾ÍÊÇmfsrestore, È»ºóÆô¶¯·þÎñµÄÊ±ºò£¬²¢ÓÐÈÎºÎÌáÊ¾ÐÅÏ¢£¬½ø³ÌÆô²»À´,ÆäÊµÊÇÅäÖÃÎÄ¼þ·ÅPIDµÄÄ¿Â¼»á±»É¾µô£¬ÖØ½¨Õâ¸öÄ¿Â¼£¬È»ºó¸³ÓèÈ¨ÏÞ¾Í¿ÉÒÔÁË,ÎÒÒÑ¾×ö¹ý1.5µ½1.6µÄÉý¼¶£¬»¹¿ÉÒÔ.
         ÏêÇé¼û Xufeng blog http://snipt.net/iamacnhero/tag/moosefs

10.Éú²ú»·¾³°¸Àý£¨´ó¼ÒÓ»Ô¾Ìá¹©£¬²»¶Ï¸üÐÂÖÐ~~~~~~£©
http://www.gaokaochina.comÌïÒÝ

[ ±¾Ìû×îºóÓÉ shinelian ÓÚ 2010-1-16 14:44 ±à¼ ]

shinelian ·¢±íÓÚ 2010-01-14 14:18

Ðø3

±¾Ìû×îºóÓÉ shinelian ÓÚ 2010-02-22 12:41 ±à¼

11web gui ¼à¿Ø

shinelian ·¢±íÓÚ 2010-01-14 14:19

Ðø4

12.mfs¹Ù·½¹ØÓÚ1.6.x µÄ½éÉÜ ·ÒëÈË(QQÈºÕ½ÓÑ£ºCuatre )

View on new features of next release v 1.6 of Moose File System
¹ØÓÚ¶ÔMFS(Moose File System)ÏÂÒ»¸ö·¢²¼°æ±¾V1.6ÐÂÌØÐÔµÄÒ»Ð©¿´·¨
We are about to release a new version of MooseFS which would include a large number of new features and bug fixes. The new features are so significant that we decided to release it under 1.6 version. The newest beta files are in the GIT repository.
ÎÒÃÇ½«Òª·¢²¼MFSÒ»¸ö×îÐÂ°æ±¾£¬¸Ã°æ±¾ÐÞ¸´ÁË´óÁ¿µÄbug,Í¬Ê±Ò²°üº¬ÁË´óÁ¿µÄÐÂÌØÐÔ¡£ÕâÐ©ÐÂÌØÐÔ·Ç³£ÖØÒªºÍÓÐÌØÉ«£¬ÎÒÃÇ¾ö¶¨ÔÚ1.6°æ±¾½øÐÐ·¢²¼¡£×îÐÂµÄbetaÎÄ¼þÄã¿ÉÒÔGITµÄÖªÊ¶¿âÕÒµÃµ½¡£
The key new features/changes of MooseFS 1.6 would include:
MooseFS 1.6µÄÖ÷ÒªÌØÐÔ¼°±ä»¯°üÀ¨£º
General:
Removed duplicate source files.
ÒÆ³ýÁË¸´ÖÆÔ´ÎÄ¼þ
Strip whitespace at the end of configuration file lines.
ÅäÖÃÎÄ¼þÐÐµÄÄ©Î²½«Îª¿Õ°×
Chunkserver:
Chunkserver
Rewritten in multi-threaded model.
ÖØÐ´ÁË¶àÏß³ÉÄ£Ê½
Added periodical chunk testing functionality (HDD_TEST_FREQ option).
Ôö¼ÓÁË¶¨ÆÚchunk²âÊÔ¹¦ÄÜ£¨HDD_TEST_FREQÑ¡Ïî£©
New -v option (prints version and exits).
ÐÂµÄ-vÑ¡Ïî£¨ÏÔÊ¾°æ±¾£©

Master:
Added "noowner" objects flag (causes objects to belong to current user).
Ôö¼ÓÁË"noowner"¶ÔÏó±ê¼Ç£¨¿ÉÒÔÊ¹¶ÔÏóÊôÓÚµ±Ç°ÓÃ»§£©
Maintaining `mfsdirinfo` data online, so it doesn't need to be calculated on every request.
±£³Ö¡®mfsdirinfo¡¯Êý¾ÝÔÚÏß£¬ÕâÑù¾Í²»ÐèÒªÇóÃ¿Ò»¸öÇëÇó¶¼½øÐÐÔËËã¡£
Filesystem access authorization system (NFS-like mfsexports.cfg file, REJECT_OLD_CLIENTS option) with ro/rw, maproot, mapall and password functionality.
ÎÄ¼þÏµÍ³·ÃÎÊÈÏÖ¤ÏµÍ³£¨ÀàËÆÓÚNFSµÄmfsexports.cfgÎÄ¼þ£¬REJECT_OLD_CLIENTSÑ¡Ïî£©£¬ÓÐro/rw, maproot, mapall¼°ÃÜÂë¹¦ÄÜ
New -v option (prints version and exits).
ÐÂµÄ-vÑ¡Ïî£¨ÏÔÊ¾°æ±¾£©

Mount:
Rewritten options parsing in mount-like way, making possible to use standard FUSE mount utilities (see mfsmount(8) manual for new syntax). Note: old syntax is no longer accepted and mountpoint is mandatory now (there is no default).
ÖØÐ´Ñ¡Ïî½«²ÉÓÃÀàËÆÓÚ¹ÒÔØµÄ½âÎö·½Ê½£¬Ê¹ÓÃ±ê×¼µÄFUSE¹ÒÔØ¹¤¾ß¼¯½«³ÉÎª¿ÉÄÜ£¨²Î¼ûÐÂµÄmfsmount(8)Óï·¨ÊÖ²á£©¡£×¢£º¾ÉµÄÓï·¨ÏÖÔÚ½«²»ÔÙ±»Ö§³Ö£¬¶øÉèÖÃ¹ÒÔØµãÔòÊÇ±ØÐëµÄ¡££¨·ÇÄ¬ÈÏÑ¡Ïî£©
Updated for FUSE 2.6+.
Éý¼¶µ½FUSE 2.6°æ±¾ÒÔÉÏ
Added password, file data cache, attribute cache and entry cache options. By default attribute cache and directory entry cache are enabled, file data cache and file entry cache are disabled.
Ôö¼ÓÁËÃÜÂë£¬ÎÄ¼þÊý¾Ý»º´æ£¬ÊôÐÔ»º´æ¼°Ä¿Â¼ÏîÑ¡Ïî¡£Ä¬ÈÏÇé¿öÏÂ£¬ÊôÐÔ»º´æ¼°Ä¿Â¼Ïî»º´æÊÇ¿ªÆôµÄ£¬¶øÎÄ¼þÊý¾Ý»º´æºÍÎÄ¼þÏîÊäÈë»º´æÔòÊÇ¹Ø±ÕµÄ
opendir() no longer reads directory contents- it's done on first readdir() now; fixes "rm -r" on recent Linux/glibc/coreutils combo.
opendir()º¯Êý½«²»ÔÙ¶ÁÈ¡Ä¿Â¼ÄÚÈÝ-¶ÁÈ¡Ä¿Â¼ÄÚÈÝÏÖÔÚ½«ÓÉreaddir()º¯ÊýÍê³É£»ÐÞ¸´ÁËµ±Ç°Linux/glibc/coreutils×éºÏÖÐµÄ¡®rm -r¡¯ÃüÁî
Fixed mtime setting just before close() (by flushing file on mtime change); fixes mtime preserving on "cp -p".
ÐÞ¸´ÁËÔÚclose()Ç°µÄmtimeÉèÖÃ£¨ÔÚmtime±ä»¯µÄÊ±ºòË¢ÐÂÎÄ¼þ£©
Added statistics accessible through MFSROOT/.stats pseudo-file.
Ôö¼ÓÁË±íÊ¾·ÃÎÊÍÌÍÂÁ¿µÄÍ³¼ÆÎ±ÎÄ¼þMFSROOT/.stats
Changed master access method for mfstools (direct .master pseudo-file replaced by .masterinfo redirection); fixes possible mfstools race condition and allows to use mfstools on read-only filesystem.
¶ÔÓÚmfstools¸Ä±äÁËÖ÷ÒªµÄ·ÃÎÊÂ·¾¶£¨Ö±½Ó£©

Tools:
Units cleanup in values display (exact values, IEC-60027/binary prefixes, SI/decimal prefixes); new options: -n, -h, -H and MFSHRFORMAT environment variable - refer to mfstools(8) manual for details).
ÔÚµ¥ÔªÖµÏÔÊ¾·½Ãæ½øÐÐÒ»ÖÂ»¯£¨È·ÇÐÖµ£¬IEC-60027/¶þ½øÖÆÇ°×º£¬ SI/Ê®½øÖÆÇ°×º£©£»ÐÂµÄÑ¡Ïî£º-n,-h,-HÒÔ¼°¿É±äµÄMFSHRFORMAT»·¾³----ÏêÏ¸²Î¼ûmfstools(8)ÊÖ²á
mfsrgetgoal, mfsrsetgoal, mfsrgettrashtime, mfsrsettrashtime have been deprecated in favour of new "-r" option for mfsgetgoal, mfssetgoal, mfsgettrashtime, mfssettrashtime tools.
ÎÒÃÇÍÆ¼öÊ¹ÓÃ´øÐÂµÄ¡°-r¡±Ñ¡ÏîµÄmfsgetgoal, mfssetgoal, mfsgettrashtime, mfssettrashtime¹¤¾ß£¬¶ø²»ÍÆ¼ömfsrgetgoal, mfsrsetgoal, mfsrgettrashtime, mfsrsettrashtime¹¤¾ß¡££¨×¢ÒâÇ°ºóÃüÁîÊÇ²»Ò»ÑùµÄ£¬¿´ÆðÀ´ºÜÀàËÆ£©
mfssnapshot utility replaced by mfsappendchunks (direct descendant of old utility) and mfsmakesnapshot (which creates "real" recursive snapshots and behaves similar to "cp -r").
mfssnapshot¹¤¾ß¼¯È¡´úÁËmfsappendchunks£¨ÀÏ¹¤¾ß¼¯µÄºóÐø°æ±¾£©ºÍmfsmakesnapshot£¨¸Ã¹¤¾ßÄÜ¹»´´½¨¡°Õæ¡±µÄµÝ¹é¿ìÕÕ£¬Õâ¸ö¶¯×÷ÀàËÆÓÚÖ´ÐÐ¡°cp -r¡±£©¹¤¾ß
New mfsfilerepair utility, which allows partial recovery of file with some missing or broken chunks.
ÐÂµÄmfsÎÄ¼þÐÞ¸´¹¤¾ß¼¯£¬¸Ã¹¤¾ßÔÊÐí¶Ô²¿·Ö¶ªÊ§¼°Ëð»µ¿éµÄÎÄ¼þ½øÐÐ»Ö¸´
CGI scripts:
First public version of CGI scripts allowing to monitor MFS installation from WWW browser.
µÚÒ»¸öÔÊÐí´ÓWWWä¯ÀÀÆ÷¼à¿ØMFS°²×°µÄCGI½Å±¾·¢²¼°æ±¾

13. mfs¹Ù·½FAQ£¨TC°æ£©
What average write/read speeds can we expect?
The raw reading / writing speed obviously depends mainly on the performance of the used hard disk drives and the network capacity and its topology and varies from installation to installation. The better performance of hard drives used and better throughput of the net, the higher performance of the whole system.

In our in-house commodity servers (which additionally make lots of extra calculations) and simple gigabyte Ethernet network on a petabyte-class installation
on Linux (Debian) with goal=2 we have write speeds of about 20-30 MiB/s and reads of 30-50MiB/s. For smaller blocks the write speed decreases, but reading is not much affected.

Similar FreeBSD based network has got a bit better writes and worse reads, giving overall a slightly better performance.

Does the goal setting influence writing/reading speeds?
Generally speaking,
it doesn¡¯t. The goal setting can influence the reading speed only under certain conditions. For example, reading the same file at the same time by more than one client would be faster when the file has goal set to 2 and not goal=1.

But the situation in the real world when several computers read the same file at the same moment is very rare; therefore, the goal setting has rather little influence on the reading speeds.

Similarly, the writing speed is not much affected by the goal setting.

How well concurrent read operations are supported?
All read processes are parallel - there is no problem with concurrent reading of the same data by several clients at the same moment.

How much CPU/RAM resources are used?
In our environment (ca. 500 TiB, 25 million files, 2 million folders distributed on 26 million chunks on 70 machines) the usage of chunkserver CPU (by constant file transfer) is about 15-20% and chunkserver RAM usually consumes about 100MiB (independent of amount of data).
The master server consumes about 30% of CPU (ca. 1500 operations per second) and 8GiB RAM. CPU load depends on amount of operations and RAM on number of files and folders.

Is it possible to add/remove chunkservers and disks on fly?
You can add / remove chunkservers on the fly. But mind that it is not wise to disconnect a chunkserver if there exists a chunk with only one copy (marked in orange in the CGI monitor).
You can also disconnect (change) an individual hard drive. The scenario for this operation would be:

[*]Mark the disk(s) for removal[*]Restart the chunkserver process[*]Wait for the replication (there should be no ¡°undergoal¡± or ¡°missing¡± chunks marked in yellow, orange or red in CGI monitor)[*]Stop the chunkserver process[*]Delete entry(ies) of the disconnected disk(s) in 'mfshdd.cfg'[*]Stop the chunkserver machine[*]Remove hard drive(s)[*]Start the machine[*]Start the chunkserver process
If you have hotswap disk(s) after step 5 you should follow these:
[*]Unmount disk(s)[*]Remove hard drive(s)[*]Start the chunkserver process
If you follow the above steps work of client computers would be not interrupted and the whole operation would not be noticed by MooseFS users.

My experience with clustered filesystems is that metadata operations are quite slow. How did you resolve this problem?
We have noticed the problem with slow metadata operations and we decided to cache file system structure in RAM in the metadata server. This is why metadata server has increased memory requirements.

When doing df -h on a filesystem the results are different from what I would expect taking into account actual sizes of written files.
Every chunkserver sends its own disk usage increased by 256MB for each used partition/hdd, and a sum of these master sends to the client as total disk usage. If you have 3 chunkservers with 7 hdd each, your disk usage will be increased by 3*7*256MB (about 5GB). Of course it's not important in real life, when you have for example 150TB of hdd space.

There is one other thing. If you use disks exclusively for MooseFS on chunkservers df will show correct disk usage, but if you have other data on your MooseFS disks df will count your own files too.

If you want to see usage of your MooseFS files use 'mfsdirinfo' command.

Do chunkservers and metadata server do their own checksumming?
Yes there is checksumming done by the system itself. We thought it would be CPU consuming but it is not really. Overhead is about 4B per a 64KiB block which is 4KiB per a 64MiB chunk (per goal).

What sort of sizing is required for the Masterserver?
The most important factor is RAM of mfsmaster machine, as the full file system structure is cached in RAM for speed. Besides RAM mfsmaster machine needs some space on HDD for main metadata file together with incremental logs.

The size of the metadata file is dependent on the number of files (not on their sizes). The size of incremental logs depends on the number of operations per hour, but length (in hours) of this incremental log is configurable.

1 million files takes approximately 300 MiB of RAM. Installation of 25 million files requires about 8GiB of RAM and 25GiB space on HDD.

When I delete files or directories the MooseFS size doesn¡¯t change. Why?
MooseFS is not erasing files immediately to let you revert the delete operation.

You can configure for how long files are kept in trash and empty the trash manually (to release the space). There are more details here:
http://moosefs.com/pages/userguides.html#2 in section "Operations specific for MooseFS".

In short - the time of storing a deleted file can be verified by the mfsgettrashtime command and changed with mfssettrashtime.

When I added a third server as an extra chunkserver it looked like it started replicating data to the 3rd server even though the file goal was still set to 2.
Yes. Disk usage ballancer uses chunks independently, so one file could be redistributed across all of your chunkservers.

Is MooseFS 64bit compatible?Yes!

Can I modify the chunk size?
File data is divided into fragments (chunks) with a maximum of 64MiB each. The value of 64 MiB is hard coded into system so you cannot modify its size. We based the chunk size on real-world data and it was a very good compromise between number of chunks and speed of rebalancing / updating the filesystem. Of course if a file is smaller than 64 MiB it occupies less space.

Please note systems we take care of enjoy files of size well exceeding 100GB and there is no chunk size penalty noticeable.

How do I know if a file has been successfully written in MooseFS?
First off, let's briefly discuss the way the writing process is done in file systems and what programming consequences this bears. Basically, files are written through a buffer (write cache) in all contemporary file systems. As a result, execution of the "write" command itself only transfers the data to a buffer (cache), with no actual writing taking place. Hence, a confirmed execution of the "write" command does not mean that the data has been correctly written on a disc. It is only with the correct performance of the "fsync" (or "close") command that all data kept in buffers (cache) gets physically written. If an error occurs while such buffer-kept data is being written, it could return an incorrect status for the "fsync" (or even "close"), not only "write" command.
The problem is that a vast majority of programmers do not test the "close" command status (which is generally a mistake, though a very common one). Consequently, a program writing data on a disc may "assume" that the data has been written correctly, while it has actually failed.
As far as MooseFS is concerned ¨C first, its write buffers are larger than in classic file systems (an issue of efficiency); second, write errors may be more frequent than in case of a classic hard drive (the network nature of MooseFS provokes some additional error-inducing situations). As a consequence, the amount of data processed during execution of the "close" command is often significant and if an error occurs while the data is being written, this will be returned in no other way than as an error in execution of the "close" command only.
Hence, before executing "close", it is recommended (especially when using MooseFS) to perform "fsync" after writing in a file and then check the status of "fsync" and ¨C just in case ¨C the status of "close" as well.
NOTE! When "stdio" is used, the "fflush" function only executes the "write" command, so correct execution of "fflush" is not enough grounds to be sure that all data has been written successfully ¨C you should also check the status of "fclose".
One frequent situation in which the above problem may occur is redirecting a standard output of a program to a file in "shell". Bash (and many other programs) does not check the status of "close" execution and so the syntax of the "application > outcome.txt" type may wrap up successfully in "shell", while in fact there has been an error in writing the "outcome.txt" file. You are strongly advised to avoid using the above syntax. If necessary, you can create a simple program reading the standard input and writing everything to a chosen file (but with an appropriate check with the "fsync" command) and then use "application | mysaver outcome.txt", where "mysaver" is the name of your writing program instead of "application > outcome.txt".
Please note that the problem discussed above is in no way exceptional and does not stem directly from the characteristics of MooseFS itself. It may affect any system of files ¨C only that network type systems are more prone to such difficulties. Technically speaking, the above recommendations should be followed at all times (also in case of classic file systems).

Janusz, tu trzeba będzie zrobić prawidłowy link

[ ±¾Ìû×îºóÓÉ shinelian ÓÚ 2010-1-15 18:07 ±à¼ ]

liukaiyi ·¢±íÓÚ 2010-01-14 14:49

É³·¢£¡£¡£¡

Ð¡µÜ ÕâÇë½ÌÒ»Ð©ÎÊÌâ £º

1. ÔÚ mfs ÖÐ Êý¾Ý´æ´¢·þÎñÆ÷ Èç¹û ÂúÁË ´íÎó´¦ÀíÁ÷³ÌÄÜ½éÉÜÏÂÂð£¿

2. ÔÚ·Ö²¼´æ´¢ÖÐ Ò»Ð©Êý¾ÝÄÜ·ñ Ö§³Ö Ò»Ð©Êý¾Ý½á¹¹ ¡£±ÈÈç£ºÊ÷£¬Á´±íÊ²Ã´

ÎÒÕâÊ¹ÓÃ¹ý ¼òµ¥µÄ Ê¹ÓÃ hadoop ÓÐÐ¡¼¸ÔÂ£¬·¢ÏÖÊ¹ÓÃ²¢²»ÀíÏë¶ÔÓÚ Êý¾Ý·ÖÎöÀ´Ëµ

shinelian ·¢±íÓÚ 2010-01-14 14:54

»Ø¸´ #6 liukaiyi µÄÌû×Ó

1. Ê×ÏÈmfs¿ÉÒÔÔÚÏßÀ©ÈÝ£¬ÄãÖ»Òª¼à¿Ø´æ´¢µÄÊ¹ÓÃÂÊ£¬±ÈÈçµ½ÁË80%¾Í±¨¾¯£¬Õâ¸ö¿ÉÒÔÊÇÔ¤·À£¬´íÎó´¦ÀíÁ÷³ÌµÈ´ó¼Ò²âÊÔºóÔÙÌÖÂÛ¡£

2. Õâ¸öÊÇÍ¨ÓÃµÄÎÄ¼þÏµÍ³£¨Äã¿ÉÒÔ°ÑËüÏëÏó³ÉÎª±¾µØµÄext3£©£¬²»°üÀ¨Êý¾Ý½á¹¹¡£

Áí£¬Äã¿ÉÒÔ¼ÓÈë
0. »¶Ó¼ÓÈëqqÈº102082446 £¬×¨ÃÅÌÖÂÛ·Ö²¼Ê½ÎÄ¼þÏµÍ³£¬Í¨¹ØÃÜÂë£ºi love cuer!

Ã²ËÆÓÐ¸çÃÇÔÚÑÐ¾¿hadoop

liukaiyi ·¢±íÓÚ 2010-01-14 14:59

»Ø¸´ #7 shinelian µÄÌû×Ó

Ð»Ð»
ÕÒ¸öÊ±¼ä ÏÈ×Ô¼º´î½¨ mfs ¸ú½øÖÐ¡£¡£¡£¡£¡£

joey.xiang ·¢±íÓÚ 2010-01-14 15:13

²»´í°¡£¬Ð´µÄÏàµ±µÄÏêÏ¸,ÓÐ¿ÕÒªÀ´×Ô¼º²âÊÔÏÂµÄ.

shinelian ·¢±íÓÚ 2010-01-15 12:40

×Ô¼º¶¥£¬´ó¼Ò¶à¶à¹Ø×¢¡£

Ò³: [1] 2 3 4 5 6 7 8 9 10

Chinaunix's Archiver