论坛徽章:: 1

电梯直达

1楼 [收藏(0)] [报告]

发表于 2014-02-16 15:53 |只看该作者 |倒序浏览

RT，看了manual之后，还是不懂，比如下面的代码：

split --filter='cat > $FILE && echo $FILE' -l 1000000 b

复制代码

filter里面的内容是干嘛的？

文库|博客

樱落花开樱落花开当前离线禁止访问好友博客消息论坛徽章: 0	2楼 [报告] 发表于 2014-02-16 16:30 \|只看该作者提示: 作者被禁止或删除内容自动屏蔽
樱落花开樱落花开当前离线禁止访问好友博客消息论坛徽章: 0	实战分享：从技术角度谈机器学习入门\| 【大话IT】RadonDB低门槛向MySQL集群下战书 \| ChinaUnix打赏功能已上线！ \| 新一代分布式关系型数据库RadonDB知多少？

elu_ligao

腰缠万贯

论坛徽章:: 29

3楼 [报告]

发表于 2014-02-16 19:12 |只看该作者

lz什么版本呀，我的版本低了，看不到这个选项
[redhat@localhost 0213]$ split --version
split (GNU coreutils) 5.97

复制代码

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

Shell_HAT

版主

论坛徽章:: 33

4楼 [报告]

发表于 2014-02-17 09:21 |只看该作者

默认情况下，split分割之后的文件名是这个样子的：

[root]# seq 15 > test.txt
[root]# ll
total 4
-rw-r--r-- 1 root root 36 Feb 17 09:14 test.txt
[root]# split -l 5 test.txt
[root]# ll
total 16
-rw-r--r-- 1 root root 36 Feb 17 09:14 test.txt
-rw-r--r-- 1 root root 10 Feb 17 09:14 xaa
-rw-r--r-- 1 root root 11 Feb 17 09:14 xab
-rw-r--r-- 1 root root 15 Feb 17 09:14 xac

可以使用 --filter 来自己定义文件的扩展名：

split -l 5 test.txt --filter='cat > $FILE.txt'

复制代码

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

runintostar

家境小康

论坛徽章:: 0

5楼 [报告]

发表于 2014-02-17 09:45 |只看该作者

回复 4# Shell_HAT
哇，学习了，感谢版主，以前完全不太懂这些命令
是不是可以这样理解，split命令会把拆分的行扔给filter后面的COMMAND，就像是管道符的作用一样?COMMAND里可以直接设置一些用户需要的fileter命令，grep，sed等等？
不过刚才试了一下COMMAND里也可以嵌套管道符，但是如果使用grep的时候必须grep成功，不然split就会退出了

--filter=COMMAND
write to shell COMMAND; file name is $FILE

复制代码

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

Shell_HAT

版主

论坛徽章:: 33

6楼 [报告]

发表于 2014-02-17 10:26 |只看该作者

回复 5# runintostar

是的，基本上是这么个意思。

'--filter=command'
With this option, rather than simply writing to each output file, write through a pipe to the specified shell command for each output file. command should use the $FILE environment variable, which is set to a different output file name for each invocation of the command. For example, imagine that you have a 1TiB compressed file that, if uncompressed, would be too large to reside on disk, yet you must split it into individually-compressed pieces of a more manageable size. To do that, you might run this command:

xz -dc BIG.xz | split -b200G --filter='xz > $FILE.xz' - big-

Assuming a 10:1 compression ratio, that would create about fifty 20GiB files with names big-aa.xz, big-ab.xz, big-ac.xz, etc.

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

suanmeilizhi

稍有积蓄

论坛徽章:: 1

7楼 [报告]

发表于 2014-02-19 21:44 |只看该作者

回复 6# Shell_HAT

split --filter='cat > $FILE && echo $FILE' -l 1000000 b | \
xargs -P 3 -I {} bash -c 'sort {} > {}.$ && comm -1 -2 {}.$ a && rm {}*' | \
tee ret | \
sort | \
uniq -c > ret.uniq

复制代码

这段代码--filter里面的COMMAND要怎么理解啊？没有看懂

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

返回列表

Chinaunix › 论坛 › 程序设计 › Shell › split的--filter选项怎么用？

樱落花开樱落花开当前离线禁止访问好友博客消息论坛徽章: 0	2楼 [报告] 发表于 2014-02-16 16:30 \|只看该作者提示: 作者被禁止或删除内容自动屏蔽
樱落花开樱落花开当前离线禁止访问好友博客消息论坛徽章: 0	实战分享：从技术角度谈机器学习入门\| 【大话IT】RadonDB低门槛向MySQL集群下战书 \| ChinaUnix打赏功能已上线！ \| 新一代分布式关系型数据库RadonDB知多少？

[文本处理] split的--filter选项怎么用？ [复制链接]