论坛徽章:: 0

电梯直达

1楼 [收藏(0)] [报告]

发表于 2018-03-12 10:59 |只看该作者 |倒序浏览

本帖最后由龙城狂霸小屌丝于 2018-03-12 11:02 编辑

Active connections: 760
server accepts handled requests request_time
5006365693 5006365693 6791022492 116870885316
Reading: 0 Writing: 14 Waiting: 746

求最后生成KV形式
比如
Active connections: 760
tengine_accepts：5006365693
tengine_handled：5006365693
tengine_requests：6791022492
request_time：116870885316

求大神们指点下想不出来这些正则该怎么写

文库|博客

zxy877298415

小富即安

论坛徽章:: 30

2楼 [报告]

发表于 2018-03-13 11:07 |只看该作者

回复 1# 龙城狂霸小屌丝
举例不够全面，只能就题论题，py3

import re
with open('file.txt') as f:
for i in f:
if re.match(r'^Active',i):
print (i,end='')
elif re.match(r'^server',i):
l=i.strip('\n').split(' ')[1:]
elif re.match(r'\d+',i):
for a,b in zip(l,i.split(' ')):
print (a+":"+b)

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

bskay

丰衣足食

论坛徽章:: 11

3楼 [报告]

发表于 2018-03-14 13:21 |只看该作者

本帖最后由 bskay 于 2018-03-14 13:24 编辑

#首先，分析数据格式：
s='''
Active connections: 760
server accepts handled requests request_time
5006365693 5006365693 6791022492 116870885316
Reading: 0 Writing: 14 Waiting: 746
'''
# 这样,人工自然的按行，按字段划分
text =[
['Active', 'connections:', '760'],
['server', 'accepts', 'handled', 'requests', 'request_time'],
['5006365693', '5006365693', '6791022492', '116870885316'],
['Reading:', '0', 'Writing:', '14', 'Waiting:', '746 '],
]

# 然后把上面需要的数据字段单个替换

text =[
['Active', 'connections:', '(?P<connections>\d+)'],
['server', 'accepts', 'handled', 'requests', 'request_time'],
['(?P<tengine_requests>\d+)', '(?P<tengine_accepts>\d+)', '(?P<tengine_handled>\d+)', '(?P<request_time>\d+)'],
['Reading:', '\d+', 'Writing:', '\d+', 'Waiting:', '\d+'],
]

#然后用 '\s*'连接行，'\s+'连接字段，进行匹配
re.match('\s*'.join(['\s+'.join(l) for l in text]), s).groupdict()

#如果有多个，用
re.findall('\s*'.join(['\s+'.join(l) for l in text]), s)

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

bskay

丰衣足食

论坛徽章:: 11

4楼 [报告]

发表于 2018-03-14 13:23 |只看该作者

结果
>>> re.match('\s*'.join(['\s+'.join(l) for l in text]), s).groupdict()
{'connections': '760', 'tengine_accepts': '5006365693', 'request_time': '116870885316', 'tengine_handled': '6791022492', 'tengine_requests': '5006365693'}

实战分享：从技术角度谈机器学习入门| 【大话IT】RadonDB低门槛向MySQL集群下战书 | ChinaUnix打赏功能已上线！ | 新一代分布式关系型数据库RadonDB知多少？

返回列表

Chinaunix › 论坛 › 程序设计 › Python › re获取每个值

re获取每个值 [复制链接]

浏览过的版块