- 论坛徽章:
- 0
|
本帖最后由 gxl133133 于 2013-09-27 11:52 编辑
#encoding:utf-8
import jieba.analyse
text = open("b.txt"
#text = "四款python中文分词系统简单测试"
for i in text:
i = i.encode('utf-8')
tags = jieba.analyse.extract_tags(i, 4)
a = []
a = "/".join(i).split('/')
for j in a:
print j,
Traceback (most recent call last):
File "F:\Python27\ciku\topk.py", line 7, in <module>
i = i.encode('utf-8')
UnicodeDecodeError: 'ascii' codec can't decode byte 0xef in position 0: ordinal not in range(12
decode('utf-8') 将utf-8转化成unicode编码
我的使用方法不对吗
已解决 |
|