电梯直达

1楼 [收藏(0)] [报告]

发表于 2014-05-23 14:01 |只看该作者 |倒序浏览

本帖最后由 fengidri 于 2014-05-23 14:06 编辑

复制代码

这里的对象b是一个unicode 但是保存的编码是ascii的。我们如何把这个对象转移成正确的编码。

谢谢

2楼 [报告]

发表于 2014-05-23 14:42 |只看该作者

这用法，自讨苦吃啊

3楼 [报告]

发表于 2014-05-23 14:58 |只看该作者

本帖最后由 timespace 于 2014-05-23 14:58 编辑

还好有bytearray，不是太复杂，完整尝试过程：

Python 2.7.5 (default, Mar 9 2014, 22:15:05)
[GCC 4.2.1 Compatible Apple LLVM 5.0 (clang-500.0.68)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> a = "帖子"
>>> a
'\xe5\xb8\x96\xe5\xad\x90'
>>> b = unicode(a)
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe5 in position 0: ordinal not in range(128)
>>> b = u'\xe5\xb8\x96\xe5\xad\x90'
>>> b
u'\xe5\xb8\x96\xe5\xad\x90'
>>> print b
å¸–å-
>>> [ord(e) for e in b]
[229, 184, 150, 229, 173, 144]
>>> bytearray(ord(e) for e in b)
bytearray(b'\xe5\xb8\x96\xe5\xad\x90')
>>> bytearray(ord(e) for e in b).decode('utf-8')
u'\u5e16\u5b50'
>>> c = bytearray(ord(e) for e in b).decode('utf-8')
>>> c
u'\u5e16\u5b50'
>>> print c
帖子
>>> d = c.encode('utf-8')
>>> d
'\xe5\xb8\x96\xe5\xad\x90'
>>>

复制代码

unicode 强制类型转换 [复制链接]