- 论坛徽章:
- 0
|
回复 9# xxjjuu796
嗯,视频能看了。
我就是搜的www.so.com的,具体代码如下
#coding:UTF-8
import urllib
import urllib2
import re
leter=['a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z']
for item in leter:
rgjc=item
gjc=urllib.quote(rgjc)
url="http://sug.so.360.cn/suggest/word?callback=suggest_so&encodein=utf-8&encodeout=utf-8&word="+gjc
headers={
'GET':url,
'Host':'sug.so.360.cn',
'Referer':'http://www.so.com/'
}
req=urllib2.Request(url)
for key in headers:
req.add_header(key,headers[key])
html=urllib2.urlopen(req).read()
ss=re.findall("\"(.*?)\"",html)
print ss[1]
结果如下,发现部分中文是乱码:
angelababy
bigbang
cf瀹樼綉
dnf
exo
fx缁勫悎
google
hao123
itunes
jd
kfc
lol
mx3
nba
office2007免费版下载
pps
qq绌洪棿
running man
so.com
two weeks
u9
v神驾到
wow
xp绯荤粺涓嬭浇
yy语音官方下载
z级一班3 |
|