- 论坛徽章:
- 0
|
回复 2# darkn3ss
这个是我发帖的时候写错了,加上引号也没用的。完整代码如下:- #coding=utf-8
- import urllib2
- from BeautifulSoup import BeautifulSoup
- proxy = urllib2.ProxyHandler({'http': 'http://zhangyanan:mengxiang@192.168.16.189:8080'})
- opener = urllib2.build_opener(proxy)
- urllib2.install_opener(opener)
- req = urllib2.Request('http://www.baidu.com/s?wd=www.chinaunix.com')
- f = urllib2.urlopen(req)
- content = f.read()
- baiduencoding = 'utf-8'
- soup = BeautifulSoup(content, fromEncoding=baiduencoding)
- tabletag = soup.find('table', {'id': '1'})
- spantag = tabletag.find('span', {'class': 'g'})
- cache = spantag
- domain = 'www.chinaunix.com'
- print cache
- if domain in cache:
- print 'yes'
- else:
- print 'no'
复制代码 |
|