- 论坛徽章:
- 0
|
- # -*- coding: utf-8 -*-
- from bs4 import BeautifulSoup
- import urllib2
- import re,sys
- reload(sys)
- sys.setdefaultencoding( "utf-8" )
- link='http: //movie.douban.com/'
- path=r'1.txt'
- f=open(path,'wb+')
- # print "it is download the %d page:" % i
- # new=link + str(i) + '.'+'shtml'
- req=urllib2.Request(link)
- fd=urllib2.urlopen(req)
- soup=BeautifulSoup(fd.read(),from_encoding="utf-8")
- ret=soup.find(id="top-nav-appintro")
- for r in ret.stripped_strings:
- f.write(r)
- print r
- # f.close()
复制代码 To get rid of this warning, change this:
BeautifulSoup([your markup])
to this:
BeautifulSoup([your markup], "lxml"
豆瓣
3.0
和有趣的人做有趣的事
扫码直接下载
iPhone
·
Android
为什么我可以 |
|