- 论坛徽章:
- 0
|
弄了个简陋的,那个网站的登录认证提交加密了,自动登录没弄成,直接手工登录后,把cookie文件的信息附加到perl里面了。
用的时候先登录网站,不要关闭浏览器。然后运行代码就可以了。就是速度有点慢。- use LWP::UserAgent;
- use LWP::ConnCache;
- #use HTTP::Cookies;
- $bro=LWP::UserAgent->new;
- $cache=LWP::ConnCache->new;
- #see the cookie file to change the ip address
- $ip="210.31.199.135";
- #$cookie=HTTP::Cookies->new(file=>'cookie.txt',autosave=>1);
- $bro->conn_cache($cache);
- $bro->cookie_jar($cookie);
- $url="http://www.gene-regulation.com/cgi-bin/pub/databases/transfac/getTF.cgi?AC=";
- #$bro->post("$login",["user"=>"chinaunix"],["password"=>"123456"]);
- for($i=1;$i<399;$i++){
- $page=sprintf("%03d",$i);
- $full_url=$url."M00".$page;
- $res=$bro->get($full_url,'cookie'=>"Ticket=hash&a01d53e7f09e71bd2a7e0dd8af166632&time&1282744299&ip&$ip&user&chinaunix&expires&10");
- if($res->is_success){
- open($page,">","M00".$page.".html");
- print $page $res->content;
- close $page;
- print "downloading"."M00".$page."\n";
- }
- else{
- print "can't get page"
- }
复制代码 |
|