码迷,mamicode.com
首页 > Web开发 > 详细

url下载网页的三种方法

时间:2016-11-30 01:46:10      阅读:251      评论:0      收藏:0      [点我收藏+]

标签:header   urlopen   processor   bsp   cookie   getc   response   res   opener   

# -*- coding: utf-8 -*-
import cookielib
import urllib2

url = "http://www.baidu.com"
print "第一种方法"
response1 = urllib2.urlopen(url)
print response1.getcode()
print len(response1.read())

print "第二种方法"
res = urllib2.Request(url)
res.add_header("user-agent","Mozilla-5.0")
response2 = urllib2.urlopen(res)
print response2.getcode()
print len(response2.read())

print "第三种方法"
cj = cookielib.CookieJar()
opener = urllib2.build_opener(urllib2.HTTPCookieProcessor)
urllib2.install_opener(opener)
response3 = urllib2.urlopen(url)
print response3.getcode()
print cj
#print response3.read()

 

url下载网页的三种方法

标签:header   urlopen   processor   bsp   cookie   getc   response   res   opener   

原文地址:http://www.cnblogs.com/php-linux/p/6115506.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!