2. Python标准库urllib.request模块_2(python3)

时间：2015-05-04 08:48:12 阅读：246 评论：0 收藏：0 [点我收藏+]

标签：

参考学习地址：http://www.iplaypython.com

# coding:utf-8

# 学习1
import urllib.request
# print(dir(html))
# 获取网页所在的header信息
url="http://www.iplaypython.com/"
html=urllib.request.urlopen(url)

# 获取网站返回的状态码
code = html.getcode()
print("返回的状态码: %s" % code)

if code == 200:
    print("网页正常")
    # 获取网页header信息，有网站编码格式
    print(html.info())
    # 获取访问的url
    print(html.geturl())
    # # 读取html内容 (decode/encode 网页编码转换 ‘ignore‘ 表示忽略一些无法转换的字符)
    # html_content=html.read().decode("utf-8", ‘ignore‘)
    # print(html_content)
else:
    print("网页错误的状态码： %s " % code)

# 学习2
def callback(a, b, c):
    """
    @a:到目前为止传递的数据块数量
    @b:每个数据块的大小，单位byte,字节
    @c:远程文件的大小（有时候返回-1）
    """
    down_progress = 100.0*a*b/c

if down_progress > 100:
down_progress = 100

    # %.2f保留两位小数位,结果不换行（python3好像不行勒）
    print("%.2f%%" % down_progress,)
    # %.2f保留两位小数位，%转义字符 %表示百分号
    # print("%.2f%%" % down_progress)

url1 = "http://www.iplaypython.com"
url2 = "http://www.python.org"
local = "e:/_python/other/iplaypython.html"

# 下载网页，并显示进度条
urllib.request.urlretrieve(url1,local,callback)

"""
1.传入网址，网址的类型一定是字符串。

2.传入的，本地的网页保存路径+文件名

3.一个函数的调用，我们可以任意来定义这个函数的行为，但是一定要保证这个函数有3个参数
（1）到目前为止传递的数据块数量。
（2）每个数据块的大小，单位byte,字节
（3）远程文件的大小。（有时候返回-1）

"""

2. Python标准库urllib.request模块_2(python3)

标签：

原文地址：http://my.oschina.net/dataRunner/blog/410207

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行