python 爬取煎蛋网图片

时间：2016-05-17 19:49:36 阅读：261 评论：0 收藏：0 [点我收藏+]

__author__ = mkdir(path):
    os
    path = path.strip()  path = path.rstrip()  mkfile = os.path.exists(path)
    mkfile:
        ()
    :
        os.makedirs(path)
        ()


urllib, urllib2, re


geturl(url):
    file_lists = []
    req = urllib2.Request(url)
    req.add_header(,
                   )
    data = urllib2.urlopen(req)
    f = data.read()
    img = re.compile(, re.S)
    file_list = re.findall(img, f)
    f_list file_list:
        :
            f_url = f_list.index()
            f_url:
                file_li = f_list[f_list.index():f_url + ]
                file_lists.append(file_li)
        e:
            (e.message)

    file_lists


save_file(path, url):
    mkdir(path)
    url_list = geturl(url)
    url_i url_list:
        req = urllib2.Request(url_i)
        req.add_header(, )
        data = urllib2.urlopen(req)
        data_picture = data.read()
        path_pic = path + + url_i[-:]
        (path_pic, ) f:
            f.write(data_picture)
            f.flush()


__name__ == :
    path = url = save_file(path, url)

本文出自 “时空旅行者” 博客，请务必保留此出处http://siweilai.blog.51cto.com/9233507/1774438

python 爬取煎蛋网图片

标签：浏览器爬虫 python 文件夹爬取图片

原文地址：http://siweilai.blog.51cto.com/9233507/1774438

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行