Python 基础学习网络小爬虫

时间：2014-07-04 08:09:50 阅读：244 评论：0 收藏：0 [点我收藏+]

<span style="font-size:18px;">#
# 百度贴吧图片网络小爬虫
#


import re
import urllib
 
def getHtml(url):
    page = urllib.urlopen(url)
    html = page.read()
    return html
 
def getImg(html):
    reg = r'src="(.+?\.jpg)" pic_ext'
    imgre = re.compile(reg)
    imglist = imgre.findall(html)
    x = 0
    l=len(imglist)
    print "总共有%d张图片"%(l)
    print "-------------------"
    for imgurl in imglist:
        print "第%d张图片" %(x+1)
        urllib.urlretrieve(imgurl,'E:\\Pythoncode\\picture\\%s.jpg' % x)
        x = x + 1       
    
html = getHtml("http://tieba.baidu.com/p/3093487131")
getImg(html)</span>

</pre><pre code_snippet_id="415913" snippet_file_name="blog_20140703_4_8970806" name="code" class="python">总共有38张图片
-------------------
第1张图片
第2张图片
第3张图片
第4张图片
第5张图片
第6张图片
第7张图片
第8张图片
第9张图片
第10张图片
第11张图片
第12张图片
第13张图片
第14张图片
第15张图片
第16张图片
第17张图片
第18张图片
第19张图片
第20张图片
第21张图片
第22张图片
第23张图片
第24张图片
第25张图片
第26张图片
第27张图片
第28张图片
第29张图片
第30张图片
第31张图片
第32张图片
第33张图片
第34张图片
第35张图片
第36张图片
第37张图片
第38张图片

Python 基础学习网络小爬虫,布布扣,bubuko.com

Python 基础学习网络小爬虫

标签：python 爬虫

原文地址：http://blog.csdn.net/u013476464/article/details/36698611

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行

Python 基础学习 网络小爬虫

Python 基础学习网络小爬虫