码迷,mamicode.com
首页 > 编程语言 > 详细

python爬runoob目录链接栏

时间:2019-12-19 13:12:48      阅读:86      评论:0      收藏:0      [点我收藏+]

标签:data   port   class   com   enc   def   pen   find   with   

import re
import requests
url=https://www.runoob.com/python3/python3.html
response=requests.get(url)
html=response.text
response.encoding=utf-8
dl=re.findall(r<div class="design" id="leftcolumn">.*?</div>,html,re.S)[0]
tree=re.findall(rtitle="(.*?)".*?href="(.*?)",dl)
lst=[]
def get_data(link):
    lst.append(link)
    ht=requests.get(link)
    print(已下载,len(lst),)
for tree_info in tree:
    url=https://www.runoob.com/python3{}\n.format(tree_info[1])
    with open(D:\Desktop\测试\html.txt,a) as f:
        f.write(url)
    get_data(url)

python爬runoob目录链接栏

标签:data   port   class   com   enc   def   pen   find   with   

原文地址:https://www.cnblogs.com/zhuyu139/p/12067020.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!