标签:port rom += request chrome div urllib user 网络
知识点:
1、爬取网站图片
import re from urllib import request import time,os headeers = {‘User-Agent‘:‘Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3325.181 Safari/537.36‘} url = ‘http://699pic.com/food.html?sem=1&sem_kid=45712&sem_type=1‘ res = request.Request(url=url,headers=headeers) result = request.urlopen(res).read().decode(‘utf-8‘) a=r"(?<=data-original=\").+?(?=\")|(?<=data-original=\‘).+?(?=\‘)" urls = re.findall(a, result) def down_load(urls): name2 = ‘img‘ os.mkdir(name2) print(len(urls)) num = 1 for i in urls: jpg = request.urlopen(i).read() name1 =str(num) +‘.jpg‘ with open(name2+‘/‘+name1, ‘wb‘) as f: f.write(jpg) num += 1 if __name__ == ‘__main__‘: down_load(urls)
标签:port rom += request chrome div urllib user 网络
原文地址:https://www.cnblogs.com/lz-tester/p/9392299.html