获取知乎热点前十

时间：2020-03-20 11:10:29 阅读：75 评论：0 收藏：0 [点我收藏+]

标签：each urllib 循环 ima 爬取目标 png from beautiful

import requests
from bs4 import BeautifulSoup
import time
import os
import urllib

#需要爬取的目标网页
link=‘https://www.zhihu.com/hot‘

#对网页进行解析
soup=BeautifulSoup(r.text,‘lxml‘)

#获取到热榜对应的那部分内容
title_list=soup.find_all(‘section‘,class_=‘HotItem‘)

#循环爬取相关内容
for each in title_list[0:10]:
    index=each.find(‘div‘,class_=‘HotItem-rank‘).text#排名
    title=each.find(‘h2‘,class_=‘HotItem-title‘).text#标题
    number=each.find(‘div‘,class_="HotItem-metrics").text[0:-3]#热度
    print(index,title,number)

技术图片

获取知乎热点前十

标签：each urllib 循环 ima 爬取目标 png from beautiful

原文地址：https://www.cnblogs.com/c---y/p/12529865.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行