百度热搜

时间：2020-07-06 16:29:50 阅读：414 评论：0 收藏：0 [点我收藏+]

标签：mozilla ext hot http sele find tle index agent

# -*- coding:utf-8 -*-
import requests
from bs4 import BeautifulSoup

url = "http://top.baidu.com/buzz?b=1&fr=topindex"
header = {
    "user-agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.106 Safari/537.36",
}
content = []
r = requests.get(url,headers = header)
respond = r.text
soup = BeautifulSoup(respond,"html.parser")
# HotSearchs = soup.find_all("td",class_="keyword")
# HotSearchs = soup.select("td[class=‘keyword‘]")
HotSearchs = soup.find_all("tr")[1:]
for HotSearch in HotSearchs:
    if HotSearch.find(class_ ="list-title") != None:
        title = HotSearch.find(class_ = "list-title").text.encode("iso-8859-1").decode("gbk")
        number = HotSearch.find(class_ = "last").text.strip()
        content.append([title,number])
print(content)

百度热搜

标签：mozilla ext hot http sele find tle index agent

原文地址：https://www.cnblogs.com/python-kp/p/13254943.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行