网易财经爬取

时间：2019-12-19 17:40:10 阅读：112 评论：0 收藏：0 [点我收藏+]

标签：tree headers use gecko mozilla webkit lis list gen

import requests
from lxml import etree

url = ‘http://quotes.money.163.com/old/‘
headers = {
‘User-Agent‘: ‘Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.79 Safari/537.36‘
}

html = requests.get(url=url,headers=headers).text

tree = etree.HTML(html)

content = tree.xpath(‘//li[@qid="HS"]//li[@id="f0-f7"]/ul/li‘)
for con in content:
one = con.xpath(‘./a/text()‘)[0]
print(one)
two_list = con.xpath(‘./ul/li‘)
for t in two_list:
qid = t.xpath(‘./@qid‘)[0]
print(qid)
two = t.xpath(‘./a/text()‘)[0]
print(two)

网易财经爬取

标签：tree headers use gecko mozilla webkit lis list gen

原文地址：https://www.cnblogs.com/Iceredtea/p/12069065.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行