Beautifulsoup关于find的测试

时间：2017-07-01 21:43:34 阅读：195 评论：0 收藏：0 [点我收藏+]

from bs4 import BeautifulSoup
import requests
url=‘https://book.douban.com/subject_search?search_text=golang&cat=1001‘
html=requests.get(url).text
# print(html)
soup=BeautifulSoup(html,‘lxml‘)
booknames=soup.findAll(‘li‘,{‘class‘:‘subject-item‘})   #查找标签
bookname=[]
chubanshe=[]
year=[]
pingjia_price=[]
for name in booknames:
    a=name.get_text().replace(‘ ‘,‘‘).replace(‘\n‘,‘‘).split(‘/‘)  #get_text方法
    # print(a)
    bookname.append(a[0])
    chubanshe.append(a[1])
    year.append(a[2])
    pingjia_price.append(a[-1])
print(bookname)
print(chubanshe)
print(year)
print(pingjia_price)

　　beautifulsoup中的find和findall参数

findAll(tag,attributes,recursive,text,limit,keywords)

findAll(tag,attributes,recursive,text,keywords)

分别代表，标签，传入字典形式的标签属性，递归开关，文本匹配数量，limitpi匹配前多少项目，关键字参数

一般来说，使用，第一个和最后的关键字参数便可，其他都是默认参数，

Beautifulsoup关于find的测试

标签：bsp soup 使用 url imp logs ext 文本 pre

原文地址：http://www.cnblogs.com/fengshuihuan/p/7103451.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行