Python中BeautifulSoup中对HTML标签的提取

时间：2017-01-12 08:43:45 阅读：300 评论：0 收藏：0 [点我收藏+]

标签：字符串 sys typeerror dal ext open html one tde

一开始使用了beautifulSoup的get_text()进行字符串的提取，后来一直提取失败，并提示错误为TypeError: ‘NoneType‘ object is not callable

返回了none类型，可能是对Span标签内容的提取产生错误，于是采用name.string进行字符的提取，成功。

# -*- coding: utf-8 -*-
"""
Created on Wed Jan 11 17:21:54 2017

@author: PE-Monitor
"""
import urllib2
import BeautifulSoup
import sys

reload(sys)
sys.setdefaultencoding(‘utf-8‘)
responce = urllib2.urlopen("http://www.pythonscraping.com/pages/warandpeace.html")
html =BeautifulSoup.BeautifulSoup(responce)
nameList=html.findAll(‘span‘,{‘class‘:{‘green‘}})
for name in nameList:
     print(name.string)

Python中BeautifulSoup中对HTML标签的提取

标签：字符串 sys typeerror dal ext open html one tde

原文地址：http://www.cnblogs.com/Peit/p/6274531.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行