码迷,mamicode.com
首页 > Web开发 > 详细

urlopen和BeautifulSoup

时间:2019-10-08 14:07:29      阅读:89      评论:0      收藏:0      [点我收藏+]

标签:font   obj   smo   import   size   ESS   soup   mini   mod   

 

from urllib.request import urlopen
html = urlopen("http://pythonscraping.com/pages/page1.html")
print(html.read())

output

b<html>\n<head>\n<title>A Useful Page</title>\n</head>\n<body>\n<h1>An Interesting Title</h1>\n<div>\nLorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.\n</div>\n</body>\n</html>\n

 

from urllib.request import urlopen
from bs4 import BeautifulSoup
html = urlopen("http://pythonscraping.com/pages/page1.html")
bsObj = BeautifulSoup(html.read())
print(bsObj.h1)

output

<h1>An Interesting Title</h1>

 

2019-10-08

18:01:59

urlopen和BeautifulSoup

标签:font   obj   smo   import   size   ESS   soup   mini   mod   

原文地址:https://www.cnblogs.com/petitherisson/p/11634831.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!