Python3.x：BeautifulSoup()解决中文乱码问题

时间：2018-01-20 11:07:49 阅读：303 评论：0 收藏：0 [点我收藏+]

标签：requests headers log html 中文 referer utf8 host post

Python3.x：BeautifulSoup()解决中文乱码问题

问题：

　　BeautifulSoup获取网页内容，中文显示乱码；

解决方案：

import requests
from bs4 import BeautifulSoup

html = requests.get(url, headers=Hostreferer)
soup = BeautifulSoup(html.content, "html.parser", fromEncoding="gb18030")

　　如果中文页面编码是gb2312，gbk，在BeautifulSoup构造器中传入fromEncoding="gb18030"参数即可解决乱码问题，

　　即使分析的页面是utf8的页面使用gb18030也不会出现乱码问题；

Python3.x：BeautifulSoup()解决中文乱码问题

标签：requests headers log html 中文 referer utf8 host post

原文地址：https://www.cnblogs.com/lizm166/p/8319919.html

踩

(0)

评论一句话评论（0）