pdfplumber模块初始用

时间：2019-08-01 16:11:01 阅读：492 评论：0 收藏：0 [点我收藏+]

import pdfplumber 
import re
def pdf_read():
    pdf=pdfplumber.open(‘文件路径‘")#文件路径,读取文件
    page0=pdf.pages[11] #指定页数
    tables=page0.extract_tables()#获得该页的表格
    texts=page0.extract_text()#获得text文本值

pdfplumber 缺省通过表格线来区分行和列，所以下列情况是无法提取出表格的：
* 你的表格是图片，通过选择可以确定是否图片
* 你的表格不是用线来分隔，或者分隔不全，例如列用线，行没线
这种情况下，你就需要尝试：
page0.extract_tables(table_settings={})

pdfplumber模块初始用

标签：port 表格 pdf 取出 pen pre 指定路径分行

原文地址：https://www.cnblogs.com/98WDJ/p/11283012.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行