C#仪器数据文件解析-PDF文件

时间：2017-10-11 21:57:07 阅读：227 评论：0 收藏：0 [点我收藏+]

标签：c# ++ fread builder 规则常用 extract 读取解析

不少仪器工作站输出的数据报告文件为PDF格式，PDF格式用于排版打印，但不易于数据解析，因此解析PDF数据需要首先读取到PDF文件中的文本内容，然后根据内容规则解析有意义的数据信息。

C#解析PDF文件常用的库有PDFBox和iTextSharp，PDFBox为Java库，通过IKVM使用，因此调用方法、属性等比较变扭。

PDFBox解析PDF文本示例：

 PDDocument doc = PDDocument.load(input);
 str = new PDFTextStripper().getText(doc);
 doc.close();

iTextSharp解析PDF文本示例：

PdfReader reader= new PdfReader(fileName);
  for (int pageNumber = 1; pageNumber <= reader.NumberOfPages; ++pageNumber)
                    stringBuilder.AppendLine(PdfTextExtractor.GetTextFromPage(reader, pageNumber));
reader.Close();

C#仪器数据文件解析-PDF文件

标签：c# ++ fread builder 规则常用 extract 读取解析

原文地址：http://www.cnblogs.com/mahongbiao/p/7652788.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行