C# 判读取得字符编码格式

时间：2014-07-11 11:07:59 阅读：213 评论：0 收藏：0 [点我收藏+]

            FileStream fs1 = new FileStream(folder + strPath, FileMode.Open);

            byte[] bytes = new byte[fs1.Length];
            fs1.Read(bytes, 0, bytes.Length);
            // 设置当前流的位置为流的开始   
            fs1.Seek(0, SeekOrigin.Begin);

            UniversalDetector Det = new UniversalDetector(null);
            Det.HandleData(bytes, 0, bytes.Length);
            Det.DataEnd();
            //得到文档字符编码类型
            string cLX = Det.GetDetectedCharset();
　　　　　　　string Text = Encoding.GetEncoding(cLX).GetString(bytes);
　　　　　　　//第2种方式
            StreamReader sr = new StreamReader(fs1, Encoding.GetEncoding(cLX));
            string str1 = sr.ReadToEnd();

UniversalDetector这个是个第三方的字符编码识别，准确率还是可以的

http://pan.baidu.com/s/1pJ5C6TX

UniversalDetector 源码链接如上

C# 判读取得字符编码格式,布布扣,bubuko.com

C# 判读取得字符编码格式

标签：blog http div new re ar

原文地址：http://www.cnblogs.com/mxh691/p/3833960.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行