码迷,mamicode.com
首页 > Web开发 > 详细

asp.net C# 获取网页源码的几种方式

时间:2014-06-24 18:32:53      阅读:217      评论:0      收藏:0      [点我收藏+]

标签:webbrowser   源码   c#   asp.net   

1 方法
               System.Net.WebClient aWebClient = new System.Net.WebClient();
               aWebClient.Encoding = System.Text.Encoding.Default;
               Byte[] pageData = aWebClient.DownloadData(url);
           
               string nhtml = Encoding.GetEncoding("utf-8").GetString(pageData);

2方法

                System.Net.WebClient aWebClient = new System.Net.WebClient();
                aWebClient.Encoding = System.Text.Encoding.Default;
                string nhtml = aWebClient.DownloadString(goodstidurl);

3方法
               WebBrowser webbrowser = new WebBrowser();
                StreamReader sr = new StreamReader(this.webBTaobao.DocumentStream, Encoding.Default);
                html = sr.ReadToEnd();
                html = html.Replace("\r\n", "");
                html = html.Replace("\n", "");
                html = html.Replace("  ", "");
                html = html.Replace("(", "");
                html = html.Replace(")", "");
                string nurl = Regex.Match(html, "(?<=data-url=\").*?(?=\")").Value;
                //新建一个WebBrowser
                WebBrowser webAddress = new WebBrowser();
                webAddress.Navigate(nurl);
                //等待加载完成
                while (webAddress.ReadyState < WebBrowserReadyState.Complete) Application.DoEvents();
                StreamReader sraddress = new StreamReader(webAddress.DocumentStream, Encoding.Default);
                jsonaddress = sraddress.ReadToEnd();

4方法

            WebRequest hwr = WebRequest.Create(@"http://item.taobao.com/item.htm?id=" + row["urlId"].ToString());//向指定Url发出请求 
            HttpWebResponse hwp = hwr.GetResponse() as HttpWebResponse;//将hwr对HTTP的请求 
            string text;
            StreamReader sr;
            string code = hwp.ContentType;//请求响应得到的内容类型 
            //得到编码了 
            code = code.Split('=')[1];
            Stream rep = hwp.GetResponseStream();//将请求得到的内容以流的形式读出 
            sr = new StreamReader(rep, Encoding.GetEncoding(code));//用指定的字符编码为指定的流初始化 


asp.net C# 获取网页源码的几种方式,布布扣,bubuko.com

asp.net C# 获取网页源码的几种方式

标签:webbrowser   源码   c#   asp.net   

原文地址:http://blog.csdn.net/summer_dream_journey/article/details/34077649

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!