C# 使用c语言提取web文档#
我试图用c语言从网页上获取数据# 到目前为止,这是我的代码:C# 使用c语言提取web文档#,c#,browser,webpage,dom,C#,Browser,Webpage,Dom,我试图用c语言从网页上获取数据# 到目前为止,这是我的代码: WebBrowser wb = new WebBrowser(); wb.Url = new Uri("http://www.microsoft.com"); HtmlDocument doc = wb.Document; MessageBox.Show(doc.ToString()); 不幸的是,wb仍然为null,并且从未设置Url属性 有人能帮我吗 谢谢我想你忘记启动页面导航了。看 而且 导航是一个异步过程。您需要将web
WebBrowser wb = new WebBrowser();
wb.Url = new Uri("http://www.microsoft.com");
HtmlDocument doc = wb.Document;
MessageBox.Show(doc.ToString());
不幸的是,wb仍然为null,并且从未设置Url属性
有人能帮我吗
谢谢我想你忘记启动页面导航了。看 而且 导航是一个异步过程。您需要将web浏览器对象置于类作用域中,然后按如下方式处理导航事件:
private void webBrowser1_Navigated(object sender, WebBrowserNavigatedEventArgs e)
{
HtmlDocument doc = webBrowser1.Document;
doc = webBrowser1.Document;
}
您将看到处理程序中的doc将不为null。我认为您忘记启动页面导航。看 而且 导航是一个异步过程。您需要将web浏览器对象置于类作用域中,然后按如下方式处理导航事件:
private void webBrowser1_Navigated(object sender, WebBrowserNavigatedEventArgs e)
{
HtmlDocument doc = webBrowser1.Document;
doc = webBrowser1.Document;
}
您将看到该文档在处理程序中是非空的。尝试
HtmlDocument
的DocumentText
属性,而不是调用ToString()
尝试HtmlDocument
的DocumentText
属性,而不是调用ToString()
我会使用类而不是web浏览器。web浏览器类更适合与UI交互,而WebClient更适合与页面进行编程交互。下面是一些示例代码:
private void sendMessage(JaxtrSmsMessage message)
{
HttpWebRequest request;
HttpWebResponse response;
CookieContainer cookies;
string url = "http://www.jaxtr.com/user/login.jsp";
try
{
request = (HttpWebRequest)WebRequest.Create(url);
request.AllowAutoRedirect = true;
request.CookieContainer = new CookieContainer();
response = (HttpWebResponse)request.GetResponse();
if (response.StatusCode == HttpStatusCode.OK)
{
StringBuilder sb = new StringBuilder();
StreamReader reader = new StreamReader(response.GetResponseStream());
while (!reader.EndOfStream)
{
sb.AppendLine(reader.ReadLine());
}
//Get the hidden value out of the form.
String fp = Regex.Match(sb.ToString(), "\"__fp\"\\svalue=\"(([A-Za-z0-9+/=]){4}){1,19}\"", RegexOptions.None).Value;
fp = fp.Substring(14);
fp = fp.Replace("\"", String.Empty);
cookies = request.CookieContainer;
//response.Close();
String requestString = "http://www.jaxtr.com/user/Login.action?tzOffset=6&navigateURL=&refPage=&jaxtrId=" + HttpUtility.UrlEncode(credentials.Username) + "&password=" + HttpUtility.UrlEncode(credentials.Password) + "&Login=Login&_sourcePage=%2Flogin.jsp&__fp="+HttpUtility.UrlEncode(fp);
request = (HttpWebRequest)WebRequest.Create(requestString);
request.CookieContainer = cookies; //added by myself
response = (HttpWebResponse)request.GetResponse();
Console.WriteLine("Response from login:" + response.StatusCode);
String messageText = (message.TruncateMessage && message.MessageText.Length > JaxtrSmsMessage.MAX_MESSAGE_LENGTH ? message.MessageText.Substring(JaxtrSmsMessage.MAX_MESSAGE_LENGTH) : message.MessageText);
String messageURL = "http://www.jaxtr.com/user/sendsms?CountryName=" + HttpUtility.UrlEncode(message.CountryName) + "&phone=" + HttpUtility.UrlEncode(message.DestinationPhoneNumber) + "&message=" + HttpUtility.UrlEncode(messageText) + "&bySMS=" + HttpUtility.UrlEncode(message.BySMS.ToString().ToLower());
request = (HttpWebRequest)WebRequest.Create(messageURL);
request.CookieContainer = cookies;
response = (HttpWebResponse)request.GetResponse();
Console.WriteLine("Response from send SMS command=" + response.StatusCode);
StringBuilder output = new StringBuilder();
using (Stream s = response.GetResponseStream())
{
StreamReader sr = new StreamReader(s);
while (!sr.EndOfStream)
{
output.AppendLine(sr.ReadLine());
}
}
response.Close();
}
else
{
Console.WriteLine("Client was unable to connect!");
}
}
catch (System.Exception e)
{
throw new SMSDeliveryException("Unable to deliver SMS message because "+e.Message, e);
}
}
我会使用类而不是web浏览器。web浏览器类更适合与UI交互,而WebClient更适合与页面进行编程交互。下面是一些示例代码:
private void sendMessage(JaxtrSmsMessage message)
{
HttpWebRequest request;
HttpWebResponse response;
CookieContainer cookies;
string url = "http://www.jaxtr.com/user/login.jsp";
try
{
request = (HttpWebRequest)WebRequest.Create(url);
request.AllowAutoRedirect = true;
request.CookieContainer = new CookieContainer();
response = (HttpWebResponse)request.GetResponse();
if (response.StatusCode == HttpStatusCode.OK)
{
StringBuilder sb = new StringBuilder();
StreamReader reader = new StreamReader(response.GetResponseStream());
while (!reader.EndOfStream)
{
sb.AppendLine(reader.ReadLine());
}
//Get the hidden value out of the form.
String fp = Regex.Match(sb.ToString(), "\"__fp\"\\svalue=\"(([A-Za-z0-9+/=]){4}){1,19}\"", RegexOptions.None).Value;
fp = fp.Substring(14);
fp = fp.Replace("\"", String.Empty);
cookies = request.CookieContainer;
//response.Close();
String requestString = "http://www.jaxtr.com/user/Login.action?tzOffset=6&navigateURL=&refPage=&jaxtrId=" + HttpUtility.UrlEncode(credentials.Username) + "&password=" + HttpUtility.UrlEncode(credentials.Password) + "&Login=Login&_sourcePage=%2Flogin.jsp&__fp="+HttpUtility.UrlEncode(fp);
request = (HttpWebRequest)WebRequest.Create(requestString);
request.CookieContainer = cookies; //added by myself
response = (HttpWebResponse)request.GetResponse();
Console.WriteLine("Response from login:" + response.StatusCode);
String messageText = (message.TruncateMessage && message.MessageText.Length > JaxtrSmsMessage.MAX_MESSAGE_LENGTH ? message.MessageText.Substring(JaxtrSmsMessage.MAX_MESSAGE_LENGTH) : message.MessageText);
String messageURL = "http://www.jaxtr.com/user/sendsms?CountryName=" + HttpUtility.UrlEncode(message.CountryName) + "&phone=" + HttpUtility.UrlEncode(message.DestinationPhoneNumber) + "&message=" + HttpUtility.UrlEncode(messageText) + "&bySMS=" + HttpUtility.UrlEncode(message.BySMS.ToString().ToLower());
request = (HttpWebRequest)WebRequest.Create(messageURL);
request.CookieContainer = cookies;
response = (HttpWebResponse)request.GetResponse();
Console.WriteLine("Response from send SMS command=" + response.StatusCode);
StringBuilder output = new StringBuilder();
using (Stream s = response.GetResponseStream())
{
StreamReader sr = new StreamReader(s);
while (!sr.EndOfStream)
{
output.AppendLine(sr.ReadLine());
}
}
response.Close();
}
else
{
Console.WriteLine("Client was unable to connect!");
}
}
catch (System.Exception e)
{
throw new SMSDeliveryException("Unable to deliver SMS message because "+e.Message, e);
}
}
首先,必须使用URL路径调用navigate方法,并且必须从web浏览器控件捕获文档完成事件
webBrowser.DocumentCompleted += new WebBrowserDocumentCompletedEventHandler(webBrowser_DocumentCompleted);
webBrowser.Navigate("http:\\www.microsoft.com");
然后使用document complete事件获取加载的网页文档
void webBrowser_DocumentCompleted(object sender, WebBrowserDocumentCompletedEventArgs e)
{
MessageBox.Show(webBrowser.DocumentText.ToString());
}
希望这有帮助…首先,您必须使用URL路径调用navigate方法,并且必须从web浏览器控件捕获文档完成事件
webBrowser.DocumentCompleted += new WebBrowserDocumentCompletedEventHandler(webBrowser_DocumentCompleted);
webBrowser.Navigate("http:\\www.microsoft.com");
然后使用document complete事件获取加载的网页文档
void webBrowser_DocumentCompleted(object sender, WebBrowserDocumentCompletedEventArgs e)
{
MessageBox.Show(webBrowser.DocumentText.ToString());
}
希望这有帮助……no:/i在wb.Url部分之前添加了这一点,但仍然没有进展:/i添加了一些更多信息。我在本地测试prj中使用了它。它会解决你的问题。不:/i在wb.Url部分之前添加了它,但仍然没有进展:/i添加了更多信息。我在本地测试prj中使用了它。这会解决你的问题。