C# html Parser
If you came here by searching for C#.net or VB.net html parser, your search ends here.
While developing screen scrapping with C#.net, I came across very rich open source HtmlParser. Who is capable of parsing entire html document.
Here is a link to download and parse the required fields in html http://www.codeplex.com/htmlagilitypack.
It is very similar to librarys provided by .net to parse or iterate XMLDocuments.
code snippet:
It will give you list of all nodes of type Input. Here str is your actual HTML string that you received in HttpWebResponse object
HtmlAgilityPack.HtmlDocument hd = new HtmlAgilityPack.HtmlDocument();
hd.LoadHtml(str);
HtmlNodeCollection hc = hd.DocumentNode.SelectNodes("//input");
You can use Xpath query to iterate nodes and ChildNodes.
Satalaj
Currently rated 2.5 by 4 people
- Currently 2.5/5 Stars.
- 1
- 2
- 3
- 4
- 5
Tags: