官方网站:
http://html-agility-pack.net/
Nuget安装:
Install-Package HtmlAgilityPack
C# HTML Parser Examples:
// From File var doc = new HtmlDocument(); doc.Load(filePath); // From String var doc = new HtmlDocument(); doc.LoadHtml(html); // From Web var url = "http://html-agility-pack.net/"; var web = new HtmlWeb(); var doc = web.Load(url);
C# HTML Selectors Examples:
// With XPath var value = doc.DocumentNode .SelectNodes("//td/input") .First() .Attributes["value"].Value; // With LINQ var nodes = doc.DocumentNode.Descendants("input") .Select(y => y.Descendants() .Where(x => x.Attributes["class"].Value == "box")) .ToList();
示例代码:
HtmlWeb webClient = new HtmlWeb(); HtmlDocument doc = webClient.Load("http://www.cnsos.net/weburl/"); HtmlNodeCollection hrefList = doc.DocumentNode.SelectNodes(".//a[@href]"); if (hrefList != null) { foreach (HtmlNode href in hrefList) { HtmlAttribute att = href.Attributes["href"]; Console.WriteLine(att.Value); } }