我需要帮助解析HTML文件。我是新的C#和LINQ和一切我试图一直没有提取“链接”全成和“名称1”使用LINQ解析HTML
<tr class="Row">
<td width="80">
<div align="left"> <a href="link">details</a>
</div>
</td>
<td width="152">Name 1</td>
<td width="151">Name 2</td>
<td width="152">Name 3</td>
<td width="151">Name 4</td>
<td width="151">Name 5</td>
<td width="152">Name 6</td>
</tr>
<tr class="Row">
<td width="80">
<div align="left"> <a href="link">details</a>
</div>
</td>
<td width="152">Name 1</td>
<td width="151">Name 2</td>
<td width="152">Name 3</td>
<td width="151">Name 4</td>
<td width="151">Name 5</td>
<td width="152">Name 6</td>
</tr>
这是我的尝试:
var links = htmlDoc.DocumentNode.Descendants()
.Where(n => n.Name == "tr")
.Where(x => x.Attributes["class"] != null && x.Attributes["class"].Value == "Row")
.Select(x => x.Descendants()
.Where(s => s.Name == "href"));
foreach (var link in links)
{
Debug.WriteLine(link);
}
您是否使用Html Agility Pack? – igorushi 2015-02-08 10:36:54
我使用HtmlAgilityPack-PCL – Macaret 2015-02-08 10:38:02
检查答案,并告诉我,如果有什么不清楚 – mybirthname 2015-02-08 10:42:18