2012-07-11 69 views
-1
<div class="logoDesc"> 


Gnb Road, Chandmari, Guwahati - 781003 




    | 
       <a href="http://www.justdial.com/Guwahati/Kiran-Mistanna-Bhandar-&lt;near&gt;-Chandmari/9999PX361-X361-1230284509G9V5B2-DC_R3V3YWhhdGkgQmFjaGVsb3IgQ2FrZQ==_BZDET/map"> 
        View Map</a><br> 
       <p> 
        <span class="Gray">Call: </span><span style="color: #424242; font-size: 12px;">+(91)-9954843180</span> 
        <span style="color: #424242;">|</span> <a href="http://contest.justdial.com/contest/register.php?utm_source=rsbnr&amp;utm_medium=banner&amp;cont_ref=rsbnr" 
         style="font-size: 12px; display: inline-block;" onclick="_ct('Win Ipad2','ltpg');" 
         target="_blank"><b>Win iPad2</b></a> 
       </p> 
       <p> 
        <span class="Gray">Also See :</span> <b>Cake Shops</b>, <a href="http://www.justdial.com/Guwahati/Bakeries/ct-10033880"> 
         Bakeries</a>, <a href="http://www.justdial.com/Guwahati/Confectionery-Retailers/ct-10127628"> 
          Confectionery Retailers</a> 
       </p> 
      </div> 

我正在使用HTML敏捷包... ii只想提取地址[星星之间] ..应该是什么语法?请帮助。HTML敏捷包语法

UPDATE:我使用下面的代码

Protected Sub Button1_Click(ByVal sender As Object, ByVal e As System.EventArgs) Handles Button1.Click 
     Dim webGet = New HtmlWeb() 
     Dim document = webGet.Load("http://www.justdial.com/Guwahati/Bachelor-Cake/ct-10070075") 

     Dim nodes1 = document.DocumentNode.SelectNodes("//*[@class='logoDesc']") 

     For Each node In nodes1 
      MsgBox(node.InnerText) 
     Next node 
    End Sub 

使用此代码段我让所有的DIV中的细节......我只想地址。

+0

可能重复:http://stackoverflow.com/q/2875347/102937 – 2012-07-11 22:58:14

+0

没有它不..其实并没有标签周围的地址...所以我很困惑如何获得地址? – user1150440 2012-07-11 23:00:20

+0

我的意思是如何形成语法? – user1150440 2012-07-11 23:00:38

回答

0

敏捷性包不知道,但这里有一个直线上升屏幕刮刀:

string page = Methods.GetPage("http://www.yoururl.com"); 
    int firstStars = page.IndexOf("***"); 
    string second = page.Substring(firstStars); 
    int secondStars = second.IndexOf("***"); 

    //Add 3 to skip over the first three stars. May not need the +3, can't recall. 
    string address = page.Substring(0 + 3, secondStars); 


    public static string GetPage(string url) 
    { 
     WebClient webClient = new WebClient(); 
     byte[] reqHTML; 
     string page = string.Empty; 

     UTF8Encoding objUTF8 = new UTF8Encoding(); 
     try 
     { 
      reqHTML = webClient.DownloadData(url); 
      page = objUTF8.GetString(reqHTML); 
     } 
     catch (Exception theex) 
     { 

     } 
     return page; 
    } 
+0

实际上没有星星..我只是用它们来突出显示地址......对不起,我感到困惑:) – user1150440 2012-07-11 23:17:23

0

试试这个(添加 “/文()” 到你的XPath结束):

Protected Sub Button1_Click(ByVal sender As Object, ByVal e As System.EventArgs) Handles Button1.Click 
    Dim webGet = New HtmlWeb() 
    Dim document = webGet.Load("http://www.justdial.com/Guwahati/Bachelor-Cake/ct-10070075") 
    Dim nodes1 = document.DocumentNode.SelectNodes("//*[@class='logoDesc']/text()") 
    For Each node In nodes1 
     MsgBox(node.InnerText) 
    Next node 
End Sub