2015-10-14 116 views
0

对于一个网址,我可以使用带变量的值,这样的:加载多个网址,并创建值的一个数据帧

library("RSelenium") 
    startServer() 
    remDr <- remoteDriver(browserName = "chrome") 
    remDr$open() 
    remDr$navigate("http://www.adobe.com/") 
    res <- remDr$executeScript('return window.s_adobe.prop1;') 
    res2 <- remDr$executeScript('return window.s_adobe.prop13;') 

如果我有一个以上的网址:

http://www.adobe.com/ 
https://acrobat.adobe.com/us/en/?promoid=KLXMA 

如何我可以采用这种格式的数据帧:

url,prop1,prop13 
http://www.adobe.com/,"Home Page Template","" 
https://acrobat.adobe.com/us/en/?promoid=KLXMA,"Full Width Content Page Template",85 

回答

0
url_list <- c("http://www.adobe.com/", "https://acrobat.adobe.com/us/en/?promoid=KLXMA") 
startServer() 
remDr <- remoteDriver(browserName = "chrome") 
remDr$open() 
df <- data.frame() 

for (i in 1:length(url_list)) { 
    url <- url_list[i] 
    remDr$navigate(url) 
    Sys.sleep(15) ##A hacky way that just allows chrome to catch up 
    res <- remDr$executeScript('return window.s_adobe.prop1;') 
    res2 <- remDr$executeScript('return window.s_adobe.prop13;') 
    df <- rbind(df, data.frame(URL = url, prop1 = res[[1]][1], prop2 = res2[[1]][1])) 
}