1
我正在尝试为Data Science 101项目刮冰球参考。我遇到了特定表格的问题。网页是:https://www.hockey-reference.com/boxscores/201611090BUF.html。所需表格在“高级统计报告(所有情况)”下。我已经尝试了以下代码:使用rvest来刮取HTML数据
url="https://www.hockey-reference.com/boxscores/201611090BUF.html"
ret <- url %>%
read_html()%>%
html_nodes(xpath='//*[contains(concat(" ", @class, " "), concat(" ", "right", " "))]') %>%
html_text()
此代码将从上表中删除所有数据,但在高级表之前停止。我也试图让更多的颗粒具有:
url="https://www.hockey-reference.com/boxscores/201611090BUF.html"
ret <- url %>%
read_html()%>%
html_nodes(xpath='//*[(@id = "OTT_adv")]//*[contains(concat(" ", @class, " "), concat(" ", "right", " "))]') %>%
html_text()
其产生的“字符(0)”讯息话题。任何和所有的帮助,将不胜感激..如果它尚未明确,我相当新的R.谢谢!