1
我想从regsubsets获取重要的变量(列名)。我喜欢逐一获取我可以分析的重要变量。这里是程序列名不同于适合regsubsets以选择最佳变量
library(leaps)
library(ISLR)
data(Hitters)
reg_fit=regsubsets(Salary~., data = Hitters, nvmax = 10, method = "forward")
的问题是在reg_fit列名不相同数据的击球手。
这里是从原始数据的输出:
names(Hitters)
## [1] "AtBat" "Hits" "HmRun" "Runs" "RBI"
## [6] "Walks" "Years" "CAtBat" "CHits" "CHmRun"
## [11] "CRuns" "CRBI" "CWalks" "League" "Division"
## [16] "PutOuts" "Assists" "Errors" "Salary" "NewLeague"
下面是从reg_fit提取的输出:
colnames(summary(reg_fit)$which)
## [1] "(Intercept)" "AtBat" "Hits" "HmRun" "Runs"
## [6] "RBI" "Walks" "Years" "CAtBat" "CHits"
## [11] "CHmRun" "CRuns" "CRBI" "CWalks" "LeagueN"
## [16] "DivisionW" "PutOuts" "Assists" "Errors" "NewLeagueN"
注Legaue改变为LeagueN,司改变为DivisionW。任何想法,如果这是一个错误,或者有一种简单的方法从reg_fit获取列名称?
我会接受这个答案。您对将分类列进行预处理以获得相同列名的建议很好。不过,我宁愿使用model.matrix,因为它没有麻烦。感谢兄弟为解决方案。 –
@MaheshYadav乐于助人。是的'model.matrix'很棒。 –