我在这里发布是因为我无法在其他地方找到任何解决方案。基本上我们在学校学习使用python进行线性回归,教授希望我们根据csv表来估计三明治中每种成分的价格以及每种三明治的固定利润。到目前为止,我们只是混淆了一个X变量和一个Y变量,所以我很困惑我应该在这里做什么?谢谢。这里是表格:使用线性回归估算价格
tomato,lettuce,cheese,pickles,palmetto,burger,corn,ham,price
0.05,1,0.05,0,0.05,0.2,0.05,0,18.4
0.05,0,0.05,0.05,0,0.2,0.05,0.05,16.15
0.05,1,0.05,0,0.05,0.4,0,0,22.15
0.05,1,0.05,0,0.05,0.2,0.05,0.05,19.4
0.05,1,0,0,0,0.2,0.05,0.05,18.4
0,0,0.05,0,0,0,0.05,0.05,11.75
0.05,1,0,0,0,0.2,0,0.05,18.15
0.05,1,0.05,0.05,0.05,0.2,0.05,0,18.65
0,0,0.05,0,0,0.2,0.05,0.05,15.75
0.05,1,0.05,0,0.05,0,0.05,0.05,15.4
0.05,1,0,0,0,0.2,0,0,17.15
0.05,1,0,0,0.05,0.2,0.05,0.05,18.9
0,1,0.05,0,0,0.2,0.05,0.05,18.75
你在做原始python或统计软件包吗? –
我可以使用numpy –
如果可以,我建议使用熊猫。它几乎将numpy数组包装成可行的数据框。 从那里,你只需要应用多个线性回归。如果你已经编写了一个回归工具,那么你肯定可以扩展它来适应多个变量。 当我第一次开始学习Python(开发一个应用的计量经济学模型)时,我做了这样的事情,既然这是一个重要的学习经历,我不能真正分享代码,但我可以建议你想要的软件包是:Sci-kit Learn/OLS统计模型;用于处理数据的大熊猫。 –