R glm.fit不返回概率?
第一篇文章在这里,新手在R。所以请容忍我,如果我没有得到这篇文章的权利:) 我尝试使用glm()来拟合模型,然后在模型上使用predictR glm.fit不返回概率?,r,glm,lm,predict,R,Glm,Lm,Predict,第一篇文章在这里,新手在R。所以请容忍我,如果我没有得到这篇文章的权利:) 我尝试使用glm()来拟合模型,然后在模型上使用predict fit_GLM <- glm(y ~., data = traintemp, family = "binomial") pred_GLM <- predict(fit_GLM, newdata = testtemp) 我错过了什么明显的东西吗 另外,如果我改为运行lm(),结果非常相似,但它运行得更快,这是怎么回事 编辑:我的数据示例:
fit_GLM <- glm(y ~., data = traintemp, family = "binomial")
pred_GLM <- predict(fit_GLM, newdata = testtemp)
我错过了什么明显的东西吗
另外,如果我改为运行lm(),结果非常相似,但它运行得更快,这是怎么回事
编辑:我的数据示例:
TripType VisitNumber Weekday Upc ScanCount DepartmentDescription FinelineNumber
1 0 7 Friday 60538815980 1 SHOES 8931
2 0 7 Friday 7410811099 1 PERSONAL CARE 4504
3 0 8 Friday 2006613744 2 PAINT AND ACCESSORIES 1017
4 0 8 Friday 2006618783 2 PAINT AND ACCESSORIES 1017
5 0 8 Friday 7004802737 1 PAINT AND ACCESSORIES 2802
6 0 8 Friday 2238495318 1 PAINT AND ACCESSORIES 4501
谢谢你,感恩节快乐
编辑23列车:
TripType Weekday Upc ScanCount DepartmentDescription FinelineNumber
1 0 Friday 60538815980 1 SHOES 8931
2 0 Friday 7410811099 1 PERSONAL CARE 4504
3 0 Friday 2006613744 2 PAINT AND ACCESSORIES 1017
4 0 Friday 2006618783 2 PAINT AND ACCESSORIES 1017
5 0 Friday 7004802737 1 PAINT AND ACCESSORIES 2802
6 0 Friday 2238495318 1 PAINT AND ACCESSORIES 4501
7 0 Friday 5200010239 1 DSD GROCERY 4606
8 0 Friday 88679300501 2 PAINT AND ACCESSORIES 3504
9 0 Friday 2238400200 2 PAINT AND ACCESSORIES 3565
10 0 Friday 72450408840 1 PAINT AND ACCESSORIES 1028
11 0 Friday 25541500000 2 DAIRY 1305
12 0 Friday 72450403700 2 PAINT AND ACCESSORIES 1018
13 0 Friday 7874204967 1 HOUSEHOLD CHEMICALS/SUPP 707
14 0 Friday 3270011053 3 PETS AND SUPPLIES 1001
15 0 Friday 1070080727 1 IMPULSE MERCHANDISE 115
16 0 Friday 3107 1 PRODUCE 103
17 0 Friday 4011 1 PRODUCE 5501
18 0 Friday 6414410235 1 DSD GROCERY 2008
19 0 Friday 4178900743 1 GROCERY DRY GOODS 3114
20 0 Friday 7800002374 1 DSD GROCERY 3467
测试:
从
?predict.glm
:
所需的预测类型。默认值为线性预测量表;另一种“响应”是响应变量的尺度。因此,对于默认二项模型,默认预测的概率为对数概率(logit标度上的概率),并且type=“response”给出了预测概率。“术语”选项返回一个矩阵,给出线性预测标度上模型公式中每个术语的拟合值
因此,在你的情况下:
pred_GLM <- predict(fit_GLM, newdata = testtemp, type = "response")
pred\u GLM您能用dput
和head
为我们提供两个数据集的一小部分吗?一些有代表性的东西可能会让我们遇到与您相同的问题。嗨,艾蒂安,我编辑了我的帖子,这行吗?对于您的数据示例,如果您使用dput(head(trainttemp,20))
和dput(head(testtemp,20))
,我们可以快速将其粘贴到R中并使用它。此外,如果两个数据集中的y
都有0和1返回glm
,则if将是最好的。试着用它来选择你的部分数据。再次你好,非常感谢你对我的帮助。我正在尝试输出dput代码,但复制粘贴的方式太大了。另外,添加响应不会改变我的结果(我还以为响应是默认的?)
TripType Weekday Upc ScanCount DepartmentDescription FinelineNumber
1 0 Friday 68113152929 -1 FINANCIAL SERVICES 1000
2 0 Friday 2238403510 2 PAINT AND ACCESSORIES 3565
3 0 Friday 2006613743 1 PAINT AND ACCESSORIES 1017
4 0 Friday 2238400200 -1 PAINT AND ACCESSORIES 3565
5 0 Friday 22006000000 1 MEAT - FRESH & FROZEN 6009
6 0 Friday 2236760452 1 PAINT AND ACCESSORIES 7
7 0 Friday 88679300501 -1 PAINT AND ACCESSORIES 3504
8 0 Friday 3019294203 1 PAINT AND ACCESSORIES 2801
9 0 Friday 2310010776 1 PETS AND SUPPLIES 3300
10 0 Friday 5114139038 1 PAINT AND ACCESSORIES 4415
11 0 Friday 5114197561 1 PAINT AND ACCESSORIES 4415
12 0 Friday 2800053970 1 CANDY, TOBACCO, COOKIES 115
13 0 Friday 7794800902 1 DSD GROCERY 7950
14 0 Friday 7920018317 1 IMPULSE MERCHANDISE 110
15 0 Friday 3500076633 1 PERSONAL CARE 203
16 0 Friday 5460010568 1 HOUSEHOLD CHEMICALS/SUPP 52
17 0 Friday 2899521479 1 FABRICS AND CRAFTS 1059
18 0 Friday 2899521979 1 FABRICS AND CRAFTS 1062
19 0 Friday 1200004300 1 DSD GROCERY 9501
20 0 Friday 88743955560 1 MENS WEAR 144
pred_GLM <- predict(fit_GLM, newdata = testtemp, type = "response")