R glm.fit不返回概率?

R glm.fit不返回概率?,r,glm,lm,predict,R,Glm,Lm,Predict,第一篇文章在这里,新手在R。所以请容忍我,如果我没有得到这篇文章的权利:) 我尝试使用glm()来拟合模型,然后在模型上使用predict fit_GLM <- glm(y ~., data = traintemp, family = "binomial") pred_GLM <- predict(fit_GLM, newdata = testtemp) 我错过了什么明显的东西吗 另外,如果我改为运行lm(),结果非常相似,但它运行得更快,这是怎么回事 编辑:我的数据示例:

第一篇文章在这里,新手在R。所以请容忍我,如果我没有得到这篇文章的权利:)

我尝试使用glm()来拟合模型,然后在模型上使用predict

  fit_GLM <- glm(y ~., data = traintemp, family = "binomial")
  pred_GLM <- predict(fit_GLM, newdata = testtemp)
我错过了什么明显的东西吗

另外,如果我改为运行lm(),结果非常相似,但它运行得更快,这是怎么回事

编辑:我的数据示例:

TripType VisitNumber Weekday         Upc ScanCount DepartmentDescription FinelineNumber
1        0           7  Friday 60538815980         1                 SHOES           8931
2        0           7  Friday  7410811099         1         PERSONAL CARE           4504
3        0           8  Friday  2006613744         2 PAINT AND ACCESSORIES           1017
4        0           8  Friday  2006618783         2 PAINT AND ACCESSORIES           1017
5        0           8  Friday  7004802737         1 PAINT AND ACCESSORIES           2802
6        0           8  Friday  2238495318         1 PAINT AND ACCESSORIES           4501
谢谢你,感恩节快乐

编辑23列车:

TripType Weekday         Upc ScanCount    DepartmentDescription FinelineNumber
1         0  Friday 60538815980         1                    SHOES           8931
2         0  Friday  7410811099         1            PERSONAL CARE           4504
3         0  Friday  2006613744         2    PAINT AND ACCESSORIES           1017
4         0  Friday  2006618783         2    PAINT AND ACCESSORIES           1017
5         0  Friday  7004802737         1    PAINT AND ACCESSORIES           2802
6         0  Friday  2238495318         1    PAINT AND ACCESSORIES           4501
7         0  Friday  5200010239         1              DSD GROCERY           4606
8         0  Friday 88679300501         2    PAINT AND ACCESSORIES           3504
9         0  Friday  2238400200         2    PAINT AND ACCESSORIES           3565
10        0  Friday 72450408840         1    PAINT AND ACCESSORIES           1028
11        0  Friday 25541500000         2                    DAIRY           1305
12        0  Friday 72450403700         2    PAINT AND ACCESSORIES           1018
13        0  Friday  7874204967         1 HOUSEHOLD CHEMICALS/SUPP            707
14        0  Friday  3270011053         3        PETS AND SUPPLIES           1001
15        0  Friday  1070080727         1      IMPULSE MERCHANDISE            115
16        0  Friday        3107         1                  PRODUCE            103
17        0  Friday        4011         1                  PRODUCE           5501
18        0  Friday  6414410235         1              DSD GROCERY           2008
19        0  Friday  4178900743         1        GROCERY DRY GOODS           3114
20        0  Friday  7800002374         1              DSD GROCERY           3467
测试:


?predict.glm

所需的预测类型。默认值为线性预测量表;另一种“响应”是响应变量的尺度。因此,对于默认二项模型,默认预测的概率为对数概率(logit标度上的概率),并且type=“response”给出了预测概率。“术语”选项返回一个矩阵,给出线性预测标度上模型公式中每个术语的拟合值

因此,在你的情况下:

pred_GLM <- predict(fit_GLM, newdata = testtemp, type = "response")

pred\u GLM您能用
dput
head
为我们提供两个数据集的一小部分吗?一些有代表性的东西可能会让我们遇到与您相同的问题。嗨,艾蒂安,我编辑了我的帖子,这行吗?对于您的数据示例,如果您使用
dput(head(trainttemp,20))
dput(head(testtemp,20))
,我们可以快速将其粘贴到R中并使用它。此外,如果两个数据集中的
y
都有0和1返回
glm
,则if将是最好的。试着用它来选择你的部分数据。再次你好,非常感谢你对我的帮助。我正在尝试输出dput代码,但复制粘贴的方式太大了。另外,添加响应不会改变我的结果(我还以为响应是默认的?)
   TripType Weekday         Upc ScanCount    DepartmentDescription FinelineNumber
1         0  Friday 68113152929        -1       FINANCIAL SERVICES           1000
2         0  Friday  2238403510         2    PAINT AND ACCESSORIES           3565
3         0  Friday  2006613743         1    PAINT AND ACCESSORIES           1017
4         0  Friday  2238400200        -1    PAINT AND ACCESSORIES           3565
5         0  Friday 22006000000         1    MEAT - FRESH & FROZEN           6009
6         0  Friday  2236760452         1    PAINT AND ACCESSORIES              7
7         0  Friday 88679300501        -1    PAINT AND ACCESSORIES           3504
8         0  Friday  3019294203         1    PAINT AND ACCESSORIES           2801
9         0  Friday  2310010776         1        PETS AND SUPPLIES           3300
10        0  Friday  5114139038         1    PAINT AND ACCESSORIES           4415
11        0  Friday  5114197561         1    PAINT AND ACCESSORIES           4415
12        0  Friday  2800053970         1  CANDY, TOBACCO, COOKIES            115
13        0  Friday  7794800902         1              DSD GROCERY           7950
14        0  Friday  7920018317         1      IMPULSE MERCHANDISE            110
15        0  Friday  3500076633         1            PERSONAL CARE            203
16        0  Friday  5460010568         1 HOUSEHOLD CHEMICALS/SUPP             52
17        0  Friday  2899521479         1       FABRICS AND CRAFTS           1059
18        0  Friday  2899521979         1       FABRICS AND CRAFTS           1062
19        0  Friday  1200004300         1              DSD GROCERY           9501
20        0  Friday 88743955560         1                MENS WEAR            144
pred_GLM <- predict(fit_GLM, newdata = testtemp, type = "response")