R中的逐步回归误差_R_Logistic Regression

R中的逐步回归误差

R中的逐步回归误差,r,logistic-regression,R,Logistic Regression,我想在R中运行逐步回归以选择最佳拟合模型，我的代码附在此处： full.modelfixed <- glm(died_ed ~ age_1 + gender + race + insurance + injury + ais + blunt_pen + comorbid + iss +min_dist + pop_dens_new + age_mdn + male_pct + pop_wht_pct + pop_blk_pct

我想在R中运行逐步回归以选择最佳拟合模型，我的代码附在此处：

full.modelfixed <- glm(died_ed ~ age_1 + gender + race + insurance + injury + ais + blunt_pen + 
               comorbid + iss +min_dist + pop_dens_new + age_mdn + male_pct + 
               pop_wht_pct + pop_blk_pct + unemp_pct + pov_100x_npct +
               urban_pct, data = trauma, family = binomial (link = 'logit'), na.action = na.exclude)
reduced.modelfixed <- stepAIC(full.modelfixed, direction = "backward")

数据中几乎每个变量都有一些缺失值，因此我无法删除所有缺失值（data=na.omit（data））

你知道怎么解决这个问题吗

谢谢

这可能会出现在统计论坛（stats.stackexchange）中，但简单地说，这里有一些注意事项

主要的一点是，在比较两个模型时，它们需要拟合在同一个数据集上（即，您需要能够将模型嵌套在彼此之间）

例如

glm1 <- glm(Dependent~indep1+indep2+indep3, family = binomial, data = data)
glm2 <- glm(Dependent~indep2+indep2, family = binomial, data = data)

glm1您可以使用软件包进行插补，然后使用数据集不会给您带来错误看起来更像是一个统计问题而不是编程问题-可能更适合stats.stackexchange。也许你想插补缺失的数据？使用多重插补来填充缺失值，避免逐步回归，因为它有严重的统计问题，并且有更好的方法可用。
glm1 <- glm(Dependent~indep1+indep2+indep3, family = binomial, data = data)
glm2 <- glm(Dependent~indep2+indep2, family = binomial, data = data)