R simstudy:产生相同值的性别数据

R simstudy:产生相同值的性别数据,r,package,simulation,R,Package,Simulation,这是我第一次尝试模拟数据-我们希望模拟数据集,并选择使用simstudy,使用以下代码: def <- defData(varname='median_household_income',formula=mean( df$median_household_income)) def <- defData(def, varname='share_unemployed_seasonal',formula=mean( df$shar

这是我第一次尝试模拟数据-我们希望模拟数据集,并选择使用simstudy,使用以下代码:

def <- defData(varname='median_household_income',formula=mean(
               df$median_household_income))
def <- defData(def, varname='share_unemployed_seasonal',formula=mean(
               df$share_unemployed_seasonal))
def <- defData(def, varname='share_population_in_metro_areas',
               formula=mean(df$share_population_in_metro_areas))
def <- defData(def, varname='share_population_with_high_school_degree',
               formula=mean(df$share_population_with_high_school_degree))
def <- defData(def, varname='share_non_citizen',
               formula=mean(df$share_non_citizen))
def <- defData(def, varname='share_white_poverty',
               formula=mean(df$share_white_poverty))
def <- defData(def, varname='gini_index',formula=mean(df$gini_index))
def <- defData(def, varname='share_non_white',formula=mean(df$share_non_white))
def <- defData(def, varname='share_voters_voted_trump',
               formula=mean(df$share_voters_voted_trump))
#outcome
def <- defData(def, varname='avg_hatecrimes_per_100k_fbi',formula=
               ".0001*median_household_income + 44*share_unemployed_seasonal + 
               -2.8*share_population_in_metro_areas +
               24*share_population_with_high_school_degree + 22*share_non_citizen + 
               3.2*share_white_poverty + 55*gini_index + -4*share_non_white + 
               -2.6*share_voters_voted_trump")

#generate simulated data
df_sim <- genData(10000,def)

为什么所有生成的值都相同?我的理解是,默认情况下,变量是按照正态分布生成的。感谢您的帮助

我发现您指的是一个软件包
simstudy
。如果查看
defData
函数()的文档,您会发现
defData
函数默认为零的
variance
参数。如果希望有不相同的观测值,则需要将此值设置为大于0的数字

defData
函数的默认行为:

defData(dtDefs = NULL, varname, formula, variance = 0,
  dist = "normal", link = "identity", id = "id")
因此,您可能希望运行以下命令

def <- defData(varname='median_household_income',
               formula=mean(df$median_household_income),
               variance = 1)

def如果您指定defData和genData函数的来源,这会很有帮助。一般来说,最好遵循:和
def <- defData(varname='median_household_income',
               formula=mean(df$median_household_income),
               variance = 1)