Python 类分布的加权随机基线

Python 类分布的加权随机基线,python,machine-learning,text-classification,baseline,Python,Machine Learning,Text Classification,Baseline,对于文本分类实验,我试图计算类分布的加权随机基线。我有三个标签。这是我为两个标签找到的一些代码:“m”和“f” def wrb(distribution): # weighted random baseline sum = 0 if isinstance(distribution,float): elem2 = 1 - distribution distribution = [distribution,elem2] for prop in distribution: s

对于文本分类实验,我试图计算类分布的加权随机基线。我有三个标签。这是我为两个标签找到的一些代码:“m”和“f”

def wrb(distribution): # weighted random baseline

sum = 0
if isinstance(distribution,float):
    elem2 = 1 - distribution
    distribution = [distribution,elem2]
for prop in distribution:
    sum += prop**2
return sum
distr = labels.count('m')/len(labels)
print('WRB', wrb(distr))    
我的问题是我需要填写哪些标签来代替
distr=labels.count('m')/len(labels)
?有规则吗?或者我真的随机选择了三个标签中的一个