Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/347.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 数据帧行比较_Python_Pandas_Dataframe - Fatal编程技术网

Python 数据帧行比较

Python 数据帧行比较,python,pandas,dataframe,Python,Pandas,Dataframe,我有一个熊猫数据帧,如下所示 id label_x label_y 0 1 F R 1 2 F F 2 3 F F 3 4 F F 4 5 F F 现在我想计算一下label_x和label_y的出现次数是相等的还是不相等的。在这种情况下,只有一次不相等,4次相等 df = pd.DataFrame({'id' : ["1","2","3","4",

我有一个熊猫
数据帧
,如下所示

       id  label_x label_y
0        1    F    R
1        2    F    F
2        3    F    F
3        4    F    F
4        5    F    F
现在我想计算一下label_x和label_y的出现次数是相等的还是不相等的。在这种情况下,只有一次不相等,4次相等

df = pd.DataFrame({'id' : ["1","2","3","4","5"],
                'label_x'  : ["F","F","F","F","F"], 'label_y' : ["R","F","F","F","F"]})
有很多方法可以做到这一点,包括上面

In [43]: (df.label_x == df.label_y).value_counts()
Out[43]:
True     4
False    1
dtype: int64

我想出了这个解决办法。那是最好的吗

def compare(x):
    if x[1] == x[2]:
        return 'yes'
    else:
        return 'no'

df['result'] =  df.apply(compare, axis=1)

df2 = pd.DataFrame({'count' : df.groupby( ["result"] ).size()}).reset_index()

感谢您的简单解决方案。这并不能解释
np.nan==np.nan
equaling
False
他的示例数据:-)
def compare(x):
    if x[1] == x[2]:
        return 'yes'
    else:
        return 'no'

df['result'] =  df.apply(compare, axis=1)

df2 = pd.DataFrame({'count' : df.groupby( ["result"] ).size()}).reset_index()