Pandas 如何将工作代码转换为dask代码?

Pandas 如何将工作代码转换为dask代码?,pandas,dask,Pandas,Dask,我有两个YYYYMM格式的日期 date1 = 203201 date2 = 201204 我有一个数据框[testdf],有2.35亿行,其中包含一个日期变量“date_TO_COMPARE”,我需要将它与上述两个日期进行比较,以进行筛选 我需要按如下方式筛选此数据帧: # Step 1: Create two date variables in the dataframe for comparison purposes testdf['date1'] = pd.to_datetime(te

我有两个YYYYMM格式的日期

date1 = 203201
date2 = 201204
我有一个数据框[testdf],有2.35亿行,其中包含一个日期变量“date_TO_COMPARE”,我需要将它与上述两个日期进行比较,以进行筛选

我需要按如下方式筛选此数据帧:

# Step 1: Create two date variables in the dataframe for comparison purposes
testdf['date1'] = pd.to_datetime(testdf['date1'], format = '%Y%m', errors='ignore')
testdf['date2'] = pd.to_datetime(testdf['date2'], format = '%Y%m', errors='ignore')

# Step 2: Apply the fiter
testdf_filtered = testdf[(testdf['DATE_TO_COMPARE'] <= testdf['date1']) & \
                   (testdf['DATE_TO_COMPARE'] > testdf['date2'])]
# Daskize pandas dataframe
import dask as dd 
ddata = dd.from_pandas(testdf, npartitions=5)

# Step 1: Create two date variables in the dataframe for comparison purposes
ddata['date1'] = pd.to_datetime(ddata['date1'], format = '%Y%m', errors='ignore')
ddata['date2'] = pd.to_datetime(ddata['date2'], format = '%Y%m', errors='ignore')

# Step 2: Apply the fiter
ddata_filtered = ddata[(ddata['DATE_TO_COMPARE'] <= ddata['date1']) & \
                   (ddata['DATE_TO_COMPARE'] > ddata['date2'])]

# Re-Pandize Daskized dataframe
testdf_filtered = ddata_filtered.compute(scheduler='processes')
任何教育/建议/榜样都将不胜感激。谢谢

TypeError: 'DataFrame' object does not support item assignment etc.