监视dd.DataFrame.apply的进度
如何监视行Dask数据帧应用操作的进度 用监视dd.DataFrame.apply的进度,dataframe,parallel-processing,progress-bar,monitoring,dask,Dataframe,Parallel Processing,Progress Bar,Monitoring,Dask,如何监视行Dask数据帧应用操作的进度 用ProgressBar()包装行似乎没有任何作用,即控制台上没有打印任何内容 from dask.diagnostics import ProgressBar with ProgressBar(): df_calc = ddf.apply(myfunc, axis=1) 默认情况下,Dask操作是延迟的。只有在调用compute或persist时,才会进行计算 df = dd.read_csv(...) # This lazily
ProgressBar()
包装行似乎没有任何作用,即控制台上没有打印任何内容
from dask.diagnostics import ProgressBar
with ProgressBar():
df_calc = ddf.apply(myfunc, axis=1)
默认情况下,Dask操作是延迟的。只有在调用
compute
或persist
时,才会进行计算
df = dd.read_csv(...) # This lazily builds up a computation
df = df[df.name == 'alice'] # This lazily builds up a computation
result = df.amount.sum() # This lazily builds up a computation
result = result.compute() # This triggers actual work