如何使用类型提示优化PySpark toPandas()

如何使用类型提示优化PySpark toPandas(),pyspark,Pyspark,我以前从未在PySpark中看到过此警告: The conversion of DecimalType columns is inefficient and may take a long time. Column names: [PVPERUSER] If those columns are not necessary, you may consider dropping them or converting to primitive types before the conversion.


The conversion of DecimalType columns is inefficient and may take a long time. Column names: [PVPERUSER] If those columns are not necessary, you may consider dropping them or converting to primitive types before the conversion.


df = data.toPandas()

df = data.select(data.PVPERUSER.cast('float'), data.another_column).toPandas()