Apache spark PySpark:如何将无数据帧转换为LibSVM格式
我想将Apache spark PySpark:如何将无数据帧转换为LibSVM格式,apache-spark,pyspark,Apache Spark,Pyspark,我想将DataFrame转换为LibSVM格式 df = spark.createDataFrame([(1,None),(0,1),(1,3)],["label","feat"]) from pyspark.mllib.util import MLUtils from pyspark.mllib.regression import LabeledPoint rdd = df.rdd.map(lambda line: LabeledPoint(line[0
DataFrame
转换为LibSVM
格式
df = spark.createDataFrame([(1,None),(0,1),(1,3)],["label","feat"])
from pyspark.mllib.util import MLUtils
from pyspark.mllib.regression import LabeledPoint
rdd = df.rdd.map(lambda line: LabeledPoint(line[0], line[1:]))
MLUtils.saveAsLibSVMFile(rdd, path)
错误:
TypeError: float() argument must be a string or a number, not 'NoneType'
预期的LibSVM
格式:
1
0 1:1
1 1:3
你同时发现了吗?你同时发现了吗?