Python 如何修复SimpleImputer:“;ValueError:输入包含NaN、无穷大或对dtype太大的值;
我有一个数据集,如下所示:Python 如何修复SimpleImputer:“;ValueError:输入包含NaN、无穷大或对dtype太大的值;,python,scikit-learn,data-science,Python,Scikit Learn,Data Science,我有一个数据集,如下所示: MonthlyCharges TotalCharges 0 29.85 109.90 1 56.95 NaN 2 NaN 108.15 3 42.30 1840.75 4 70.70 NaN 5 NaN 820.50 6 89.10 1949.40 7 NaN
MonthlyCharges TotalCharges
0 29.85 109.90
1 56.95 NaN
2 NaN 108.15
3 42.30 1840.75
4 70.70 NaN
5 NaN 820.50
6 89.10 1949.40
7 NaN NaN
8 104.80 3046.05
9 54.10 354.95
import numpy as np
from sklearn.impute import SimpleImputer
imp = SimpleImputer(missing_values=np.nan, strategy='median', verbose=0)
imp.fit(tele2_sub)
我使用SimpleImputer
来完成缺少的值:
from sklearn.impute import SimpleImputer
imp = SimpleImputer(missing_values='NaN',strategy='median',verbose=0)
imp.fit(tele2_sub)
但是,我收到一个错误
Error: ValueError: Input contains NaN, infinity or a value too large for dtype('float64').
这里缺少什么?初始化调用应将
缺少的\u值设置为numpy.nan
,如下所示:
MonthlyCharges TotalCharges
0 29.85 109.90
1 56.95 NaN
2 NaN 108.15
3 42.30 1840.75
4 70.70 NaN
5 NaN 820.50
6 89.10 1949.40
7 NaN NaN
8 104.80 3046.05
9 54.10 354.95
import numpy as np
from sklearn.impute import SimpleImputer
imp = SimpleImputer(missing_values=np.nan, strategy='median', verbose=0)
imp.fit(tele2_sub)