Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/291.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/arrays/13.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
如何使Python LightGBM代码接受列表_Python_Arrays_List_Scikit Learn_Lightgbm - Fatal编程技术网

如何使Python LightGBM代码接受列表

如何使Python LightGBM代码接受列表,python,arrays,list,scikit-learn,lightgbm,Python,Arrays,List,Scikit Learn,Lightgbm,我使用以下代码: import numpy as np from sklearn.model_selection import train_test_split import pandas as pd from sklearn.preprocessing import StandardScaler from sklearn.ensemble import GradientBoostingClassifier from sklearn.metrics import mean_squared_e

我使用以下代码:

import numpy as np

from sklearn.model_selection import train_test_split

import pandas as pd
from sklearn.preprocessing import StandardScaler
from sklearn.ensemble import GradientBoostingClassifier
from sklearn.metrics import mean_squared_error,roc_auc_score,precision_score
pd.options.display.max_columns = 999
import lightgbm as lgb

def load_csv(filepath):
    data =  []
    col = []
    checkcol = False
    with open(filepath) as f:
        for val in f.readlines():
            val = val.replace("\n","")
            val = val.split(',')
            if checkcol is False:
                col = val
                checkcol = True
            else:
                data.append(val)
    df = pd.DataFrame(data=data, columns=col)
    return df

heart=load_csv(r'C:\Users\PC\Documents\Essay\heart.csv')

df=heart[['chol','cp']]
Y=heart['sex']

sc=StandardScaler()
sc.fit(df)
X=pd.DataFrame(sc.fit_transform(df))

X_train,X_test,y_train,y_test=train_test_split(X,Y,test_size=0.3,random_state=0)


d_train=lgb.Dataset(X_train, label=y_train)

params={}
params['learning_rate']=0.03
params['boosting_type']='gbdt' #GradientBoostingDecisionTree
params['objective']='binary' #Binary target feature
params['metric']='binary_logloss' #metric for binary classification
params['max_depth']=10


clf=lgb.train(params,d_train,100)
仅获取错误消息:

ValueError: Series.dtypes must be int, float or bool

我知道这是因为我选择了Y,但我也尝试过使用数组和嵌套列表,但仍然失败。

使用labelencoder可以将列转换为预期格式:

from sklearn import preprocessing

encoder = preprocessing.LabelEncoder()
encoder.fit(df['sex'])
encoder.transform(df['sex']) 
这将产生一个0和1的列表,您可以将其输入到学习算法中:

array([1, 0, 0, 0, 1])
好像你的专栏“性”是字符串类型?例如,您可以使用Sklearns Labelencoder对其进行编码,并将其设置为0和1