Python Pandas、Jupyter笔记本中的规范化错误

Python Pandas、Jupyter笔记本中的规范化错误,python,pandas,dataframe,tensorflow,Python,Pandas,Dataframe,Tensorflow,我是Python和Tensorflow的新手。我已经创建了一个使用熊猫和现有CSV文件进行简单分类的示例。但是为了规范化CSV中的列,我得到以下错误 --------------------------------------------------------------------------- TypeError Traceback (most recent call last) ~/anaconda3/lib/python3

我是Python和Tensorflow的新手。我已经创建了一个使用熊猫和现有CSV文件进行简单分类的示例。但是为了规范化CSV中的列,我得到以下错误

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in na_op(x, y)
   1008         try:
-> 1009             result = expressions.evaluate(op, str_rep, x, y, **eval_kwargs)
   1010         except TypeError:

~/anaconda3/lib/python3.6/site-packages/pandas/core/computation/expressions.py in evaluate(op, op_str, a, b, use_numexpr, **eval_kwargs)
    204     if use_numexpr:
--> 205         return _evaluate(op, op_str, a, b, **eval_kwargs)
    206     return _evaluate_standard(op, op_str, a, b)

~/anaconda3/lib/python3.6/site-packages/pandas/core/computation/expressions.py in _evaluate_numexpr(op, op_str, a, b, truediv, reversed, **eval_kwargs)
    119     if result is None:
--> 120         result = _evaluate_standard(op, op_str, a, b)
    121 

~/anaconda3/lib/python3.6/site-packages/pandas/core/computation/expressions.py in _evaluate_standard(op, op_str, a, b, **eval_kwargs)
     64     with np.errstate(all='ignore'):
---> 65         return op(a, b)
     66 

TypeError: unsupported operand type(s) for -: 'str' and 'str'

During handling of the above exception, another exception occurred:

TypeError                                 Traceback (most recent call last)
~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in safe_na_op(lvalues, rvalues)
   1029             with np.errstate(all='ignore'):
-> 1030                 return na_op(lvalues, rvalues)
   1031         except Exception:

~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in na_op(x, y)
   1019                 mask = notna(x)
-> 1020                 result[mask] = op(x[mask], y)
   1021 

TypeError: unsupported operand type(s) for -: 'str' and 'str'

During handling of the above exception, another exception occurred:

TypeError                                 Traceback (most recent call last)
<ipython-input-22-44ad2490d2ae> in <module>()
----> 1 patients[cols_to_norm] = patients[cols_to_norm].apply(lambda x: (x- x.min())/(x.max()-x.min()))

~/anaconda3/lib/python3.6/site-packages/pandas/core/frame.py in apply(self, func, axis, broadcast, raw, reduce, result_type, args, **kwds)
   6002                          args=args,
   6003                          kwds=kwds)
-> 6004         return op.get_result()
   6005 
   6006     def applymap(self, func):

~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py in get_result(self)
    316                                       *self.args, **self.kwds)
    317 
--> 318         return super(FrameRowApply, self).get_result()
    319 
    320     def apply_broadcast(self):

~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py in get_result(self)
    140             return self.apply_raw()
    141 
--> 142         return self.apply_standard()
    143 
    144     def apply_empty_result(self):

~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py in apply_standard(self)
    246 
    247         # compute the result using the series generator
--> 248         self.apply_series_generator()
    249 
    250         # wrap results

~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py in apply_series_generator(self)
    275             try:
    276                 for i, v in enumerate(series_gen):
--> 277                     results[i] = self.f(v)
    278                     keys.append(v.name)
    279             except Exception as e:

<ipython-input-22-44ad2490d2ae> in <lambda>(x)
----> 1 patients[cols_to_norm] = patients[cols_to_norm].apply(lambda x: (x- x.min())/(x.max()-x.min()))

~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in wrapper(left, right)
   1064             rvalues = rvalues.values
   1065 
-> 1066         result = safe_na_op(lvalues, rvalues)
   1067         return construct_result(left, result,
   1068                                 index=left.index, name=res_name, dtype=None)

~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in safe_na_op(lvalues, rvalues)
   1032             if is_object_dtype(lvalues):
   1033                 return libalgos.arrmap_object(lvalues,
-> 1034                                               lambda x: op(x, rvalues))
   1035             raise
   1036 

pandas/_libs/algos_common_helper.pxi in pandas._libs.algos.arrmap_object()

~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in <lambda>(x)
   1032             if is_object_dtype(lvalues):
   1033                 return libalgos.arrmap_object(lvalues,
-> 1034                                               lambda x: op(x, rvalues))
   1035             raise
   1036 

TypeError: ("unsupported operand type(s) for -: 'str' and 'str'", 'occurred at index Group')
调试IDE:Jupyter笔记本

import pandas as pd
patients = pd.read_csv("../npushpakaran/TENSORFLOW/Tensorflow-Bootcamp-master/02-TensorFlow-Basics/pima-indians-diabetes.csv")
patients.columns
Index(['Number_pregnant', 'Glucose_concentration', 'Blood_pressure', 'Triceps',
       'Insulin', 'BMI', 'Pedigree', 'Age', 'Class', 'Group'],
      dtype='object')
cols_to_norm =['Number_pregnant', 'Glucose_concentration', 'Blood_pressure', 'Triceps',
       'Insulin', 'BMI', 'Pedigree', 'Age', 'Class', 'Group'] 


patients[cols_to_norm] = patients[cols_to_norm].apply(lambda x: (x- x.min())/(x.max()-x.min()))
在最后一行,我得到了下面的错误

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in na_op(x, y)
   1008         try:
-> 1009             result = expressions.evaluate(op, str_rep, x, y, **eval_kwargs)
   1010         except TypeError:

~/anaconda3/lib/python3.6/site-packages/pandas/core/computation/expressions.py in evaluate(op, op_str, a, b, use_numexpr, **eval_kwargs)
    204     if use_numexpr:
--> 205         return _evaluate(op, op_str, a, b, **eval_kwargs)
    206     return _evaluate_standard(op, op_str, a, b)

~/anaconda3/lib/python3.6/site-packages/pandas/core/computation/expressions.py in _evaluate_numexpr(op, op_str, a, b, truediv, reversed, **eval_kwargs)
    119     if result is None:
--> 120         result = _evaluate_standard(op, op_str, a, b)
    121 

~/anaconda3/lib/python3.6/site-packages/pandas/core/computation/expressions.py in _evaluate_standard(op, op_str, a, b, **eval_kwargs)
     64     with np.errstate(all='ignore'):
---> 65         return op(a, b)
     66 

TypeError: unsupported operand type(s) for -: 'str' and 'str'

During handling of the above exception, another exception occurred:

TypeError                                 Traceback (most recent call last)
~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in safe_na_op(lvalues, rvalues)
   1029             with np.errstate(all='ignore'):
-> 1030                 return na_op(lvalues, rvalues)
   1031         except Exception:

~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in na_op(x, y)
   1019                 mask = notna(x)
-> 1020                 result[mask] = op(x[mask], y)
   1021 

TypeError: unsupported operand type(s) for -: 'str' and 'str'

During handling of the above exception, another exception occurred:

TypeError                                 Traceback (most recent call last)
<ipython-input-22-44ad2490d2ae> in <module>()
----> 1 patients[cols_to_norm] = patients[cols_to_norm].apply(lambda x: (x- x.min())/(x.max()-x.min()))

~/anaconda3/lib/python3.6/site-packages/pandas/core/frame.py in apply(self, func, axis, broadcast, raw, reduce, result_type, args, **kwds)
   6002                          args=args,
   6003                          kwds=kwds)
-> 6004         return op.get_result()
   6005 
   6006     def applymap(self, func):

~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py in get_result(self)
    316                                       *self.args, **self.kwds)
    317 
--> 318         return super(FrameRowApply, self).get_result()
    319 
    320     def apply_broadcast(self):

~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py in get_result(self)
    140             return self.apply_raw()
    141 
--> 142         return self.apply_standard()
    143 
    144     def apply_empty_result(self):

~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py in apply_standard(self)
    246 
    247         # compute the result using the series generator
--> 248         self.apply_series_generator()
    249 
    250         # wrap results

~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py in apply_series_generator(self)
    275             try:
    276                 for i, v in enumerate(series_gen):
--> 277                     results[i] = self.f(v)
    278                     keys.append(v.name)
    279             except Exception as e:

<ipython-input-22-44ad2490d2ae> in <lambda>(x)
----> 1 patients[cols_to_norm] = patients[cols_to_norm].apply(lambda x: (x- x.min())/(x.max()-x.min()))

~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in wrapper(left, right)
   1064             rvalues = rvalues.values
   1065 
-> 1066         result = safe_na_op(lvalues, rvalues)
   1067         return construct_result(left, result,
   1068                                 index=left.index, name=res_name, dtype=None)

~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in safe_na_op(lvalues, rvalues)
   1032             if is_object_dtype(lvalues):
   1033                 return libalgos.arrmap_object(lvalues,
-> 1034                                               lambda x: op(x, rvalues))
   1035             raise
   1036 

pandas/_libs/algos_common_helper.pxi in pandas._libs.algos.arrmap_object()

~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in <lambda>(x)
   1032             if is_object_dtype(lvalues):
   1033                 return libalgos.arrmap_object(lvalues,
-> 1034                                               lambda x: op(x, rvalues))
   1035             raise
   1036 

TypeError: ("unsupported operand type(s) for -: 'str' and 'str'", 'occurred at index Group')
---------------------------------------------------------------------------
TypeError回溯(最近一次调用上次)
na_op(x,y)中的~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py
1008尝试:
->1009结果=表达式。求值(op,str_rep,x,y,**eval_kwargs)
1010除类型错误外:
~/anaconda3/lib/python3.6/site-packages/pandas/core/computation/expressions.py in evaluate(op,op_str,a,b,use_numexpr,**eval_kwargs)
204如果使用\u numexpr:
-->205返回评估(op、op、str、a、b、**评估kwargs)
206返回评估标准(op,op str,a,b)
~/anaconda3/lib/python3.6/site-packages/pandas/core/computation/expressions.py in\u evaluate\u numexpr(op,op\u str,a,b,truediv,reversed,**eval\u kwargs)
119如果结果为无:
-->120结果=_评估_标准(op,op str,a,b)
121
~/anaconda3/lib/python3.6/site-packages/pandas/core/computation/expressions.py in_evaluate_标准(op,op_str,a,b,**eval_kwargs)
64和np.errstate(all='ignore'):
--->65返回op(a、b)
66
TypeError:-:“str”和“str”的操作数类型不受支持
在处理上述异常期间,发生了另一个异常:
TypeError回溯(最近一次调用上次)
~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py在safe\u na\u op中(左值、右值)
1029,带有np.errstate(all='ignore'):
->1030返回na_op(左值、右值)
1031例外情况除外:
na_op(x,y)中的~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py
1019掩模=notna(x)
->1020结果[mask]=op(x[mask],y)
1021
TypeError:-:“str”和“str”的操作数类型不受支持
在处理上述异常期间,发生了另一个异常:
TypeError回溯(最近一次调用上次)
在()
---->1名患者[cols_to_norm]=患者[cols_to_norm]。应用(lambda x:(x-x.min())/(x.max()-x.min())
应用中的~/anaconda3/lib/python3.6/site-packages/pandas/core/frame.py(self、func、axis、broadcast、raw、reduce、result\u type、args、**kwds)
6002 args=args,
6003千瓦时=千瓦时)
->6004返回操作获取结果()
6005
6006 def应用映射(自身,功能):
获取结果(self)中的~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py
316*self.args,**self.kwds)
317
-->318返回super(FrameRowApply,self).get_result()
319
320 def应用_广播(自):
获取结果(self)中的~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py
140返回自我。应用_原始()
141
-->142返回自我。应用标准()
143
144 def应用\空\结果(自身):
应用标准中的~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py(self)
246
247#使用级数生成器计算结果
-->248自应用_系列_发生器()
249
250#包装结果
~/anaconda3/lib/python3.6/site-packages/pandas/core/apply.py-in-apply\u系列\u生成器(self)
275试试:
276用于枚举中的i、v(系列):
-->277结果[i]=自f(v)
278键。追加(v.name)
279例外情况除外,如e:
in(x)
---->1名患者[cols_to_norm]=患者[cols_to_norm]。应用(lambda x:(x-x.min())/(x.max()-x.min())
包装器中的~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py(左、右)
1064 rvalues=rvalues.values
1065
->1066结果=安全值(左值、右值)
1067返回构造结果(左,结果,
1068 index=left.index,name=res\u name,dtype=None)
~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py在safe\u na\u op中(左值、右值)
1032如果是对象类型(左值):
1033返回libalgos.arrmap_对象(左值,
->1034λx:op(x,右值))
1035提高
1036
pandas/_libs/algos_common_helper.pxi在pandas中。_libs.algos.arrmap_对象()
~/anaconda3/lib/python3.6/site-packages/pandas/core/ops.py in(x)
1032如果是对象类型(左值):
1033返回libalgos.arrmap_对象(左值,
->1034λx:op(x,右值))
1035提高
1036
TypeError:(-:'str'和'str''的不支持的操作数类型发生在索引组')

任何人有任何想法,请帮助。

您不需要使用
pd.DataFrame。使用自定义函数应用
。相反,使用熊猫中可用的矢量化方法:

cols = cols_to_norm
df_sub = df.loc[:, cols]

df.loc[:, cols] = (df_sub - df_sub.min()) / (df_sub.max() - df_sub.min())

您不需要使用带有自定义函数的
pd.DataFrame.apply
。相反,使用熊猫中可用的矢量化方法:

cols = cols_to_norm
df_sub = df.loc[:, cols]

df.loc[:, cols] = (df_sub - df_sub.min()) / (df_sub.max() - df_sub.min())

谢谢你的快速更新。我会尝试这个并更新你。谢谢你的快速更新。我会尝试这个并更新你。