Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/326.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
ValueError:第一个参数必须是序列-->;散点图Python_Python_Pandas_Numpy_Matplotlib - Fatal编程技术网

ValueError:第一个参数必须是序列-->;散点图Python

ValueError:第一个参数必须是序列-->;散点图Python,python,pandas,numpy,matplotlib,Python,Pandas,Numpy,Matplotlib,我目前正在努力绘制我的线性回归输出。我发现了一个类似的问题,建议确保将数据类型设置为int。我已经确保将其合并到我的代码中 我已经阅读了很多次代码,对我来说结构似乎很合理。我愿意接受任何和所有的反馈!非常感谢你的帮助 请注意,列(事故严重程度和伤亡人数)仅为数字。(即事故严重程度为3,涉及1人伤亡) -------------------步骤1------------------- import numpy as np import pandas as pd from sklearn.linea

我目前正在努力绘制我的线性回归输出。我发现了一个类似的问题,建议确保将数据类型设置为int。我已经确保将其合并到我的代码中

我已经阅读了很多次代码,对我来说结构似乎很合理。我愿意接受任何和所有的反馈!非常感谢你的帮助

请注意,列(事故严重程度和伤亡人数)仅为数字。(即事故严重程度为3,涉及1人伤亡)

-------------------步骤1-------------------

import numpy as np
import pandas as pd
from sklearn.linear_model import LinearRegression
%pylab inline
import matplotlib.pyplot as plt
raw_data = pd.read_csv("/Users/Maddco12/Desktop/1-6m-accidents-traffic-flow-over-16-years/accidents_2005_to_2007.csv")
dtype={'Number_of_Casualties': int,'Accident_Severity': int}
raw_data.head(4)
filtered_data = raw_data[~np.isnan(raw_data["Accident_Severity"])] #removes rows with NaN in them
filtered_data.head(4)

filtered_data = raw_data[~np.isnan(raw_data["Number_of_Casualties"])] #removes rows with NaN in them
filtered_data.head(4)
npMatrix = np.matrix(filtered_data)
X, Y = npMatrix[:,0], npMatrix[:,1]
mdl = LinearRegression().fit(filtered_data[['Number_of_Casualties']],
filtered_data.Accident_Severity)
m = mdl.coef_[0]
b = mdl.intercept_
print "formula: y = {0}x + {1}".format(m, b)
-------------------步骤2-------------------

import numpy as np
import pandas as pd
from sklearn.linear_model import LinearRegression
%pylab inline
import matplotlib.pyplot as plt
raw_data = pd.read_csv("/Users/Maddco12/Desktop/1-6m-accidents-traffic-flow-over-16-years/accidents_2005_to_2007.csv")
dtype={'Number_of_Casualties': int,'Accident_Severity': int}
raw_data.head(4)
filtered_data = raw_data[~np.isnan(raw_data["Accident_Severity"])] #removes rows with NaN in them
filtered_data.head(4)

filtered_data = raw_data[~np.isnan(raw_data["Number_of_Casualties"])] #removes rows with NaN in them
filtered_data.head(4)
npMatrix = np.matrix(filtered_data)
X, Y = npMatrix[:,0], npMatrix[:,1]
mdl = LinearRegression().fit(filtered_data[['Number_of_Casualties']],
filtered_data.Accident_Severity)
m = mdl.coef_[0]
b = mdl.intercept_
print "formula: y = {0}x + {1}".format(m, b)
-------------------步骤3-------------------

import numpy as np
import pandas as pd
from sklearn.linear_model import LinearRegression
%pylab inline
import matplotlib.pyplot as plt
raw_data = pd.read_csv("/Users/Maddco12/Desktop/1-6m-accidents-traffic-flow-over-16-years/accidents_2005_to_2007.csv")
dtype={'Number_of_Casualties': int,'Accident_Severity': int}
raw_data.head(4)
filtered_data = raw_data[~np.isnan(raw_data["Accident_Severity"])] #removes rows with NaN in them
filtered_data.head(4)

filtered_data = raw_data[~np.isnan(raw_data["Number_of_Casualties"])] #removes rows with NaN in them
filtered_data.head(4)
npMatrix = np.matrix(filtered_data)
X, Y = npMatrix[:,0], npMatrix[:,1]
mdl = LinearRegression().fit(filtered_data[['Number_of_Casualties']],
filtered_data.Accident_Severity)
m = mdl.coef_[0]
b = mdl.intercept_
print "formula: y = {0}x + {1}".format(m, b)
-------------------步骤4-------------------

import numpy as np
import pandas as pd
from sklearn.linear_model import LinearRegression
%pylab inline
import matplotlib.pyplot as plt
raw_data = pd.read_csv("/Users/Maddco12/Desktop/1-6m-accidents-traffic-flow-over-16-years/accidents_2005_to_2007.csv")
dtype={'Number_of_Casualties': int,'Accident_Severity': int}
raw_data.head(4)
filtered_data = raw_data[~np.isnan(raw_data["Accident_Severity"])] #removes rows with NaN in them
filtered_data.head(4)

filtered_data = raw_data[~np.isnan(raw_data["Number_of_Casualties"])] #removes rows with NaN in them
filtered_data.head(4)
npMatrix = np.matrix(filtered_data)
X, Y = npMatrix[:,0], npMatrix[:,1]
mdl = LinearRegression().fit(filtered_data[['Number_of_Casualties']],
filtered_data.Accident_Severity)
m = mdl.coef_[0]
b = mdl.intercept_
print "formula: y = {0}x + {1}".format(m, b)
-------------------步骤5---------------------(我在这里得到值错误)

错误如下---->

ValueError回溯(最近一次调用)
在()
---->1 plt.散射(X,Y,颜色=蓝色)
2 plt.绘图([0100],[b,m*100+b],'r')
3 plt.标题(“线性回归示例”,fontsize=20)
4 plt.xlabel('伤亡人数',fontsize=15)
5 plt.ylabel(事故严重程度),fontsize=15)
/用户/maddc12/Documents/Python/anaconda/lib/python2.7/site-packages/matplotlib/pyplot.pyc分散(x、y、s、c、marker、cmap、norm、vmin、vmax、alpha、线宽、顶点、边色、保持、数据、**kwargs)
3256 vmin=vmin,vmax=vmax,alpha=alpha,
3257线宽=线宽,顶点=顶点,
->3258 edgecolors=edgecolors,data=data,**kwargs)
3259最后:
3260斧头保持(洗旧)
/内部的Users/maddc12/Documents/Python/anaconda/lib/python2.7/site packages/matplotlib/__init__.pyc(ax,*args,**kwargs)
1817警告。警告(消息%(标签名称,功能名称),
1818运行时警告,堆栈级别=2)
->1819返回函数(ax,*args,**kwargs)
1820预存单据=内部单据__
1821如果pre_doc为无:
/Users/maddc12/Documents/Python/anaconda/lib/python2.7/site-packages/matplotlib/axes//u axes.pyc分散(self、x、y、s、c、marker、cmap、norm、vmin、vmax、alpha、线宽、顶点、边色、**kwargs)
3836
3837#c将保持不变,除非其长度与x相同:
->3838 x,y,s,c=cbook。删除屏蔽点(x,y,s,c)
3839
3840刻度=s#为便于阅读,重命名如下。
/删除隐藏点(*args)中的Users/maddc12/Documents/Python/anaconda/lib/python2.7/site-packages/matplotlib/cbook.pyc
1846返回()
1847如果(类似字符串(args[0])或不可编辑(args[0]):
->1848 raise VALUERROR(“第一个参数必须是序列”)
1849 nrecs=len(args[0])
1850马格斯=[]
ValueError:第一个参数必须是序列。

我建议在绘制X和Y值之前检查它们。代码的其余部分是直接向前看的,所以问题很可能就在那里

散点图期望X和Y的值数组

试试这个,看看是否有效

plt.scatter([X],[Y], color='blue')

也许你应该检查你的csv文件。如果使用旧的Excel版本生成,可能会出现这种错误。我解决了这个问题,将我的csv加载到Google电子表格中,然后再次将其导出为(更好的)csv文件。有些csv文件类型和python的某些版本似乎存在奇怪的不兼容性。在这里,您对这个问题进行了有价值的讨论:。希望能有帮助

不幸的是,这并没有产生预期的结果。下面是产生的以下错误。ValueError:float()的文本无效:200797UD71405