Java 为什么在WEKA中读取此ARFF文件会导致过早的EOF?

Java 为什么在WEKA中读取此ARFF文件会导致过早的EOF?,java,machine-learning,weka,arff,Java,Machine Learning,Weka,Arff,下面是我用于测试的简化版本,它会产生下面的错误 @RELATION documents @ATTRIBUTE rnClass {r,n} @ATTRIBUTE CONTAINS_EMAIL numeric @ATTRIBUTE CONTAINS_PHONE numeric @DATA n,0.0,0.0 n,0.0,0.0 r,0.0,0.0 r,0.0,0.0 n,0.0,0.0 r,0.0,0.0 r,0.0,0.0 Caused by: java.io.IOException: p

下面是我用于测试的简化版本,它会产生下面的错误

@RELATION documents

@ATTRIBUTE rnClass {r,n}
@ATTRIBUTE CONTAINS_EMAIL numeric
@ATTRIBUTE CONTAINS_PHONE numeric

@DATA
n,0.0,0.0
n,0.0,0.0
r,0.0,0.0
r,0.0,0.0
n,0.0,0.0
r,0.0,0.0
r,0.0,0.0


Caused by: java.io.IOException: premature end of file, read Token[EOF], line 1
    at weka.core.converters.ArffLoader$ArffReader.errorMessage(ArffLoader.java:258) ~[weka-dev-3.7.6.jar:na]
    at weka.core.converters.ArffLoader$ArffReader.readHeader(ArffLoader.java:678) ~[weka-dev-3.7.6.jar:na]
    at weka.core.converters.ArffLoader$ArffReader.<init>(ArffLoader.java:139) ~[weka-dev-3.7.6.jar:na]

我试着用你的输入创建一个arff文件,它对我有用。可能是您的arff文件已损坏。能否创建一个新的arff文件并重试

我在尝试使用记事本++生成一些文件进行数据挖掘实验时遇到了这个问题。我怀疑这可能是一个编码问题,以某种方式破坏了文件,使WEKA停止工作。您应该尝试更改文件编码配置:如果您使用UTF-8编码创建文件,则应更改为ANSI或其他编码。它成功了。

只是为了确定,你能试试我的arff文件吗?你可以从下面的链接下载。
@RELATION documents

@ATTRIBUTE rnClass {r,n}
@ATTRIBUTE text String

@DATA
r,'blah blah blah...'
r,'\n\n\n\n\t\n\nblah blah blah\n\n...'
n,'\n\n   \n\n\n \nblah blah blah \n\n\nblah blah \n \n\n\n \'