ANTLR3实现了Jaccard相似性来比较两个java';s文件
看这段代码,这是我混搭的JCCD APIANTLR3实现了Jaccard相似性来比较两个java';s文件,java,similarity,Java,Similarity,看这段代码,这是我混搭的JCCD API BufferedReader in = new BufferedReader(new FileReader(f.getFile())); String filePath = f.getNama(); // getName of file final Antlr3JavaLexer lexer = new Antlr3JavaLexer(); lexer.preserveWhitespacesAndComments = false
BufferedReader in = new BufferedReader(new FileReader(f.getFile()));
String filePath = f.getNama(); // getName of file
final Antlr3JavaLexer lexer = new Antlr3JavaLexer();
lexer.preserveWhitespacesAndComments = false;
try {
lexer.setCharStream(new ANTLRReaderStream(in));
} catch (IOException e) {
e.printStackTrace();
return false;
}
StringBuilder sbu = new StringBuilder();
while (true) {
org.antlr.runtime.Token token = lexer.nextToken();
if (token.getType() == lexer.EOF) {
break;
}
sbu.append(token.getType());
System.out.println(token.getType());
}
它为TestFileOne.java提供了如下输出
876116423877916429791644323742916418167432388167444266238816449164291643016743444242877916429791641179164432310329164351674323742916420164432316461643016444426623164616430164444242881644442879010116429164164224143234242[]
这个TestFileTwo.java
876116423877916429791644323742916418167432388167444266238816449164291643016743444242877916429791641179164432310329164351674323742916420164432316461643016444426623164616430164444242881644442879010116429164164224143234242[]
现在我的问题是,任何人都可以给我一个线索或建议,让我知道预期结果的相似性,比如两者的输出相似性百分比?
非常感谢您……您的问题不够具体(“有人能帮忙吗”或“提供线索”很少适合这样做)。你试过了吗:巴特先生,也许你能给我举个例子吗?