Java ANTLR-输入错误不匹配

Java ANTLR-输入错误不匹配,java,antlr,antlr4,Java,Antlr,Antlr4,我有一个语法,看起来像这样,由特定语言的注释和控制语句组成: 语法: grammar DD; ddlist: (ddstmt| jclcomment)+; ddstmt: dd1 | dd2 | dd3 | dd4 ; dd1: JCLBEGIN ddname DDWORD 'DUMMY'; dd2: JCLBEGIN ddname DDWORD 'DYNAM'; dd3: JCLBEGIN ddname DDWORD NAME'=' ('*'|NAM

我有一个语法,看起来像这样,由特定语言的注释和控制语句组成:

语法:

grammar DD;

ddlist: (ddstmt| jclcomment)+;

ddstmt:        dd1 | dd2 | dd3 | dd4 ;

dd1:    JCLBEGIN ddname  DDWORD 'DUMMY';
dd2:    JCLBEGIN ddname  DDWORD 'DYNAM';
dd3:    JCLBEGIN ddname  DDWORD NAME'=' ('*'|NAME);
dd4:    JCLBEGIN ddname  DDWORD '*' inlinerec INLINESTMTEND?;

inlinerec: (INLINEDATA )+ ;
fragment INLINEDATA: (~[\r\n])*;

ddname: NAME;

jclcomment: JCLCOMMENT+;
JCLCOMMENT: COMMENTBEGIN ~[\r\n]*;

DDWORD:     'DD';

JCLBEGIN:       '//'    ;
COMMENTBEGIN:   '//*'   ;
INLINESTMTEND:  '/*'    ;

NAME  : [A-Z#] (ALPHA | NUMBER | SPECIALCHARS)*;

NUMBER: [0-9];
ALPHA: [A-Z];
SPECIALCHARS:   '#' | '@' | '$';

STRING
 : '\'' (~[\r\n"])* '\''
 | '"' (~[\r\n"])* '"'
 ;

WS     : [ \r\n] -> channel(HIDDEN);
我的意见是:

//SYSIN    DD  *                                      
SORT FIELDS=COPY
INCLUDE COND
/*                                                    
//SYSPRINT DD  SYSOUT=*        
//* Comment line #1                       
//* Comment line #2
//SYSOUT   DD  SYSOUT=*                               
//CEEDUMP  DD  SYSOUT=*                               
//* Comment line #3
//SYSUDUMP DD  SYSOUT=A           
使用AntlrWorks对输入运行此语法时,出现以下错误:

line 2:0 mismatched input 'SORT' expecting INLINEDATA
如何解决此错误?

1)空白规则生成大量令牌:

$ echo $CLASSPATH
.:/usr/local/lib/antlr-4.6-complete.jar
$ alias grun
alias grun='java org.antlr.v4.gui.TestRig'
$ grun DD ddlist -tokens jcl.txt
[@6,11:12='DD',<'DD'>,1:11]
[@7,13:13=' ',<WS>,channel=1,1:13]
[@8,14:14=' ',<WS>,channel=1,1:14]
[@9,15:15='*',<'*'>,1:15]
[@10,16:16=' ',<WS>,channel=1,1:16]
[@11,17:17=' ',<WS>,channel=1,1:17]
[@12,18:18=' ',<WS>,channel=1,1:18]
[@13,19:19=' ',<WS>,channel=1,1:19]
[@14,20:20=' ',<WS>,channel=1,1:20]
[@15,21:21=' ',<WS>,channel=1,1:21]
[@16,22:22=' ',<WS>,channel=1,1:22]

2) 您没有提到编译语法时出现的重要错误消息:

warning(125): DD.g4:12:12: implicit definition of token INLINEDATA in parser
在解析器中使用未定义的标记就好像您有一个lexer规则:

INLINEDATA : 'INLINEDATA' ;
这是一个字符串常量。因此,解析器规则

dd4:    JCLBEGIN ddname  DDWORD '*' inlinerec INLINESTMTEND?;
意思是:我希望输入流是:

//{a name} DD * 'INLINEDATA'
但投入是:

//SYSIN    DD  *   SORT 
这就是信息

line 2:0 mismatched input 'SORT' expecting INLINEDATA

3) 我对此类作业控制语句的语法:

grammar JCL;

/* Parsing JCL, ignoring inline sysin. */

jcl
    :   jcl_card+   // good old punched cards :-)
    ;

jcl_card
    :   dd_statement
    |   COMMENT
    ;

dd_statement
    :   '//' NAME 'DD' file_type ( NL | EOF )
    ;

file_type
    :   'DUMMY'
    |   'DYNAM'
    |   NAME '=' ( '*' | NAME )
    |   '*' NL inline_sysin
    ;

inline_sysin
    :   NON_JCL_CARD* END_OF_FILE
    ;

NAME          : [A-Z#] ( LETTER | DIGIT | SPECIAL_CHARS )* ;
COMMENT       : '//*' .*? ( NL | EOF ) ;
END_OF_FILE   : '/'  {getCharPositionInLine() == 1}? '*' ;
NON_JCL_CARD  : ~'/' {getCharPositionInLine() == 1}? .*? ( NL | EOF ) ;
STRING        : '\'' .*? '\'' | '"' .*? '"' ;
NL  : [\r\n] ;
WS  : [ \t]+ -> skip ; // or -> channel(HIDDEN) to keep white space tokens

fragment DIGIT  : [0-9] ;
fragment LETTER : [A-Z] ;
fragment SPECIAL_CHARS : '#' | '@' | '$' ;
输入

//SYSIN    DD  *     
SORT FIELDS=COPY
INCLUDE COND
any other program input @ $ ! & %
/*                  
//SYSPRINT DD  SYSOUT=*
//* Comment line #1           
//* Comment line #2
//SYSOUT   DD  SYSOUT=*     
//SYSOUT   DD  DUMMY
//SYSIN    DD  *     
 /* not end of input    
/*
它给

$ grun JCL jcl -tokens jcl.txt
[@0,0:1='//',<'//'>,1:0]
[@1,2:6='SYSIN',<NAME>,1:2]
[@2,11:12='DD',<'DD'>,1:11]
[@3,15:15='*',<'*'>,1:15]
[@4,21:21='\n',<NL>,1:21]
[@5,22:38='SORT FIELDS=COPY\n',<NON_JCL_CARD>,2:0]
[@6,39:51='INCLUDE COND\n',<NON_JCL_CARD>,3:0]
[@7,52:85='any other program input @ $ ! & %\n',<NON_JCL_CARD>,4:0]
[@8,86:87='/*',<END_OF_FILE>,5:0]
[@9,106:106='\n',<NL>,5:20]
...
@17,131:161='//* Comment line #1           \n',<COMMENT>,7:0]
...
[@31,232:233='//',<'//'>,11:0]
[@32,234:238='SYSIN',<NAME>,11:2]
[@33,243:244='DD',<'DD'>,11:11]
[@34,247:247='*',<'*'>,11:15]
[@35,253:253='\n',<NL>,11:21]
[@36,254:278=' /* not end of input    \n',<NON_JCL_CARD>,12:0]
[@37,279:280='/*',<END_OF_FILE>,13:0]
[@38,281:280='<EOF>',<EOF>,13:2]

如果解析器规则dd4产生冲突,那么单独为dd4编写一个单独的语法,然后再编写主语法有意义吗?如果您有任何疑问,请与我们分享最佳实践。谢谢您的回答。让我尝试您的解决方案并更新您。当我使用Antlrworks2运行语法时,我收到了此错误
第1:22行无关输入'\n'预期{END_OF_FILE,NON_JCL_CARD}
。我更改了
NL:[\r\n]
NL:'\r'\n'而且它就像一个魔咒,如果你看看我的原始语法,我可以选择
INLINESTMTEND
。因此,我将把解决方案语法中的
END\u OF_FILE
设置为可选,并尝试在没有NL标记的情况下重写语法。语法必须与输入匹配,反之亦然。所以抱怨我们有太多的评论,并建议继续聊天,但我并不总是在线。我已经删除了一些评论,你可以这样做,我想未来的读者对此并不感兴趣。
$ grun JCL jcl -tokens jcl.txt
[@0,0:1='//',<'//'>,1:0]
[@1,2:6='SYSIN',<NAME>,1:2]
[@2,11:12='DD',<'DD'>,1:11]
[@3,15:15='*',<'*'>,1:15]
[@4,21:21='\n',<NL>,1:21]
[@5,22:38='SORT FIELDS=COPY\n',<NON_JCL_CARD>,2:0]
[@6,39:51='INCLUDE COND\n',<NON_JCL_CARD>,3:0]
[@7,52:85='any other program input @ $ ! & %\n',<NON_JCL_CARD>,4:0]
[@8,86:87='/*',<END_OF_FILE>,5:0]
[@9,106:106='\n',<NL>,5:20]
...
@17,131:161='//* Comment line #1           \n',<COMMENT>,7:0]
...
[@31,232:233='//',<'//'>,11:0]
[@32,234:238='SYSIN',<NAME>,11:2]
[@33,243:244='DD',<'DD'>,11:11]
[@34,247:247='*',<'*'>,11:15]
[@35,253:253='\n',<NL>,11:21]
[@36,254:278=' /* not end of input    \n',<NON_JCL_CARD>,12:0]
[@37,279:280='/*',<END_OF_FILE>,13:0]
[@38,281:280='<EOF>',<EOF>,13:2]
$ grun JCL jcl -gui jcl.txt