Is it possible to parse/lex UNICODE characters with ANTLR?
Created May 4, 2012
Terence Parr Yes. See Scanning Unicode Characters for a full description.
I hope to support predefined UNICODE sets now like LETTER but that will wait for 2.7.3 I think.
As of 2.7.2, you can use European chars (up to 255) for antlr grammars themselves whereas the input to your generated parsers is unrestricted.
I hope to support predefined UNICODE sets now like LETTER but that will wait for 2.7.3 I think.
As of 2.7.2, you can use European chars (up to 255) for antlr grammars themselves whereas the input to your generated parsers is unrestricted.