Is it possible to parse/lex UNICODE characters with ANTLR?

Terence Parr

Yes. See Scanning Unicode Characters for a full description.

I hope to support predefined UNICODE sets now like LETTER but that will wait for 2.7.3 I think.

As of 2.7.2, you can use European chars (up to 255) for antlr grammars themselves whereas the input to your generated parsers is unrestricted.