Thursday, August 15, 2002 02:10 PM
BreakIterator works fine with most encodings, even with Japanese. But seems it is not working correctly with Chinese. Is there some way to detect begin and end of the word in text in Chinese?
Another question, is there easy way using BreakIterator to switch from word detection to character detection for some particular charset? For example, if I use mixed text (chinese and english), to get word if it is in english, or single character if it is in chinese.