> 'nocompatible' mode, searching on [^\x00-\xFF] might do it: this
> regexp atom matches anything above U+00FF, i.e., any hanzi, but (this
> is the caveat) also any non-Latin letter, any non-Western Latin
> letter, and the above-mentioned œ Œ and Ÿ which are Western (French,
> at least, has a use for them) but not included in Latin 1.
Another caveat is that things such as spaces and punctuation can be used
in Chinese text, which are in that lower range (though there are
alternatives in the CJK Unicode regions too). This may or may not matter
for your text and application, of course.
Ben.
--
You received this message from the "vim_use" maillist.
Do not top-post! Type your reply below the text you are replying to.
For more information, visit http://www.vim.org/maillist.php
No comments:
Post a Comment