Tuesday, December 25, 2012

Re: matchpairs and Unicode

On 25/12/12 23:11, Christian Brabandt wrote:
>
> By the way, what kind of multibyte characters do you need?
>
> regards,
> Christian
>
Hm, let's see… Anything above 0x7F is 2 or more bytes in UTF-8…

For Latin, Cyrillic, etc.:
« (U+00AB LEFT-POINTING DOUBLE ANGLE QUOTATION MARK) and » (U+00BB
RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK)
" (U+201C LEFT DOUBLE QUOTATION MARK) and either " (U+201D RIGHT DOUBLE
QUOTATION MARK) or „ (U+201E DOUBLE LOW-9 QUOTATION MARK)
' (U+2018 LEFT SINGLE QUOTATION MARK) and either ' (U+2019 RIGHT SINGLE
QUOTATION MARK) or ‚ (U+201A SINGLE LOW-9 QUOTATION MARK)
Depending on the language, two characters on each of the above lines may
be chosen, and in either order.

In addition, for Spanish:
¡ (U+00A1 INVERTED EXCLAMATION MARK) and ! (U+0021 EXCLAMATION MARK)
¿ (U+00BF INVERTED QUESTION MARK) and ? (U+003F QUESTION MARK)
In each of these two pairs, only the first member is multi-byte.

For Arabic etc.:
﴿ (U+FD3F ORNATE RIGHT PARENTHESIS) and ﴾ (U+FD3E ORNATE LEFT PARENTHESIS)

For CJK:
〈 (U+3008 LEFT ANGLE BRACKET) and 〉 (U+3009 RIGHT ANGLE BRACKET)
《 (U+300A LEFT DOUBLE ANGLE BRACKET) and 》 (U+300B RIGHT DOUBLE ANGLE
BRACKET)
「 (U+300C LEFT CORNER BRACKET) and 」 (U+300D RIGHT CORNER BRACKET)
『 (U+300E LEFT WHITE CORNER BRACKET) and 』 (U+300F RIGHT WHITE CORNER
BRACKET)
【 (U+3010 LEFT BLACK LENTICULAR BRACKET) and 】 (U+3011 RIGHT BLACK
LENTICULAR BRACKET)
〔 (U+3014 LEFT TORTOISE SHELL BRACKET) and 〕 (U+3015 RIGHT TORTOISE
SHELL BRACKET)
〖 (U+3016 LEFT WHITE LENTICULAR BRACKET) and 〗 (U+3017 RIGHT WHITE
LENTICULAR BRACKET)
〘 (U+3018 LEFT WHITE TORTOISE SHELL BRACKET) and 〙 (U+3019 RIGHT WHITE
TORTOISE SHELL BRACKET)
〚 (U+301A LEFT WHITE SQUARE BRACKET) and 〛 (U+301B RIGHT WHITE SQUARE
BRACKET)
〝 (U+301D REVERSED DOUBLE PRIME QUOTATION MARK) with either 〞 (U+301E
DOUBLE PRIME QUOTATION MARK) or (preferred) 〟 (U+301F LOW DOUBLE PRIME
QUOTATION MARK)

…these are the ones I know about; there may be others.


Best regards,
Tony.
--
A triangle which has an angle of 135 degrees is called an obscene
triangle.


--
You received this message from the "vim_use" maillist.
Do not top-post! Type your reply below the text you are replying to.
For more information, visit http://www.vim.org/maillist.php

No comments: