Tuesday, July 27, 2021

Re: unicode: UTF / UCS

Johannes Köhler wrote:
> when my linux
> is already configured with unicode it should
> not use ascii anymore, and without the order that i have
> to stick deep into the science of charsets.

Unfortunately the only way to achieve that, BOMs, is particularly
disputed on Linux, where indeed it has more likelihood to cause problems
By the way, many things "should"


> I spent some time with unicode pages. Therefore i thought to
> remember that ascii characters encoded in unicode using
> an different endian. But maybe i am wrong with this in
> mind.

The multi-byte encodings (UTF-16, UTF-32 and the legacy UCSs) do have
endianness, the specific versions are called UTF-16LE, UTF-16BE etc. .
In Unicode these latter are called "Encoding Schemes" (Unicode chapter
3.10), while the higher level concept that doesn't concern itself with
endianness is called "Encoding Form" (Unicode ch. 3.9).
By the way, I'm actually quite rusty in all these things, take my words
with a grain of salt

--
--
You received this message from the "vim_use" maillist.
Do not top-post! Type your reply below the text you are replying to.
For more information, visit http://www.vim.org/maillist.php

---
You received this message because you are subscribed to the Google Groups "vim_use" group.
To unsubscribe from this group and stop receiving emails from it, send an email to vim_use+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/vim_use/8211623e-0d90-39c9-634b-94c0bf4db212%40tiscali.it.

No comments: