Tuesday, August 12, 2014

an update for the romanian spell checker

The latest (utf8) spell check file for Vim for romanian language, from http://ftp.vim.org/vim/runtime/spell/ is still using the old (and wrong) diacritics (cedilla form of șțȘȚ and î instead of â inside of words).

The other method - using the 'spell-mkspell' method from vim's excellent help docs - does not yield by default a valid .spl file, using any recent (3 year or less) version of ro_RO.aff and ro_RO.dic (the newest being http://sourceforge.net/projects/rospell/files/Romanian%20dictionaries/dict-3.3.10/ro_RO.3.3.10.zip).

I took some time to look at the fawlty 'ro_RO.aff' format (previously total 'foreign' to me) and noticed that at some points, the authors forgot to place the 5th element in some lines (presumably a dot i.e. anything goes). So instead of this:

"SFX q 0 ă ."

it had this:

"SFX q 0 ă"

on several lines, which i can only suppose made Vim think that it met another suffix group which had 'ă' elements (obviously not a number) -> error, causing the rest of the file to be ignored.

I attached an updated utf8 romanian file from the latest (3.3.10) romanian spell files, if anyone close to home (in spirit, at least) is there to use it. Of course, with the above file corrected by the above addition, anybody can use the standard method and make his/hers own spell file.

Obviously this is something for the authors of the .aff files to take care of. On the other hand, i saw the exactly same error discussed on some other OpenOffice (or Mozilla) site. Since 1) it's very easy to forget the dot on such a line, 2) the authors of such files are not necessarily software developers, so the strictness should not be on their backs and 3) such a line is expected to be part of a 4-element prefix or suffix group defined above with a KNOWN / EXPECTED number of entries, perhaps something could be done to the vim src/spell.c so that such a "expected 5th element is missing" could be translated to a dot. But - what do i know - only if this happens to another languages as well.

--
--
You received this message from the "vim_use" maillist.
Do not top-post! Type your reply below the text you are replying to.
For more information, visit http://www.vim.org/maillist.php

---
You received this message because you are subscribed to the Google Groups "vim_use" group.
To unsubscribe from this group and stop receiving emails from it, send an email to vim_use+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

No comments: