Saturday, August 30, 2014

Re: an update for the romanian spell checker

Vanilla Ice wrote:

> > > miercuri, 13 august 2014, 15:51:25 UTC+3, Bram Moolenaar a scris:
> > > > Vanilla Ice wrote:
> > > >
> > > >> <snip>
> > > >
> > > > Thanks for figuring this out. Can you please send me the .dic and .aff
> > > >
> > > > files you used to generate this .spl file? Or better: the URL of the
> > > >
> > > > files to be downloaded and a diff on top of that.
> > > >
> > >
> > > By all means. I hope that there aren't other mistakes (I loaded a
> > > text-formatted translation of "Caves of Steel" by Asimov and it
> > > looked ok). In any case, it should be better than the old one,
> > > which is quite unusable for correct fonts/new grammar rules.
> > >
> > > 1. The URL is: http://sourceforge.net/projects/rospell/files/Romanian%20dictionaries/dict-3.3.10/ro_RO.3.3.10.zip
> > >
> > > and
> > > 2. The diff file is attached to this post.
> >
> > I tried using that zip file with the diff, but I get lots of errors.
> >
> > The first one is easy to fix: Comment-out the KEY line.
> >
> > I get lots of "Trailing text" errors.
> > I get a few "Duplicate character in MAP" errors.
> >
> > For the cp1250 encoded file I get lots of "Conversion failure" errors.
> >
> > Did you not see these errors or did you just ignore them?
>
> The "trailing text" errors (a lot of them!) are caused by the fact
> that the authors of the spell files didn't put a comment start after
> the 5th element. From what i've seen in the vim spell.c source, the
> error messages are actually warnings, i.e. they don't affect the
> parsing.

I think these are grammar annotations. The problem is that they could
be mistakes. I think we need to add a flag that tells the parser that
the fifth item is to be ignored. We could edit the file and put the
changes in the diff, but that means the diff gets outdated very quickly.

> I don't know what the effect of the "Duplicate character in MAP" error
> is, but i've seen no invalid spell behaviour with the new .spl.
>
> Also, i just generated the ro.cp1250.spl file in gvim/Windows using
>
> :set encoding=cp1250
> :mkspell ro ro_RO in the right folder, and had no other errors than the above ones (no "Conversion failure" errors)

I'm doing this on Ubuntu. The locale handling is a bit complicated and
iconv is quite strict. I hope someone can make it work on Ubuntu
without errors. The errors might be harmless, but they will cause other
errors to go unnoticed.

Generally, I would really appreciate it when someone takes upon him the
task to update the spell files. Once you get the hang of it when doing
one language, it should get easier for others.

--
From "know your smileys":
:q vi user saying, "How do I get out of this damn emacs editor?"

/// Bram Moolenaar -- Bram@Moolenaar.net -- http://www.Moolenaar.net \\\
/// sponsor Vim, vote for features -- http://www.Vim.org/sponsor/ \\\
\\\ an exciting new programming language -- http://www.Zimbu.org ///
\\\ help me help AIDS victims -- http://ICCF-Holland.org ///

--
--
You received this message from the "vim_use" maillist.
Do not top-post! Type your reply below the text you are replying to.
For more information, visit http://www.vim.org/maillist.php

---
You received this message because you are subscribed to the Google Groups "vim_use" group.
To unsubscribe from this group and stop receiving emails from it, send an email to vim_use+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

No comments: