<div dir="ltr">I think it should be done in the following order:<br>- If hspell doesn't have it add for each word if it's a verb adjective and so on.<br>- Grammatical analyzer - I saw a doc work that was released under GPL about it long ago.<br>
- Grammatical fixer (maybe better spelling suggestion based on grammar<br>- Independent of that we need a list of words and their nikud (I also saw one in that doc work)<br>- Nikud checker<br>- Nakdan <br><br>Does anyone know where will be a good place to start getting word list with nikud?<br>
Or where is the doc work that made grammatical analyzer?<br><br>Ely<br><div class="gmail_quote">On Fri, Jan 1, 2010 at 10:18 AM, Dan Kenigsberg <span dir="ltr"><<a href="mailto:danken@cs.technion.ac.il">danken@cs.technion.ac.il</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Who said anything about *few* rules? They are many, and are complex, and have<br>
gazillion of exceptions. But they exist, and putting them into effect in<br>
hspell's inflection scripts is doable, albeit requiring a lot of meticulous<br>
work. The classical references for niqqud are Luah HaShemot HaShalem and Luah<br>
HaP`alim HaShalem by Shaul Bakali. These tables include all the rules and all<br>
the exceptions needed to add the correct niqqud to Hebrew words.<br>
<div><div></div><div class="h5"><br>
On Fri, Jan 01, 2010 at 02:02:21AM +0200, Ely Levy wrote:<br>
> I can only talk from my own experience, I couldn't find any good source for<br>
> rules about nikud and grammar in a simple form.<br>
> I did find some gpled work list with nikud, and I think I even talked to the<br>
> people in mila.<br>
> But no one could provide that few rules you are talking about.<br>
> (And I'm still confused about the difference between old and modern<br>
> grammar/nikud...)<br>
><br>
> Ely<br>
><br>
> On Thu, Dec 31, 2009 at 4:11 PM, Nadav Har'El <<a href="mailto:nyh@math.technion.ac.il">nyh@math.technion.ac.il</a>>wrote:<br>
><br>
> > On Thu, Dec 31, 2009, E L wrote about "Re: Announce: Hspell 1.1":<br>
> > > I think the main problem is what need to be done and not the man power to<br>
> > > program it.<br>
> > > If someone know of what are the rules grammar or nikud checkers should<br>
> > > follow I'm sure it won't be a big<br>
> > > deal programing one<br>
> ><br>
> > I beg to differ.<br>
> ><br>
> > First of all, most of the needed knowledge already exists, published in<br>
> > numerous papers and books, and demonstrated by several pieces of commercial<br>
> > software. One doesn't need to come with advanced knowledge of the topic,<br>
> > any more than I had to be some spell-checking expert before I started<br>
> > Hspell.<br>
> > All one needs is a willingness to learn, and of course the resourcefulness<br>
> > to put it into good use.<br>
> ><br>
> > Second, while the work on Hspell had a lot of very interesting theoretical<br>
> > sides and problems to solve (in linguistics, language, compression, etc.),<br>
> > most of the work was actually the mundane and almost endless task of making<br>
> > lists of words (a task which you can see, still isn't done 10 years after<br>
> > starting the project). For niqqud checking, there is also a lot of similar<br>
> > mundane work that needs to be done (writing the right niqqud for each<br>
> > word),<br>
> > and that takes a lot of time.<br>
> > For grammar checking, it depends what you call grammar: If you also want<br>
> > to include semantics, and not just grammar - like Prof. Uzzi Ornan did in<br>
> > his text-to-speech and niqqud research (and product) - there's also tons<br>
> > of work that needs to be done on creating classes of nouns, listing<br>
> > arguments<br>
> > of verbs, and so on. I guess you can start with just grammar, though, and<br>
> > in this case, you're right - it should be doable without too much data<br>
> > collection - so maybe this is indeed a good project to start with.<br>
> ><br>
> > This is all very interesting work. Unfortunately, I do not see myself<br>
> > starting it in the near future. If anyone is interested in taking a shot<br>
> > at it, I'd love to advise - please contact me and/or Dan privately.<br>
> ><br>
> > Nadav.<br>
> ><br>
> > --<br>
> > Nadav Har'El | Thursday, Dec 31 2009, 14 Tevet<br>
> > 5770<br>
> > <a href="mailto:nyh@math.technion.ac.il">nyh@math.technion.ac.il</a><br>
> > |-----------------------------------------<br>
> > Phone +972-523-790466, ICQ 13349191 |I couldn't afford a cool signature, so<br>
> > I<br>
> > <a href="http://nadav.harel.org.il" target="_blank">http://nadav.harel.org.il</a> |just got this one.<br>
> ><br>
<br>
</div></div><div><div></div><div class="h5">> _______________________________________________<br>
> Linux-il mailing list<br>
> <a href="mailto:Linux-il@cs.huji.ac.il">Linux-il@cs.huji.ac.il</a><br>
> <a href="http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il" target="_blank">http://mailman.cs.huji.ac.il/mailman/listinfo/linux-il</a><br>
<br>
<br>
</div></div><font color="#888888">--<br>
Dan Kenigsberg <a href="http://www.cs.technion.ac.il/%7Edanken" target="_blank">http://www.cs.technion.ac.il/~danken</a> ICQ 162180901<br>
</font></blockquote></div><br></div>