tag:blogger.com,1999:blog-180738146708053206.post1547699847815795394..comments2023-10-02T23:12:57.521+08:00Comments on A Little Bit of ME: Experimental Thaana Spell-checking with HunspellSoEhttp://www.blogger.com/profile/00484927113635908032noreply@blogger.comBlogger19125tag:blogger.com,1999:blog-180738146708053206.post-72045190129711651232013-10-11T05:15:39.346+08:002013-10-11T05:15:39.346+08:00font thaifont thaiAnonymoushttps://www.blogger.com/profile/01692165427533842499noreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-12529687148234495192010-09-15T13:19:32.104+08:002010-09-15T13:19:32.104+08:00not at all. go right ahead.not at all. go right ahead.SoEhttps://www.blogger.com/profile/00484927113635908032noreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-75045826001934645672010-09-15T13:16:31.364+08:002010-09-15T13:16:31.364+08:00cannot send a message because you're not follo...cannot send a message because you're not following me lol.<br />would it be considered creepy if i send you a message on facebook? hahaAyaBuddyhttps://www.blogger.com/profile/08866820808002476797noreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-58371721080470698292010-09-14T13:38:27.974+08:002010-09-14T13:38:27.974+08:00hit me up on twitter. link on page nav on top.hit me up on twitter. link on page nav on top.SoEhttps://www.blogger.com/profile/00484927113635908032noreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-66153849933620200102010-09-13T22:30:52.605+08:002010-09-13T22:30:52.605+08:00Hello,
I am working on a similar project and would...Hello,<br />I am working on a similar project and would be very happy to get some assistance from you :)<br />Please tell me a way to contact you :)AyaBuddyhttps://www.blogger.com/profile/08866820808002476797noreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-73036862833232359332010-09-13T14:30:55.472+08:002010-09-13T14:30:55.472+08:00very interesting. Best of luck with your research....very interesting. Best of luck with your research.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-79518546507256982092010-09-12T19:05:00.874+08:002010-09-12T19:05:00.874+08:00semiotically speaking dhivehi & thaana writing...semiotically speaking dhivehi & thaana writing can be considered as something evolved beyond the organic rule based languages, and there are many languages like that. with no spell checking necessary. languages and words die due to spell checkers and aids which limit the writers thinking. but then again the society of google would prefer to google everything up. if it doesnt exists google. it doesnt exist at all. i would still suggest to do a background study of this subject instead of trying to apply what you perceive as necessary just because ms word has that feature. Trying is ok. you can even try putting icing on rihaakuru, just because cakes got it, but would it do any good? it might. try it with more research i'd suggest.<br /><br />good luck.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-64318518146168188172010-09-12T16:55:25.507+08:002010-09-12T16:55:25.507+08:00that was my intention as well jaa. Hunspell does p...that was my intention as well jaa. Hunspell does provide for creating stemming rules (see the SFX in the affix - I was experimenting with stemming the word "kievun" and managed to have "kievumakee" recognized as a stem)<br /><br />it is however quite a challenge..mostly because like you, my understanding of Dhivehi grammar is quite rudimentary.<br /><br />as far as segmentation goes... well... let's just say I've personally never gotten the hang of it.SoEhttps://www.blogger.com/profile/00484927113635908032noreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-39052600512194327072010-09-12T14:52:45.418+08:002010-09-12T14:52:45.418+08:00Interesting...
I've been experimenting with s...Interesting...<br /><br />I've been experimenting with spell checking for Thaana for a while now. Feeding a database of words as above is one approach and simple. My first approach was somewhat similar to yours above: I created a custom dictionary for MS Word using Radheef entries. That performs well at suggesting alternatives for misspelled words that are in the Radheef.<br /><br />But IMO, the best approach to the task is to have a stemming engine/rules and a segmentation engine/rules, so that the spell checker can take base words and transform it to the various tenses etc and be able to split up words/phrases (it is very common practice when writing Dhivehi for two words that should be written separately to be combined together and vice versa). My understanding of Dhivehi grammer is rudimentary so work has slowed down to a crawl but hopefully I can get it moving again soon when I have time.<br /><br />Best of luck with your work!Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-31092279547500894972010-09-12T06:26:05.741+08:002010-09-12T06:26:05.741+08:00I wouldn't call the guy a pessimist. I do valu...I wouldn't call the guy a pessimist. I do value critique...SoEhttps://www.blogger.com/profile/00484927113635908032noreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-43309005020638114102010-09-12T04:30:52.260+08:002010-09-12T04:30:52.260+08:00don't worry about the pessimist SoE I think yo...don't worry about the pessimist SoE I think you are doing a wonderful jobAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-55062759113251114172010-09-12T02:30:26.252+08:002010-09-12T02:30:26.252+08:00@anonymous: The reason why it doesn't exist is...@anonymous: The reason why it doesn't exist is because people have not bothered to try. Dhivehi, like any language is based on a set of rules which -however flexible they may be - determine the "proper" way to write and speak it. If somebody types "phat" instead of "fat", or "becoz" instead of "because" it too would be automatically be understood by anybody familiar with english. This does not however make it correct; Nor does this make it entirely wrong. The complication only arises when people forget that spell checkers are "writing aids". They are by no definition the backbone of the language as you seem to fear that it might become. There is no law that says what the spell checker says HAS to be correct.<br /><br />The necessity of such a system is quite evident in everyday life. Just look at the various online newspapers and magazines. The idea is not to hinder creativity, but to point out mistakes which may otherwise be lost on the proof reader.<br /><br />In either case, the project is bound to produce valuable data for further applications. Do not be so narrow minded as to think that research into developing a "spell checker" would stop at just that. <br /><br />If we do not give the language this sort of attention, the true value of it's heritage will be lost within a decade surely. A language will not "evolve" unless it's people find out new and more creative ways to use it... sitting back and simply saying "this is not needed so it should not be done" is not something I'm prepared to do.SoEhttps://www.blogger.com/profile/00484927113635908032noreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-21839238157880048482010-09-11T23:37:49.907+08:002010-09-11T23:37:49.907+08:00during a conversation with a friend recently, he p...during a conversation with a friend recently, he pointed out that unavailability of a spell checker for thaana is a feature of thaana writing and dhivehi language itself. english or arabic having a spell checker does not mean that all languages should follow that. sometimes it may make things complicated. in the case of thaana, a very scientific phonetic writing system for eg: any reader understands the words written in different spellings, without limiting to a certain set. it also allows the user to create new words, or import new words from other languages hence defining it on the set of writing. it's more forgiving in that manner without the limitation of a spell checker.<br />you could do a quantitative (instead of a qualitative) survey with a null hypothesis just to see if its really necessary. you should talk to thaana typists, writers, and some who are not exactly computer programmers or western educated ms office addicts. just a thought to consider.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-46270818304546782252010-09-11T20:47:32.230+08:002010-09-11T20:47:32.230+08:00sounds very interesting.
have been trialing xiosis...sounds very interesting.<br />have been trialing xiosis.<br />getting spell check for dhivehi would be a very challenging task.<br />can't wait to see how your project develops :)Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-88706220209342985782010-09-11T19:57:55.579+08:002010-09-11T19:57:55.579+08:00At the very least if you can implement correction ...At the very least if you can implement correction of "thiki jehi thaana" I'll be willing to use it.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-91750943833994981682010-09-11T18:47:05.911+08:002010-09-11T18:47:05.911+08:00I want to see a comparison of this approach vs. Xi...I want to see a comparison of this approach vs. XiosisAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-14111122321727315712010-09-11T18:43:29.375+08:002010-09-11T18:43:29.375+08:00This is great dude! I'm impressed!This is great dude! I'm impressed!Huxennoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-28166891530933635042010-09-11T18:38:24.905+08:002010-09-11T18:38:24.905+08:00Good work, All the best
ShifauGood work, All the best <br /><br />ShifauAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-180738146708053206.post-20245432805433125532010-09-11T18:26:44.439+08:002010-09-11T18:26:44.439+08:00man you are on a roll!man you are on a roll!Anonymousnoreply@blogger.com