[Hamara-devel] indian english spell checker in hamara ?

Jonas Smedegaard dr at jones.dk
Mon Nov 23 20:09:18 GMT 2015


Quoting Vikas Tara (2015-11-23 20:49:07)
> On 23/11/15 18:16, Jonas Smedegaard wrote:
>> Quoting shirish (2015-11-23 18:43:25)
>>> While there are various English spell-checkers in Debian and other 
>>> distros. there isn't an Indian english spell-checker for various 
>>> languages. It would be nice to have such a spell-checker in hamara 
>>> which will reduce the time it takes for a new user to start using 
>>> the system. The spell-checker would be useful in many ways :-
>> Hear, hear!!
>>
>> The approach in Denmark to grow a Free wordlist was to setup a public 
>> volunteer process of proof-reading words:
>>
>>   1) Harvest a pile of "possible" words from various (free!) sources
>>   2) Invite anyone to register as proof-reading volunteer
>>   3) Ask "is this a proper danish word?" for random words in the pile
>>   4) Compile lists of words vetted by 4, 5, 6 or 7 volunteers.
>>
>> This resources for the danish system is here: http://da.speling.org/
>>
>> It is free to reuse, but documentation is partly in danish.  I would 
>> be happy translating the parts you need, if that approach is of 
>> interest. Perhaps it could even be streamlined and packaged into 
>> Debian, for potential use for other wordlist communities Worldwide - 
>> I'd be happy participating in such team (but won't do it alone).
> Agree it's a good idea - and would benefit many users - is there any 
> resource that covers these Indian English words at the moment?

Beware of licensing of such source, as that taints the whole project 
(obviously).

A possibly relevant infopoint: Apart from above referenced project 
(which covers ispell, aspell and myspell/hunspell), another free 
wordlist project exist - smaller in corpus and covering only hunspell.

Why they started over? The liberally minded Mozilla and (back then) 
OpenOffice projects would not want "infection" by the GPL license.
 

> I guess they are not in the Oxford Dictionary ;)

:-)


>>> A list of such Indian-English words, updation, support etc. would 
>>> make hamara a pretty unique offering.
>> Oh. if you want to be _unique_ then the danish approach is not for 
>> you - you probably should instead go for a non-free closed (perhaps 
>> even patented?) process to ensure that your competitors (e.g. Debian) 
>> don't steal your advantage on the India market.  Or what else could 
>> possibly be implied by "unique offering"?
> If there isn't another one, then it would be unique, but only until 
> someone includes it in another distro (if they felt the need for 
> that).
>
> I don't think anyone here is very interested in closed / non-free, if 
> they are - they probably got off at the wrong stop.

Hehe, bounced right back on myself :-)

 - Jonas

-- 
 * Jonas Smedegaard - idealist & Internet-arkitekt
 * Tlf.: +45 40843136  Website: http://dr.jones.dk/

 [x] quote me freely  [ ] ask before reusing  [ ] keep private
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: signature
URL: <http://lists.hamaralinux.org/pipermail/hamara-devel/attachments/20151123/a776c8ee/attachment.sig>


More information about the Hamara-devel mailing list