[Hamara-devel] indian english spell checker in hamara ?
Jonas Smedegaard
dr at jones.dk
Mon Nov 23 20:09:18 GMT 2015
Quoting Vikas Tara (2015-11-23 20:49:07)
> On 23/11/15 18:16, Jonas Smedegaard wrote:
>> Quoting shirish (2015-11-23 18:43:25)
>>> While there are various English spell-checkers in Debian and other
>>> distros. there isn't an Indian english spell-checker for various
>>> languages. It would be nice to have such a spell-checker in hamara
>>> which will reduce the time it takes for a new user to start using
>>> the system. The spell-checker would be useful in many ways :-
>> Hear, hear!!
>>
>> The approach in Denmark to grow a Free wordlist was to setup a public
>> volunteer process of proof-reading words:
>>
>> 1) Harvest a pile of "possible" words from various (free!) sources
>> 2) Invite anyone to register as proof-reading volunteer
>> 3) Ask "is this a proper danish word?" for random words in the pile
>> 4) Compile lists of words vetted by 4, 5, 6 or 7 volunteers.
>>
>> This resources for the danish system is here: http://da.speling.org/
>>
>> It is free to reuse, but documentation is partly in danish. I would
>> be happy translating the parts you need, if that approach is of
>> interest. Perhaps it could even be streamlined and packaged into
>> Debian, for potential use for other wordlist communities Worldwide -
>> I'd be happy participating in such team (but won't do it alone).
> Agree it's a good idea - and would benefit many users - is there any
> resource that covers these Indian English words at the moment?
Beware of licensing of such source, as that taints the whole project
(obviously).
A possibly relevant infopoint: Apart from above referenced project
(which covers ispell, aspell and myspell/hunspell), another free
wordlist project exist - smaller in corpus and covering only hunspell.
Why they started over? The liberally minded Mozilla and (back then)
OpenOffice projects would not want "infection" by the GPL license.
> I guess they are not in the Oxford Dictionary ;)
:-)
>>> A list of such Indian-English words, updation, support etc. would
>>> make hamara a pretty unique offering.
>> Oh. if you want to be _unique_ then the danish approach is not for
>> you - you probably should instead go for a non-free closed (perhaps
>> even patented?) process to ensure that your competitors (e.g. Debian)
>> don't steal your advantage on the India market. Or what else could
>> possibly be implied by "unique offering"?
> If there isn't another one, then it would be unique, but only until
> someone includes it in another distro (if they felt the need for
> that).
>
> I don't think anyone here is very interested in closed / non-free, if
> they are - they probably got off at the wrong stop.
Hehe, bounced right back on myself :-)
- Jonas
--
* Jonas Smedegaard - idealist & Internet-arkitekt
* Tlf.: +45 40843136 Website: http://dr.jones.dk/
[x] quote me freely [ ] ask before reusing [ ] keep private
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: signature
URL: <http://lists.hamaralinux.org/pipermail/hamara-devel/attachments/20151123/a776c8ee/attachment.sig>
More information about the Hamara-devel
mailing list