Hi Colleagues,

http://www.isoc-ke.org/wiki/Orodha_Ya_Msamiati

Best Regards


On Thu, May 30, 2013 at 2:38 PM, Martin Chiteri <martin.chiteri@gmail.com> wrote:
Hi Tony, 

As I had mentioned before, we are taking a different approach by using both Swahili and English entries on Wikipedia as training sets. Also, we are not using the data for any commercial use. At the same time, content from the wiki is provided for gratis(?) licensed under the Creative Commons Attribution license. 

Martin.


On Thu, May 30, 2013 at 12:52 PM, Tony Likhanga <tlikhanga@gmail.com> wrote:
I don't know how legal/agreeable this may be....
How about pulling the text from a random website/blog that has equivalent transcripts  in both languages?

In mind I have http://bitcoins.co.ke/index.html <--- Bitcoins Not BanglaPesa!! 
Maybe you'll first have to contact the owners of such sites for their consent.

Tony.

_______________________________________________
skunkworks mailing list
skunkworks@lists.my.co.ke
------------
List info, subscribe/unsubscribe
http://lists.my.co.ke/cgi-bin/mailman/listinfo/skunkworks
------------

Skunkworks Rules
http://my.co.ke/phpbb/viewtopic.php?f=24&t=94
------------
Other services @ http://my.co.ke


_______________________________________________
skunkworks mailing list
skunkworks@lists.my.co.ke
------------
List info, subscribe/unsubscribe
http://lists.my.co.ke/cgi-bin/mailman/listinfo/skunkworks
------------

Skunkworks Rules
http://my.co.ke/phpbb/viewtopic.php?f=24&t=94
------------
Other services @ http://my.co.ke



--
Barrack O. Otieno
+254721325277
+254-20-2498789
Skype: barrack.otieno
http://www.otienobarrack.me.ke/