[Ilugc] Tamil Corpus
- From: yogeshg1987@xxxxxxxxx (Yogesh Girikumar)
- Date: Sat, 7 Jan 2012 13:24:09 +0530
2012/1/6 JAGANADH G <jaganadhg at gmail.com>:
@karthik
You can use the Tamil Wikipedia dump as a corpus. Try it
IMHO building a language model out of Tamil Wikipedia is a bad idea..
It has lots of colloquial terms and modern/mixed words.. And sentences
are similar to everyday conversations.
--
Y
Other related posts: