@karthik You can use the Tamil Wikipedia dump as a corpus. Try it -- ********************************** JAGANADH G http://jaganadhg.in *ILUGCBE* http://ilugcbe.org.in