[Ilugc] Tamil Corpus
- From: jaganadhg@xxxxxxxxx (JAGANADH G)
- Date: Fri, 6 Jan 2012 20:20:57 +0530
On Fri, Jan 6, 2012 at 8:14 PM, Kartik Raj <xiansy at gmail.com> wrote:
How to get those from wikimedia dump?
Find it from here
http://static.wikipedia.org/ . Also u can populate corpus
using RSS feeds of new papers. Have to bit programming
Also i need speech corpus since just
started recording audio for my research.
I think as of now nobody is providing Speech corpus in Open Domain. There
was something available in this site
http://www.nrcfosshelpline.in/. But is
it is not working now.
The better solution is Go to AIR website they may be publishing the daily
AIR Tamil news audio files. I am not sure . Just check . But there will be
lot of noise in those recording.
--
**********************************
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in
Other related posts: