Personal tools
You are here: Home Resources Corpora Collections
Document Actions

Corpora Collections

by admin last modified 2008-06-09 05:19

Collections of Amharic corpora — a cross referencing to http://corpora.amharic.org/resources/

Walta Information Center - Tagged Amharic News Corpus

A corpus of 1,065 Amharic news articles (210,000 words) from the Walta Information Center (http://www.waltainfo.com/). The news articles span the period 1998 - 2002 and have been tagged for part of speech and punctuation (download). This is the corpus used in the 2006 research paper "Manual Annotation of Amharic News Items with Part-of-Speech Tags and its Challenges" (Girma A. Demeke & Mesfin Getachew).


Ethiopian News Agency — Amharic News Corpus

A corpus of 1,435 Amharic news articles (259,609 words) from the Ethiopian News Agency (http://www.ena.gov.et/).

(download)


This site conforms to the following standards: