Corpora Collections
Collections of Amharic corpora — a cross referencing to http://corpora.amharic.org/resources/
Walta Information Center - Tagged Amharic News Corpus
A corpus of 1,065 Amharic news articles (210,000 words) from the Walta Information Center (http://www.waltainfo.com/). The news articles span the period 1998 - 2002 and have been tagged for part of speech and punctuation (download). This is the corpus used in the 2006 research paper "Manual Annotation of Amharic News Items with Part-of-Speech Tags and its Challenges" (Girma A. Demeke & Mesfin Getachew).
Ethiopian News Agency — Amharic News Corpus
A corpus of 1,435 Amharic news articles (259,609 words) from the Ethiopian News Agency (http://www.ena.gov.et/).
(download)