Back to Projects

Amharic Corpus Data

PythonScrapyWeb Scraping

Collected Amharic language corpus data from web sources, PDFs, and social media. Published as an open-source dataset on HuggingFace for NLP research.