Indonesian part-of-speech (POS) annotated corpus

Corpus of text documents that contain manually annotated sentences in Indonesian.

Corpus of text documents that contain manually annotated sentences in Indonesian.

Github repository: https://github.com/famrashel/idn-tagged-corpus.git

Publication

A. Dinakaramani, Fam Rashel, A. Luthfi, and Ruli Manurung. Designing an Indonesian Part of speech Tagset and Manually Tagged Indonesian Corpus. In Proceedings of the International Conference on Asian Language Processing (IALP 2014), pages 66-69, Kuching, Malaysia, October 2014. doi: 10.1109/IALP.2014.6973519