Majka is a fast tool which assigns a lemma (basic form) and all possible grammatical tags to each word form on the input. It can be used also for word forms generation or diacritics restoration. Although Majka builds on the previous system for morphological analysis Ajka, which gives roughly the same results, it is an entirely new and independent implementation completely based on finite automata and it is also much faster and more flexible than the previous system.
Free databases of multiple languages for Majka can be found at:
https://nlp.fi.muni.cz/ma/
When using majka for research purposes, please cite:
Pavel Šmerk. Fast Morphological Analysis of Czech. In Petr Sojka and Aleš Horák. Proceedings of Third Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2009. Brno : Masaryk University, 2007. p. 13–16. ISBN 978-80-210-5048-8. https://nlp.fi.muni.cz/raslan/2009/papers/13.pdf
Usage:
program expects one entry (word, lemma, or string lemma:tag, according the data file in use) per line on its standard input and prints the requested information on its standard output.
Binaries:
majka for Linux
majka.exe for Windows
Source code:
majkalinux.tgz for Linux
majkawin.tgz for Windows