Český stemmer v jazyce Snowball
Chmelař Petr, Ing. (DIFS FIT BUT)
Lemmatization, stemming, Snowball, Czexh language, grammar.
The product is a stemming algorithm for Czech language based on grammatical rules, in addition to methods of using vocabulary for searching and mining the Czech text. Snowball stemmer implementations of the Czech language is created on the basis of a complete set of all prefixes, suffixes and endings, which may occur in the Czech language.
See the Snowball web at http://snowball.tartarus.org/ and the thesis text at http://www/study/DP/rpfile.php?id=7988 (in Czech). The publication describing the product in Czech at www.fit.vutbr.cz/research/view_pub.php.en?id=9473.
Copyright (C) 2007-2008 Brno University of Technology
By downloading, copying, installing or using the software you agree to GNU General Public License (enclosed).
Security-Oriented Research in Information Technology (MSM0021630528)