Group :: Text tools
RPM: mguesser
Main Changelog Spec Patches Sources Download Gear Bugs and FR Repocop
Current version: 0.2-alt2
Build date: 20 october 2002, 18:16 ( 1125.0 weeks ago )
Size: 123.13 Kb
Home page: http://mnogosearch.org/guesser/
License: GPL
Summary: mguesser allows to guess text's charset and language
Description:
List of contributors List of rpms provided by this srpm:
ACL:
Build date: 20 october 2002, 18:16 ( 1125.0 weeks ago )
Size: 123.13 Kb
Home page: http://mnogosearch.org/guesser/
License: GPL
Summary: mguesser allows to guess text's charset and language
Description:
mguesser is a standalong part of libudmsearch (a core of mnogo search engine
http://mnogosearch.org) which allows to guess text's charset and language.
Guessing is implemented using "N-Gram-Based Text Categorization" technique
which is implemented in TextCat language guesser written in Perl
(http://www.let.rug.nl/~vannoord/TextCat/). mguesser is significantly
faster than TextCat especially on large texts.
Current maintainer: Michael Shigorin http://mnogosearch.org) which allows to guess text's charset and language.
Guessing is implemented using "N-Gram-Based Text Categorization" technique
which is implemented in TextCat language guesser written in Perl
(http://www.let.rug.nl/~vannoord/TextCat/). mguesser is significantly
faster than TextCat especially on large texts.
List of contributors List of rpms provided by this srpm:
- mguesser