Репозиторий Sisyphus
Последнее обновление: 1 октября 2023 | Пакетов: 18631 | Посещений: 37530549
en ru br
Репозитории ALT
S:5.3.0-alt1
5.1: 2.04-alt1
4.1: 2.01-alt1
www.altlinux.org/Changes

Группа :: Графика
Пакет: tesseract

 Главная   Изменения   Спек   Патчи   Sources   Загрузить   Gear   Bugs and FR  Repocop 

#set_verify_elf_method none

Name: tesseract
Version: 5.3.0
Release: alt1

Summary: Tesseract Open Source OCR Engine
Summary(ru_RU.UTF-8): п■п╡п╦п╤п╬п╨ я─п╟я│п©п╬п╥п╫п╟п╡п╟п╫п╦я▐ я┌п╣п╨я│я┌п╟ я│ п╬я┌п╨я─я▀я┌я▀п╪ п╦я│я┘п╬п╢п╫я▀п╪ п╨п╬п╢п╬п╪

License: Apache-2.0
Group: Graphics
Url: https://github.com/tesseract-ocr

Packager: Andrey Cherepanov <cas at altlinux.org>

# Source-url: https://github.com/tesseract-ocr/tesseract/archive/refs/tags/%version.tar.gz

Source: %name-%version.tar

# installing language files /usr/share/tesseract/tessdata

Patch: tesseract-5.1.0-alt-makefile.patch

BuildRequires: gcc-c++
BuildRequires: libtiff-devel
BuildRequires: libleptonica-devel >= 1.74
BuildRequires: autoconf-archive
BuildRequires: libpango-devel
BuildRequires: libcairo-devel
BuildRequires: libicu-devel
BuildRequires: doxygen

Requires: %name-langpack-en >= 4.1.0
Requires: %name-langpack-ru >= 4.1.0

%description
This package contains an OCR engine - libtesseract and a command line
program - tesseract. Tesseract has unicode (UTF-8) support, and can recognize
more than 100 languages "out of the box". Tesseract supports various output
formats: plain text, hOCR (HTML), PDF, TSV. To improve OCR, you need to improve
the quality of the analyzed image.

%description -l ru_RU.UTF-8
п╜я┌п╬я┌ п©п╟п╨п╣я┌ я│п╬п╢п╣я─п╤п╦я┌ п╢п╡п╦п╤п╬п╨ я─п╟я│п©п╬п╥п╫п╟п╡п╟п╫п╦я▐ я┌п╣п╨я│я┌п╟ - libtesseract п╦ п©я─п╬пЁя─п╟п╪п╪я┐
п╨п╬п╪п╟п╫п╢п╫п╬п╧ я│я┌я─п╬п╨п╦ - tesseract. Tesseract п©п╬п╢п╢п╣я─п╤п╦п╡п╟п╣я┌ я▌п╫п╦п╨п╬п╢ (UTF-8) п╦ п╪п╬п╤п╣я┌
я─п╟я│п©п╬п╥п╫п╟п╡п╟я┌я▄ п╠п╬п╩п╣п╣ 100 я▐п╥я▀п╨п╬п╡ "п╦п╥ п╨п╬я─п╬п╠п╨п╦". Tesseract п©п╬п╢п╢п╣я─п╤п╦п╡п╟п╣я┌ я─п╟п╥п╩п╦я┤п╫я▀п╣
я└п╬я─п╪п╟я┌я▀ п╡я▀п╡п╬п╢п╟: txt, ocr (HTML), PDF, TSV. п╖я┌п╬п╠я▀ я┐п╩я┐я┤я┬п╦я┌я▄ я─п╟я│п©п╬п╥п╫п╟п╡п╟п╫п╦п╣ я┌п╣п╨я│я┌п╟,
п╫п╣п╬п╠я┘п╬п╢п╦п╪п╬ я┐п╩я┐я┤я┬п╦я┌я▄ п╨п╟я┤п╣я│я┌п╡п╬ п╟п╫п╟п╩п╦п╥п╦я─я┐п╣п╪п╬пЁп╬ п╦п╥п╬п╠я─п╟п╤п╣п╫п╦я▐.

%package devel
Summary: Development files for tesseract
Summary(ru_RU.UTF-8): п╓п╟п╧п╩я▀ я─п╟п╥я─п╟п╠п╬я┌п╨п╦ п╢п╩я▐ tesseract
Group: Development/C
Requires: %name
Requires: libleptonica-devel >= 1.74

%description devel
The %name-devel package contains header file for
developing applications that use %name.

%description devel -l ru_RU.UTF-8
п÷п╟п╨п╣я┌ %name-devel я│п╬п╢п╣я─п╤п╦я┌ я└п╟п╧п╩я▀ п╥п╟пЁп╬п╩п╬п╡п╨п╟ п╢п╩я▐
я─п╟п╥я─п╟п╠п╬я┌п╨п╦ п©я─п╦п╩п╬п╤п╣п╫п╦п╧, п╦я│п©п╬п╩я▄п╥я┐я▌я┴п╦я┘ %name.

%package doc
Summary: Tesseract OCR Tool Documentation
Summary(ru_RU.UTF-8): п■п╬п╨я┐п╪п╣п╫я┌п╟я├п╦я▐ п©п╬ п╢п╡п╦п╤п╨я┐ Tesseract OCR
Group: Documentation
BuildArch: noarch

%description doc
The documentation contains a description of the library functions and the
tesseract utilities. The development section has examples of teaching language
models.

%description doc -l ru_RU.UTF-8
п■п╬п╨я┐п╪п╣п╫я┌п╟я├п╦я▐ я│п╬п╢п╣я─п╤п╦я┌ п╬п©п╦я│п╟п╫п╦п╣ я└я┐п╫п╨я├п╦п╧ п╠п╦п╠п╩п╦п╬я┌п╣п╨п╦ п╦ я┐я┌п╦п╩п╦я┌ %name п▓ я─п╟п╥п╢п╣п╩п╣
я─п╟п╥я─п╟п╠п╬я┌п╨п╦ п╣я│я┌я▄ п©я─п╦п╪п╣я─я▀ п╬п╠я┐я┤п╣п╫п╦я▐ я▐п╥я▀п╨п╬п╡я▀я┘ п╪п╬п╢п╣п╩п╣п╧.

%prep
%setup
%patch -p2
%ifarch %e2k
# LCC autovectorization perform better than these brief SIMD snippets
sed -i "/CHECK_COMPILE_FLAG/{N;/_OPT/s/=true/=false/}" configure.ac
%add_optflags -mno-sse
%endif

%build
%autoreconf
%configure --disable-static
%make_build

# for teaching language models (dev)

%make_build training

doxygen doc/Doxyfile

%install
%makeinstall_std
%makeinstall_std training-install

# link to a non-existent file

rm -I %buildroot%_libdir/*.la

%files
%doc AUTHORS ChangeLog README.md LICENSE
%_bindir/*
%_datadir/%name/tessdata/configs
%_datadir/%name/tessdata/tessconfigs
%_datadir/%name/tessdata/pdf.ttf
%_libdir/lib*.so.5*

%files devel
%_includedir/%name
%_libdir/lib*.so
%_pkgconfigdir/%name.pc

%files doc
%doc doc/html/*

%changelog

Полный changelog можно просмотреть здесь

 
дизайн и разработка: Vladimir Lettiev aka crux © 2004-2005, Andrew Avramenko aka liks © 2007-2008
текущий майнтейнер: Michael Shigorin