Группа :: Графика
Пакет: tesseract
Главная Изменения Спек Патчи Sources Загрузить Gear Bugs and FR Repocop
#set_verify_elf_method none
Name: tesseract
Version: 5.3.0
Release: alt1
Summary: Tesseract Open Source OCR Engine
Summary(ru_RU.UTF-8): п■п╡п╦п╤п╬п╨ я─п╟я│п©п╬п╥п╫п╟п╡п╟п╫п╦я▐ я┌п╣п╨я│я┌п╟ я│ п╬я┌п╨я─я▀я┌я▀п╪ п╦я│я┘п╬п╢п╫я▀п╪ п╨п╬п╢п╬п╪
License: Apache-2.0
Group: Graphics
Url: https://github.com/tesseract-ocr
Packager: Andrey Cherepanov <cas at altlinux.org>
# Source-url: https://github.com/tesseract-ocr/tesseract/archive/refs/tags/%version.tar.gz
Source: %name-%version.tar
# installing language files /usr/share/tesseract/tessdata
Patch: tesseract-5.1.0-alt-makefile.patch
BuildRequires: gcc-c++
BuildRequires: libtiff-devel
BuildRequires: libleptonica-devel >= 1.74
BuildRequires: autoconf-archive
BuildRequires: libpango-devel
BuildRequires: libcairo-devel
BuildRequires: libicu-devel
BuildRequires: doxygen
Requires: %name-langpack-en >= 4.1.0
Requires: %name-langpack-ru >= 4.1.0
%description
This package contains an OCR engine - libtesseract and a command line
program - tesseract. Tesseract has unicode (UTF-8) support, and can recognize
more than 100 languages "out of the box". Tesseract supports various output
formats: plain text, hOCR (HTML), PDF, TSV. To improve OCR, you need to improve
the quality of the analyzed image.
%description -l ru_RU.UTF-8
п╜я┌п╬я┌ п©п╟п╨п╣я┌ я│п╬п╢п╣я─п╤п╦я┌ п╢п╡п╦п╤п╬п╨ я─п╟я│п©п╬п╥п╫п╟п╡п╟п╫п╦я▐ я┌п╣п╨я│я┌п╟ - libtesseract п╦ п©я─п╬пЁя─п╟п╪п╪я┐
п╨п╬п╪п╟п╫п╢п╫п╬п╧ я│я┌я─п╬п╨п╦ - tesseract. Tesseract п©п╬п╢п╢п╣я─п╤п╦п╡п╟п╣я┌ я▌п╫п╦п╨п╬п╢ (UTF-8) п╦ п╪п╬п╤п╣я┌
я─п╟я│п©п╬п╥п╫п╟п╡п╟я┌я▄ п╠п╬п╩п╣п╣ 100 я▐п╥я▀п╨п╬п╡ "п╦п╥ п╨п╬я─п╬п╠п╨п╦". Tesseract п©п╬п╢п╢п╣я─п╤п╦п╡п╟п╣я┌ я─п╟п╥п╩п╦я┤п╫я▀п╣
я└п╬я─п╪п╟я┌я▀ п╡я▀п╡п╬п╢п╟: txt, ocr (HTML), PDF, TSV. п╖я┌п╬п╠я▀ я┐п╩я┐я┤я┬п╦я┌я▄ я─п╟я│п©п╬п╥п╫п╟п╡п╟п╫п╦п╣ я┌п╣п╨я│я┌п╟,
п╫п╣п╬п╠я┘п╬п╢п╦п╪п╬ я┐п╩я┐я┤я┬п╦я┌я▄ п╨п╟я┤п╣я│я┌п╡п╬ п╟п╫п╟п╩п╦п╥п╦я─я┐п╣п╪п╬пЁп╬ п╦п╥п╬п╠я─п╟п╤п╣п╫п╦я▐.
%package devel
Summary: Development files for tesseract
Summary(ru_RU.UTF-8): п╓п╟п╧п╩я▀ я─п╟п╥я─п╟п╠п╬я┌п╨п╦ п╢п╩я▐ tesseract
Group: Development/C
Requires: %name
Requires: libleptonica-devel >= 1.74
%description devel
The %name-devel package contains header file for
developing applications that use %name.
%description devel -l ru_RU.UTF-8
п÷п╟п╨п╣я┌ %name-devel я│п╬п╢п╣я─п╤п╦я┌ я└п╟п╧п╩я▀ п╥п╟пЁп╬п╩п╬п╡п╨п╟ п╢п╩я▐
я─п╟п╥я─п╟п╠п╬я┌п╨п╦ п©я─п╦п╩п╬п╤п╣п╫п╦п╧, п╦я│п©п╬п╩я▄п╥я┐я▌я┴п╦я┘ %name.
%package doc
Summary: Tesseract OCR Tool Documentation
Summary(ru_RU.UTF-8): п■п╬п╨я┐п╪п╣п╫я┌п╟я├п╦я▐ п©п╬ п╢п╡п╦п╤п╨я┐ Tesseract OCR
Group: Documentation
BuildArch: noarch
%description doc
The documentation contains a description of the library functions and the
tesseract utilities. The development section has examples of teaching language
models.
%description doc -l ru_RU.UTF-8
п■п╬п╨я┐п╪п╣п╫я┌п╟я├п╦я▐ я│п╬п╢п╣я─п╤п╦я┌ п╬п©п╦я│п╟п╫п╦п╣ я└я┐п╫п╨я├п╦п╧ п╠п╦п╠п╩п╦п╬я┌п╣п╨п╦ п╦ я┐я┌п╦п╩п╦я┌ %name п▓ я─п╟п╥п╢п╣п╩п╣
я─п╟п╥я─п╟п╠п╬я┌п╨п╦ п╣я│я┌я▄ п©я─п╦п╪п╣я─я▀ п╬п╠я┐я┤п╣п╫п╦я▐ я▐п╥я▀п╨п╬п╡я▀я┘ п╪п╬п╢п╣п╩п╣п╧.
%prep
%setup
%patch -p2
%ifarch %e2k
# LCC autovectorization perform better than these brief SIMD snippets
sed -i "/CHECK_COMPILE_FLAG/{N;/_OPT/s/=true/=false/}" configure.ac
%add_optflags -mno-sse
%endif
%build
%autoreconf
%configure --disable-static
%make_build
# for teaching language models (dev)
%make_build training
doxygen doc/Doxyfile
%install
%makeinstall_std
%makeinstall_std training-install
# link to a non-existent file
rm -I %buildroot%_libdir/*.la
%files
%doc AUTHORS ChangeLog README.md LICENSE
%_bindir/*
%_datadir/%name/tessdata/configs
%_datadir/%name/tessdata/tessconfigs
%_datadir/%name/tessdata/pdf.ttf
%_libdir/lib*.so.5*
%files devel
%_includedir/%name
%_libdir/lib*.so
%_pkgconfigdir/%name.pc
%files doc
%doc doc/html/*
%changelog
…
Полный changelog можно просмотреть здесь
Name: tesseract
Version: 5.3.0
Release: alt1
Summary: Tesseract Open Source OCR Engine
Summary(ru_RU.UTF-8): п■п╡п╦п╤п╬п╨ я─п╟я│п©п╬п╥п╫п╟п╡п╟п╫п╦я▐ я┌п╣п╨я│я┌п╟ я│ п╬я┌п╨я─я▀я┌я▀п╪ п╦я│я┘п╬п╢п╫я▀п╪ п╨п╬п╢п╬п╪
License: Apache-2.0
Group: Graphics
Url: https://github.com/tesseract-ocr
Packager: Andrey Cherepanov <cas at altlinux.org>
# Source-url: https://github.com/tesseract-ocr/tesseract/archive/refs/tags/%version.tar.gz
Source: %name-%version.tar
# installing language files /usr/share/tesseract/tessdata
Patch: tesseract-5.1.0-alt-makefile.patch
BuildRequires: gcc-c++
BuildRequires: libtiff-devel
BuildRequires: libleptonica-devel >= 1.74
BuildRequires: autoconf-archive
BuildRequires: libpango-devel
BuildRequires: libcairo-devel
BuildRequires: libicu-devel
BuildRequires: doxygen
Requires: %name-langpack-en >= 4.1.0
Requires: %name-langpack-ru >= 4.1.0
%description
This package contains an OCR engine - libtesseract and a command line
program - tesseract. Tesseract has unicode (UTF-8) support, and can recognize
more than 100 languages "out of the box". Tesseract supports various output
formats: plain text, hOCR (HTML), PDF, TSV. To improve OCR, you need to improve
the quality of the analyzed image.
%description -l ru_RU.UTF-8
п╜я┌п╬я┌ п©п╟п╨п╣я┌ я│п╬п╢п╣я─п╤п╦я┌ п╢п╡п╦п╤п╬п╨ я─п╟я│п©п╬п╥п╫п╟п╡п╟п╫п╦я▐ я┌п╣п╨я│я┌п╟ - libtesseract п╦ п©я─п╬пЁя─п╟п╪п╪я┐
п╨п╬п╪п╟п╫п╢п╫п╬п╧ я│я┌я─п╬п╨п╦ - tesseract. Tesseract п©п╬п╢п╢п╣я─п╤п╦п╡п╟п╣я┌ я▌п╫п╦п╨п╬п╢ (UTF-8) п╦ п╪п╬п╤п╣я┌
я─п╟я│п©п╬п╥п╫п╟п╡п╟я┌я▄ п╠п╬п╩п╣п╣ 100 я▐п╥я▀п╨п╬п╡ "п╦п╥ п╨п╬я─п╬п╠п╨п╦". Tesseract п©п╬п╢п╢п╣я─п╤п╦п╡п╟п╣я┌ я─п╟п╥п╩п╦я┤п╫я▀п╣
я└п╬я─п╪п╟я┌я▀ п╡я▀п╡п╬п╢п╟: txt, ocr (HTML), PDF, TSV. п╖я┌п╬п╠я▀ я┐п╩я┐я┤я┬п╦я┌я▄ я─п╟я│п©п╬п╥п╫п╟п╡п╟п╫п╦п╣ я┌п╣п╨я│я┌п╟,
п╫п╣п╬п╠я┘п╬п╢п╦п╪п╬ я┐п╩я┐я┤я┬п╦я┌я▄ п╨п╟я┤п╣я│я┌п╡п╬ п╟п╫п╟п╩п╦п╥п╦я─я┐п╣п╪п╬пЁп╬ п╦п╥п╬п╠я─п╟п╤п╣п╫п╦я▐.
%package devel
Summary: Development files for tesseract
Summary(ru_RU.UTF-8): п╓п╟п╧п╩я▀ я─п╟п╥я─п╟п╠п╬я┌п╨п╦ п╢п╩я▐ tesseract
Group: Development/C
Requires: %name
Requires: libleptonica-devel >= 1.74
%description devel
The %name-devel package contains header file for
developing applications that use %name.
%description devel -l ru_RU.UTF-8
п÷п╟п╨п╣я┌ %name-devel я│п╬п╢п╣я─п╤п╦я┌ я└п╟п╧п╩я▀ п╥п╟пЁп╬п╩п╬п╡п╨п╟ п╢п╩я▐
я─п╟п╥я─п╟п╠п╬я┌п╨п╦ п©я─п╦п╩п╬п╤п╣п╫п╦п╧, п╦я│п©п╬п╩я▄п╥я┐я▌я┴п╦я┘ %name.
%package doc
Summary: Tesseract OCR Tool Documentation
Summary(ru_RU.UTF-8): п■п╬п╨я┐п╪п╣п╫я┌п╟я├п╦я▐ п©п╬ п╢п╡п╦п╤п╨я┐ Tesseract OCR
Group: Documentation
BuildArch: noarch
%description doc
The documentation contains a description of the library functions and the
tesseract utilities. The development section has examples of teaching language
models.
%description doc -l ru_RU.UTF-8
п■п╬п╨я┐п╪п╣п╫я┌п╟я├п╦я▐ я│п╬п╢п╣я─п╤п╦я┌ п╬п©п╦я│п╟п╫п╦п╣ я└я┐п╫п╨я├п╦п╧ п╠п╦п╠п╩п╦п╬я┌п╣п╨п╦ п╦ я┐я┌п╦п╩п╦я┌ %name п▓ я─п╟п╥п╢п╣п╩п╣
я─п╟п╥я─п╟п╠п╬я┌п╨п╦ п╣я│я┌я▄ п©я─п╦п╪п╣я─я▀ п╬п╠я┐я┤п╣п╫п╦я▐ я▐п╥я▀п╨п╬п╡я▀я┘ п╪п╬п╢п╣п╩п╣п╧.
%prep
%setup
%patch -p2
%ifarch %e2k
# LCC autovectorization perform better than these brief SIMD snippets
sed -i "/CHECK_COMPILE_FLAG/{N;/_OPT/s/=true/=false/}" configure.ac
%add_optflags -mno-sse
%endif
%build
%autoreconf
%configure --disable-static
%make_build
# for teaching language models (dev)
%make_build training
doxygen doc/Doxyfile
%install
%makeinstall_std
%makeinstall_std training-install
# link to a non-existent file
rm -I %buildroot%_libdir/*.la
%files
%doc AUTHORS ChangeLog README.md LICENSE
%_bindir/*
%_datadir/%name/tessdata/configs
%_datadir/%name/tessdata/tessconfigs
%_datadir/%name/tessdata/pdf.ttf
%_libdir/lib*.so.5*
%files devel
%_includedir/%name
%_libdir/lib*.so
%_pkgconfigdir/%name.pc
%files doc
%doc doc/html/*
%changelog
…
Полный changelog можно просмотреть здесь