Index index by Group index by Distribution index by Vendor index by creation date index by Name Mirrors Help Search

tesseract-ocr-3.05.01-lp150.1.2 RPM for ppc64le

From OpenSuSE Ports Leap 15.0 for ppc64le

Name: tesseract-ocr Distribution: openSUSE Leap 15.0
Version: 3.05.01 Vendor: openSUSE
Release: lp150.1.2 Build date: Wed May 9 23:07:53 2018
Group: Productivity/Graphics/Other Build host: obs-power8-02
Size: 1323046 Source RPM: tesseract-ocr-3.05.01-lp150.1.2.src.rpm
Packager: https://bugs.opensuse.org
Url: https://github.com/tesseract-ocr/tesseract
Summary: Open Source OCR Engine
A commercial quality OCR engine originally developed at HP between 1985 and
1995. In 1995, this engine was among the top 3 evaluated by UNLV. It was
open-sourced by HP and UNLV in 2005. From 2007 it is developed by Google.

Provides

Requires

License

Apache-2.0 AND GPL-2.0+

Changelog

* Tue Feb 20 2018 jweberhofer@weberhofer.at
  - Update to 3.05.01
    * Fixed several build issues
    * Fixed C-API
    * Backport pdfrenderer changes
    * Code clean up
  - Spec file cleaned up
* Fri Feb 17 2017 idonmez@suse.com
  - Update to 3.05.00
    * Made some fine tuning to the hOCR output.
    * Added TSV as another optional output format.
    * Fixed ABI break introduced in 3.04.00 with the AnalyseLayout()
      method.
    * text2image tool - Enable all OpenType ligatures available in
      a font. This feature requires Pango 1.38 or newer.
    * Training tools - Replaced asserts with tprintf() and exit(1).
    * Improved multipage tiff processing.
    * Improved the embedded pdf font (pdf.ttf).
    * Enable selection of OCR engine mode from command line.
    * Changed tesseract command line parameter '-psm' to '--psm'.
    * Added new C API for orientation and script detection, removed
      the old one.
    * Fixed many compiler warning.
    * Fixed memory and resource leaks.
* Fri Feb 19 2016 idonmez@suse.com
  - Update to 3.04.01
    * No changelog upstream
* Fri Oct 02 2015 asterios.dramis@gmail.com
  - Update to version 3.04.00:
    * Added OpenCL support (experimental).
    * Many bug fixes.
    From version 3.03.00:
    * Added new training tool text2image to generate box/tif file
      pairs from text and truetype fonts.
    * Added support for PDF output with searchable text.
    * Removed entire IMAGE class and all code in image directory.
    * Tesseract executable: support for output to stdout; limited
      support for one page images from stdin  (especially on Windows)
    * Added Renderer to API to allow document-level processing and
      output of document formats, like hOCR, PDF.
    * Major refactor of word-level recognition, beam search,
      eliminating dead code.
    * Refactored classifier to make it easier to add new ones.
    * Generalized feature extractor to allow feature extraction from
      greyscale.
    * Improved sub/superscript treatment.
    * Improved baseline fit.
    * Added set_unicharset_properties to training tools.
    * Many bug fixes.
    * More training source data included.
  - Added new build requirements cairo-devel, doxygen, libicu-devel
    and pango-devel.
  - Recommend tesseract-ocr-traineddata-english instead of
    tesseract-ocr-traineddata-american (based on new (3.04.00)
    tesseract-ocr traineddata files).
* Mon Sep 14 2015 asterios.dramis@gmail.com
  - Fix Recommends: entry to tesseract-ocr-traineddata-american.
* Sat Jun 20 2015 mailaender@opensuse.org
  - rename to match upstream tarball and fix boo#900303
* Sat Jun 22 2013 asterios.dramis@gmail.com
  - Split library into separate package (libtesseract3).
  - Removed debuginfo package (not needed).
  - There is no need anymore to regenerate the build system (removed automake and
    libtool build requirements).
  - Added pkg-config build requirement (fix for rpmlint error
    "no-pkg-config-provides"). Removed also not needed
    "Provides: pkgconfig(%{name})" entry.
* Mon May 06 2013 idonmez@suse.com
  - Update license, some files are GPL-2.0+ licensed
* Mon Oct 29 2012 jw@suse.com
  - Update to version 3.02.02
    * untested
  - Notable features:
    * Hebrew with BiDi support.
    * More languages.
  - removed upstreamed patch0
* Mon Jun 25 2012 asterios.dramis@gmail.com
  - Update to version 3.01:
    * Removed old/dead serialise/deserialze methods on *LISTIZED classes.
    * Total rewrite of DENORM to better encapsulate operation and make
      for potential to extract features from images.
    * Thread-safety! Moved all critical globals and statics to
      members of the appropriate class. Tesseract is now
      thread-safe (multiple instances can be used in parallel
      in multiple threads.) with the minor exception that some
      control parameters are still global and affect all threads.
    * Added Cube, a new recognizer for Arabic. Cube can also be
      used in combination with normal Tesseract for other languages
      with an improvement in accuracy at the cost of (much) lower speed.
      There is no training module for Cube yet.
    * OcrEngineMode in Init replaces AccuracyVSpeed to control cube.
    * Greatly improved segmentation search with consequent accuracy and
      speed improvements, especially for Chinese.
    * Added PageIterator and ResultIterator as cleaner ways to get the
      full results out of Tesseract, that are not currently provided
      by any of the TessBaseAPI::Get* methods.
      All other methods, such as the ETEXT_STRUCT in particular are
      deprecated and will be deleted in the future.
    * ApplyBoxes totally rewritten to make training easier.
      It can now cope with touching/overlapping training characters,
      and a new boxfile format allows word boxes instead of character
      boxes, BUT to use that you have to have already boostrapped the
      language with character boxes. "Cyclic dependency" on traineddata.
    * Auto orientation and script detection added to page layout analysis.
    * Deleted *lots* of dead code.
    * Fixxht module replaced with scalable data-driven module.
    * Output font characteristics accuracy improved.
    * Removed the double conversion at each classification.
    * Upgraded oldest structs to be classes and deprecated PBLOB.
    * Removed non-deterministic baseline fit.
    * Added fixed length dawgs for Chinese.
    * Handling of vertical text improved.
    * Handling of leader dots improved.
    * Table detection greatly improved.
  - Removed the various languages traineddata subpackages (to be included in a
    separate package "tesseract-traineddata").
  - Changed License to Apache-2.0 (SPDX style).
  - Removed libtiff-devel build dependency (not needed anymore).
  - Added new build dependency liblept-devel, required now by the package.
  - Added automake and libtool build dependencies in order to regenerate the
    build system because of missing Makefile.in.
  - Removed tesseract-traineddata-deu from recommended entries.
  - Removed nonvoid.patch (fixed upstream).
  - Added a patch (svutil.cpp_fix.patch) to fix compilation due to missing
    includes (taken from upstream).
  - Disabled compilation of static libraries.
* Mon Oct 25 2010 prusnak@opensuse.org
  - fixed missing returns in nonvoid functions (nonvoid.patch)
  - added missing post/postun scripts calling ldconfig
* Thu Sep 23 2010 michal.smrz@opensuse.cz
  - update to tesseract-3.00
  - added plenty od new supported languages
  - created tesseract-package-creator.py which will, hopefully, make future
    updates easier
* Fri Jul 10 2009 puzel@novell.com
  - update to tesseract-2.04
    * Integrated bug fixes and patches and misc changes for portability.
    * Integrated a patch to remove some of the "access" macros.
    * Removed dependence on lua from the viewer, speeding it up
      dramatically.
    * Fixed the viewer so it compiles and runs properly!

Files

/usr/bin/ambiguous_words
/usr/bin/classifier_tester
/usr/bin/cntraining
/usr/bin/combine_tessdata
/usr/bin/dawg2wordlist
/usr/bin/mftraining
/usr/bin/set_unicharset_properties
/usr/bin/shapeclustering
/usr/bin/tesseract
/usr/bin/text2image
/usr/bin/unicharset_extractor
/usr/bin/wordlist2dawg
/usr/share/doc/packages/tesseract-ocr
/usr/share/doc/packages/tesseract-ocr/AUTHORS
/usr/share/doc/packages/tesseract-ocr/COPYING
/usr/share/doc/packages/tesseract-ocr/ChangeLog
/usr/share/doc/packages/tesseract-ocr/README.md
/usr/share/man/man1/ambiguous_words.1.gz
/usr/share/man/man1/cntraining.1.gz
/usr/share/man/man1/combine_tessdata.1.gz
/usr/share/man/man1/dawg2wordlist.1.gz
/usr/share/man/man1/mftraining.1.gz
/usr/share/man/man1/shapeclustering.1.gz
/usr/share/man/man1/tesseract.1.gz
/usr/share/man/man1/unicharset_extractor.1.gz
/usr/share/man/man1/wordlist2dawg.1.gz
/usr/share/man/man5/unicharambigs.5.gz
/usr/share/man/man5/unicharset.5.gz
/usr/share/tessdata
/usr/share/tessdata/configs
/usr/share/tessdata/configs/ambigs.train
/usr/share/tessdata/configs/api_config
/usr/share/tessdata/configs/bigram
/usr/share/tessdata/configs/box.train
/usr/share/tessdata/configs/box.train.stderr
/usr/share/tessdata/configs/digits
/usr/share/tessdata/configs/hocr
/usr/share/tessdata/configs/inter
/usr/share/tessdata/configs/kannada
/usr/share/tessdata/configs/linebox
/usr/share/tessdata/configs/logfile
/usr/share/tessdata/configs/makebox
/usr/share/tessdata/configs/pdf
/usr/share/tessdata/configs/quiet
/usr/share/tessdata/configs/rebox
/usr/share/tessdata/configs/strokewidth
/usr/share/tessdata/configs/tsv
/usr/share/tessdata/configs/txt
/usr/share/tessdata/configs/unlv
/usr/share/tessdata/pdf.ttf
/usr/share/tessdata/tessconfigs
/usr/share/tessdata/tessconfigs/batch
/usr/share/tessdata/tessconfigs/batch.nochop
/usr/share/tessdata/tessconfigs/matdemo
/usr/share/tessdata/tessconfigs/msdemo
/usr/share/tessdata/tessconfigs/nobatch
/usr/share/tessdata/tessconfigs/segdemo


Generated by rpm2html 1.8.1

Fabrice Bellet, Tue Nov 9 10:26:55 2021