Commit Graph

  • cd910fbf21 Improve "PDF/A conversion failed" message James R. Barlow 2024-08-10 01:34:02 -07:00
  • 1225269a4b Clarify opporutnities available with OCR_JSON_SETTINGS James R. Barlow 2024-08-10 01:02:05 -07:00
  • 3a75b20740 v16.4.3 release notes v16.4.3 James R. Barlow 2024-07-31 02:14:12 -07:00
  • d35d008806 Increase pdfminer's bufsiz to mitigate token splitting issue James R. Barlow 2024-07-31 02:11:47 -07:00
  • f5662d5eb0 Consider Masks and stencil masks when calculating DPI James R. Barlow 2024-07-29 15:46:30 -07:00
  • 39010dd255 Handle incompatible jbig2.exe from TeX Live James R. Barlow 2024-07-27 01:09:19 -07:00
  • fbaad570c7 v16.4.2 release notes v16.4.2 James R. Barlow 2024-07-22 15:02:53 -07:00
  • f974e3b3c1 ghostscript: change input filename order for 10.03.1 James R. Barlow 2024-07-22 14:56:03 -07:00
  • 46b49cc176 Suppress missing jbig2dec warning message James R. Barlow 2024-07-19 15:23:20 -07:00
  • 5256e74d0c Update installation.rst "python -m venv .venv" (#1355) Johannes Kalliauer 2024-07-18 15:28:07 +02:00
  • 621d6a0b89 Fix image size calculation when SMask dimensions do not match image James R. Barlow 2024-07-16 13:36:48 -07:00
  • 08be7c8bbe update arch base-devel install command (#1354) Iris 2024-07-15 13:28:36 -07:00
  • 980a5472b6 Fix test failures due to 4dde378 James R. Barlow 2024-07-09 15:54:53 -07:00
  • 51c618e357 Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-07-09 14:46:58 -07:00
  • 4dde3786c2 Fix KeyError '/Subtype' James R. Barlow 2024-07-09 14:46:47 -07:00
  • d544342602 Merge branch 'main' of https://github.com/ocrmypdf/OCRmyPDF James Barlow 2024-07-04 22:59:33 -07:00
  • fac91fca2a v16.4.1 release notes v16.4.1 James R. Barlow 2024-06-30 00:11:27 -07:00
  • 6edf756849 optimize: trap Hifi..Error James R. Barlow 2024-06-30 00:08:51 -07:00
  • 4fb1bb4de6 pipeline: fix typo in message James R. Barlow 2024-06-30 00:08:31 -07:00
  • 6a8eb7daaa docs: page seg mode James Barlow 2024-06-26 01:16:31 -07:00
  • 0544d06c3d Fix calculation of image printed area (used in finding weighted DPI for OCR) James R. Barlow 2024-06-21 15:13:51 -07:00
  • 34c285c9ac v16.4.0 release notes (3) v16.4.0 James R. Barlow 2024-06-17 14:40:22 -07:00
  • 2f53b27651 Disable progbar for linearizing when --no-progress-bar set James R. Barlow 2024-06-14 14:18:22 -07:00
  • 772677746b Update issue templates to improve data collection for 3rd party apps James R. Barlow 2024-06-13 15:25:50 -07:00
  • f0bad87ea6 Restore choco since winget isn't supported (still) James R. Barlow 2024-06-13 00:40:05 -07:00
  • 44e71f8c14 Attempt to deal with jbig2dec warnings James R. Barlow 2024-06-13 00:27:33 -07:00
  • 964b30ca26 v16.4.0 release notes (2) James R. Barlow 2024-06-11 16:55:33 -07:00
  • 214a333e2d Block Tesseract 5.4.0 James R. Barlow 2024-06-11 14:44:54 -07:00
  • ec6401ab57 Merge branch 'pr/helkaluin/1300' James R. Barlow 2024-06-09 15:34:19 -07:00
  • cbc5e8ce8d Revert "Delete and de-list snap because it no longer works" James R. Barlow 2024-06-09 15:32:34 -07:00
  • a1c4cfe8f1 Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-06-08 01:37:19 -07:00
  • 3a721e6578 Delete and de-list snap because it no longer works James R. Barlow 2024-06-08 01:23:57 -07:00
  • e6b716cdde Update docker.rst (#1327) Omid Raha 2024-06-08 01:20:51 -07:00
  • 02c39998b8 Note that alpine is now available for arm James R. Barlow 2024-06-08 01:20:25 -07:00
  • 0774bc7f14 v16.4.0 release notes James R. Barlow 2024-06-01 02:01:11 -07:00
  • c6a98b3d0b Merge branch 'feature/alpine-arm' James R. Barlow 2024-06-01 01:55:26 -07:00
  • 981bbf1105 optimize: add a recursion guard to avoid chasing cyclic form xobjects James R. Barlow 2024-06-01 01:52:21 -07:00
  • 2b0c6cfd40 v16.4.0 release ntoes James R. Barlow 2024-06-01 00:26:07 -07:00
  • 59f6bc8306 More Tesseract-specific language checks to its plugin James R. Barlow 2024-06-01 00:15:50 -07:00
  • 653c4ffb45 hocr: accept multiple spaces in bounding boxes James R. Barlow 2024-05-31 02:43:01 -07:00
  • d947ca258e Prevent issuing equ and osd as languages James R. Barlow 2024-05-25 01:17:57 -07:00
  • d5ff7f7db9 batch: fix issues flagged by ruff James R. Barlow 2024-05-21 01:52:57 -07:00
  • 579cef3649 watcher: Ensure output files are .pdf James R. Barlow 2024-05-21 01:51:30 -07:00
  • cb2f090c60 v16.3.1 release notes v16.3.1 James R. Barlow 2024-05-21 01:39:30 -07:00
  • f3d6387bca Fix "OCR" progress bar not matching actual progress James R. Barlow 2024-05-21 01:35:14 -07:00
  • abf9729c61 Semfree test: accept pdfa conversion failed as a valid return code James R. Barlow 2024-05-21 01:26:11 -07:00
  • 442e9c9f0d Add missing codecov token where missed & drop unneeded brew openssl James R. Barlow 2024-05-19 01:07:38 -07:00
  • 397fad249d v16.3.0 release notes v16.3.0 James R. Barlow 2024-05-19 00:50:59 -07:00
  • 9a3c5a3f7c Add progressbar for metadata_fixup James R. Barlow 2024-05-19 00:46:50 -07:00
  • 950c700274 Fix Ghostscript PDF/A progressbar not displaying James R. Barlow 2024-05-19 00:44:21 -07:00
  • 26432c38a9 Raise exception if rotate pages threshold adjusted without --rotate-pages James R. Barlow 2024-05-18 23:49:27 -07:00
  • 28be50136c hocr: If a line box's coords are invalid, log and error and don't render James R. Barlow 2024-05-18 23:32:18 -07:00
  • 0c62f2de5d Issue template: check for EOL OS James R. Barlow 2024-05-17 19:51:15 -07:00
  • 5caf654f22 Add new codecov token James R. Barlow 2024-05-11 01:03:41 -07:00
  • 205593445e Change test to run on macos x64 and arm64 James R. Barlow 2024-05-11 00:13:08 -07:00
  • f25fb8c63a Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-05-08 00:39:27 -07:00
  • 99c78650b6 Add better error message for PDFs with invalid CTMs James R. Barlow 2024-05-07 14:00:30 -07:00
  • 69355886a8 Fix wrong env var for GS path in Snap helkaluin 2024-04-26 16:45:04 +08:00
  • 08e89e2dbe Adding language install docs for archlinux (#1296) Ahmed Abdou 2024-04-24 23:46:05 +02:00
  • 0e013df161 v16.2.0 release notes v16.2.0 James R. Barlow 2024-04-16 00:37:03 -07:00
  • 9ba4e3ab46 Log unusual exceptions when trying to obtain a version James R. Barlow 2024-04-07 14:38:10 -07:00
  • 5fdcb7602b Make downsampling large images that Tesseract would otherwise error on into default behavior James R. Barlow 2024-04-07 13:43:12 -07:00
  • b4db1b741f optimize: fix handling of [/FlateDecode none] - type images James R. Barlow 2024-04-07 01:44:08 -07:00
  • 7a8cc21e31 Add support for sidecar output to io.BytesIO James R. Barlow 2024-04-07 01:38:55 -07:00
  • 0674829d8f Remove tool.black config James R. Barlow 2024-04-07 00:36:52 -07:00
  • 315aa0474b Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-04-07 00:34:51 -07:00
  • df3451e779 Update the typer[all] dependency to typer-slim[standard] (#1287) Ben Beasley 2024-04-07 03:34:34 -04:00
  • 3ba42802d1 added Macports install information (#1286) akierig 2024-04-07 07:33:57 +00:00
  • d6342cb8c2 Add heif/heic input image support James R. Barlow 2024-04-07 00:33:13 -07:00
  • 065bddbc6c Reformat with ruff format James R. Barlow 2024-04-07 00:25:32 -07:00
  • 067f429dde Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-03-26 15:34:00 -07:00
  • 6895c2d70f Fix Broken Documentation Links (#1275) Daniel Lovegrove 2024-03-22 16:38:52 -05:00
  • 686481982a Fix naming of hOCR rendered files James R. Barlow 2024-03-22 13:27:20 -07:00
  • a9e1d19b78 v16.1.2 release notes v16.1.2 James R. Barlow 2024-03-20 12:56:13 -07:00
  • f95aa63718 Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-03-20 12:26:02 -07:00
  • 855de287b2 Fix test suite failure with Ghostscript >= 10.3 James Barlow 2024-03-19 17:20:33 -07:00
  • feeb9f213f batch example: added archive, small corrections and optimizations (#1277) NilsRo 2024-03-18 21:22:24 +01:00
  • e7eb8fa805 Update Dockerfile.alpine (#1268) Emiel Molenaar 2024-03-13 22:49:42 +01:00
  • 8a747f005a pixels -> megapixels James R. Barlow 2024-02-29 15:31:07 -08:00
  • 16ab4a8b4e Fix error message about missing Python exec James R. Barlow 2024-02-21 23:54:41 -08:00
  • 8d30cff4ef Undo future annotations from watcher.py till Typer fixes its issue James R. Barlow 2024-02-20 19:14:24 -08:00
  • 59d5b0d1bd v16.1.1 release notes v16.1.1 James R. Barlow 2024-02-15 16:56:25 -08:00
  • 9ec0745ab8 Try pypy3.10 James R. Barlow 2024-02-14 14:25:13 -08:00
  • 3a3635f7f9 Python 3.10 cleanup, manual fixes James R. Barlow 2024-02-14 12:48:17 -08:00
  • 6a746a1cbb ruff linting/Python 3.10 cleanup James R. Barlow 2024-02-14 12:41:51 -08:00
  • 906c130f96 Update rust toml settings James R. Barlow 2024-02-14 12:32:26 -08:00
  • 4a78458821 v16.1.0 release notes v16.1.0 James R. Barlow 2024-02-12 01:46:21 -08:00
  • fddf3ce2f4 Clarify warnings filter James R. Barlow 2024-02-12 01:43:47 -08:00
  • 353b34e695 Merge branch 'feature/pageboxes' James R. Barlow 2024-02-12 01:41:56 -08:00
  • 7d63355c3c Use hocr renderer for LTR languages James R. Barlow 2024-02-12 01:41:41 -08:00
  • 42ff7fc842 Fix handling of pages that are restored to correct orientation with /Rotate James R. Barlow 2024-02-12 01:32:26 -08:00
  • 26470fe16a Suppress reportlab deprecation warning James R. Barlow 2024-02-12 01:17:08 -08:00
  • 3b9d4b7f0a Attempt to deal with oddball mediaboxes James R. Barlow 2023-10-31 00:33:10 -07:00
  • 11f53fe9a9 First cut at propagating page boxes James R. Barlow 2023-10-31 00:12:15 -07:00
  • 123c0c766f Mention pipx, install --user --upgrade James R. Barlow 2024-02-08 09:42:00 -08:00
  • 6a9be2142e Advise Homebrew on Linux for Ubuntu 20.04 James R. Barlow 2024-02-07 19:52:50 -08:00
  • 0bc350f55e Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2024-02-06 01:28:10 -08:00
  • 7a6edf62ba Bump codecov/codecov-action from 3 to 4 (#1247) dependabot[bot] 2024-02-05 03:55:13 -08:00
  • 07b6f06f11 optimize: log images with unclear decode tables James R. Barlow 2024-02-01 15:42:40 -08:00
  • 2005f622bb Update gs dependency & instructions for RHEL (#1228) nisbet-hubbard 2024-01-25 02:07:58 +08:00