Commit Graph

  • 5cb5d7a682 Merge remote-tracking branch 'origin/dependabot/github_actions/codecov/codecov-action-7' main James R. Barlow 2026-06-09 01:12:35 -07:00
  • 37e71dece6 Merge branch 'feature/page-box-repair' James R. Barlow 2026-06-09 01:10:58 -07:00
  • df84945773 feat: validate and repair malformed page boxes James R. Barlow 2026-06-09 01:10:26 -07:00
  • b5a6a9f9f1 feat: discard stale structure tree when re-OCRing tagged PDFs James R. Barlow 2026-06-08 15:47:53 -07:00
  • ed36aefe48 Bump codecov/codecov-action from 6 to 7 dependabot[bot] 2026-06-08 10:42:38 +00:00
  • 32013f4294 Merge branch 'feature/discard-obsolete-pdf-features' James R. Barlow 2026-06-07 02:09:00 -07:00
  • 8f2bcc2c64 feat: discard stale embedded page thumbnails when rewriting PDF James R. Barlow 2026-06-07 00:30:34 -07:00
  • 015b53ae30 feat: discard stale embedded text search index when rewriting PDF James R. Barlow 2026-06-07 00:02:28 -07:00
  • 164cf2dc8a test: use bundled font in gray-mask test for macOS/Windows portability James R. Barlow 2026-06-05 12:48:33 -07:00
  • 98d6d02704 Merge branch 'fix/1688-mask-fill-color-device' James R. Barlow 2026-06-05 11:41:18 -07:00
  • 5efb98931d fix: inherit fill color into Form XObjects; reset fill color on cs (#1688) James R. Barlow 2026-06-05 01:25:43 -07:00
  • 2f4e47213f style: ruff format operator whitelist line (#1688) James R. Barlow 2026-06-05 01:16:08 -07:00
  • 94c8123bd7 docs: release note for image mask fill-color device promotion (#1688) James R. Barlow 2026-06-05 01:14:45 -07:00
  • 0db130e1c3 test: end-to-end gray image-mask OCR across rasterizers (#1688) James R. Barlow 2026-06-05 01:13:22 -07:00
  • 91b6a818f5 feat: promote raster device for color/gray image masks; default pngmonod (#1688) James R. Barlow 2026-06-05 01:07:27 -07:00
  • 6bc9499e68 feat: recognize pngmonod device in pypdfium rasterizer (#1688) James R. Barlow 2026-06-05 01:07:27 -07:00
  • 09f2d6c386 feat: add pngmonod raster device to enum (#1688) James R. Barlow 2026-06-05 01:01:50 -07:00
  • 87f918f58c feat: expose fill-color ink classification on ImageInfo (#1688) James R. Barlow 2026-06-05 00:57:08 -07:00
  • 80e77fb021 feat: track image mask fill color during content stream interpretation (#1688) James R. Barlow 2026-06-05 00:42:38 -07:00
  • fa9c5b3fae feat: add fill color -> Ink classification helper (#1688) James R. Barlow 2026-06-05 00:34:30 -07:00
  • 3d17a60a54 feat: add Ink classification type for image mask fill colors (#1688) James R. Barlow 2026-06-05 00:30:35 -07:00
  • c33f073d4f Improve DeviceN color conversion guidance (#1623) (#1694) jbarlow 2026-06-04 14:48:24 -07:00
  • 5d7b5742e4 Bump version: v17.5.0 v17.5.0 James R. Barlow 2026-05-27 13:36:30 -07:00
  • c391b2b7d0 Draft release notes for v17.5.0 James R. Barlow 2026-05-27 13:35:45 -07:00
  • 0250929150 Update uv.lock James R. Barlow 2026-05-26 13:11:12 -07:00
  • 9748208e68 Support 'end' alias for last page in --pages James R. Barlow 2026-05-26 12:18:09 -07:00
  • e4b0c04be4 Fix pypdfium2 MediaBox rendering when CropBox is smaller James R. Barlow 2026-05-25 23:17:17 -07:00
  • efb83ad64f Add --ghostscript-jpeg-quality and --ghostscript-jpeg-maxdpi James R. Barlow 2026-05-25 10:20:54 -07:00
  • 08e40f96e8 Surface Tesseract config errors instead of FileNotFoundError James R. Barlow 2026-05-25 01:45:45 -07:00
  • 3f6feb1dcc Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF James R. Barlow 2026-05-25 01:38:46 -07:00
  • ab6553f4ff Merge pull request #1677 from ocrmypdf/dependabot/uv/gitpython-3.1.50 jbarlow 2026-05-25 01:36:02 -07:00
  • cedca9fa1f Merge pull request #1679 from ocrmypdf/dependabot/uv/urllib3-2.7.0 jbarlow 2026-05-25 01:35:45 -07:00
  • 3f40118022 Merge pull request #1686 from ocrmypdf/dependabot/uv/idna-3.15 jbarlow 2026-05-25 01:35:31 -07:00
  • b18b1da6d0 Bump idna from 3.11 to 3.15 dependabot[bot] 2026-05-19 21:26:42 +00:00
  • 14fb9f56e8 Add explanatory note about Ghostscript -dJPEG=95 James R. Barlow 2026-05-16 12:18:34 -07:00
  • 8709cf506b Update uv.lock James R. Barlow 2026-05-12 10:14:47 -07:00
  • 9a92eb40df Merge pull request #1680 from cislunarspace/docs/refresh-chinese-readme jbarlow 2026-05-12 15:52:47 +02:00
  • 0a59c210f9 docs: refresh Chinese README translation ouyangjiahong 2026-05-12 09:16:32 +08:00
  • 0b370fdd15 Bump urllib3 from 2.6.3 to 2.7.0 dependabot[bot] 2026-05-11 17:48:53 +00:00
  • 1c16dd26f7 Bump gitpython from 3.1.47 to 3.1.50 dependabot[bot] 2026-05-09 04:47:56 +00:00
  • c355d927ba Bump gitpython from 3.1.46 to 3.1.47 dependabot[bot] 2026-04-26 01:22:53 +00:00
  • c993857752 Fix Form XObject cycle detection in image xref scan (#1321) James R. Barlow 2026-04-25 00:48:25 -07:00
  • 84f5fe9ee0 Separate probing from execution in _exec and subprocess modules James R. Barlow 2026-04-24 13:33:34 -07:00
  • 3336d67e77 Fix CJK test broken by fpdf2 2.8.7 CFF font encoding change v17.4.2 James R. Barlow 2026-04-19 23:26:30 -07:00
  • 73e16e7821 Merge remote-tracking branch 'origin/dependabot/github_actions/codecov/codecov-action-6' James R. Barlow 2026-04-19 13:59:42 -07:00
  • 6f1d37d78f Merge remote-tracking branch 'origin/dependabot/github_actions/sigstore/gh-action-sigstore-python-3.3.0' James R. Barlow 2026-04-19 13:59:30 -07:00
  • 2ed82de2e0 Update uv.lock again - pygithub James R. Barlow 2026-04-19 13:58:46 -07:00
  • c43903fa14 Bump version: v17.4.2 James R. Barlow 2026-04-19 13:45:34 -07:00
  • 1c89cacfef Respect host-set PIL.Image.MAX_IMAGE_PIXELS in Python API James R. Barlow 2026-04-19 13:44:08 -07:00
  • 75714fe43e Update uv.lock James R. Barlow 2026-04-19 13:06:22 -07:00
  • e371ce95ca Bump sigstore/gh-action-sigstore-python from 3.2.0 to 3.3.0 dependabot[bot] 2026-04-06 10:50:26 +00:00
  • 716a2e22c3 Bump codecov/codecov-action from 5 to 6 dependabot[bot] 2026-04-06 10:50:22 +00:00
  • 10e6019ada Bump version: v17.4.1 v17.4.1 James R. Barlow 2026-04-06 00:34:08 -07:00
  • 89c76b5145 v17.4.1 release notes James R. Barlow 2026-04-05 00:23:07 -07:00
  • 83c04e6399 Update GS JPEG corruption warning for 10.7.0+ James R. Barlow 2026-04-04 01:59:57 -07:00
  • 7fdeeb3635 Refactor word_render_data tuple into WordRenderData dataclass James R. Barlow 2026-04-04 01:43:25 -07:00
  • 5be368fe75 Fix RTL text extraction order in fpdf2 renderer (#1655) James R. Barlow 2026-04-04 01:40:38 -07:00
  • 91c5b1e480 Merge pull request #1613 from bluebox-steven:add-options.work_folder-to-pdfcontext jbarlow 2026-04-03 01:28:18 -07:00
  • 73154b97ba Merge pull request #1643 from ocrmypdf:dependabot/github_actions/actions/upload-artifact-7 jbarlow 2026-04-03 01:13:07 -07:00
  • 76a40759ae Merge pull request #1644 from ocrmypdf:dependabot/github_actions/actions/download-artifact-8 jbarlow 2026-04-03 01:12:43 -07:00
  • 12ce565e98 Merge pull request #1646 from ocrmypdf:dependabot/github_actions/docker/setup-qemu-action-4 jbarlow 2026-04-03 01:12:02 -07:00
  • 9f46126859 Merge pull request #1647 from ocrmypdf:dependabot/github_actions/docker/login-action-4 jbarlow 2026-04-03 01:11:36 -07:00
  • 11849e5a70 Merge pull request #1648 from ocrmypdf:dependabot/github_actions/docker/setup-buildx-action-4 jbarlow 2026-04-03 01:08:58 -07:00
  • e30c00cc26 Merge pull request #1649 from ocrmypdf:dependabot/uv/tornado-6.5.5 jbarlow 2026-04-03 01:07:59 -07:00
  • 001b403657 Bump tornado from 6.5.4 to 6.5.5 dependabot[bot] 2026-04-03 08:06:38 +00:00
  • 851c61ee85 Merge pull request #1657 from ocrmypdf:dependabot/uv/cryptography-46.0.6 jbarlow 2026-04-03 01:06:25 -07:00
  • f5ebd23b8f Merge pull request #1653 from ocrmypdf:dependabot/uv/requests-2.33.0 jbarlow 2026-04-03 01:05:58 -07:00
  • 81118c6195 Merge pull request #1658 from ocrmypdf:dependabot/uv/pygments-2.20.0 jbarlow 2026-04-03 01:05:22 -07:00
  • 834b60a02a Bump pygments from 2.19.2 to 2.20.0 dependabot[bot] 2026-03-30 20:07:51 +00:00
  • 47e3b5b4d2 Bump cryptography from 46.0.5 to 46.0.6 dependabot[bot] 2026-03-29 02:02:03 +00:00
  • d9346cc3d8 Bump requests from 2.32.5 to 2.33.0 dependabot[bot] 2026-03-26 17:26:22 +00:00
  • 4e974ebd46 Bump version: v17.4.0 v17.4.0 James R. Barlow 2026-03-21 01:43:13 -07:00
  • 6f2b8408c1 v17.4.0 release notes James R. Barlow 2026-03-21 01:43:03 -07:00
  • 1dba941261 Add cyclopts for dev James R. Barlow 2026-03-21 01:37:48 -07:00
  • ef76625abb Fix text stretching in fpdf2 renderer for widely-spaced words James R. Barlow 2026-03-16 16:00:00 -07:00
  • 57bb554a70 Fix verapdf NotADirectoryError crash on some platforms James R. Barlow 2026-03-10 02:08:59 -07:00
  • 5b9d6f979e Add --no-overwrite / -n option to prevent overwriting output files James R. Barlow 2026-03-10 01:58:57 -07:00
  • b588e3bfd7 Fix optimize=2/3 crash when using Python API James R. Barlow 2026-03-10 01:51:07 -07:00
  • a35dd1f9ee Bump docker/setup-buildx-action from 3 to 4 dependabot[bot] 2026-03-09 11:18:20 +00:00
  • bf46f4fe35 Bump docker/login-action from 3 to 4 dependabot[bot] 2026-03-09 11:18:16 +00:00
  • 55b76338a8 Bump docker/setup-qemu-action from 3 to 4 dependabot[bot] 2026-03-09 11:18:10 +00:00
  • 2af7b1c179 Bump actions/download-artifact from 7 to 8 dependabot[bot] 2026-03-02 11:32:46 +00:00
  • 69f4cca9b6 Bump actions/upload-artifact from 6 to 7 dependabot[bot] 2026-03-02 11:32:40 +00:00
  • 59190ef643 Bump version: v17.3.0 v17.3.0 James R. Barlow 2026-02-21 00:00:26 -08:00
  • 910ccccc7d Fix bump-version James R. Barlow 2026-02-21 00:00:14 -08:00
  • 0c15ff594c v17.3.0 release notes James R. Barlow 2026-02-20 23:52:48 -08:00
  • e19ea653aa Switch to static versioning and two-workflow release model James R. Barlow 2026-02-20 23:34:03 -08:00
  • a899f0d59a Split release_notes into parts for each major release James R. Barlow 2026-02-20 18:19:31 -08:00
  • b4e8e9dac9 Fix Python API ignoring language parameter (fixes #1640) James R. Barlow 2026-02-20 17:03:16 -08:00
  • aca5eb626b Docker: increase alpine version to 3.23 James R. Barlow 2026-02-20 11:06:33 -08:00
  • bd4a74de0e Restore image rendering for hocrtransform James R. Barlow 2026-02-18 18:00:34 -08:00
  • 10b71937c4 Fix OCR text displacement on PDFs with non-zero MediaBox origins James R. Barlow 2026-02-17 23:34:33 -08:00
  • 5890d1855e Fix Python API producing empty OCR due to tesseract_timeout defaulting to 0 James R. Barlow 2026-02-17 21:55:49 -08:00
  • 3da952a23d Fix garbled Arabic/Devanagari text by using HarfBuzz text shaping v17.2.0 James R. Barlow 2026-02-11 01:30:15 -08:00
  • 716ce6324c Update dependencies James R. Barlow 2026-02-11 00:43:01 -08:00
  • 76fe2f7e28 Merge remote-tracking branch 'origin/dependabot/uv/cryptography-46.0.5' James R. Barlow 2026-02-11 00:42:21 -08:00
  • c85c8941d3 Fix pdftotext word spacing by emitting single BT block per line James R. Barlow 2026-02-11 00:38:38 -08:00
  • 9a0dadbd4c Bump cryptography from 46.0.4 to 46.0.5 dependabot[bot] 2026-02-11 02:57:37 +00:00
  • 4d7e398c4b Suppress rendering of text lines with improbable aspect ratios James R. Barlow 2026-02-10 17:42:33 -08:00
  • 56c0b41f97 Fix extreme font sizes for rotated text in fpdf2 renderer James R. Barlow 2026-02-10 17:02:25 -08:00