Next | Query returned 29 messages, browsing 1 to 10 | Previous

History of commit frequency

CVS Commit History:


   2025-01-27 15:00:11 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-pdf: updated to 5.2.0

Version 5.2.0, 2025-01-26

Deprecations (DEP)
- Deprecate with replacement CCITParameters
- Correct deprecation of interiour_color

New Features (ENH)
- Support alternative (U)F names for embedded file retrieval
- Adding support for reading .metadata.keywords

Bug Fixes (BUG)
- Handle further Tf operators in text extraction layout mode
- Ensure `add_metadata` can deal with `_info = None`
- Handle IndirectObject in CCITTFaxDecode filter
- Handle chained colorspace for inline images when no filter is set
- Avoid extracting inline images twice and dropping other operators
- Fixed reference of value with `str.__new__` in TextStringObject
- Handle indirect objects in font width calculations
- Title sometimes is bytes and not str
- Fix undefined variable for text extraction (regression)
- Don't close stream passed to PdfWriter.write()

Robustness (ROB)
- Handle zero height fonts when extracting text
- Deal with content streams not containing streams
- Gracefully handle some text operators when the operands are missing
- Fall back to non-Adobe Ascii85 format for missing end markers
- Ignore odd-length strings when processing cmap lines
- Skip annotation destination being NullObject in PdfWriter
- Skip destination page being None in PdfWriter
- Fix infinite loop case when reading null objects within an Array
- Fixing infinite loop in ArrayObject read_from_stream

Documentation (DOC)
- Add note about default line colors

Developer Experience (DEV)
- Remove ignoring Ruff rule PGH004
- Tidy ignore array in tool.ruff.lint
- Move Windows CI to Python 3.13
- Move to Ubuntu 22.04

Maintenance (MAINT)
- Fix formatting of warning message and include exception message
- Narrow return type for `ContentStream.operations`

Testing (TST)
- Fix image similarity for upcoming Ubuntu 24.04
- Replace broken Apache Tika Corpora urls

Code Style (STY)
- Add form feed to WHITESPACES
- Lots of small internal changes
   2024-11-04 18:58:39 by Thomas Klausner | Files touched by this commit (3) | Package updated
Log message:
py-pdf: update to 5.1.0.

## Version 5.1.0, 2024-10-27

### New Features (ENH)
- Add `layout_mode_font_height_weight` argument to `PageObject.extract_text()` \ 
(#2920)

### Bug Fixes (BUG)
- Fix font specificier for FreeText annotation (#2893)
- Line breaks are not generated due to incorrect calculation of text leading (#2890)
- Improve handling of spaces in text extraction (#2882)

### Robustness (ROB)
- Soft failure for flate encode image mode 1 with wrong LUT size (#2900)

### Documentation (DOC)
- Use latest package versions (#2907)
- Correct example of reading FileAttachment annotation (#2906)

### Developer Experience (DEV)
- Update pinned requirements (#2918)
- Make make_release.py compatible with Windows environment (#2894)

### Maintenance (MAINT)
- Remove references to outdated Python versions (#2919)
- Generalize the method of obtaining space_code (#2891)
- Unnecessary character mapping process (#2888)
- New LZW decoding implementation (#2887)

### Testing (TST)
- Add LzwCodec for encoding (#2883)

### Code Style (STY)
- Capitalize error messages (#2903)
- Modify error messages in PdfWriter (#2902)
   2024-10-11 14:43:58 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-pdf: updated to 5.0.1

Version 5.0.1, 2024-09-29

New Features (ENH)
- Add `full` parameter to PdfWriter constructor

Bug Fixes (BUG)
- Update pyproject.toml with minimum Python version of 3.8
- Cope with unbalanced delimiters in dictionary object
- Cope with encoding with too many differences
- Missing spaces in extract_text() method
- Tolerate truncated files and no warning when jumping startxref

Robustness (ROB)
- Repair PDF with invalid Root object
- Continue parsing dictionary object when error is detected
- Merge documents with invalid pages in named destinations
- Tolerate comments in arrays

Developer Experience (DEV)
- Use latest Python version for benchmarking

Maintenance (MAINT)
- Add tests to source distributions
- Refactor _update_field_annotation
   2024-09-22 18:19:49 by Thomas Klausner | Files touched by this commit (2) | Package updated
Log message:
py-pdf: update to 5.0.0.

## Version 5.0.0, 2024-09-15

This version drops support for Python 3.7 (not maintained since July 2023), \ 
PdfMerger (use PdfWriter instead) and AnnotationBuilder (use annotations \ 
instead).

### Deprecations (DEP)
- Remove the deprecated PfdMerger and AnnotationBuilder classes and other \ 
deprecations cleanup (#2813)
- Drop Python 3.7 support (#2793)

### New Features (ENH)
- Add capability to remove /Info from PDF (#2820)
- Add incremental capability to PdfWriter (#2811)
- Add UniGB-UTF16 encodings (#2819)
- Accept utf strings for metadata (#2802)
- Report PdfReadError instead of RecursionError (#2800)
- Compress PDF files merging identical objects (#2795)

### Bug Fixes (BUG)
- Fix sheared image (#2801)

### Robustness (ROB)
- Robustify .set_data() (#2821)
- Raise PdfReadError when missing /Root in trailer (#2808)
- Fix extract_text() issues on damaged PDFs (#2760)
- Handle images with empty data when processing an image from bytes (#2786)

### Developer Experience (DEV)
- Fix coverage uploads (#2832)
- Test against Python 3.13 (#2776)
   2024-07-17 05:50:23 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-pdf: updated to 4.3.0

Version 4.3.0, 2024-06-23

New Features (ENH)
- Accept ETen-B5 and UniCNS-UTF16 encodings
- Add decode_as_image() to ContentStreams
- context manager for PdfReader
- Add capability to set font and size in fields
- Allow to pass input file without named argument

Bug Fixes (BUG)
- Fix deprecation for Ressources when using old constants
- Fix images issue 4 bits encoding and LUT starting with UTF16_BOM
- Reading large compressed images takes huge time to process
- Highlighted Text Cannot Be Printed
- Fix UnboundLocalError on malformed pdf

Robustness (ROB)
- Cope with missing Standard 14 fonts in fields
- Improve inline image extraction
- Cope with loops in Fields tree
- Discard /I in choice fields for compatibility with Acrobat
- Cope with some issues in pillow
- Cope with some image extraction issues

Documentation (DOC)
- Various improvements on docstrings and examples

Maintenance (MAINT)
- Deprecate interiour_color with replacement interior_color
- Add deprecate_with_replacement to PdfWriter.find_bookmark

Code Style (STY)
- Change Link to be a non-markup annotation
   2024-04-14 10:58:05 by Thomas Klausner | Files touched by this commit (1)
Log message:
py-pdf: fix depends for Python 3.10

Also needs py-typing-extensions there.

Bump PKGREVISION.
   2024-04-10 13:19:09 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-pdf: updated to 4.2.0

Version 4.2.0, 2024-04-07

New Features (ENH)
- Allow multiple charsets for NameObject.read_from_stream
- Add support for /Kids in page labels
- Allow to update fields on many pages
- Tolerate PDF with invalid xref pointed objects
- Add Enforce from PDF2.0 in viewer_preferences
- Add += and -= operators to ArrayObject

Bug Fixes (BUG)
- Fix merge_page sometimes generating unknown operator 'QQ'
- Fix fields update where annotations are kids of field
- Process CMYK images without a filter correctly
- Extract text in layout mode without finding resources
- Prevent recursive loop in some PDF files

Robustness (ROB)
- Tolerate "truncated" xref
- Replace error by warning for EOD in RunLengthDecode/ASCIIHexDecode
- Rebuild xref table if one entry is invalid
- Robustify stream extraction

Documentation (DOC)
- Update release process for latest changes
- Encryption/decryption: Clone document instead of copying all pages
- Minor improvements
- Update annotation list
- Update references and formatting
- Correct threads reference, plus minor changes
- Minor readability increases
- Simplify PaperSize examples
- Minor improvements

Developer Experience (DEV)
- Remove unused dependencies
- Remove page labels PR link from message
- Fix changelog generator regarding whitespace and handling of "Other" \ 
group
- Add REL to known PR prefixes
- Release using the REL commit instead of git tag
- Unify code between PdfReader and PdfWriter
- Bump softprops/action-gh-release from 1 to 2

Maintenance (MAINT)
- Ressources → Resources (and internal name childs)
- Fix typos found by codespell
- Update Read the Docs configuration
- Add root_object, _info and _ID to PdfReader

Testing (TST)
- Allow loading truncated images if required
- Fix download issues from
- Improve test_get_contents_from_nullobject to show real use-case
- Add missing test annotations
   2024-03-11 14:18:00 by Thomas Klausner | Files touched by this commit (2) | Package updated
Log message:
py-pdf: update to 4.1.0.

## Version 4.1.0, 2024-03-03

Generating name objects (`NameObject`) without a leading slash
is considered deprecated now. Previously, just a plain warning
would be logged, leading to possibly invalid PDF files. According
to our deprecation policy, this will log a *DeprecationWarning*
for now.

### New Features (ENH)
- Add get_pages_from_field  (#2494)
- Add reattach_fields function (#2480)
- Automatic access to pointed object for IndirectObject (#2464)

### Bug Fixes (BUG)
- Missing error on name without leading / (#2387)
- encode_pdfdocencoding() always returns bytes (#2440)
- BI in text content identified as image tag (#2459)

### Robustness (ROB)
- Missing basefont entry in type 3 font (#2469)

### Documentation (DOC)
- Improve lossless compression example (#2488)
- Amend robustness documentation (#2479)

### Developer Experience (DEV)
- Fix changelog for UTF-8 characters (#2462)

### Maintenance (MAINT)
- Add _get_page_number_from_indirect in writer (#2493)
- Remove user assignment for feature requests (#2483)
- Remove reference to old 2.0.0 branch (#2482)

### Testing (TST)
- Fix benchmark failures (#2481)
- Broken test due to expired test file URL (#2468)
- Resolve file naming conflict in test_iss1767 (#2445)
   2024-02-19 07:09:44 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-pdf: updated to 4.0.2

Version 4.0.2, 2024-02-18

Bug Fixes (BUG)
-  Use NumberObject for /Border elements of annotations
   2024-01-28 18:33:44 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-pdf: updated to 4.0.1

Version 4.0.1, 2024-01-28

Bug Fixes (BUG)
- layout mode text extraction ZeroDivisionError

Testing (TST)
- Skip tests using fpdf2 if it's not installed

Next | Query returned 29 messages, browsing 1 to 10 | Previous