./devel/py-bitstring, Simple construction, analysis and modification of binary data

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 4.2.3, Package name: py312-bitstring-4.2.3, Maintainer: pkgsrc-users

bitstring is a pure Python module that makes the creation, manipulation
and analysis of binary data as simple and natural as possible.


Master sites:

Filesize: 244.665 KB

Version history: (Expand)


CVS history: (Expand)


   2024-11-11 08:29:31 by Thomas Klausner | Files touched by this commit (862)
Log message:
py-*: remove unused tool dependency

py-setuptools includes the py-wheel functionality nowadays
   2024-05-27 07:05:48 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-bitstring: updated to 4.2.3

4.2.3

More small bug fixes related to some of the new (beta) float formats.

* Some codes representing exotic float negative zero converted to positive zero.
* Auto-scaling rounding the wrong way on occasion.
   2024-05-11 07:06:04 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-bitstring: updated to 4.2.2

4.2.2

A couple more minor bug fixes.

* Sometimes a ValueError was being raised instead of a ReadError.
* Initialising a bitstring from None now raises a TypeError rather than generating
  an empty bitstring.
* Fixed performance regression for find/findall in some situations.
* Fix for AttributeError bug when combining Bits with BitStream.
   2024-05-08 14:00:21 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-bitstring: updated to 4.2.1

April 2024: version 4.2.1
-------------------------
Fixing a few regressions introduced in 4.2.0.

* Module crashes on import with 32-bit Python.
* Lists of integers not converted to bytes when using the bytes constructor.
* Empty comma separated tokens not handled correctly.
* Crash on import when docstrings not present due to optimize flag.

April 2024: version 4.2.0
-------------------------

This release contains a fairly large refactor of how different types are \ 
managed. This
shouldn't affect the end user, and the main noticeable change should be the new Dtype
class, which is optional to use.

Support for 8-bit and smaller floats has been reworked and expanded. These are still
a 'beta' feature.

Backwardly incompatible changes:

* Dropped support for Python 3.7. Minimum version is now 3.8.
* For tokens that use a non-numeric length, a ':' is now compulsory rather than
  recommended. For example use 'uint:foo' instead of 'uintfoo'.
* The previous e4m3float and e5m2float formats have become the slightly modified
  p4binary8 and p3binary8 formats.
* Some parameters are now enforced as positional only, such as `auto` in \ 
constructors.

Other changes:

* The Array class is no longer 'beta'.

* A new Dtype class can be optionally used to specify types.

* The bitstring.options object is now the preferred method for changing module \ 
options.
  The `bitstring.lsb0` and `bitstring.bytealigned` variables are now deprecated, use
  `bitstring.options.lsb0` and `bitstring.options.bytealigned` instead.

* New fromstring method as another way to create bitstrings from formatted strings.
  Instead of relying on the `auto` parameter you can now optionally use `fromstring`.
  >>> s1 = BitArray('u24=1000')  # This is still fine
  >>> s2 = BitArray.fromstring('u24=1000')  # This may be clearer and \ 
more efficient.

* More types can now be pretty printed. For example integer and float formats \ 
can be used.
  >>> s.pp('u15, bin')

* Pretty printing is now prettier - optional terminal colours added.

* A range of 8-bit, 6-bit and even 4-bit float formats added (beta):
  p3binary8: IEEE 8-bit floating point with 3 bit precision.
  p4binary8: IEEE 8-bit floating point with 4 bit precision.
  e5m2mxfp: OCP 8-bit floating point with 3 bit precision.
  e4m3mxfp: OCP 8-bit floating point with 4 bit precision.
  e2m3mxfp: OCP 6-bit floating point with 4 bit precision.
  e3m2mxfp: OCP 6-bit floating point with 3 bit precision.
  e2m1mxfp: OCP 4-bit floating point with 2 bit precision.
  e8m0mxfp: OCP 8-bit unsigned floating point designed to scale the other formats.
  mxint: OCP 8-bit floating point that is a scaled integer representation.

* Performance improvements.
   2023-12-07 13:37:38 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-bitstring: updated to 4.1.4

version 4.1.4
Fixing a regression introduced in 4.1.3
* 'bytes' token can't be used without explicit length.
   2023-11-23 09:56:51 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-bitstring: updated to 4.1.3

version 4.1.3
A maintenance release, with some changes to the beta features introduced in 4.1.

* Removed a couple of files that accidentally got included in the previous release.
* The 8-bit float formats have been renamed 'e4m3float' and 'e5m2float'.
* Some refactoring and performance optimizations.
   2023-11-13 11:59:21 by Thomas Klausner | Files touched by this commit (3) | Package updated
Log message:
py-bitstring: update to 4.1.2.

--------------------------------------
September 2023: version 4.1.2 released
--------------------------------------
Another maintenance release. Once again some small changes to the 'beta' Array class,
plus new Array functionality.

* Fix for the module command-line usage. Bug #290.
* Fix for when creating bitstrings from memoryview objects.
* Renamed the 'fmt' parameter for Arrays to 'dtype'.
* More Array operator coverage.
* Added operators that act on two Arrays of the same size.
* Added comparison operators for Arrays that return an Array of bools.
* Added Array.equals method as == will now return an Array (see above item).
* Added astype() method for Arrays to easily cast to a new dtype.

-----------------------------------
August 2023: version 4.1.1 released
-----------------------------------
A maintenance release, with some changes to the Array class which is still in 'beta'.

* bitarray dependency now pinned to ">=2.8.0, <3.0.0" rather \ 
than a specific version. Bug #283.
* Fix for using numpy integers as integer parameters. Bug #286.
* Removed ability to extend an Array with the '+' operator. Use the 'extend' \ 
method instead.
* Improvements when pretty-printing the Array.
* Array.count() can now count 'nan' values for floating point types.

-----------------------------------
August 2023: version 4.1.0 released
-----------------------------------

This has turned into a suprisingly big release, with a major refactor and a brand new
class (the first for 12 years!) There are also a couple of small possibly \ 
breaking changes
detailed below, in particular 'auto' initialising bitstrings from integers is \ 
now disallowed.

* Speed increased with bitarray dependency.

The major weakness of bitstring has been its poor performance for computationally
intensive tasks relative to lower level alternatives. This was principally due to
relying on pure Python code to achieve things that the base language often \ 
didn't have
fast ways of doing.

This release starts to address that problem with a fairly extensive rewrite to \ 
replace
much of the pure Python low-level bit operations with methods from the bitarray \ 
package.
This is a package that does many of the same things as bitstring, and the two \ 
packages
have co-existed for a long time. While bitarray doesn't have all of the options and
facilities of bitstring it has the advantage of being very fast as it is \ 
implemented in C.
By replacing the internal datatypes I can speed up bitstring's operations while \ 
keeping
the same API.

Huge kudos to Ilan Schnell for all his work on bitarray.

* New Array class for homogeneous data (beta)

If your data is all of the same type you can make use of the new Array class, which
mirrors much of the functionality of the standard array.array type, but doesn't \ 
restrict
you to just a dozen formats.

  >>> from bitstring import Array
  >>> a = Array('uint7', [9, 100, 3, 1])
  >>> a.data
  BitArray('0x1390181')
  >>> b = Array('float16', a.tolist())
  >>> b.append(0.25)
  >>> b.tobytes()
  b'H\x80V@B\x00<\x004\x00'
  >>> b.tolist()
  [9.0, 100.0, 3.0, 1.0, 0.25]

The data is stored efficiently in a BitArray object, and you can manipulate both the
data and the Array format freely. See the main documentation for more details. \ 
Note that
this feature carries the 'beta' flag so may change in future point versions.

Other changes:

* Added two new floating point interpretations: float8_143 and float8_152. These \ 
are 8-bit
  floating point formats, with very limited range and precision, but useful in \ 
some fields,
  particularly machine learning. This is an experimental feature - the formats \ 
haven't
  even been standardised yet.

  >>> a = Bits(float8_143=16.5)
  >>> a.bin
  '01100000'
  >>> a.float8_143
  16.0

* Auto initialistion from ints has been removed and now raises a TypeError. \ 
Creating a
  bitstring from an int still creates a zeroed bitstring of that length but ints \ 
won't
  be promoted to bitstrings as that has been a constant source of errors and \ 
confusion.

  >>> a = BitArray(100)  # Fine - create with 100 zeroed bits
  >>> a += 0xff   # TypeError - previously this would have appended \ 
0xff (=255) zero bits.
  >>> a += '0xff'  # Probably what was meant - append eight '1' bits.
  >>> a += Bits(255)  # Fine, append 255 zero bits.

  This is a breaking change, but it breaks loudly with an exception, it is \ 
easily recoded,
  and it removes a confusing wrinkle.

* Explicitly specifying the 'auto' parameter is now disallowed rather than \ 
discouraged.
  It was always meant to be a positional-only parameter (and will be once I can drop
  Python 3.7 support) but for now it's renamed to '__auto'. In the unlikely event
  this breaks code, the fix should be just to delete the 'auto=' if it's already the
  first parameter.

  >>> s = Bits(auto='0xff')  # Now raises a CreationError
  >>> s = Bits('0xff')  # Fine, as always

* Deleting, replacing or inserting into a bitstring resets the bit position to 0 \ 
if the
  bitstring's length has been changed. Previously the bit position was adjusted but
  this was not well defined.

* Only empty bitstring are now considered False in a boolean sense. Previously s was
  False is no bits in s were set to 1, but this goes against what it means to be a
  container in Python so I consider this to be a bug, even if it was documented. I'm
  guessing it's related to __nonzero__ in Python 2 becoming __bool__ in Python 3, and
  it's never been fixed before now.

* Casting to bytes now behaves as expected, so that bytes(s) gives the same result as
  s.tobytes(). Previously it created a byte per bit.

* Pretty printing with the 'bytes' format now uses characters from the 'Latin \ 
Extended-A'
  unicode block for non-ASCII and unprintable characters instead of replacing \ 
them with '.'

* When using struct-like codes you can now use '=' instead of '@' to signify native-
  endianness. They behave identically, but the new '=' is now preferred.

* More fixes for LSB0 mode. There are now no known issues with this feature.

----------------------------------
April 2023: version 4.0.2 released
----------------------------------
A maintenance release.

* Added py.typed file and converted the module to a package to let mypy find type
  annotations. Bug 248.
* Fix to shifting operations when using LSB0 mode. Bug 251.
* A few more fixes for LSB0 mode.
* Improved LSB0 documentation.
* Added build-system section to pyproject.toml. Bug 243.
* Rewrote the walkthrough documentation as a jupyter notebook.
* Updated the project's logo.

-------------------------------------
November 2022: version 4.0.1 released
-------------------------------------

This is a major release which drops support for Python 2.7 and has a new minimum
requirement of Python 3.7. Around 95% of downloads satisfy this - users of
older versions can continue to use bitstring 3.1, which will still be supported
with fixes, but no new features.

Other breaking changes are minimal, and there are a few cool features added.

Breaking changes:

* Minimum supported Python version is now Python 3.7.
* Removed ConstBitArray and BitString class aliases. Use Bits and BitStream instead.
* The cut() method will now also yield the final bits of a bitstring, even if they
  are shorter than the requested cut size.
* Removed default uint interpretation. This wasn't being applied uniformly - default
  is now always to return a bitstring object of the given length and not to interpret
  it as a uint. Bug 220.
* If an overwrite goes beyond the end of the bitstring it will now extend the \ 
bitstring
  rather than raise an exception. Bug 148.

New features and improvements:

* Type hints added throughout the code.
* Underscores are now allowed in strings representing number literals.
* The copy() method now works on Bits as well as BitArray objects.
* The experimental command-line feature is now official. Command-line
  parameters are concatenated and a bitstring created from them. If
  the final parameter is either an interpretation string or ends with
  a '.' followed by an interpretation string then that interpretation
  of the bitstring will be used when printing it.

  $ python -m bitstring int:16=-400
  0xfe70
  $ python -m bitstring float:32=0.2 bin
  00111110010011001100110011001101

* New pp() method that pretty-prints the bitstring in various formats - useful
  especially in interactive sessions. Thanks to Omer Barak for the suggestion
  and discussion.

  >>> s.pp()
    0: 10001000 01110110 10001110 01110110 11111000 01110110 10000111 00101000
   64: 01110010 11111001 10000111 10011000 11110111 10011110 10000111 11111101
  128: 11111001 10001100 01111111 10111100 10111111 11011011 11101011 11111011
  192: 1100
  >>> s.pp('bin, hex')
    0: 10001000 01110110 10001110 01110110 11111000 01110110   88 76 8e 76 f8 76
   48: 10000111 00101000 01110010 11111001 10000111 10011000   87 28 72 f9 87 98
   96: 11110111 10011110 10000111 11111101 11111001 10001100   f7 9e 87 fd f9 8c
  144: 01111111 10111100 10111111 11011011 11101011 11111011   7f bc bf db eb fb
  192: 1100                                                    c

* Shorter and more versatile properties. The bin, oct, hex, float, uint and int
  properties can now be shortened to just their first letter. They can also have
  a length in bits after them - allowing Rust-like data types. ::

      >>> s = BitArray('0x44961000')
      >>> s.h
      '44961000'
      >>> s.f32
      1200.5
      >>> s.u
      1150685184
      >>> s.i7 = -60
      >>> s.b
      '1000100'
      >>> t = Bits('u12=160, u12=120, b=100')

* Other types with bit lengths can also be used as properties ::

      >>> s.floatle64 = 10.511

* A colon is no longer required in format strings before a bit length. So for
  example `Bits('int:15=-101')` could be written as `Bits('int15=-101')`. This is
  now the preferred usage in the documentation except where the colon improves
  readability.

* Support for IEEE 16 bit floats. Floating point types can now be 16 bits long \ 
as well
  as 32 and 64 bits. This is using the 'e' format from the struct module.

* Support for bfloats. This is a specialised 16-bit floating point format
  mostly used in machine learning. It's essentially a truncated IEEE 32-bit
  format that keeps its range but only has a couple of signficant figures of
  accuracy.
   2022-01-04 21:55:40 by Thomas Klausner | Files touched by this commit (1595)
Log message:
*: bump PKGREVISION for egg.mk users

They now have a tool dependency on py-setuptools instead of a DEPENDS