2023-09-28 18:01:24 by Adam Ciarcinski | Files touched by this commit (3) | |
Log message:
py-pandas: updated to 2.1.1
What’s new in 2.1.1 (September 20, 2023)
These are the changes in pandas 2.1.1. See Release notes for a full changelog \
including other versions of pandas.
Fixed regressions
Fixed regression in concat() when DataFrame ‘s have two different extension \
dtypes (GH 54848)
Fixed regression in merge() when merging over a PyArrow string index (GH 54894)
Fixed regression in read_csv() when usecols is given and dtypes is a dict for \
engine="python" (GH 54868)
Fixed regression in read_csv() when delim_whitespace is True (GH 54918, GH 54931)
Fixed regression in GroupBy.get_group() raising for axis=1 (GH 54858)
Fixed regression in DataFrame.__setitem__() raising AssertionError when setting \
a Series with a partial MultiIndex (GH 54875)
Fixed regression in DataFrame.filter() not respecting the order of elements for \
filter (GH 54980)
Fixed regression in DataFrame.to_sql() not roundtripping datetime columns \
correctly for sqlite (GH 54877)
Fixed regression in DataFrameGroupBy.agg() when aggregating a DataFrame with \
duplicate column names using a dictionary (GH 55006)
Fixed regression in MultiIndex.append() raising when appending overlapping \
IntervalIndex levels (GH 54934)
Fixed regression in Series.drop_duplicates() for PyArrow strings (GH 54904)
Fixed regression in Series.interpolate() raising when fill_value was given (GH 54920)
Fixed regression in Series.value_counts() raising for numeric data if bins was \
specified (GH 54857)
Fixed regression in comparison operations for PyArrow backed columns not \
propagating exceptions correctly (GH 54944)
Fixed regression when comparing a Series with datetime64 dtype with None (GH 54870)
Bug fixes
Fixed bug for ArrowDtype raising NotImplementedError for fixed-size list (GH 55000)
Fixed bug in DataFrame.stack() with future_stack=True and columns a \
non-MultiIndex consisting of tuples (GH 54948)
Fixed bug in Series.dt.tz() with ArrowDtype where a string was returned instead \
of a tzinfo object (GH 55003)
Fixed bug in Series.pct_change() and DataFrame.pct_change() showing unnecessary \
FutureWarning (GH 54981)
Other
Reverted the deprecation that disallowed Series.apply() returning a DataFrame \
when the passed-in callable returns a Series object (GH 52116)
|
2023-09-02 09:19:56 by Adam Ciarcinski | Files touched by this commit (3) | |
Log message:
py-pandas: updated to 2.1.0
https://pandas.pydata.org/docs/whatsnew/v2.1.0.html
|
2023-08-28 12:34:02 by Adam Ciarcinski | Files touched by this commit (4) | |
Log message:
py-pandas: updated to 2.0.3
2.0.3
Fixed regressions
Bug in Timestamp.weekday`() was returning incorrect results before '0000-02-29' \
(GH53738)
Fixed performance regression in merging on datetime-like columns (GH53231)
Fixed regression when DataFrame.to_string() creates extra space for string \
dtypes (GH52690)
Bug fixes
Bug in DataFrame.convert_dtype() and Series.convert_dtype() when trying to \
convert ArrowDtype with dtype_backend="nullable_numpy" (GH53648)
Bug in RangeIndex.union() when using sort=True with another RangeIndex (GH53490)
Bug in Series.reindex() when expanding a non-nanosecond datetime or timedelta \
Series would not fill with NaT correctly (GH53497)
Bug in read_csv() when defining dtype with bool[pyarrow] for the "c" \
and "python" engines (GH53390)
Bug in Series.str.split() and Series.str.rsplit() with expand=True for \
ArrowDtype with pyarrow.string (GH53532)
Bug in indexing methods (e.g. DataFrame.__getitem__()) where taking the entire \
DataFrame/Series would raise an OverflowError when Copy on Write was enabled and \
the length of the array was over the maximum size a 32-bit integer can hold \
(GH53616)
Bug when constructing a DataFrame with columns of an ArrowDtype with a \
pyarrow.dictionary type that reindexes the data (GH53617)
Bug when indexing a DataFrame or Series with an Index with a timestamp \
ArrowDtype would raise an AttributeError (GH53644)
2.0.2
Fixed regressions
Fixed performance regression in GroupBy.apply() (GH53195)
Fixed regression in merge() on Windows when dtype is np.intc (GH52451)
Fixed regression in read_sql() dropping columns with duplicated column names \
(GH53117)
Fixed regression in DataFrame.loc() losing MultiIndex name when enlarging object \
(GH53053)
Fixed regression in DataFrame.to_string() printing a backslash at the end of the \
first row of data, instead of headers, when the DataFrame doesn’t fit the line \
width (GH53054)
Fixed regression in MultiIndex.join() returning levels in wrong order (GH53093)
Bug fixes
Bug in arrays.ArrowExtensionArray incorrectly assigning dict instead of list for \
.type with pyarrow.map_ and raising a NotImplementedError with pyarrow.struct \
(GH53328)
Bug in api.interchange.from_dataframe() was raising IndexError on empty \
categorical data (GH53077)
Bug in api.interchange.from_dataframe() was returning DataFrame’s of incorrect \
sizes when called on slices (GH52824)
Bug in api.interchange.from_dataframe() was unnecessarily raising on bitmasks \
(GH49888)
Bug in merge() when merging on datetime columns on different resolutions (GH53200)
Bug in read_csv() raising OverflowError for engine="pyarrow" and \
parse_dates set (GH53295)
Bug in to_datetime() was inferring format to contain "%H" instead of \
"%I" if date contained “AM” / “PM” tokens (GH53147)
Bug in DataFrame.convert_dtypes() ignores convert_* keywords when set to False \
dtype_backend="pyarrow" (GH52872)
Bug in DataFrame.convert_dtypes() losing timezone for tz-aware dtypes and \
dtype_backend="pyarrow" (GH53382)
Bug in DataFrame.sort_values() raising for PyArrow dictionary dtype (GH53232)
Bug in Series.describe() treating pyarrow-backed timestamps and timedeltas as \
categorical data (GH53001)
Bug in Series.rename() not making a lazy copy when Copy-on-Write is enabled when \
a scalar is passed to it (GH52450)
Bug in pd.array() raising for NumPy array and pa.large_string or pa.large_binary \
(GH52590)
Bug in DataFrame.__getitem__() not preserving dtypes for MultiIndex partial keys \
(GH51895)
2.0.1
Fixed regressions
Fixed regression for subclassed Series when constructing from a dictionary (GH52445)
Fixed regression in SeriesGroupBy.agg() failing when grouping with categorical \
data, multiple groupings, as_index=False, and a list of aggregations (GH52760)
Fixed regression in DataFrame.pivot() changing Index name of input object (GH52629)
Fixed regression in DataFrame.resample() raising on a DataFrame with no columns \
(GH52484)
Fixed regression in DataFrame.sort_values() not resetting index when DataFrame \
is already sorted and ignore_index=True (GH52553)
Fixed regression in MultiIndex.isin() raising TypeError for Generator (GH52568)
Fixed regression in Series.describe() showing RuntimeWarning for extension dtype \
Series with one element (GH52515)
Fixed regression when adding a new column to a DataFrame when the \
DataFrame.columns was a RangeIndex and the new key was hashable but not a scalar \
(GH52652)
Bug fixes
Bug in Series.dt.days that would overflow int32 number of days (GH52391)
Bug in arrays.DatetimeArray constructor returning an incorrect unit when passed \
a non-nanosecond numpy datetime array (GH52555)
Bug in ArrowExtensionArray with duration dtype overflowing when constructed from \
data containing numpy NaT (GH52843)
Bug in Series.dt.round() when passing a freq of equal or higher resolution \
compared to the Series would raise a ZeroDivisionError (GH52761)
Bug in Series.median() with ArrowDtype returning an approximate median (GH52679)
Bug in api.interchange.from_dataframe() was unnecessarily raising on categorical \
dtypes (GH49889)
Bug in api.interchange.from_dataframe() was unnecessarily raising on large \
string dtypes (GH52795)
Bug in pandas.testing.assert_series_equal() where check_dtype=False would still \
raise for datetime or timedelta types with different resolutions (GH52449)
Bug in read_csv() casting PyArrow datetimes to NumPy when \
dtype_backend="pyarrow" and parse_dates is set causing a performance \
bottleneck in the process (GH52546)
Bug in to_datetime() and to_timedelta() when trying to convert numeric data with \
a ArrowDtype (GH52425)
Bug in to_numeric() with errors='coerce' and dtype_backend='pyarrow' with \
ArrowDtype data (GH52588)
Bug in ArrowDtype.__from_arrow__() not respecting if dtype is explicitly given \
(GH52533)
Bug in DataFrame.describe() not respecting ArrowDtype in include and exclude \
(GH52570)
Bug in DataFrame.max() and related casting different Timestamp resolutions \
always to nanoseconds (GH52524)
Bug in Series.describe() not returning ArrowDtype with pyarrow.float64 type with \
numeric data (GH52427)
Bug in Series.dt.tz_localize() incorrectly localizing timestamps with ArrowDtype \
(GH52677)
Bug in arithmetic between np.datetime64 and np.timedelta64 NaT scalars with \
units always returning nanosecond resolution (GH52295)
Bug in logical and comparison operations between ArrowDtype and numpy masked \
types (e.g. "boolean") (GH52625)
Fixed bug in merge() when merging with ArrowDtype one one and a NumPy dtype on \
the other side (GH52406)
Fixed segfault in Series.to_numpy() with null[pyarrow] dtype (GH52443)
Other
DataFrame created from empty dicts had columns of dtype object. It is now a \
RangeIndex (GH52404)
Series created from empty dicts had index of dtype object. It is now a \
RangeIndex (GH52404)
Implemented Series.str.split() and Series.str.rsplit() for ArrowDtype with \
pyarrow.string (GH52401)
Implemented most str accessor methods for ArrowDtype with pyarrow.string (GH52401)
Supplying a non-integer hashable key that tests False in api.types.is_scalar() \
now raises a KeyError for RangeIndex.get_loc(), like it does for \
Index.get_loc(). Previously it raised an InvalidIndexError (GH52652).
|
2023-08-02 01:20:57 by Thomas Klausner | Files touched by this commit (158) |
Log message:
*: remove more references to Python 3.7
|
2023-07-01 10:37:47 by Thomas Klausner | Files touched by this commit (105) | |
Log message:
*: restrict py-numpy users to 3.9+ in preparation for update
|
2023-04-25 15:51:50 by Jonathan Perkin | Files touched by this commit (9) |
Log message:
*: GCC_REQD must always be appended to.
|
2023-01-28 20:47:55 by Havard Eidnes | Files touched by this commit (3) |
Log message:
math/py-pandas: note upstream pull request, and remove .orig file.
The .orig file would otherwise be installed, cauisng a PLIST
mismatch.
|
2023-01-28 18:58:05 by Havard Eidnes | Files touched by this commit (2) |
Log message:
math/py-pandas: fix aggregations.pyx so that this builds on NetBSD.
On NetBSD, signbit and sqrt cannot be imported from "libc", because
even though <math.h> is included, Python.h later includes
<cmath> which in turn #undef's all the relevant definitions
from <math.h>, forcing the use of the std:: variants of those
functions. So ... re-do how we import signbit() and sqrt().
|
2023-01-25 15:05:16 by Adam Ciarcinski | Files touched by this commit (2) | |
Log message:
py-pandas: updated to 1.5.3
What's new in 1.5.3 (January 18, 2023)
--------------------------------------
These are the changes in pandas 1.5.3. See :ref:`release` for a full changelog
including other versions of pandas.
Fixed regressions
~~~~~~~~~~~~~~~~~
- Fixed performance regression in :meth:`Series.isin` when ``values`` is empty \
(:issue:`49839`)
- Fixed regression in :meth:`DataFrame.memory_usage` showing unnecessary \
``FutureWarning`` when :class:`DataFrame` is empty (:issue:`50066`)
- Fixed regression in :meth:`.DataFrameGroupBy.transform` when used with \
``as_index=False`` (:issue:`49834`)
- Enforced reversion of ``color`` as an alias for ``c`` and ``size`` as an alias \
for ``s`` in function :meth:`DataFrame.plot.scatter` (:issue:`49732`)
- Fixed regression in :meth:`.SeriesGroupBy.apply` setting a ``name`` attribute \
on the result if the result was a :class:`DataFrame` (:issue:`49907`)
- Fixed performance regression in setting with the :meth:`~DataFrame.at` indexer \
(:issue:`49771`)
- Fixed regression in the methods ``apply``, ``agg``, and ``transform`` when \
used with NumPy functions that informed users to supply ``numeric_only=True`` if \
the operation failed on non-numeric dtypes; such columns must be dropped prior \
to using these methods (:issue:`50538`)
- Fixed regression in :func:`to_datetime` raising ``ValueError`` when parsing \
array of ``float`` containing ``np.nan`` (:issue:`50237`)
Bug fixes
~~~~~~~~~
- Bug in the Copy-on-Write implementation losing track of views when indexing a \
:class:`DataFrame` with another :class:`DataFrame` (:issue:`50630`)
- Bug in :meth:`.Styler.to_excel` leading to error when unrecognized \
``border-style`` (e.g. ``"hair"``) provided to Excel writers \
(:issue:`48649`)
- Bug in :meth:`Series.quantile` emitting warning from NumPy when \
:class:`Series` has only ``NA`` values (:issue:`50681`)
- Bug when chaining several :meth:`.Styler.concat` calls, only the last styler \
was concatenated (:issue:`49207`)
- Fixed bug when instantiating a :class:`DataFrame` subclass inheriting from \
``typing.Generic`` that triggered a ``UserWarning`` on python 3.11 \
(:issue:`49649`)
- Bug in :func:`pivot_table` with NumPy 1.24 or greater when the \
:class:`DataFrame` columns has nested elements (:issue:`50342`)
- Bug in :func:`pandas.testing.assert_series_equal` (and equivalent ``assert_`` \
functions) when having nested data and using numpy >= 1.25 (:issue:`50360`)
Other
~~~~~
If you are using :meth:`DataFrame.to_sql`, :func:`read_sql`, \
:func:`read_sql_table`, or :func:`read_sql_query` with SQLAlchemy 1.4.46 or \
greater,
you may see a ``sqlalchemy.exc.RemovedIn20Warning``. These warnings can be \
safely ignored for the SQLAlchemy 1.4.x releases
as pandas works toward compatibility with SQLAlchemy 2.0.
- Reverted deprecation (:issue:`45324`) of behavior of \
:meth:`Series.__getitem__` and :meth:`Series.__setitem__` slicing with an \
integer :class:`Index`; this will remain positional (:issue:`49612`)
- A ``FutureWarning`` raised when attempting to set values inplace with \
:meth:`DataFrame.loc` or :meth:`DataFrame.iloc` has been changed to a \
``DeprecationWarning`` (:issue:`48673`)
|
2022-12-05 23:42:54 by Adam Ciarcinski | Files touched by this commit (1) |
Log message:
py-pandas: needs C++ and GCC >= 8
|