pkgsrc.se | The NetBSD package collection

Subject: CVS commit: pkgsrc/textproc/R-haven
From: Makoto Fujiwara
Date: 2023-06-17 15:29:43
Message id: 20230617132943.A26ECFA89@cvs.NetBSD.org
Log Message:
(textproc/R-haven) Updated 2.4.3 to 2.5.2

# haven 2.5.2

* Updated to ReadStat 1.1.9.

  * Fix various SAS catalog file reading bugs (#529, #653, #680, #696, #705).
  * Increase maximum SAS page file size to 16MiB (#697).
  * Ignore invalid SAV timestamp strings (#683).
  * Fix compiler warnings (#707).

* The experimental `write_sas()` function has been deprecated (#224). The
  sas7bdat file format is complex and undocumented, and as such writing SAS
  files is not officially supported by ReadStat. `write_xpt()` should be used
  instead - it produces files in the SAS transport format, which has
  limitations but will be reliably read by SAS.

* `write_*()` functions gain a new `adjust_tz` argument to allow more control
  over time zone conversion for date-time variables (#702). Thanks to @jmobrien
  for the detailed issue and feedback.

  Stata, SPSS and SAS do not have a concept of time zone. Since haven 2.4.0
  date-time values in non-UTC time zones are implicitly converted when writing
  to ensure the time displayed in Stata/SPSS/SAS will match the time displayed
  to the user in R (see #555). This is the behaviour when `adjust_tz = TRUE`
  (the default). Although this is in line with general user expectations it can
  cause issues when the time zone is important, for e.g. when looking at
  differences between time points, since the underlying numeric data is changed
  to preserve the displayed time. Use `adjust_tz = FALSE` to write the time as
  the corresponding UTC value, which will appear different to the user but
  preserves the underlying numeric data.

* `write_*()` functions previously returned the data frame with minor
  alterations made to date-time variables. These functions now invisibly return
  the original input data frame unchanged (@jmobrien, #702).

* Fix bug in string variable width calculation that treated `NA` values as width
  2. `NA` values are now treated as blanks for width calculations (#699).

# haven 2.5.1

* All `labelled()` vectors now have left-aligned column headers when printing
  in tibbles for better alignment with labels (#676).

* `write_*()` now accept functions as well as strings in the
  `.name_repair` argument in line with the documentation. Previously they only
  supported string values (#684).

* `write_sav()` variable name validation no longer treats all non-ASCII
  characters as invalid (#689).

# haven 2.5.0

## New author

* @gorcha is now a haven author in recognition of his significant and sustained
  contributions.

## File writing improvements

* All `write_` functions can now write custom variable widths by setting the
  `width` attribute (#650).

* When writing files, the minimum width for character variables is now 1. This
  fixes issues with statistical software reading blank character variables with
  width 0 (#650).

* `write_dta()` now uses strL when strings are too long to be stored in an str#
  variable (#437). strL is used when strings are longer than 2045 characters by
  default, which matches Stata's behaviour, but this can be reduced with the
  `strl_threshold` argument.

* `write_xpt()` can now write dataset labels with the `label` argument,  which
  defaults to the `label` attribute of the input data frame, if present (#562).

* `write_sav()` now checks for case-insensitive duplicate variable names
  (@juansebastianl, #641) and verifies that variable names are valid SPSS
  variables.

* The `compress` argument for `write_sav()` now supports all 3 SPSS compression
  modes specified as a character string - "byte", "none" and \ 
"zsav" (#614).
  `TRUE` and `FALSE` can be used for backwards compatibility, and correspond to
  the "zsav" and "none" options respectively.

* `write_sav()` successfully writes user missing values and ranges for
  `labelled()` integer vectors (#596).

* POSIXct and POSIXlt values with no time component (e.g. \ 
"2010-01-01") were
  being converted to `NA` when attempting to convert the output timezone to UTC.
  These now output successfully (#634).

* Fix bug in output timezone conversion that was causing variable labels and
  other variable attributes to disappear (#624).

## Other improvements and fixes

* Updated to ReadStat 1.1.8 RC.

  * Fix bug when writing formats to XPT files (#650).
  * Fix off by one error in indexing for strL variables (#437).

* `labelled()` vectors now throw a warning when combining two vectors with
  conflicting labels (#667).

* `zap_labels()` gains a `user_na` argument to control whether user-defined
  missing values are converted to `NA` or left as is (#638).

* vctrs casting and coercion generics now do less work when working with two
  identical `labelled()` vectors. This significantly improves performance when
  working with `labelled()` vectors in grouped data frames (#658).

* Errors and warnings now use `cli_abort()` and `cli_warning()` (#661).

## Dependency changes

* R 3.4 is now the minimum supported version, in line with [tidyverse  \ 
policy](https://www.tidyverse.org/blog/2019/04/r-version-support/).

* cli >= 3.0.0 has been added to Imports to support new error messaging.

* lifecycle has been added to Imports, and is now used to manage deprecations.
Files:
Revision	Action	file
1.4	modify	pkgsrc/textproc/R-haven/Makefile
1.5	modify	pkgsrc/textproc/R-haven/distinfo