./textproc/R-stringr, Simple, consistent wrappers for common string operations

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.5.0, Package name: R-stringr-1.5.0, Maintainer: pkgsrc-users

stringr is a set of simple wrappers that make R's string functions
more consistent, simpler and easier to use. It does this by ensuring
that: function and argument names (and positions) are consistent, all
functions deal with NA's and zero length character appropriately, and
the output data structures from each function matches the input data
structures of other functions.


Required to run:
[math/R] [textproc/R-stringi] [devel/R-magrittr] [devel/R-glue]

Required to build:
[pkgtools/cwrappers]

Master sites: (Expand)


Version history: (Expand)


CVS history: (Expand)


   2023-03-12 09:54:41 by Makoto Fujiwara | Files touched by this commit (2)
Log message:
(textproc/R-stringr) Updated 1.4.0 to 1.5.0

# stringr 1.5.0

## Breaking changes

* stringr functions now consistently implement the tidyverse recycling rules
  (#372). There are two main changes:

    *  Only vectors of length 1 are recycled. Previously, (e.g.)
       `str_detect(letters, c("x", "y"))` worked, but it now \ 
errors.

    *  `str_c()` ignores `NULLs`, rather than treating them as length 0
        vectors.

    Additionally, many more arguments now throw errors, rather than warnings,
    if supplied the wrong type of input.

* `regex()` and friends now generate class names with `stringr_` prefix (#384).

* `str_detect()`, `str_starts()`, `str_ends()` and `str_subset()` now error
  when used with either an empty string (`""`) or a `boundary()`. These
  operations didn't really make sense (`str_detect(x, "")` returned `TRUE`
  for all non-empty strings) and made it easy to make mistakes when programming.

## New features

* Many tweaks to the documentation to make it more useful and consistent.

* New `vignette("from-base")` by @sastoudt provides a comprehensive \ 
comparison
  between base R functions and their stringr equivalents. It's designed to
  help you move to stringr if you're already familiar with base R string
  functions (#266).

* New `str_escape()` escapes regular expression metacharacters, providing
  an alternative to `fixed()` if you want to compose a pattern from user
  supplied strings (#408).

* New `str_equal()` compares two character vectors using unicode rules,
  optionally ignoring case (#381).

* `str_extract()` can now optionally extract a capturing group instead of
  the complete match (#420).

* New `str_flatten_comma()` is a special case of `str_flatten()` designed for
  comma separated flattening and can correctly apply the Oxford commas
  when there are only two elements (#444).

* New `str_split_1()` is tailored for the special case of splitting up a single
  string (#409).

* New `str_split_i()` extract a single piece from a string (#278, @bfgray3).

* New `str_like()` allows the use of SQL wildcards (#280, @rjpat).

* New `str_rank()` to complete the set of order/rank/sort functions (#353).

* New `str_sub_all()` to extract multiple substrings from each string.

* New `str_unique()` is a wrapper around `stri_unique()` and returns unique
  string values in a character vector (#249, @seasmith).

* `str_view()` uses ANSI colouring rather than an HTML widget (#370). This
  works in more places and requires fewer dependencies. It includes a number
  of other small improvements:

    * It no longer requires a pattern so you can use it to display strings with
      special characters.
    * It highlights unusual whitespace characters.
    * It's vectorised over both string` and `pattern` (#407).
    * It defaults to displaying all matches, making `str_view_all()` redundant
      (and hence deprecated) (#455).

* New `str_width()` returns the display width of a string (#380).

* stringr is now licensed as MIT (#351).

## Minor improvements and bug fixes

* Better error message if you supply a non-string pattern (#378).

* A new data source for `sentences` has fixed many small errors.

* `str_extract()` and `str_exctract_all()` now work correctly when `pattern`
  is a `boundary()`.

* `str_flatten()` gains a `last` argument that optionally override the
  final separator (#377). It gains a `na.rm` argument to remove missing
  values (since it's a summary function) (#439).

* `str_pad()` gains `use_width` argument to control whether to use the total
  code point width or the number of code points as "width" of a string \ 
(#190).

* `str_replace()` and `str_replace_all()` can use standard tidyverse formula
  shorthand for `replacement` function (#331).

* `str_starts()` and `str_ends()` now correctly respect regex operator
  precedence (@carlganz).

* `str_wrap()` breaks only at whitespace by default; set
  `whitespace_only = FALSE` to return to the previous behaviour (#335, @rjpat).

* `word()` now returns all the sentence when using a negative `start` parameter
  that is greater or equal than the number of words. (@pdelboca, #245)

# stringr 1.4.1

Hot patch release to resolve R CMD check failures.
   2021-10-26 13:23:42 by Nia Alarie | Files touched by this commit (1161)
Log message:
textproc: Replace RMD160 checksums with BLAKE2s checksums

All checksums have been double-checked against existing RMD160 and
SHA512 hashes

Unfetchable distfiles (fetched conditionally?):
./textproc/convertlit/distinfo clit18src.zip
   2021-10-07 17:02:49 by Nia Alarie | Files touched by this commit (1162)
Log message:
textproc: Remove SHA1 hashes for distfiles
   2021-06-05 09:32:59 by Makoto Fujiwara | Files touched by this commit (1)
Log message:
(textproc/R-stringr) Add TEST_DEPENDS+= lines
   2019-08-08 21:53:58 by Brook Milligan | Files touched by this commit (189) | Package updated
Log message:
Update all R packages to canonical form.

The canonical form [1] of an R package Makefile includes the
following:

- The first stanza includes R_PKGNAME, R_PKGVER, PKGREVISION (as
  needed), and CATEGORIES.

- HOMEPAGE is not present but defined in math/R/Makefile.extension to
  refer to the CRAN web page describing the package.  Other relevant
  web pages are often linked from there via the URL field.

This updates all current R packages to this form, which will make
regular updates _much_ easier, especially using pkgtools/R2pkg.

[1] http://mail-index.netbsd.org/tech-pkg/2019/08/02/msg021711.html
   2018-07-28 16:40:53 by Brook Milligan | Files touched by this commit (126)
Log message:
Remove MASTER_SITES= from individual R package Makefiles.

Each R package should include ../../math/R/Makefile.extension, which also
defines MASTER_SITES.  Consequently, it is redundant for the individual
packages to do the same.  Package-specific definitions also prevent
redefining MASTER_SITES in a single common place.
   2018-02-05 19:58:57 by Min Sik Kim | Files touched by this commit (2) | Package updated
Log message:
textproc/R-stringr: Update to 1.2.0

Notable changes:
- str_match_all() now returns NA if an optional group doesn't match
  (previously it returned ""). This is more consistent with
  str_match() and other match failures.
- In str_replace(), replacement can now be a function that is called
  once for each match and who's return value is used to replace the
  match.
- New str_which() mimics grep().
- A new vignette (vignette("regular-expressions")) describes the
  details of the regular expressions supported by stringr. The main
  vignette (vignette("stringr")) has been updated to give a high-level
  overview of the package.
   2016-12-18 03:33:18 by Wen Heping | Files touched by this commit (2)
Log message:
Update to 1.1.0

Upstream changes:
stringr 1.1.0

    Add sample datasets: fruit, words and sentences.

    fixed(), regex(), and coll() now throw an error if you use them with \ 
anything other than a plain string (#60). I've clarified that the replacement \ 
for perl() is regex() not regexp() (#61). boundary() has improved defaults when \ 
splitting on non-word boundaries (#58, @lmullen).

    str_detect() now can detect boundaries (by checking for a str_count() > \ 
0) (#120). str_subset() works similarly.

    str_extract() and str_extract_all() now work with boundary(). This is \ 
particularly useful if you want to extract logical constructs like words or \ 
sentences. str_extract_all() respects the simplify argument when used with \ 
fixed() matches.

    str_subset() now respects custom options for fixed() patterns (#79, @gagolews).

    str_replace() and str_replace_all() now behave correctly when a replacement \ 
string contains $s, \\\\1, etc. (#83, #99).

    str_split() gains a simplify argument to match str_extract_all() etc.

    str_view() and str_view_all() create HTML widgets that display regular \ 
expression matches (#96).

    word() returns NA for indexes greater than number of words (#112).