more consistent, simpler and easier to use. It does this by ensuring
2023-03-12 09:54:41 by Makoto Fujiwara | Files touched by this commit (2) |
Log message:
(textproc/R-stringr) Updated 1.4.0 to 1.5.0
# stringr 1.5.0
## Breaking changes
* stringr functions now consistently implement the tidyverse recycling rules
(#372). There are two main changes:
* Only vectors of length 1 are recycled. Previously, (e.g.)
`str_detect(letters, c("x", "y"))` worked, but it now \
errors.
* `str_c()` ignores `NULLs`, rather than treating them as length 0
vectors.
Additionally, many more arguments now throw errors, rather than warnings,
if supplied the wrong type of input.
* `regex()` and friends now generate class names with `stringr_` prefix (#384).
* `str_detect()`, `str_starts()`, `str_ends()` and `str_subset()` now error
when used with either an empty string (`""`) or a `boundary()`. These
operations didn't really make sense (`str_detect(x, "")` returned `TRUE`
for all non-empty strings) and made it easy to make mistakes when programming.
## New features
* Many tweaks to the documentation to make it more useful and consistent.
* New `vignette("from-base")` by @sastoudt provides a comprehensive \
comparison
between base R functions and their stringr equivalents. It's designed to
help you move to stringr if you're already familiar with base R string
functions (#266).
* New `str_escape()` escapes regular expression metacharacters, providing
an alternative to `fixed()` if you want to compose a pattern from user
supplied strings (#408).
* New `str_equal()` compares two character vectors using unicode rules,
optionally ignoring case (#381).
* `str_extract()` can now optionally extract a capturing group instead of
the complete match (#420).
* New `str_flatten_comma()` is a special case of `str_flatten()` designed for
comma separated flattening and can correctly apply the Oxford commas
when there are only two elements (#444).
* New `str_split_1()` is tailored for the special case of splitting up a single
string (#409).
* New `str_split_i()` extract a single piece from a string (#278, @bfgray3).
* New `str_like()` allows the use of SQL wildcards (#280, @rjpat).
* New `str_rank()` to complete the set of order/rank/sort functions (#353).
* New `str_sub_all()` to extract multiple substrings from each string.
* New `str_unique()` is a wrapper around `stri_unique()` and returns unique
string values in a character vector (#249, @seasmith).
* `str_view()` uses ANSI colouring rather than an HTML widget (#370). This
works in more places and requires fewer dependencies. It includes a number
of other small improvements:
* It no longer requires a pattern so you can use it to display strings with
special characters.
* It highlights unusual whitespace characters.
* It's vectorised over both string` and `pattern` (#407).
* It defaults to displaying all matches, making `str_view_all()` redundant
(and hence deprecated) (#455).
* New `str_width()` returns the display width of a string (#380).
* stringr is now licensed as MIT (#351).
## Minor improvements and bug fixes
* Better error message if you supply a non-string pattern (#378).
* A new data source for `sentences` has fixed many small errors.
* `str_extract()` and `str_exctract_all()` now work correctly when `pattern`
is a `boundary()`.
* `str_flatten()` gains a `last` argument that optionally override the
final separator (#377). It gains a `na.rm` argument to remove missing
values (since it's a summary function) (#439).
* `str_pad()` gains `use_width` argument to control whether to use the total
code point width or the number of code points as "width" of a string \
(#190).
* `str_replace()` and `str_replace_all()` can use standard tidyverse formula
shorthand for `replacement` function (#331).
* `str_starts()` and `str_ends()` now correctly respect regex operator
precedence (@carlganz).
* `str_wrap()` breaks only at whitespace by default; set
`whitespace_only = FALSE` to return to the previous behaviour (#335, @rjpat).
* `word()` now returns all the sentence when using a negative `start` parameter
that is greater or equal than the number of words. (@pdelboca, #245)
# stringr 1.4.1
Hot patch release to resolve R CMD check failures.
|
2021-10-26 13:23:42 by Nia Alarie | Files touched by this commit (1161) |
Log message:
textproc: Replace RMD160 checksums with BLAKE2s checksums
All checksums have been double-checked against existing RMD160 and
SHA512 hashes
Unfetchable distfiles (fetched conditionally?):
./textproc/convertlit/distinfo clit18src.zip
|
2021-10-07 17:02:49 by Nia Alarie | Files touched by this commit (1162) |
Log message:
textproc: Remove SHA1 hashes for distfiles
|
2021-06-05 09:32:59 by Makoto Fujiwara | Files touched by this commit (1) |
Log message:
(textproc/R-stringr) Add TEST_DEPENDS+= lines
|
2019-08-08 21:53:58 by Brook Milligan | Files touched by this commit (189) | |
Log message:
Update all R packages to canonical form.
The canonical form [1] of an R package Makefile includes the
following:
- The first stanza includes R_PKGNAME, R_PKGVER, PKGREVISION (as
needed), and CATEGORIES.
- HOMEPAGE is not present but defined in math/R/Makefile.extension to
refer to the CRAN web page describing the package. Other relevant
web pages are often linked from there via the URL field.
This updates all current R packages to this form, which will make
regular updates _much_ easier, especially using pkgtools/R2pkg.
[1] http://mail-index.netbsd.org/tech-pkg/2019/08/02/msg021711.html
|
2018-07-28 16:40:53 by Brook Milligan | Files touched by this commit (126) |
Log message:
Remove MASTER_SITES= from individual R package Makefiles.
Each R package should include ../../math/R/Makefile.extension, which also
defines MASTER_SITES. Consequently, it is redundant for the individual
packages to do the same. Package-specific definitions also prevent
redefining MASTER_SITES in a single common place.
|
2018-02-05 19:58:57 by Min Sik Kim | Files touched by this commit (2) | |
Log message:
textproc/R-stringr: Update to 1.2.0
Notable changes:
- str_match_all() now returns NA if an optional group doesn't match
(previously it returned ""). This is more consistent with
str_match() and other match failures.
- In str_replace(), replacement can now be a function that is called
once for each match and who's return value is used to replace the
match.
- New str_which() mimics grep().
- A new vignette (vignette("regular-expressions")) describes the
details of the regular expressions supported by stringr. The main
vignette (vignette("stringr")) has been updated to give a high-level
overview of the package.
|
2016-12-18 03:33:18 by Wen Heping | Files touched by this commit (2) |
Log message:
Update to 1.1.0
Upstream changes:
stringr 1.1.0
Add sample datasets: fruit, words and sentences.
fixed(), regex(), and coll() now throw an error if you use them with \
anything other than a plain string (#60). I've clarified that the replacement \
for perl() is regex() not regexp() (#61). boundary() has improved defaults when \
splitting on non-word boundaries (#58, @lmullen).
str_detect() now can detect boundaries (by checking for a str_count() > \
0) (#120). str_subset() works similarly.
str_extract() and str_extract_all() now work with boundary(). This is \
particularly useful if you want to extract logical constructs like words or \
sentences. str_extract_all() respects the simplify argument when used with \
fixed() matches.
str_subset() now respects custom options for fixed() patterns (#79, @gagolews).
str_replace() and str_replace_all() now behave correctly when a replacement \
string contains $s, \\\\1, etc. (#83, #99).
str_split() gains a simplify argument to match str_extract_all() etc.
str_view() and str_view_all() create HTML widgets that display regular \
expression matches (#96).
word() returns NA for indexes greater than number of words (#112).
|