./www/py-soupsieve, CSS4 selector implementation for Beautiful Soup

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 2.5nb1, Package name: py311-soupsieve-2.5nb1, Maintainer: pkgsrc-users

Soup Sieve is a CSS selector library designed to be used with Beautiful Soup 4.
It aims to provide selecting, matching, and filtering using modern CSS
selectors. Soup Sieve currently provides selectors from the CSS level 1
specifications up through the latest CSS level 4 drafts (though some are not
yet implemented).

Soup Sieve was written with the intent to replace Beautiful Soup's builtin
select feature, and as of Beautiful Soup version 4.7.0, it now is. Soup Sieve
can also be imported in order to use its API directly for more controlled,
specialized parsing.

Soup Sieve has implemented most of the CSS selectors up through the level 4
drafts, though there are a number that don't make sense in a non-browser
environment. Selectors that cannot provide meaningful functionality simply do
not match anything.


Required to run:
[devel/py-setuptools] [lang/python37]

Required to build:
[pkgtools/cwrappers]

Master sites:

Filesize: 98.577 KB

Version history: (Expand)


CVS history: (Expand)


   2023-10-28 21:57:26 by Thomas Klausner | Files touched by this commit (516) | Package updated
Log message:
python/wheel.mk: simplify a lot, and switch to 'installer' for installation

This follows the recommended bootstrap method (flit_core, build, installer).

However, installer installs different files than pip, so update PLISTs
for all packages using wheel.mk and bump their PKGREVISIONs.
   2023-09-09 23:35:02 by Thomas Klausner | Files touched by this commit (2) | Package updated
Log message:
py-soupsieve: update to 2.5.

NEW: Update to support Python 3.12.
NEW: Drop support for Python 3.7.
   2023-04-17 07:29:56 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-soupsieve: updated to 2.4.1

2.4.1

- **FIX**: Attribute syntax for case insensitive flag optionally allows a space, \ 
it does not require one.
   2023-04-02 14:40:27 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-soupsieve: updated to 2.4

2.4

- **NEW**: Update to support changes related to `:lang()` in the official CSS \ 
spec. `:lang("")` should match unspecified
  languages, e.g. `lang=""`, but not `lang=und`.
- **NEW**: Only `:is()` and `:where()` should allow forgiving selector lists \ 
according to latest CSS (as far as Soup
  Sieve supports "forgiving" which is limited to empty selectors).
- **NEW**: Formally drop Python 3.6.
- **NEW**: Formally declare support for Python 3.11.
   2022-11-30 17:59:36 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-soupsieve: updated to 2.3.2.1

2.3.2.post1

- **FIX**: Documentation for installation from source is outdated.

2.3.2

- **FIX**: Fix some typos in error messages.

2.3.1

- **FIX**: Ensure attribute selectors match tags that have new line characters \ 
in attributes.

2.3

- **NEW**: Officially support Python 3.10.
- **NEW**: Add static typing.
- **NEW**: `:has()`, `:is()`, and `:where()` now use use a forgiving selector \ 
list. While not as forgiving as CSS might
 be, it will forgive such things as empty sets and empty slots due to multiple \ 
consecutive commas, leading commas, or
 trailing commas. Essentially, these pseudo-classes will match all non-empty \ 
selectors and ignore empty ones. As the
 scraping environment is different than a browser environment, it was chosen not \ 
to aggressively forgive bad syntax and
 invalid features to ensure the user is alerted that their program may not \ 
perform as expected.
- **NEW**: Add support to output a pretty print format of a compiled \ 
`SelectorList` for debug purposes.
- **FIX**: Some small corner cases discovered with static typing.

2.2.1

- **FIX**: Fix an issue with namespaces when one of the keys is `self`.

2.2

- **NEW**: `:link` and `:any-link` no longer include `#!html <link>` due \ 
to a change in the level 4 selector
 specification. This actually yields more sane results.
- **FIX**: BeautifulSoup, when using `find`, is quite forgiving of odd types \ 
that a user may place in an element's
 attribute value. Soup Sieve will also now be more forgiving and attempt to \ 
match these unexpected values in a sane
 manner by normalizing them before compare.

2.1

- **NEW**: Officially support Python 3.9.
- **NEW**: Drop official support for Python 3.5.
- **NEW**: In order to avoid conflicts with future CSS specification changes, \ 
non-standard pseudo classes will now start
 with the `:-soup-` prefix. As a consequence, `:contains()` will now be known as \ 
`:-soup-contains()`, though for a time
 the deprecated form of `:contains()` will still be allowed with a warning that \ 
users should migrate over to
 `:-soup-contains()`.
- **NEW**: Added new non-standard pseudo class `:-soup-contains-own()` which \ 
operates similar to `:-soup-contains()`
 except that it only looks at text nodes directly associated with the currently \ 
scoped element and not its descendants.
- **FIX**: Import `bs4` globally instead of in local functions as it appears \ 
there are no adverse affects due to
 circular imports as `bs4` does not immediately reference `soupsieve` functions \ 
and `soupsieve` does not immediately
 reference `bs4` functions. This should give a performance boost to functions \ 
that had previously included `bs4`
 locally.

2.0.1

- **FIX**: Remove unused code.

2.0

- **NEW**: `SelectorSyntaxError` is derived from `Exception` not `SyntaxError`.
- **NEW**: Remove deprecated `comments` and `icomments` from the API.
- **NEW**: Drop support for EOL Python versions (Python 2 and Python < 3.5).
- **FIX**: Corner case with splitting namespace and tag name that that have an \ 
escaped `|`.
   2022-01-04 21:55:40 by Thomas Klausner | Files touched by this commit (1595)
Log message:
*: bump PKGREVISION for egg.mk users

They now have a tool dependency on py-setuptools instead of a DEPENDS
   2021-10-26 13:31:15 by Nia Alarie | Files touched by this commit (1030)
Log message:
www: Replace RMD160 checksums with BLAKE2s checksums

All checksums have been double-checked against existing RMD160 and
SHA512 hashes

Not committed (merge conflicts):
www/nghttp2/distinfo

Unfetchable distfiles (almost certainly fetched conditionally...):
./www/nginx-devel/distinfo array-var-nginx-module-0.05.tar.gz
./www/nginx-devel/distinfo echo-nginx-module-0.62.tar.gz
./www/nginx-devel/distinfo encrypted-session-nginx-module-0.08.tar.gz
./www/nginx-devel/distinfo form-input-nginx-module-0.12.tar.gz
./www/nginx-devel/distinfo headers-more-nginx-module-0.33.tar.gz
./www/nginx-devel/distinfo lua-nginx-module-0.10.19.tar.gz
./www/nginx-devel/distinfo naxsi-1.3.tar.gz
./www/nginx-devel/distinfo nginx-dav-ext-module-3.0.0.tar.gz
./www/nginx-devel/distinfo nginx-rtmp-module-1.2.2.tar.gz
./www/nginx-devel/distinfo nginx_http_push_module-1.2.10.tar.gz
./www/nginx-devel/distinfo ngx_cache_purge-2.5.1.tar.gz
./www/nginx-devel/distinfo ngx_devel_kit-0.3.1.tar.gz
./www/nginx-devel/distinfo ngx_http_geoip2_module-3.3.tar.gz
./www/nginx-devel/distinfo njs-0.5.0.tar.gz
./www/nginx-devel/distinfo set-misc-nginx-module-0.32.tar.gz
./www/nginx/distinfo array-var-nginx-module-0.05.tar.gz
./www/nginx/distinfo echo-nginx-module-0.62.tar.gz
./www/nginx/distinfo encrypted-session-nginx-module-0.08.tar.gz
./www/nginx/distinfo form-input-nginx-module-0.12.tar.gz
./www/nginx/distinfo headers-more-nginx-module-0.33.tar.gz
./www/nginx/distinfo lua-nginx-module-0.10.19.tar.gz
./www/nginx/distinfo naxsi-1.3.tar.gz
./www/nginx/distinfo nginx-dav-ext-module-3.0.0.tar.gz
./www/nginx/distinfo nginx-rtmp-module-1.2.2.tar.gz
./www/nginx/distinfo nginx_http_push_module-1.2.10.tar.gz
./www/nginx/distinfo ngx_cache_purge-2.5.1.tar.gz
./www/nginx/distinfo ngx_devel_kit-0.3.1.tar.gz
./www/nginx/distinfo ngx_http_geoip2_module-3.3.tar.gz
./www/nginx/distinfo njs-0.5.0.tar.gz
./www/nginx/distinfo set-misc-nginx-module-0.32.tar.gz
   2021-10-07 17:09:00 by Nia Alarie | Files touched by this commit (1033)
Log message:
www: Remove SHA1 hashes for distfiles