./biology/py-biopython, Python libraries for computational molecular biology

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.85nb1, Package name: py312-biopython-1.85nb1, Maintainer: pkgsrc-users

The Biopython package contains high-quality, reusable modules and
scripts written in Python to make it as easy as possible to use Python
for bioinformatics. The Biopython includes the follwing: the ability
to parse bioinformatics files into python utilizable data structures,
including support for the formats such as Blast output, Clustalw,
FASTA, GenBank, PubMed and Medicine, various Expasy files, SCOP,
Rebase, UniGene, and SwissProt.


Required to run:
[devel/py-setuptools] [math/py-numpy] [lang/python37]

Required to build:
[pkgtools/cwrappers]

Master sites:

Filesize: 19443.264 KB

Version history: (Expand)


CVS history: (Expand)


   2025-04-15 15:03:39 by Adam Ciarcinski | Files touched by this commit (10) | Package updated
Log message:
Fix PLIST after py-setuptools update; bump depends and revision
   2025-02-10 14:08:29 by Adam Ciarcinski | Files touched by this commit (3) | Package updated
Log message:
py-biopython: updated to 1.85

Biopython 1.85

This release of Biopython supports Python 3.9, 3.10, 3.11, 3.12 and 3.13. It
has also been tested on PyPy3.9 v7.3.13. Python 3.9 is approaching end of
life, our support for it is now deprecated.

Some optimisation work was done for ``Bio.SeqIO`` including avoiding nested
iterators, and speeding up both FASTA and FASTQ parsing.

``Bio.motifs`` now supports reading PFM from Cys2His2 Zinc Finger Proteins PWM
Predictor and reading motifs in ``pfm-four-columns`` format will set motif name
to "" instead of None, when no motif name was found.

Tests that use assertAlmostEqual calls now use ``places`` parameter with enough
presision when comparing very small numbers in scientific notation.

``Bio.motifs`` now supports reverse complementing RNA motifs and correctly
generating degenerate consensus sequences for RNA motifs.

``Bio.motifs.minimal`` now supports parsing RNA motifs and parsing motifs
for which not all statistics are provided (e.g. missing E-values or nsites).

``Bio.motifs.clusterbuster`` now supports parsing GAP and WEIGHT parameters
and can optionally write Cluster Buster motif files with floats instead of
integers, by specifying the ``precision=<int>`` parameter when writing:
e.g. motifs.write(motifs, "clusterbuster", precision=2)

Additionally, a number of small bugs and typos have been fixed with additions
to the test suite and type annotations.
   2024-11-11 08:29:31 by Thomas Klausner | Files touched by this commit (862)
Log message:
py-*: remove unused tool dependency

py-setuptools includes the py-wheel functionality nowadays
   2024-10-14 08:46:10 by Thomas Klausner | Files touched by this commit (325)
Log message:
*: clean-up after python38 removal
   2023-11-15 09:58:06 by Thomas Klausner | Files touched by this commit (1)
Log message:
py-biopython: add missing setuptools tool
   2023-11-06 00:52:20 by Thomas Klausner | Files touched by this commit (3) | Package updated
Log message:
py-biopython: update to 1.81.

12 February 2023: Biopython 1.81
===============================================

This release of Biopython supports Python 3.7, 3.8, 3.9, 3.10, 3.11. It has
also been tested on PyPy3.7 v7.3.5. We intend to drop Python 3.7 support.

The API documentation and the `Biopython Tutorial and Cookbook` have
been updated to better annotate use and application of the
``Bio.PDB.internal_coords`` module.

``Bio.Phylo`` now supports ``Alignment`` and ``MultipleSeqAlignment``
objects as input.

Several improvements and bug fixes to the snapgene parser contributes by
Damien Goutte-Gattat.

Additionally, a number of small bugs and typos have been fixed with additions
to the test suite.

18 November 2022: Biopython 1.80
================================

This release of Biopython supports Python 3.7, 3.8, 3.9, 3.10, 3.11. It has
also been tested on PyPy3.7 v7.3.5.

Functions ``read``, ``parse``, and ``write`` were added to ``Bio.Align`` to
read and write ``Alignment`` objects.  String formatting and printing output
of ``Alignment`` objects from ``Bio.Align`` were changed to support these new
functions. To obtain a string showing the aligned sequence with the appropriate
gap characters (as previously shown when calling ``format`` on an alignment),
use ``alignment[i]``, where ``alignment`` is an ``Alignment`` object and ``i``
is the index of the aligned sequence.

Because dict retains the item order by default since Python3.6, all instances
of ``collections.OrderedDict`` have been replaced by either standard ``dict``
or where appropriate by ``collections.defaultsdict``.

Robert Miller has updated the ``Bio.PDB.internal_coords`` module  to
make better use of Numpy for lossless structure assembly from dihedral
angles and related internal coordinates.  In addition to speeding the
assembly step by ~30%, this adds distance plot support (including
re-generating structures from distance plot data), coordinate space
transforms for superimposing residues and their environments, a
per-chain all-atom array for Atom coordinates, and optional default
values for all internal coordinates.  The internal coordinates module
continues to support extracting dihedral angle, bond angle and bond
length (internal coordinates) data, reading/writing structure files of
internal coordinates, and OpenSCAD output of structures for 3D CAD/3D
printing work.

The ``Bio.motifs.jaspar.db`` now returns ``tf_family`` and ``tf_class`` as a
string array since the JASPAR 2018 release.

The Local Composition Complexity functions from ``Bio.SeqUtils`` now uses
base 4 log instead of 2 as stated in the original reference Konopka (2005),
Sequence Complexity and Composition. https://doi.org/10.1038/npg.els.0005260

Append mode is now supported in ``Bio.bgzf`` (and a bug parsing blocked GZIP
files with an internal empty block fixed).

The experimental warning was dropped from ``Bio.phenotype`` (which was new in
Biopython 1.67).

Sequences now have a ``defined`` attribute that returns a boolean indicating
if the underlying data is defined or not.

The ``Bio.PDB`` module now includes a structural alignment module, using the
combinatorial extension algorithm of Shindyalov and Bourne, commonly known as
CEAlign. The module allows for two structures to be aligned based solely on
their 3D conformation, ie. in a sequence-independent manner. The method is
particularly powerful when the structures shared a very low degree of sequence
similarity. The new module is available in ``Bio.PDB.CEAligner`` with an
interface similar to other 3D superimposition modules.

A new module ``Bio.PDB.qcprot`` implements the QCP superposition algorithm in
pure Python, deprecating the existing C implementation. This leads to a slight
performance improvement and to much better maintainability. The refactored
``qcprot.QCPSuperimposer`` class has small changes to its API, to better mirror
that of ``Bio.PDB.Superimposer``.

The ``Bio.PDB.PDBList`` module now allows downloading biological assemblies,
for one or more entries of the wwPDB.

In the ``Bio.Restriction`` module, each restriction enzyme now includes an `id`
property giving the numerical identifier for the REBASE database identifier
from which the enzyme object was created, and a `uri` property with a canonical
`identifiers.org` link to the database, for use in linked-data representations.

Add new ``gc_fraction`` function in ``SeqUtils`` and marks ``GC`` for future
deprecation.

Support for the old format (dating back to 2004) of the GN line in SwissProt
files was dropped in ``Bio.SwissProt``.

Additionally, a number of small bugs and typos have been fixed with additions
to the test suite.
   2023-08-02 01:20:57 by Thomas Klausner | Files touched by this commit (158)
Log message:
*: remove more references to Python 3.7
   2023-07-01 10:37:47 by Thomas Klausner | Files touched by this commit (105) | Package updated
Log message:
*: restrict py-numpy users to 3.9+ in preparation for update