./devel/polly, Polyhedral optimizations for LLVM

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]

Branch: CURRENT, Version: 7.0.1, Package name: polly-7.0.1, Maintainer: pkgsrc-users

Polly is a high-level loop and data-locality optimizer and optimization
infrastructure for LLVM. It uses an abstract mathematical representation based
on integer polyhedra to analyze and optimize the memory access pattern of a
program. We currently perform classical loop transformations, especially tiling
and loop fusion to improve data-locality. Polly can also exploit OpenMP level
parallelism, expose SIMDization opportunities. Work has also be done in the
area of automatic GPU code generation.

For many users, however, it's not the existing optimizations in Polly that are
of most interest, but the new analyses and optimizations enabled by the Polly
infrastructure. At polyhedral.info you can get an idea of what has already been
done and what is possible in the context of polyhedral compilation.

Required to run:

Required to build:

Master sites:

SHA1: 7c750261090168a4a219ab823cfbe76f0938850f
RMD160: f1968870c6069bdc22b660eac4a1de4ca580cddd
Filesize: 8549.969 KB

Version history: (Expand)

CVS history: (Expand)

   2018-12-23 01:11:40 by Adam Ciarcinski | Files touched by this commit (17) | Package updated
Log message:
llvm and friends: updated to 7.0.1

This release contains bug-fixes for the LLVM 7.0.0 release. This
release is API and ABI compatible with 7.0.0.
   2018-12-09 21:04:40 by Adam Ciarcinski | Files touched by this commit (44) | Package updated
Log message:
llvm: updated to 7.0.0

LLVM 7.0.0 Release

The release contains the work on trunk up to SVN revision 338536 plus
work on the release branch. It is the result of the community's work
over the past six months, including: function multiversioning in Clang
with the 'target' attribute for ELF-based x86/x86_64 targets, improved
PCH support in clang-cl, preliminary DWARF v5 support, basic support
for OpenMP 4.5 offloading to NVPTX, OpenCL C++ support, MSan, X-Ray
and libFuzzer support for FreeBSD, early UBSan, X-Ray and libFuzzer
support for OpenBSD, UBSan checks for implicit conversions, many
long-tail compatibility issues fixed in lld which is now production
ready for ELF, COFF and MinGW, new tools llvm-exegesis, llvm-mca and
diagtool. And as usual, many optimizations, improved diagnostics, and
bug fixes.

For more details, see the release notes:
https://llvm.org/releases/7.0.0/tools/c … Notes.html
https://llvm.org/releases/7.0.0/tools/c … Notes.html
https://llvm.org/releases/7.0.0/tools/l … Notes.html
   2018-08-07 12:44:50 by Adam Ciarcinski | Files touched by this commit (16) | Package updated
Log message:
llvm: updated to 6.0.1


Non-comprehensive list of changes in this release

Support for retpolines was added to help mitigate “branch target injection” \ 
(variant 2) of the “Spectre” speculative side channels described by Project \ 
Zero and the Spectre paper.
The Redirects argument of llvm::sys::ExecuteAndWait and llvm::sys::ExecuteNoWait \ 
was changed to an ArrayRef of optional StringRef‘s to make it safer and more \ 
convenient to use.
The backend name was added to the Target Registry to allow run-time information \ 
to be fed back into TableGen. Out-of-tree targets will need to add the name used \ 
in the def X : Target definition to the call to RegisterTarget.
The Debugify pass was added to opt to facilitate testing of debug info \ 
preservation. This pass attaches synthetic DILocations and DIVariables to the \ 
instructions in a Module. The CheckDebugify pass determines how much of the \ 
metadata is lost.
Significantly improved quality of CodeView debug info for Windows.
Preliminary support for Sanitizers and sibling features on X86(_64) NetBSD \ 
(ASan, UBsan, TSan, MSan, SafeStack, libFuzzer).

Changes to the LLVM IR
The fast-math-flags (FMF) have been updated. Previously, the ‘fast’ flag \ 
indicated that floating-point reassociation was allowed and all other flags were \ 
set too. The ‘fast’ flag still exists, but there is a new flag called \ 
‘reassoc’ to indicate specifically that reassociation is allowed. A new bit \ 
called ‘afn’ was also added to selectively allow approximations for common \ 
mathlib functions like square-root. The new flags provide more flexibility to \ 
enable/disable specific floating-point optimizations. Making the optimizer \ 
respond appropriately to these flags is an ongoing effort.

Changes to the AArch64 Target
Enabled the new GlobalISel instruction selection framework by default at -O0.

Changes to the ARM Target
Support for enabling SjLj exception handling on platforms where it isn’t the \ 

Changes to the Hexagon Target
The Hexagon backend now supports V65 ISA.
The -mhvx option now takes an optional value that specifies the ISA version of \ 
the HVX coprocessor. The available values are v60, v62 and v65. By default, the \ 
value is set to be the same as the CPU version.
The compiler option -mhvx-double is deprecated and will be removed in the next \ 
release of the compiler. Programmers should use the -mhvx-length option to \ 
specify the desired vector length: -mhvx-length=64b for 64-byte vectors and \ 
-mhvx-length=128b for 128-byte vectors. While the current default vector length \ 
is 64 bytes, users should always specify the length explicitly, since the \ 
default value may change in the future.
The target feature hvx-double is deprecated and will be removed in the next \ 
release. LLVM IR generators should use target features hvx-length64b and \ 
hvx-length128b to indicate the vector length. The length should always be \ 
specified when HVX code generation is enabled.

Changes to the MIPS Target
Fixed numerous bugs:

fpowi on MIPS64 giving incorrect results when used with a negative integer.
Usage of the asm ‘c’ constraint with the wrong datatype causing an assert/crash.
Fixed a conversion bug when using the DSP ASE.
Fixed an inconsistency where objects were not marked as using the microMIPS as \ 
when the micromips function attribute or the ”.set micromips” directive was \ 
Reordered the MIPSR6 specific hazard scheduler pass to after the delay slot \ 
filler, fixing a class of rare edge case bugs where the delay slot filler would \ 
violate ISA restrictions.
Fixed a crash when using a type of unknown size with gp relative addressing.
Corrected the j macro for microMIPS.
Corrected the encoding of movep for microMIPS32r6.
Fixed an issue with the usage of insert instructions having an invalid set of \ 
Fixed an issue where TLS symbols were not marked as such.
Enabled the usage of register scavenging with MSA, due to its shorter offsets \ 
for loads and stores.
Corrected the ELF headers when using the DSP ASE.

New features:

The long branch pass now generates some R6 specific instructions when targeting \ 
The delay slot filler now performs more branch conversions if delay slots cannot \ 
be filled.
The MIPS MT ASE is now fully supported.
Added support for the lapc pseudo instruction.
Improved the selection of multiple instructions (dext, nmadd, nmsub).
Further improved microMIPS codesize reduction.

Deprecation notices:

microMIPS64R6 support was been deprecated since 5.0, and has now been completely \ 

Changes to the SystemZ Target
During this release the SystemZ target has:

Added support for 128-bit atomic operations.
Added support for the “o” constraint for inline asm statements.

Changes to the X86 Target
During this release the X86 target has:

Added support for enabling SjLj exception handling on platforms where it isn’t \ 
the default.
Added intrinsics for Intel Extensions: VAES, GFNI, VPCLMULQDQ, AVX512VBMI2, \ 
Added support for Intel Icelake CPU.
Fixed some X87 codegen bugs.
Added instruction scheduling information for Intel Sandy Bridge, Ivy Bridge, \ 
Haswell, Broadwell, and Skylake CPUs.
Improved scheduler model for AMD Jaguar CPUs.
Improved llvm-mc’s disassembler for some EVEX encoded instructions.
Add support for i8 and i16 vector signed/unsigned min/max horizontal reductions.
Improved codegen for memory comparisons
Improved codegen for i32 vector multiplies
Improved codegen for scalar integer absolute values
Improved codegen for vector integer rotations (XOP and AVX512)
Improved codegen of data being transferred between GPRs and K-registers.
Improved codegen for vector truncations.
Improved folding of address computations into gather/scatter instructions.
Gained initial support recognizing variable shuffles from vector element \ 
extracts and inserts.
Improved documentation for SSE/AVX intrinsics in intrin.h header files.
Gained support for emitting retpolines, including automatic insertion of the \ 
necessary thunks or using external thunks.
   2018-05-22 14:39:23 by Adam Ciarcinski | Files touched by this commit (10) | Package updated
Log message:
llvm: updated to 5.0.2

Bug fix release.
   2017-12-30 11:39:25 by Adam Ciarcinski | Files touched by this commit (14) | Package updated
Log message:
llvm: updated to 5.0.1

LLVM 5.0.1:
This release contains bug-fixes for the LLVM 5.0.0 release. This
release is API and ABI compatible with 5.0.0.
   2017-12-01 20:22:13 by Adam Ciarcinski | Files touched by this commit (34) | Package updated
Log message:
llvm: updated to 5.0.0

Non-comprehensive list of changes in this release
* LLVM’s WeakVH has been renamed to WeakTrackingVH and a new WeakVH has been \ 
introduced. The new WeakVH nulls itself out on deletion, but does not track \ 
values across RAUW.
* A new library named BinaryFormat has been created which holds a collection of \ 
code which previously lived in Support. This includes the file_magic structure \ 
and identify_magic functions, as well as all the structure and type definitions \ 
for DWARF, ELF, COFF, WASM, and MachO file formats.
* The tool llvm-pdbdump has been renamed llvm-pdbutil to better reflect its \ 
nature as a general purpose PDB manipulation / diagnostics tool that does more \ 
than just dumping contents.
* The BBVectorize pass has been removed. It was fully replaced and no longer \ 
used back in 2014 but we didn’t get around to removing it. Now it is gone. The \ 
SLP vectorizer is the suggested non-loop vectorization pass.
* A new tool opt-viewer.py has been added to visualize optimization remarks in \ 
HTML. The tool processes the YAML files produced by clang with the \ 
-fsave-optimization-record option.
* A new CMake macro LLVM_REVERSE_ITERATION has been added. If enabled, all \ 
supported unordered LLVM containers would be iterated in reverse order. This is \ 
useful for uncovering non-determinism caused by iteration of unordered \ 
containers. Currently, it supports reverse iteration of SmallPtrSet and \ 
* A new tool llvm-dlltool has been added to create short import libraries from \ 
GNU style definition files. The tool utilizes the PE COFF SPEC Import Library \ 
Format and PE COFF Auxiliary Weak Externals Format to achieve compatibility with \ 
   2017-07-11 21:47:38 by Adam Ciarcinski | Files touched by this commit (15)
Log message:
Changes 4.0.1:
This is a bug-fix release.
   2017-05-06 03:38:43 by Kamil Rytarowski | Files touched by this commit (3)
Log message:
Mark c++11 requirement