2024-12-22 17:50:10 by Nia Alarie | Files touched by this commit (2) |
Log message:
I forgot about compiler.mk...
|
2024-12-22 17:44:30 by Nia Alarie | Files touched by this commit (2) |
Log message:
svt-av1, x265: These use ARM NEON intrinsics only supported in GCC>=10
|
2024-11-01 09:05:44 by Thomas Klausner | Files touched by this commit (2) | |
Log message:
svt-av1: update to 2.3.0.
## [2.3.0] - 2024-10-28
API updates
- Preset shift: M12/M13 mapped to M11, M7-M11 shifted one position down. API \
does not change, all presets from MR-M13 will still be accepted
- svt_av1_enc_get_packet API is now a blocking call for low-delay enforcing a \
picture in, picture out model
- --fast-decode range changed from 0-1 to 0-2 available on all presets
- Introducing a new definition of --lp being levels of parallelism with a new \
field in the config structure level_of_parallelism
- logical_processors will be deprecated in the 3.0 release
Encoder
- NEW FAST DECODE MODE - (!2280)
- New fast-decode (2) to allow for an average AV1 software cycle reduction of \
25-50% vs fast-decode 0 with a 1-3% BD-Rate loss across the presets
- Improved fast-decode (1) option to increase its AV1 software cycle reduction \
by ~10% while maintaining the same quality levels
- Improved --lp settings for high resolutions, with CRF gaining a ~4% \
improvement in speed and VBR gaining ~15% (!2323)
- Further Arm-based optimizations improving the efficiency of previously written \
Arm-neon implementations by an average of 30%. See below for more information on \
specific presets
- Address speed regressions for high resolutions first pass encode by tuning the \
threading parameters, with 1080p showing the biggest gains
- Enabled AVX512 by default in cmake allowing for ~2-4% speedup
- Enabled LTO by default if using a new enough compiler (!2288, !2305)
- If LTO is a problem or causes one, it can be disabled by adding \
-DSVT_AV1_LTO=OFF to cmake to force it off.
- Please report any issues that occur when using it.
Cleanup Build and bug fixes and documentation
- third_party: update safestringlib with applicable upstream changes
- Improved the unit test coverage for Arm-neon code
- Updated documentation
Arm Improvements
- Speed comparison was done against v2.2 on AWS Graviton4 instances with Clang 19.1.1
- `--lp 1` was used for all tests
Standard bitdepth (Bosphorus 1080p):
| Preset | Uplift |
|-------------------------------|--------|
| 0 | 1.15x |
| 1 | 1.24x |
| 2 | 1.29x |
| 3 | 1.17x |
| 4 | 1.22x |
| 5 | 1.31x |
| 6 | 1.40x |
| 7 (against 2.2 preset 8) | 1.50x |
| 8 (against 2.2 preset 9) | 1.61x |
| 9 (against 2.2 preset 10 | 1.31x |
| 10 (against 2.2 preset 11 ) | 1.29x |
| 11 (against 2.2 preset 12/13) | 1.24x |
High bitdepth (Bosphorus 2160p):
| Preset | Uplift |
|-------------------------------|--------|
| 0 | 1.18x |
| 1 | 1.19x |
| 2 | 1.16x |
| 3 | 1.27x |
| 4 | 1.33x |
| 5 | 1.27x |
| 6 | 1.33x |
| 7 (against 2.2 preset 8) | 1.35x |
| 8 (against 2.2 preset 9) | 1.82x |
| 9 (against 2.2 preset 10) | 1.95x |
| 10 (against 2.2 preset 11) | 1.40x |
| 11 (against 2.2 preset 12/13) | 1.35x |
|
2024-09-04 13:59:05 by Jonathan Perkin | Files touched by this commit (3) |
Log message:
svt-av1: Fix FOPEN conflict with system headers.
|
2024-08-21 19:21:44 by Thomas Klausner | Files touched by this commit (3) | |
Log message:
svt-av1: update to 2.2.0.
Removes decoder - use dav1d or aomdec.
https://gitlab.com/AOMediaCodec/SVT-AV1/-/issues/2182
## [2.2.0] - 2024-08-19
API updates
- No API changes on this release
Encoder
- Improve the tradeoffs for the random access mode across presets:
- Speedup of ~15% across presets M0 - M8 while maintaining similar quality \
levels (!2253)
- Improve the tradeoffs for the low-delay mode across presets (!2260)
- Increased temporal resolution setting to 6L for 4k resolutions by default
- Added ARM optimizations for functions with c_only equivalent yielding an \
average speedup of ~13% for 4k10bit
Cleanup Build and bug fixes and documentation
- Profile-guided-optimized helper build overhaul
- Major cleanup and fixing of Neon unit test suite
- Address stylecheck dependence on public repositories
## [2.1.2] - 2024-06-27
Cleanup, bug fixes:
- Fixed profile-guided-optimization build by removing the remaining decoder path
## [2.1.1] - 2024-06-25
Cleanup, bug fixes, and documentation improvements:
- Removed the SVT-AV1 Decoder portion of the project. The last version \
containing the decoder is SVT-AV1 v2.1.0.
- Updated the folder structure and library build order to reflect the removal of \
the decoder.
- Renamed all files (except for API files) to remove the "Eb" prefix \
and changed them to camel_case format.
- Updated the gtest version to v1.12.1.
- Added CI support for ARM-based macOS machines.
- Improved documentation for accuracy and completeness.
|
2024-08-21 19:00:42 by Thomas Klausner | Files touched by this commit (3) |
Log message:
*: revbump for svt-av1 shlib bump
|
2024-08-21 18:59:43 by Thomas Klausner | Files touched by this commit (5) | |
Log message:
svt-av1: update to 2.1.0.
2.2.0 is out, but 2.1.1 removed the decoder. Not sure what to do about that.
## [2.1.0] - 2024-05-17
API updates
- One config parameter added within the padding size. Config param structure \
size remains unchanged
- Presets 6 and 12 are now pointing to presets 7 and 13 respectively due to the \
lack of spacing between the presets
- Further preset shuffling is being discussed in #2152
Encoder
- Added variance boost support to improve visual quality for the tune vq mode
- Improve the tradeoffs for the random access mode across presets:
- Speedup of 12-40% presets M0, M3, M5 and M6 while maintaining similar \
quality levels
- Improved the compression efficiency of presets M11-M13 by 1-2% (!2213)
- Added ARM optimizations for functions with c_only equivalent
Cleanup Build and bug fixes and documentation
- Use nasm as a default assembler and yasm as a fallback
- Fix performance regression for systems with multiple processor groups
- Enable building SvtAv1ApiTests and SvtAv1E2ETests for arm
- Added variance boost documentation
- Added a mailmap file to map duplicate git generated emails to the appropriate \
author
## [2.0.0] - 2024-03-13
Major API updates
- Changed the API signaling the End Of Stream (EOS) with the last frame vs with \
an empty frame
- OPT_LD_LATENCY2 making the change above is kept in the code to help devs with \
integration
- Removed the 3-pass VBR mode which changed the calling mechanism of multi-pass VBR
- Moved to a new versioning scheme where the project major version will be \
updated everytime API/ABI is changed
Encoder
- Improve the tradeoffs for the random access mode across presets:
- Speedup presets MR by ~100% and improved quality along with tradeoff \
improvements across the higher quality presets (!2179)
- Improved the compression efficiency of presets M9-M13 by 1-4% (!2179)
- Simplified VBR multi-pass to use 2 passes to allow integration with ffmpeg
- Continued adding ARM optimizations for functions with c_only equivalent
- Replaced the 3-pass VBR with a 2-pass VBR to ease the multi-pass integration \
with ffmpeg
- Memory savings of 20-35% for LP 8 mode in preset M6 and below and 1-5% in \
other modes / presets
Cleanup and bug fixes and documentation
- Various cleanups and functional bug fixes
- Update the documentation to reflect the rate control changes
|
2024-08-19 19:33:37 by Thomas Klausner | Files touched by this commit (1) |
Log message:
svt-av1: convert to cmake/build.mk
|
2023-12-17 08:14:56 by Thomas Klausner | Files touched by this commit (2) | |
Log message:
svt-av1: update to 1.8.0.
## [1.8.0] - 2023-12-11
Encoder
- Improve the tradeoffs for the random access mode across presets:
- Speedup CRF presets M6 to M0 by 17-53% while maintaining similar quality levels
- Re-adjust CRF presets M7 to M13 for better quality with BD-rate gains ranging \
from 1-4%
- Improve the quality and speed of the 1-pass VBR mode
- More details on the per preset improvements can be found in MR !2143
- Add API allowing to update bitrate / CRF and Key_frame placement during the \
encoding session for CBR lowdelay mode and CRF Random Access mode
- ARM Neon SIMD optimizations for most critical kernels allowing for a 4.5-8x \
fps speedup vs the c implementation
Cleanup and bug fixes and documentation
- Various cleanups and functional bug fixes
- Update the documentation for preset options and individual features
|
2023-09-09 22:19:00 by Thomas Klausner | Files touched by this commit (2) | |
Log message:
svt-av1: update to 1.7.0.
## [1.7.0] - 2023-08-23
Encoder
- Improve the tradeoffs for the random access mode across presets MR-M13:
- Quality improvements across all presets and metrics ranging from 0.3% to 4.5% \
in BD-rate (!2129)
- Spacing between presets [M1-M6] has been adjusted to account for the tradeoff \
improvements achieved
- As a user guidance when comparing v1.7 vs v1.6 in a convexhull encoding setup:
- v1.7.0 M2 is now at similar quality levels as v1.6.0 M1 while being ~50% faster
- v1.7.0 M3 is now at similar quality levels as v1.6.0 M2 while being ~50% faster
- v1.7.0 M4 is now at similar quality levels as v1.6.0 M3 while being ~40% faster
- v1.7.0 M5 is now at similar quality levels as v1.6.0 M4 while being ~30% faster
- v1.7.0 M6 is now at similar quality levels as v1.6.0 M5 while being ~25% faster
- Added an experimental tune SSIM mode yielding ~3-4% additional SSIM BD-rate \
gains (!2109)
Build, cleanup and bug fixes
- Various cleanups and functional bug fixes
- Fix build conflict with libaom
## [1.6.0] - 2023-06-18
Encoder
- Improve the tradeoffs for the random access mode across presets M1-M13:
- Speeding up the higher quality presets by 30-40%
- Improving the BD-rate by 1-2% for the faster presets
- Improve the tradeoffs for the low delay mode for both scrren content and \
non-screen content encoding modes
- Add a toggle to remove the legacy one-frame buffer at the input of the \
pipeline alowing the low delay mode to operate at sub-frame processing latencies
- Add a new API allowing the user to specify quantization offsets for a region \
of interest per frame
Build, cleanup and bug fixes
- Various cleanups and functional bug fixes
- Fix the startup minigop size BD-rate loss
- Add ability to run the ci-testing offline
|