Armv9較難理解的是SVE2!
# SVE2讓開發人員編寫和構建一次程式,就能用在不同AArch64硬體之上。
《轉貼》Why use SVE2?
The SVE2 design concept enables developers to write and build software once, then run the same binaries on different AArch64 hardware with various SVE2 vector length implementations, as the name suggests. Since some laptop and mobile devices have different vector lengths, SVE2 can reduce the cost of cross-platform support by sharing code. Removing the requirement to rebuild binaries allows software to be ported more easily. The scalability and portability of the binaries means that developers do not have to know and care about the vector length for their target devices. This particular benefit of SVE2 is more effective when the software is shared across platforms or used over an extended period of time.
# 四種使用SVE2的方法
How to use SVE2
So, how can you make the most of SVE2? There are several ways to write or generate SVE2 code:
1) A library that uses SVE2
2) SVE2-enabled Compiler
3) SVE2 Intrinsics in C/C++
4) SVE2 Assembly
https://community.arm.com/arm-commu...blog/posts/sve2
# Arm合作夥伴可在 128∼2048位元之間且以 128位為增量去設計向量長度
《轉貼》Introducing SVE2
SVE2 is a superset of SVE and Neon. SVE2 allows for more function domains in data-level parallelism. SVE2 inherits the concept, vector registers, and operation principles of SVE. SVE and SVE2 define 32 scalable vector registers.
Silicon partners can choose a suitable vector length design implementation for hardware that varies between 128 bits and 2048 bits, at 128-bit increments. The advantage of SVE and SVE2 is that only one vector instruction set uses the scalable variables.
# SVE2讓開發人員編寫和構建一次程式,就能用在不同AArch64硬體之上。
The SVE design concept enables developers to write and build software once, then run the same binaries on different AArch64 hardware with various SVE vector length implementations. The portability of the binaries means that developers do not have to know the vector length implementation for their system. Removing the requirement to rebuild binaries allows software to be ported more easily.
# SVE2 指令集還可以加速以下應用
The main difference between SVE2 and SVE is the functional coverage of the instruction set. SVE was designed for HPC and ML applications. SVE2 extends the SVE instruction set to enable data-processing domains beyond HPC and ML. The SVE2 instruction set can also accelerate the common algorithms that are used in the following applications:
Computer vision
Multimedia
Long-Term Evolution (LTE) baseband processing
Genomics
In-memory database
Web serving
General-purpose software
https://developer.arm.com/documenta...ntroducing-SVE2