Compute Units
The compute units offers robust computing power, equivalent to the powerhouse of AI Core. Table 2-4 lists the compute units of AI Core.
Compute Unit |
Description |
---|---|
Cube |
Performs matrix computations.
The Cube Unit of different versions of Ascend AI Processors supported different data types, including fp16, fp32, uint8, int8, and int32. |
Vector |
Performs vector computations. Compared with the Cube, the Vector offers less robust computing power but more flexible computations (such as the reciprocal and square root in mathematics). All source data and target data of Vector computations must be stored in the Unified Buffer and be 32-byte aligned. The Vector Unit of different versions of Ascend AI Processors supported different data types, including fp16, fp32, int32, and int8. |
Scalar |
Computes scalar data of various types and controls the program flow. It can be regarded as a small CPU, which implements loop control, branch judgment, address and parameter computations of Cube/Vector instructions, and basic arithmetic operations for a program. |