Date: Jun 22, 2020
Click Count: 1586
MLP stands for Machine Learning Processing unit, which is an array of up to 32 multipliers, and an addition tree, accumulator, and rounding/saturation/normalization normalization function block. It also includes two caches, one BRAM72k and one LRAM2k, for independent or combined multiplier use.
Considering the trade-off between operation energy consumption and accuracy, the most commonly used operation formats in machine learning engines are FP16 and INT8, while BF16 supported by Tensor Flow is to reduce the accuracy to obtain a larger value space. Table 1 below is the floating-point format of the maximum bit width supported by MLP, and Table 2 illustrates the respective range of values.
And this seems to be a trend in the future. Many studies have shown that floating-point or integer types with smaller bit widths can not only ensure the correct rate, but also reduce a lot of calculations. Therefore, in order to comply with this trend, MLP also supports splitting the large-bit width multiplication unit into multiple small-bit width multiplications, including integers and floating-point numbers. See Table 3 below for details.
It is worth noting that bfloat16 here is the Brain Float format, and block float is a block floating point algorithm, that is, when applying Block Float16 and lower bit wide block floating point format, the exponent bit width is unchanged, and the decimal place is reduced to less than 16bit, Therefore, the floating-point addition bit width becomes smaller, and there is no need to use a floating-point multiplication unit. Instead, integer multiplication and addition trees are sufficient. The MLP architecture can double the computing power in these formats.
<< Previous: FPGA and GPU analog analysis
<< Next: Design of remote display system based on FPGA
Design of real-time panoramic video ...
In order to improve the resolution and processing speed of t...
Date: Jun 29, 2020
Road to FPGA development: reduce pow...
In the era of invention, FPGA is much smaller than the user'...
Date: Jun 23, 2020
An Interface Circuit Design Based on...
The DSP controls the timing of each module. The FPGA is resp...
Date: Jun 26, 2020
Xilinx Zynq zc706 evaluation board d...
The core of the Xilinx Zynq zc706 evaluation board is the Zy...
Date: Dec 07, 2020
Application of FPGA in stepper motor...
The biggest feature of stepper motor control is open loop co...
Date: Jun 22, 2020
How to extend the operating temperat...
Since the Peltier effect can only be used to cool the image ...
Date: Jun 30, 2020
1
2
3
4
5
6
7
8
Comparison of the latest released FPGAs from Xilinx, Intel, and Lattice
9
10
Support