SPRAD90 February   2023 AM62A3 , AM62A3-Q1 , AM62A7 , AM62A7-Q1

 

  1.   Abstract
  2.   Trademarks
  3. 1Introduction
    1. 1.1 Change Cortex-A53 Clock Frequency
  4. 2Processor Core Benchmarks
    1. 2.1 Dhrystone
  5. 3Compute and Memory System Benchmarks
    1. 3.1 Memory Bandwidth and Latency
      1. 3.1.1 LMBench
      2. 3.1.2 STREAM
      3. 3.1.3 Critical Memory Access Latency
    2. 3.2 CoreMark-Pro
    3. 3.3 Fast Fourier Transform
    4. 3.4 Cryptographic Benchmarks
  6. 4Application Benchmarks
    1. 4.1 Machine Learning Inference
  7. 5References

Fast Fourier Transform

Fast Fourier Transform (FFT) is on of the most common signal processing algorithms. This section shows FFT performance results for both Arm-Cortex-A53 and C7x processors.#GUID-FDA244DE-FEFD-4276-96A7-E5B72B5E2408/TABLE_FYY_HBR_L4B shows a 1024-point single precision floating point complex FFT execution time on Arm-Cortex-A53 and C7x DSP. The benchmark on Arm-Cortex-A53 uses the implementation from Ne10 library, which leverages the Advanced SIMD or NEON acceleration of Cortex-A53. This library is not included in the SDK but it can be downloaded from the official Ne10 repository. On the C7x, the FFT library available within the SDK was employed to benchmark the performance. The code executed on bare-metal with warm L2 cache.

Table 3-6 NE10 CFFT Benchmark
Arm-Cortex-A53
at 1.25 GHz (single thread / core)
Arm-Cortex-A53 at 1.4 GHz (single thread / core)C7x at 1.0 GHz
1024-point Complex FFT Execution Time21.7 µs19.4 µs1.8 µs