SPRADF0A November 2023 – November 2023 AM263P4 , AM263P4-Q1
A direct comparison of embedded flash MCU devices to OptiFlash devices is not relevant as overall architectures are different. Yet, as mentioned previously, both require application developers to execute time-critical code from on-chip memory to meet necessary processing timelines. To show how this balance of flash and on-chip memory performance can be achieved, TI has developed a set of system KPIs that measure OptiFlash performance and its constituent accelerators and tools. Following KPIs are being measured using application-1 that emulates a poorly cached AutoSAR application, and Application-2, which is a real-world networking example with Lwip client-server + Mbed TLS use case.
Test | KPI | Without OptiFlash | OptiFlash Enabled | App. Use case |
---|---|---|---|---|
XIP | Basic (without Safety Security) | CPU DMIPs loss of 2-3x observed | DMIPs degradation limited to 1.1x with 128kB RL2. | App-1 and App-2 |
W/ Safety and Security | DMIPs degradation limited to 1.4x with Hardware accelerators for in-line ECC and OTFA. | App-1 and App-2 |
Table 2-2 show the impact of the configurable RL2 cache. A cache size larger than 128KB did not show further improvement in XIP performance. The optimal RL2 cache size also eliminated the difference in processing timelines with and without security and safety. Note that degradation is in comparison of internal RAM. For example, when L2 cache was disabled, application performed 2.4 times worst when run from external flash in comparison to internal RAM.
Test/ Cache Size Used (kB) | Performance Degradation With Safety and Security | Performance Degradation Without Safety and Security | App. Use Case | |
---|---|---|---|---|
RL2 access size | 0 | 2.4x | 2.2x | App-1 |
16 | 2.2x | 1.9x | ||
32 | 1.9x | 1.7x | ||
128 | 1.1x | 1.1x |
The Smart Placement tool was used to analyze the application and place time-sensitive code or data in TCM, OCRAM, or flash. Table 2-3 showed that the Smart Placement Tool enabled 19% application execution time improvement when utilized for both code and data.
Test/ TCM Size Used (kB) | Data vs. Code | Execution Time (μS) | % Improvement With Smart Placement | App Use Case | |
---|---|---|---|---|---|
Execution time improved with Smart Placement | 0 | n/a | 27,583 | N/A | App 1 |
64 | code | 25,342 | 9% | ||
64 | code + data | 22,537 | 19% |
In another OptiFlash XIP test, an EtherNet/IP protocol application was implemented with XIP mode and then with XIP using the Smart Placement tool. As can be seen, the CPU loading was reduced and the worst-case jitter was notably improved with Smart Placement.
Test | Max. CPU Loading (%) | Worst case jitter | App. Use case |
---|---|---|---|
XIP | 98.91 | 115.7 | EtherNet/IP protocol application |
XIP + Smart Placement | 85.97 (13% better) | 68 (40% better) |
The OptiShare technology was used to optimize code sharing among R5F cores for an IPC application on the MCU+ SDK. When using OptiShare, the code size was reduced by 10%.
Test | Code Size (kB) | Memory Footprint Optimized (%) | App. Use Case |
---|---|---|---|
Code size reduction with OptiShare | 73 | ~10 (lower code size) | SDK Out-of-box IPC application |