SPRUIG8 User guide

SPRUIG8J January 2018 – March 2024

1
Read This First
1. About This Manual
2. Notational Conventions
3. Related Documentation
4. Related Documentation From Texas Instruments
5. Trademarks
1 Introduction to the Software Development Tools
1. 1.1 Software Development Tools Overview
2. 1.2 Compiler Interface
3. 1.3 ANSI/ISO Standard
4. 1.4 Output Files
2 Getting Started with the Code Generation Tools
1. 2.1 How Code Composer Studio Projects Use the Compiler
2. 2.2 Compiling from the Command Line
3 Using the C/C++ Compiler
1. 3.1 About the Compiler
2. 3.2 Invoking the C/C++ Compiler
3. 3.3 Changing the Compiler's Behavior with Options
4. 3.4 Controlling the Compiler Through Environment Variables
  1. 3.4.1 Setting Default Compiler Options (C7X_C_OPTION)
  2. 3.4.2 Naming One or More Alternate Directories (C7X_C_DIR)
5. 3.5 Controlling the Preprocessor
6. 3.6 Passing Arguments to main()
7. 3.7 Understanding Diagnostic Messages
  1. 3.7.1 Controlling Diagnostic Messages
  2. 3.7.2 How You Can Use Diagnostic Suppression Options
8. 3.8 Other Messages
9. 3.9 Generating a Raw Listing File (--gen_preprocessor_listing Option)
10. 3.10 Using Inline Function Expansion
11. 3.11 Using Interlist
12. 3.12 About the Application Binary Interface
13. 3.13 Enabling Entry Hook and Exit Hook Functions
4 Optimizing Your Code
1. 4.1 Invoking Optimization
2. 4.2 Controlling Code Size Versus Speed
3. 4.3 Performing File-Level Optimization (--opt_level=3 option)
  1. 4.3.1 Creating an Optimization Information File (--gen_opt_info Option)
4. 4.4 Program-Level Optimization (--program_level_compile and --opt_level=3 options)
  1. 4.4.1 Controlling Program-Level Optimization (--call_assumptions Option)
5. 4.5 Automatic Inline Expansion (--auto_inline Option)
6. 4.6 Link-Time Optimization (--opt_level=4 Option)
  1. 4.6.1 Option Handling
  2. 4.6.2 Incompatible Types
7. 4.7 Optimizing Software Pipelining
8. 4.8 Redundant Loops
9. 4.9 Indicating Whether Certain Aliasing Techniques Are Used
  1. 4.9.1 Use the --aliased_variables Option When Certain Aliases are Used
10. 4.10 Prevent Reordering of Associative Floating-Point Operations
11. 4.11 Using Performance Advice to Optimize Code
  1. 4.11.1 Advice #35000: Use restrict to improve loop performance
12. 4.12 Using the Interlist Feature With Optimization
13. 4.13 Debugging and Profiling Optimized Code
  1. 4.13.1 Profiling Optimized Code
14. 4.14 What Kind of Optimization Is Being Performed?
15. 4.15 Streaming Engine and Streaming Address Generator
16. 4.16 Nested Loop Controller (NLC)
  1. 4.16.1 Obstacles That May Inhibit Use of NLC
5 C/C++ Language Implementation
1. 5.1 Characteristics of C7000 C
  1. 5.1.1 Implementation-Defined Behavior
2. 5.2 Characteristics of C7000 C++
3. 5.3 Data Types
  1. 5.3.1 Size of Enum Types
  2. 5.3.2 Vector Data Types
4. 5.4 File Encodings and Character Sets
5. 5.5 Keywords
6. 5.6 C++ Exception Handling
7. 5.7 Register Variables and Parameters
8. 5.8 Pragma Directives
9. 5.9 The _Pragma Operator
10. 5.10 Application Binary Interface
11. 5.11 Object File Symbol Naming Conventions (Linknames)
12. 5.12 Changing the ANSI/ISO C/C++ Language Mode
13. 5.13 GNU and Clang Language Extensions
14. 5.14 Operations and Functions for Vector Data Types
15. 5.15 C7000 Intrinsics
16. 5.16 C7000 Scalable Vector Programming
6 Run-Time Environment
1. 6.1 Memory
2. 6.2 Object Representation
3. 6.3 Register Conventions
4. 6.4 Function Structure and Calling Conventions
5. 6.5 Accessing Linker Symbols in C and C++
6. 6.6 Run-Time-Support Arithmetic Routines
7. 6.7 System Initialization
  1. 6.7.1 Boot Hook Functions for System Pre-Initialization
  2. 6.7.2 Automatic Initialization of Variables
7 Using Run-Time-Support Functions and Building Libraries
1. 7.1 C and C++ Run-Time Support Libraries
2. 7.2 The C I/O Functions
  1. 7.2.1 High-Level I/O Functions
    1. 7.2.1.1 Formatting and the Format Conversion Buffer
  2. 7.2.2 Overview of Low-Level I/O Implementation
    1. open
    2. close
    3. read
    4. write
    5. lseek
    6. unlink
    7. rename
  3. 7.2.3 Device-Driver Level I/O Functions
    1. DEV_open
    2. DEV_close
    3. DEV_read
    4. DEV_write
    5. DEV_lseek
    6. DEV_unlink
    7. DEV_rename
  4. 7.2.4 Adding a User-Defined Device Driver for C I/O
    1. 7.2.4.1 Mapping Default Streams to Device
  5. 7.2.5 The device Prefix
3. 7.3 Handling Reentrancy (_register_lock() and _register_unlock() Functions)
4. 7.4 Library-Build Process
8 Introduction to Object Modules
1. 8.1 Object File Format Specifications
2. 8.2 Executable Object Files
3. 8.3 Introduction to Sections
  1. 8.3.1 Special Section Names
4. 8.4 How the Linker Handles Sections
  1. 8.4.1 Combining Input Sections
  2. 8.4.2 Placing Sections
5. 8.5 Symbols
  1. 8.5.1 Local Symbols
  2. 8.5.2 Weak Symbols
6. 8.6 Loading a Program
9 Program Loading and Running
1. 9.1 Loading
2. 9.2 Entry Point
3. 9.3 Run-Time Initialization
4. 9.4 Arguments to main
5. 9.5 Run-Time Relocation
6. 9.6 Additional Information
10Archiver Description
1. 10.1 Archiver Overview
2. 10.2 The Archiver's Role in the Software Development Flow
3. 10.3 Invoking the Archiver
4. 10.4 Archiver Examples
5. 10.5 Library Information Archiver Description
11Linking C/C++ Code
1. 11.1 Invoking the Linker Through the Compiler (-z Option)
2. 11.2 Linker Code Optimizations
3. 11.3 Controlling the Linking Process
12Linker Description
1. 12.1 Linker Overview
2. 12.2 The Linker's Role in the Software Development Flow
3. 12.3 Invoking the Linker
4. 12.4 Linker Options
5. 12.5 Linker Command Files
6. 12.6 Linker Symbols
7. 12.7 Default Placement Algorithm
  1. 12.7.1 How the Allocation Algorithm Creates Output Sections
  2. 12.7.2 Reducing Memory Fragmentation
8. 12.8 Using Linker-Generated Copy Tables
9. 12.9 Partial (Incremental) Linking
10. 12.10 Linking C/C++ Code
11. 12.11 Linker Example
13Object File Utilities
1. 13.1 Invoking the Object File Display Utility
2. 13.2 Invoking the Disassembler
3. 13.3 Invoking the Name Utility
4. 13.4 Invoking the Strip Utility
14C++ Name Demangler
1. 14.1 Invoking the C++ Name Demangler
2. 14.2 Sample Usage of the C++ Name Demangler
A XML Link Information File Description
1. A.1 XML Information File Element Types
2. A.2 Document Elements
  1. A.2.1 Header Elements
  2. A.2.2 Input File List
  3. A.2.3 Object Component List
  4. A.2.4 Logical Group List
  5. A.2.5 Placement Map
  6. A.2.6 Far Call Trampoline List
  7. A.2.7 Symbol Table
B Unsupported Tools and Features
1. B.1 List of Unsupported Tools and Features
C Glossary
1. 528
D Revision History

4.14.12 Unroll-and-jam

The compiler can unroll an outer loop that encloses an innermost loop. This transformation makes an extra iteration of the outer loop, and as a result, there is another copy of the inner loop. The second "inner loop" is then "fused" back into the original inner loop. As a result, the fused inner loop performs two iterations of the outer loop for each execution of the inner loop. This transformation is called "unroll-and-jam" and can increase available parallelism and function unit utilization.

The compiler can perform unroll-and-jam if the compiler detects that there is not sufficient parallelism available in the inner loop to effectively utilize the computational resources on the CPU.

This type of optimization is performed if both the --opt_for_speed (-mf) option is set to level 3 or higher (level 4 is the default) and the --opt_level (-o) option is set to any level other than "off" (off is the default if --vectypes=off). This optimization can improve performance, but results in increased code size and reduced debuggability.