Invention Grant
US08171464B2 Efficient code generation using loop peeling for SIMD loop code with multile misaligned statements 失效
使用多重不对齐语句的SIMD循环码循环剥离进行有效的代码生成

Efficient code generation using loop peeling for SIMD loop code with multile misaligned statements
Abstract:
An approach is provided for vectorizing misaligned references in compiled code for SIMD architectures that support only aligned loads and stores. In this framework, a loop is first simdized as if the memory unit imposes no alignment constraints. The compiler then inserts data reorganization operations to satisfy the actual alignment requirements of the hardware. Finally, the code generation algorithm generates SIMD codes based on the data reorganization graph, addressing realistic issues such as runtime alignments, unknown loop bounds, residual iteration counts, and multiple statements with arbitrary alignment combinations. Loop peeling is used to reduce the computational overhead associated with misaligned data. A loop prologue and epilogue are peeled from individual iterations in the simdized loop, and vector-splicing instructions are applied to the peeled iterations, while the steady-state loop body incurs no additional computational overhead.
Information query
Patent Agency Ranking
0/0