Professional Documents
Culture Documents
GCC
Latest release: 4.7, March 2012 http://gcc.gnu.org Architecture
Generate instructions specific -march=bdver2 to Abu Dhabi
Optimization Levels
Disable all optimizations (default) Local optimizations Global optimizations Additional aggressive optimizations Maximize performance -O0 -O1 -O2 -O3 -Ofast
Optimization Levels
Disable all optimizations Local optimizations Global optimizations -O0 -O1 -O2
Additional Optimizations
Schedule instructions AVX tuning Enable unrolling Disable prefetch instructions for loops Inline string operations Link to ACML 5.1.0 -fschedule-insns -fscheduleinsns2 -fsched-pressure -mprefer-avx128 -funroll-all-loops -fno-prefetch-loop-arrays -minline-all-stringops -L/opt/acml5.1.0/ gfortran64_fma4/lib -lacml -fopenmp -fprofile-generate, -fprofile-use -fno-tree-pre -ftree-vectorize
Additional Optimizations
Autoparallelization Feedback directed optimization Huge pages Interprocedural Analysis and Optimizations Link to ACML 5.1.0 Loop nest optimizations: vectorization, fission, fusion -apo -fb-create -fb-opt -HP -ipa -L/opt/acml5.1.0/ open64_64_fma4/lib -lacml -LNO:simd=n -LNO:fission=n -LNO:fusion=n Multicore scalability OpenMP Prefetch (disabled by default) -mso -mp -LNO:prefetch -LNO:prefetch_ahead Turn off partial redundancy elimination Vectorization
Advanced Micro Devices One AMD Place P.O. Box 3453 Sunnyvale, CA 94088-3453
2012 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo, AMD Opteron, and combinations thereof are trademarks of Advanced Micro Devices, Inc. Linux is a registered trademark of Linus Torvalds. Other names are for informational purposes only and may be trademarks of their respective owners.
AMD Opteron processors code named Abu Dhabi and AMD Opteron processors code named Seoul
PGI
Latest release: 12.3, March 2012 http://www.pgroup.com Architecture
Generate instructions that run on Abu Dhabi -tp bulldozer
Optimization Levels
Disable all optimizations Speed optimization without code growth Enable optimization including vectorization Aggressive optimization Maximize performance -O0 -O1
Optimization Levels
Disable all optimizations Local optimization Global optimization -O2 Aggressive global optimization -O3 -fast Hoist guarded invariant floating point expressions Maximize performance -unroll-aggressive -no-prec-div -O4 -O0 -O1 -O2 -O3
Additional Optimizations
Aggressive unrolling Disable improved precision floating divides Enable vectorization Inter procedural Optimization Link to ACML to 5.1.0
-fast
Additional Optimizations
Huge pages Autoparallelize loops -Msmartalloc=huge -Mconcur -Mvect -Mipa=fast,inline -L/opt/acml5.1.0/ pgi64_fma4/lib -lacml -mp -Mvect=prefetch -Mpfi, -Mpfo -Munroll
-vec -ipo
Advanced Micro Devices One AMD Place P.O. Box 3453 Sunnyvale, CA 94088-3453
2012 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo, AMD Opteron, and combinations thereof are trademarks of Advanced Micro Devices, Inc. Linux is a registered trademark of Linus Torvalds. Other names are for informational purposes only and may be trademarks of their respective owners.