SGI Techpubs Library

IRIX 6.5  »  Books  »  Developer  »  
Origin 2000 and Onyx2 Performance Tuning and Optimization Guide
(document number: 007-3430-003 / published: 2001-08-02)    table of contents  |  additional info  |  download
find in page

Origin2000 and Onyx2® Performance Tuning and Optimization Guide

Document Number: 007-3430-003

Front Matter

| List of Figures | List of Examples | List of Tables |


Table of Contents

About This Guide
Who Can Benefit from This Guide
What the Guide Contains
Related Documents
Text Conventions
Obtaining Publications
Reader Comments

1. Understanding SN0 Architecture
Understanding Scalable Multiprocessor Memory
Understanding Scalable Shared Memory
Understanding MIPS R10000 Architecture
Summary

2. SN0 Memory Management
Dealing With Nonuniform Access Time
IRIX Memory Locality Management
Achieving Good Performance in a NUMA System
Summary

3. Tuning for a Single Process
Getting the Right Answers
Exploiting Existing Tuned Code
Summary

4. Profiling and Analyzing Program Behavior
Profiling Tools
Analyzing Performance with perfex
Using SpeedShop
Using Address Space Profiling
Summary

5. Using Basic Compiler Optimizations
Understanding Compiler Options
Exploiting Software Pipelining
Informing the Compiler
Exploiting Interprocedural Analysis
Summary

6. Optimizing Cache Utilization
Understanding the Levels of the Memory Hierarchy
Identifying Cache Problems with Perfex and SpeedShop
Using Other Cache Techniques
Summary

7. Using Loop Nest Optimization
Understanding Loop Nest Optimizations
Using Outer Loop Unrolling
Using Loop Interchange
Controlling Cache Blocking
Using Loop Fusion and Fission
Using Prefetching
Using Array Padding
Using Gather-Scatter and Vector Intrinsics
Summary

8. Tuning for Parallel Processing
Understanding Parallel Speedup and Amdahl's Law
Compiling Serial Code for Parallel Execution
Explicit Models of Parallel Computation
Tuning Parallel Code for SN0
Scalability and Data Placement
Using Data Distribution Directives
Non-MP Library Programs and Dplace
Summary

A. Bentley's Rules Updated
Space-for-Time Rules
Time-for-Space Rules
Loop Rules
Logic Rules
Procedure Design Rules
Expression Rules

B. R10000 Counter Event Types
Counter Events In Detail

C. Useful Scripts and Code
Program adi2

Glossary

Index

Origin 2000 and Onyx2 Performance Tuning and Optimization Guide
(document number: 007-3430-003 / published: 2001-08-02)    table of contents  |  additional info  |  download

    Front Matter
    About This Guide
    Chapter 1. Understanding SN0 Architecture
    Chapter 2. SN0 Memory Management
    Chapter 3. Tuning for a Single Process
    Chapter 4. Profiling and Analyzing Program Behavior
    Chapter 5. Using Basic Compiler Optimizations
    Chapter 6. Optimizing Cache Utilization
    Chapter 7. Using Loop Nest Optimization
    Chapter 8. Tuning for Parallel Processing
    Appendix A. Bentley's Rules Updated
    Appendix B. R10000 Counter Event Types
    Appendix C. Useful Scripts and Code
    Glossary
    Index


home/search | what's new | help

Contact Us | Site Map | Trademarks | Privacy | Using this site means you accept its Terms of Use

Copyright © 1993-2007 SGI, Inc. All rights reserved.