Tools

Performance Portable C++

By Jeff Keasler, May 07, 2008

Performance portability means that code can achieve good performance across a range of computer architectures while maintaining a single body of source code.

Jeff is a computer scientist at Lawrence Livermore National Laboratory where he contributes to several software projects managed through the ASC program.

Programmers have two basic ways of organizing arrays of data; see Figure 1. The performance of each choice can vary greatly as code is ported from machine to machine and compiler to compiler.

It's easy to switch between the Array-like and Struct-like implementations in Figure 1 by hiding the array details behind a class API. Listing One shows how a coordinate array is implemented as a performance portable Point class. There are two important features of the Point class implementation:

Methods are inlined.
Methods return direct references to the underlying data.

Together, these two features let almost all compilers efficiently optimize most (if not all) class overhead, especially when interprocedural analysis has been enabled in the compiler optimization flags.

If you use classes having the above form, you can quickly switch between array layouts as you port code. The easiest way to do this is to create a configuration header file with system-specific layout choices, and #include that configuration file at the top of each array class header file.

If you don't hide the array implementation as I describe here, you can end up completely rewriting your software when switching from one form of array layout to the other.

<b></b>
(a) 

double x[10000] ;         
double y[10000] ; 
double z[10000] ;         

<b>(b)</b>

struct {           
  double x,y,z ;
} point[10000] ;

Figure 1: (a) Array-like, (b) Struct-like.

  
#define ML_STRUCT 0
#define ML_ARRAY  1

#if POINT_MEM == ML_ARRAY
class Point {
public:
   Point(const int size) : m_x(size), m_y(size) {}
   inline double &x(const int idx) { return m_x[idx] ; }
   inline double &y(const int idx) { return m_y[idx] ; }
private:
  Point() ;
  std::vector<double> m_x ;
  std::vector<double> m_y ;
} ;
#else /* ML_STRUCT */
class Point {
public:
   Point(const int size) : m_p(size) {}
   inline double &x(const int idx) { return m_p[idx].x ; }
   inline double &y(const int idx) { return m_p[idx].y ; }
private:
  struct Coord { double x, y ; } ;
  Point() ;
  std::vector<Coord> m_p ;
} ;
#endif

Listing One

1 2 3 4 5 Next

More Insights

INFO-LINK


	To upload an avatar photo, first complete your Disqus profile. \| View the list of supported HTML tags you can use to style comments. \| Please read our commenting policy.

Tools

Performance Portable C++

Related Reading

More Insights

Currently we allow the following HTML tags in comments:

Single tags

Matching tags

Tools Recent Articles

Most Popular

This month's Dr. Dobb's Journal

Upcoming Events

Featured Reports

Featured Whitepapers

Most Recent Premium Content

Tools

Performance Portable C++

Related Reading

News

Commentary

Slideshow

Video

Most Popular

More Insights

White Papers

Reports

Webcasts

Currently we allow the following HTML tags in comments:

Single tags

Matching tags

Tools Recent Articles

Most Popular

This month's Dr. Dobb's Journal

Upcoming Events

Featured Reports

Featured Whitepapers

Most Recent Premium Content