Intel Cluster Studio XE 2012 and Hybrid MPI/OpenMP Applications: Optimize Up To the Node Level

Intel Cluster Studio XE 2012 provides an MPI hybrid development suite that targets developers on high-performance clusters.

Rebuilding the Tower of Hanoi

Sometimes seemingly serial computations may be coded with parallel algorithms if you can figure out how to present the final results in the proper order.

The CoreInfo 3.03 Utility Provides Detailed Intel Instruction Sets

The newest version of CoreInfo provides a detailed summary of all the instructions sets that the CPUs found on the system support.

Linked Lists Are, Like, So Last Century

Give up your sequential linked lists in favor of tree structures for all of your unordered data when processing that data in parallel.

Seeing the Light with Backtracking

Use the processing speed of computers to generate and explore different possible configurations to solve Akari puzzles.

Boost Performance for Your Android Native Code

The Android NDK is a toolset that lets you embed components that make use of native code in your Android applications.

Calling IPP Functions from C# Code

Intel Integrated Performance Primitives, also known as IPP, is a library of highly optimized math software functions for digital media and data-processing applications.

Three Parallel Backtracking Designs

Clay presents three design versions of the parallel NQueens() function.

Retrieving Detailed Information About Your Intel Multicore CPU Features with Intel IPP

There are several ways of retrieving detailed information about your Intel multicore CPU features. I discovered two functions in IPP that provide most of the information I usually need.

Implicit CPU Vectorization with Intel OpenCL SDK 1.5

Intel OpenCL SDK 1.5 includes important performance enhancements specifically designed for the latest Intel 2nd Generation Core Processors and outlines a path for future performance improvements.

Welcome to the Parallel Jungle!

Herb Sutter dives into the repercussions of parallel's reach from mobile devices, to the desktop, to clusters, and — at the highest level of granularity — to the cloud. This welter of different parallel implementations presents significant challenges for programming. The free lunch of sequential programming is well and truly over.

Low-Cost, Low-Power Servers Begin Their Ascent

As the cloud is quickly demonstrating, small, seemingly underpowered servers are quite enough for most development tasks. Buying one of these low-cost, low-power units makes clear how much developer-friendlier they are than datacenter solutions

Booting an Intel Architecture System, Part II: Advanced Initialization

Booting an Intel Architecture System, Part I: Early Initialization

  • NEW Research!! State of Parallel Programming 2012
    The Parallel Programming Landscape: Multicore has gone mainstream — but are developers ready?
    A recent survey of Dr. Dobb's readers reveals how much experience software developers have with parallel programming and what challenges they face in developing software that exploits multicore architectures.
    Download the Research Report Now
  • The ROI from Optimizing Software Performance with Intel® Parallel Studio XE
    Intel Parallel Studio XE delivers ROI solutions to development organizations. This comprehensive tool offering for the entire software development life cycle helps improve productivity, enhance quality, increase performance and reduce overall costs.
  • Simplifying High Performance
    Intel introduced Intel® Parallel Studio 2011, a tool suite for Microsoft™ Windows™ Visual Studio™ C++ developers, with the singular objective of providing the essential performance tools for application development on Intel® Architecture. These tools provide significant innovation, and enable unprecedented developer productivity when building, debugging, and tuning parallel applications for multicore. With the introduction of Intel® Parallel Building Blocks (Intel® PBB), developers have methods to introduce and extend parallelism in C/C++ applications for higher performance and efficiencies.
  • Automatic Parallelism with the Intel® Math Kernel Library (Intel® MKL)
    The Intel® Math Kernel Library (Intel® MKL) provides software developers optimized and automatically parallelized mathematical library routines. The primary advantage of Intel MKL is that it makes the highest performance levels easily accessible to software developers. Within the software, we do automated dispatching to amortize the value of the underlying hardware features.
  • The Answers to Cracking the Parallelism Puzzle
    Intel® Threading Building Blocks (Intel® TBB) has grown fantastically popular with C++ developers over the past five years. It has been ported to many platforms and used in many applications, including recently in well-known Adobe™ products. This article introduces an expanded family of parallel models, with Intel TBB at the very center. The author introduces the complementary models that expand upon what Intel TBB can do in a compatible and complementary manner that makes Intel® Parallel Building Blocks (Intel® PBB) well worth understanding and using.
  • The World's First Sudoku™ 'Thirty-Niner'
    Lars Peters Endresen and Håvard Graff, two talented engineers from Oslo, share with us how they created what may be the world's first Sudoku puzzle that has 39 clues.
  • Parallelizing Intel® Integrated Performance Primitive Functions + Envivio Case Study
    Intel® Integrated Performance Primitives (Intel® IPP) is an extensive library of multicore-ready, highly optimized software functions for multimedia, data processing, and communications applications. Intel® IPP is included in Intel® Composer XE, which is a component of Intel® Parallel Studio XE; and Intel® C++ Composer XE, which is a component of Intel® C++ Studio XE.
  • Intel® Cilk™ Plus: A C/C++ Language Extension for Parallel Programming (excerpt from PUM #7)
    Intel® Cilk™ Plus adds fine-grained task parallelism support to C and C++, making it easy to add parallelism to both new and existing software, and efficiently exploit multiple processors.
  • Using Serial Modeling Tools to Tame the Parallel Beast + Creative Assembly Case Study
    Intel® Parallel Advisor is a new tool for developers who need to add parallelism to existing serial code, and it makes it easier to parallelize. It increases the programmer's return on investment by focusing effort where it matters. It helps the programmer identify problems early, so that little effort is wasted on unproductive directions. The key to Intel Parallel Advisor's success is its reliance on a well-proven method of introducing parallelism: serial modeling.
PPoPP '12: ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
  • February 25-29
    PPoPP '12 will be held in New Orleans, Louisiana.
4th Annual Cloud Computing Summit 2012 HPC 2012: 20th High Performance Computing Symposium
  • March 26-29
    HPC 2012 will be held in Orlando, Florida.
HIPS: International Workshop on High-Level Parallel Programming Models and Supportive Environments
  • May 21-25
    HIPS will be held in Shanghai, China.
26th International Conference on Supercomputing SC12
  • November 10-16
    SC12 will be held in Salt Lake City, Utah.

Optimization Notice

Intel® compilers, associated libraries and associated development tools may include or utilize options that optimize for instruction sets that are available in both Intel® and non-Intel microprocessors (for example SIMD instruction sets), but do not optimize equally for non-Intel microprocessors.  In addition, certain compiler options for Intel compilers, including some that are not specific to Intel micro-architecture, are reserved for Intel microprocessors.  For a detailed description of Intel compiler options, including the instruction sets and specific microprocessors they implicate, please refer to the "Intel® Compiler User and Reference Guides" under "Compiler Options."  Many library routines that are part of Intel® compiler products are more highly optimized for Intel microprocessors than for other microprocessors.  While the compilers and libraries in Intel® compiler products offer optimizations for both Intel and Intel-compatible microprocessors, depending on the options you select, your code and other factors, you likely will get extra performance on Intel microprocessors.

Intel® compilers, associated libraries and associated development tools may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors.  These optimizations include Intel® Streaming SIMD Extensions 2 (Intel® SSE2), Intel® Streaming SIMD Extensions 3 (Intel® SSE3), and Supplemental Streaming SIMD Extensions 3 (Intel® SSSE3) instruction sets and other optimizations.  Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel.  Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors.

While Intel believes our compilers and libraries are excellent choices to assist in obtaining the best performance on Intel® and non-Intel microprocessors, Intel recommends that you evaluate other compilers and libraries to determine which best meet your requirements.  We hope to win your business by striving to offer the best performance of any compiler or library; please let us know if you find we do not.

Notice revision #20101101

Enabling People and Organizations to Harness the Transformative Power of Technology