This paper from Lawrence Berkeley National Laboratory introduces the Roofline model as a way to judge the suitability of your kernel to any given architecture.
|Paper Title||Roofline: An Insightful Visual Performance Model for Floating-Point Programs and Multicore Architectures|
|Author||Samuel Williams, Andrew Waterman and David Patterson|
The operational intensity introduced in the Roofline Model -- operations per byte of DRAM traffic -- is a simple model that can be used to determine what architectures are the best match for a given computational kernel or, conversely, in what ways to optimize a kernel so it performs better on a given architecture. Operational intensity is not typically provided directly by performance tools but can be estimated from other readily available measurements.