Abstract

FFTW: An Adaptive Software Architecture for the FFT

In the Proceedings of the 1998 ICASSP conference (vol. 3, pp. 1381-1384)
[Full text]

Abstract

FFT literature has been mostly concerned with minimizing the number of floating-point operations performed by an algorithm. Unfortunately, on present-day microprocessors this measure is far less important than it used to be, and interactions with the processor pipeline and the memory hierarchy have a larger impact on performance. Consequently, one must know the details of a computer architecture in order to design a fast algorithm. In this paper, we propose an adaptive FFT program that tunes the computation automatically for any particular hardware. We compared our program, called FFTW, with over 40 implementations of the FFT on 7 machines. Our tests show that FFTW's self-optimizing approach usually yields significantly better performance than all other publicly available software. FFTW also compares favorably with machine-specific, vendor-optimized libraries.

This page maintained by: Matteo Frigo. Last updated: Mon Sep 16 16:14:09 2019