A self-sorting in-place prime factor real/half-complex FFT algorithm

Abstract A new fast Fourier transform algorithm for real or half-complex (conjugate-symmetric) input data is described. Based on the decomposition of N (the length of the transform) into mutually prime factors, the algorithm performs transforms in-place and without pre- or post-reordering of the data. With large-scale scientific computing in mind, the emphasis is on reducing the number of additions required. Compared with the best available algorithm based on specializing the conventional FFT to the real/half-complex case, the number of multiplications is also reduced by about 50%. On the Cray X-MP, a transform package based on the new algorithm runs up to 20% faster than the previous fastest available routines, besides halving the storage requirements.