The natural architectures are architectures, derived from the signal graph of the corresponding algorithm. That is why they are considered to be the most appropriate architectures for parallel realization of this algorithm. For Fast Fourier Transform algorithm (FFT) two types of natural architectures are known – the direct and the indirect hypercube. The direct hypercube has been investigated and analyzed a long time ago. The development of the concept of Indirect Hypercube, although quite old, is too difficult, controversal and still unfinished. Fast Hartley transform (FHT)/Real-valued Fast Fourier transform (RFFT) algorithms are important Fourier-related transforms, because they lower twice the operational and memory requirements when the input data is real-valued. These types of algorithms, however, have an irregular computational structure, which makes their parallel implementation a very difficult task. The aim of this paper is, based on the results achieved so far, to present further development of the concept Indirect Hypercube. A method of parametric synthesis of an indirect hypercube is described as a model of parallel FFT algorithms of a transpose type with different granularity/radix. This method is generalized for relevant RFFT/FHT and FCT algorithms. Two types of SIMD array architectures are described (radix-2 and radix-4), based on the indirect hypercube concept. These architectures are implemented as fast FFT/RFFT/FHT processors for real time applications. The performance estimation, as well as the estimation of resource utilization is carried out.
1. [Hartley-1] Hartley, R. V. L. A More Symmetrical Fourier Analysis Applied to Transmission Problems. Proc. IRE 30, 1942, 144-150.
2. [Danielson-1] Danielson, G. C. and C. Lanczos. Some Improvements in Practical Fourier Analysis and Their Application to X-ray Scattering From Liquids. J. Franklin Inst., 233, April 1942, 365-380, 435-452.
3. [Cooley-1] Cooley, J. W., J. W. Tukey. “An Algorithm for the Machine Calculation of Complexes Fourier Series. – Math. Comput., 19, 1965, No. 90.
4. [Singleton-1] Singleton, C. A. Method for Computing the Fast Fourier Transform with Auxiliary Memory and Limited High-Speed Storage. – IEEE Transactions on Audio and Electroacoustics, AU-15, June 1967, 91-98.
5. [Bergland-1] Bergland, Glenn D. A Fast Fourier Transform Algorithm for Real-valued Series. – Communications of the ACM, 11, Oct. 1968, 10, 703-710.
6. [Stone-1] Stone, H. S. Parallel Processing with the Perfect Shuffle. – IEEE Trans. Computers, C-20, 1971, 153-161.
7. [Ahmed-1] Ahmed, N., T. Natarajan and K. R. Rao. Discrete Cosine Transform. – IEEE Transactions on Computers, Jan. 1974, 90-93.
8. [Pease-1] Pease, M. C. The Indirect Binary n-cube Microprocessor Array. – IEEE Trans. on Computers, May 1977.
9. [Bracewell-1] Bracewell, R. N. Discrete Hartley Transform. – J. Opt. Soc. Am., 73, 1983, 12, 1832-1835.
10. [Bracewell-2] Bracewell, R. N. The Fast Hartley Transform. – Proc. IEEE, 72, 1984, 8, 1010–1018.
11. [Ulman-1] Ulman, Ronald F. An Algorithm for Fast Hartley Transform. Technical Report, Stanford University, 1984.
12. [Bhuyan-1] Bhuyan, L. Interconnection Networks for Parallel and Distributed Processing. – Computer, June 1987, 9-12.
13. [Sorensen-1] Sorensen, H. V., D. L. Jones, M. T. Heideman, C. S. Burrus. Real-valued Fast Fourier Transform Algorithms. – IEEE Trans. Acoust. Speech Sig. Processing, 35, 1987, 849–863.
14. [Marchesi-1] Marchesi, M., G. Orlandi, F. Piazza. A Systolic Circuit for Fast Hartley Transform. IEEE International Symposium on Circuits and Systems, 7-9 June 1988, 2685-2688.
15. [Rao-1] Rao, K., P. Yip. Discrete Cosine Transform: Algorithms, Advantages, Applications. Boston, Academic Press, ISBN 0-12-580203-X, 1990.
16. [Zapata-1] Zapata, E. L., F. Arguello, F. F. Rivera, J. D. Bruguera. Multidimensional Fast Hartley Transform onto SIMD Hypercubes. – Microprocessing and Microprogramming, 29, September 1990, 2, 121-134.
17. [Chang-1] Chang, L.-W., S.-W. Lee. Systolic Arrays for the Discrete Hartley Transform. – IEEE Transactions on Signal Processing, 39, Nov. 1991, 11, 2411-2418.
18. [Zapata-2] Zapata, E. L., F. Arguello. A VLSI Constant Geometry Architecture for the Fast Hartley and Fourier Transforms. – IEEE Transactions on Parallel and Distributed Systems, 3, Jan. 1992, 1, 58-70.
19. [Liu-1] Liu, K. J. R., C.-T. Chiu. Unified Parallel Lattice Structures for Time-recursive Discrete Cosine/Sine/Hartley Transforms. – IEEE Transactions on Signal Processing, Mar. 1993, 1357-1377.
20. [Arguello-1] Arguello, F., J. D. Bruguera, R. Doallo, E. L. Zapata. Parallel Architecture for Fast Transforms with Trigonometric Kernel. – IEEE Transactions on Parallel and Distributed Systems, 5, Oct. 1994, 10, 1091–1099.
21. [Uniyal-1] Uniyal, P. R. Transforming Real-valued Sequences: Fast Fourier Versus Fast Hartley Transform Algorithms. – IEEE Transactions on Signal Processing, 42, Nov. 1994, 11, 3249-3254.
22. [Mazzeo-1] Mazzeo, A. and U. Villano. Parallel 1D-FFT Computation on Constant-Valance Multicomputers. – Software – Practice and Experience, 25, June 1995, 6, 681-704.
23. [Zhenyu Liu-1] Zhenyu, Liu, Yang Song, Takeshi Ikenaga, Satoshi Goto. A VLSI Array Processing Oriented Fast Fourier Transform Algorithm and Hardware Implementation. GLSVLSI'05, 17–19 April 2005, Chicago, Illinois, USA.
24. [Jones-1] Jones, K. J. Design and Parallel Computation of Regularised Fast Hartley Transform. – Vision, Image and Signal Processing – IEEE Proceedings, 153, 9 Feb. 2006, 1, 70-78.
25. [Phil-1] Philipov, Ph., V. Lazarov, Z. Zlatev, M. Ivanova. A Parallel Architecture for Radix-2 Fast Fourier Transform. IEEE John Vincent Atanasoff 2006 International Symposium on Modern Computing, Sofia, 2006, 229-234.
26. [Phil-2] Philipov, Philip, Ilian Costov, Vladimir. Lazarov, Zlaty Zlatev, Milena Ivanova. Implementation of a Parallel Architecture for Radix-2 Fast Fourier Transform. – Information Technologies and Control, ISSN 1312–2622, 2008, No. 2, 12-16.
27. [Phil-3] Philipov, Ph. Investigation of the Indirect Hypercube as a Natural Architecture for Realization of the Fast Fourier Transform Algorithm. PhD Thesis, Sofia, 2010.