To: benchfft@theory.lcs.mit.edu Subject: SUBMIT ------------------- @@SUBMIT@@ @ submitter = Joachim Wesner @ submitter email = joachim.wesner@frankfurt.netsurf.de @ submitter organization = NONE @ computer manufacturer = @ computer model = @ CPU manufacturer = Intel @ CPU model = Pentium MMX @ CPU speed = 233 MHz @ RAM = 56 MB @ L2 cache size = 512 kB @ operating system = Linux 2.0.30 @ C compiler = gcc 2.7.2 @ C compiler flags = -pedantic -ansi -O6 -fomit-frame-pointer -Wall @ Fortran compiler = g77 0.5.21 @ Fortran compiler flags = -O6 -fomit-frame-pointer @ remarks = @ FFTW version = FFTW V1.2 @ floating-point precision = double @ floating-point size = 8 bytes ------------------------------------------------------ @@@@ bench.1d.p2.log Benchmarking for sizes: 2 (0.000228882 MB) 4 (0.000534058 MB) 8 (0.000839233 MB) 16 (0.00164795 MB) 32 (0.00297546 MB) 64 (0.00616455 MB) 128 (0.0119019 MB) 256 (0.0238037 MB) 512 (0.0476074 MB) 1024 (0.0939941 MB) 2048 (0.189575 MB) 4096 (0.37915 MB) 8192 (0.765991 MB) 16384 (1.51184 MB) 32768 (3.02368 MB) 65536 (6.09973 MB) 131072 (12.1995 MB) 262144 (25.4987 MB) Maximum array size = 360360 Benchmarking FFTs: 0. Arndt DIF 1. Arndt DIT 2. Arndt Split-Radix 3. Arndt 4-step 4. Bailey 5. Beauregard 6. Bergland 7. Brenner 8. Burrus 9. CWP (min N) 10. CWP (best N) 11. Edelblute 12. FFTPACK 13. FFTPACK (f2c) 14. FFTW 15. FFTW_ESTIMATE 16. Frigo-old 17. Green 18. GSL 19. GSL DIT 20. GSL DIF 21. Krukar 22. Mayer (Buneman) 23. Mayer (simple) 24. Mayer (lookup) 25. Monro 26. NAPACK (f2c) 27. Nielsen 28. NR (C) 29. NR (F) 30. Ooura (C) 31. Ooura (F) 32. QFT 33. Ransom 34. SCIPORT 35. Singleton 36. Singleton (f2c) 37. Sorensen 38. Sorensen DIT 39. Temperton 40. Temperton (f2c) 41. Valkenburg Computing normalized averages (42 transforms). Benchmarking for array size = 2 (power of 2): 0. Arndt DIF: elapsed time t=1.1 s, 2097152 iters, t-(init.)=0.26 s t(norm)=0.0619888, mflops=80.6597 (err=5.6e-17) 1. Arndt DIT: elapsed time t=1.13 s, 2097152 iters, t-(init.)=0.24 s t(norm)=0.0572205, mflops=87.3813 (err=5.6e-17) 2. Arndt Split-Radix: elapsed time t=1.05 s, 1048576 iters, t-(init.)=0.63 s t(norm)=0.300407, mflops=16.6441 (err=5.6e-17) 3. Arndt 4-step: elapsed time t=1.09 s, 131072 iters, t-(init.)=1.07 s t(norm)=4.08173, mflops=1.22497 (err=5.6e-17) 4. Bailey: elapsed time t=1.09 s, 524288 iters, t-(init.)=0.85 s t(norm)=0.810623, mflops=6.16809 (err=5.6e-17) 5. Beauregard: elapsed time t=1.33 s, 524288 iters, t-(init.)=1.1 s t(norm)=1.04904, mflops=4.76625 (err=8.4e-17) 6. Bergland: elapsed time t=1.19 s, 524288 iters, t-(init.)=1.12 s t(norm)=1.06812, mflops=4.68114 (err=8.4e-17) 7. Brenner: elapsed time t=1.5 s, 524288 iters, t-(init.)=1.29 s t(norm)=1.23024, mflops=4.06425 (err=8.4e-17) 8. Burrus: elapsed time t=1.04 s, 1048576 iters, t-(init.)=0.69 s t(norm)=0.329018, mflops=15.1968 (err=5.6e-17) 9. CWP (min N): elapsed time t=1.1 s, 262144 iters, t-(init.)=1.07 s t(norm)=2.04086, mflops=2.44994 10. CWP (best N) (N=3): elapsed time t=1.29 s, 262144 iters, t-(init.)=1.11 s t(norm)=2.11716, mflops=2.36166 11. Skipping fft (Edelblute can't handle N <= 2). 12. FFTPACK: elapsed time t=1.61 s, 1048576 iters, t-(init.)=1.33 s t(norm)=0.634193, mflops=7.88403 (err=8.4e-17) 13. FFTPACK (f2c): elapsed time t=1.71 s, 1048576 iters, t-(init.)=1.27 s t(norm)=0.605583, mflops=8.2565 (err=8.4e-17) FFTW_MEASURE plan: (cost = 4.959106e-07) FFTW_NOTW 2 14. FFTW: elapsed time t=1.2 s, 2097152 iters, t-(init.)=0.51 s t(norm)=0.121593, mflops=41.1206 (err=8.4e-17) FFTW_ESTIMATE plan: (cost = 1.820000e+02) FFTW_NOTW 2 15. FFTW_ESTIMATE: elapsed time t=1.18 s, 2097152 iters, t-(init.)=0.3 s t(norm)=0.0715256, mflops=69.9051 (err=8.4e-17) 16. Frigo-old: elapsed time t=1.05 s, 2097152 iters, t-(init.)=0.28 s t(norm)=0.0667572, mflops=74.8983 (err=8.4e-17) 17. Skipping fft (Green can't handle this size.). 18. GSL: elapsed time t=1.17 s, 524288 iters, t-(init.)=0.95 s t(norm)=0.905991, mflops=5.51882 (err=8.4e-17) 19. GSL DIT: elapsed time t=1.36 s, 524288 iters, t-(init.)=1.14 s t(norm)=1.08719, mflops=4.59902 (err=8.4e-17) 20. GSL DIF: elapsed time t=1.43 s, 524288 iters, t-(init.)=1.25 s t(norm)=1.19209, mflops=4.1943 (err=8.4e-17) 21. Krukar: elapsed time t=1.15 s, 2097152 iters, t-(init.)=0.34 s t(norm)=0.0810623, mflops=61.6809 (err=8.4e-17) 22. Skipping fft (Mayer can't handle N <= 2). 23. Skipping fft (Mayer can't handle N <= 2). 24. Skipping fft (Mayer can't handle N <= 2). 25. Skipping fft (Monro can't handle N <= 2). 26. NAPACK (f2c): elapsed time t=1.23 s, 262144 iters, t-(init.)=1.2 s t(norm)=2.28882, mflops=2.18453 (err=8.3e-17) 27. Nielsen: elapsed time t=1.81 s, 262144 iters, t-(init.)=1.78 s t(norm)=3.39508, mflops=1.47272 (err=5.6e-17) 28. NR (C): elapsed time t=1.06 s, 524288 iters, t-(init.)=0.92 s t(norm)=0.87738, mflops=5.69878 (err=8.4e-17) 29. NR (F): elapsed time t=1.4 s, 524288 iters, t-(init.)=1.2 s t(norm)=1.14441, mflops=4.36907 (err=8.4e-17) 30. Ooura (C): elapsed time t=1.96 s, 4194304 iters, t-(init.)=0.16 s t(norm)=0.0190735, mflops=262.144 (err=8.4e-17) 31. Ooura (F): elapsed time t=1.17 s, 2097152 iters, t-(init.)=0.38 s t(norm)=0.0905991, mflops=55.1882 (err=8.4e-17) 32. Skipping fft (QFT requires N >= 16). 33. Skipping fft (Ransom doesn't work for N=2). 34. Skipping fft (SCIPORT can't handle N < 4). 35. Singleton: elapsed time t=1.43 s, 524288 iters, t-(init.)=1.22 s t(norm)=1.16348, mflops=4.29744 (err=8.4e-17) 36. Singleton (f2c): elapsed time t=1.41 s, 524288 iters, t-(init.)=1.19 s t(norm)=1.13487, mflops=4.40578 (err=8.4e-17) 37. Sorensen: elapsed time t=1.94 s, 2097152 iters, t-(init.)=1.12 s t(norm)=0.267029, mflops=18.7246 (err=5.6e-17) 38. Sorensen DIT: elapsed time t=1.03 s, 1048576 iters, t-(init.)=0.69 s t(norm)=0.329018, mflops=15.1968 (err=5.6e-17) 39. Temperton: elapsed time t=1.07 s, 262144 iters, t-(init.)=1.01 s t(norm)=1.92642, mflops=2.59549 (err=8.4e-17) 40. Temperton (f2c): elapsed time t=1.01 s, 262144 iters, t-(init.)=0.98 s t(norm)=1.8692, mflops=2.67494 (err=8.4e-17) 41. Valkenburg: elapsed time t=1.37 s, 524288 iters, t-(init.)=1.19 s t(norm)=1.13487, mflops=4.40578 (err=8.3e-17) Top mflops for N=2 = 262.144 Normalized results and averages for N=2: fft 0: mflops = 80.6597 (norm. = 0.307692), norm. avg. (of 1) = 0.307692 fft 1: mflops = 87.3813 (norm. = 0.333333), norm. avg. (of 1) = 0.333333 fft 2: mflops = 16.6441 (norm. = 0.0634921), norm. avg. (of 1) = 0.0634921 fft 3: mflops = 1.22497 (norm. = 0.0046729), norm. avg. (of 1) = 0.0046729 fft 4: mflops = 6.16809 (norm. = 0.0235294), norm. avg. (of 1) = 0.0235294 fft 5: mflops = 4.76625 (norm. = 0.0181818), norm. avg. (of 1) = 0.0181818 fft 6: mflops = 4.68114 (norm. = 0.0178571), norm. avg. (of 1) = 0.0178571 fft 7: mflops = 4.06425 (norm. = 0.0155039), norm. avg. (of 1) = 0.0155039 fft 8: mflops = 15.1968 (norm. = 0.057971), norm. avg. (of 1) = 0.057971 fft 9: mflops = 2.44994 (norm. = 0.00934579), norm. avg. (of 1) = 0.00934579 fft 10: mflops = 2.36166 (norm. = 0.00900901), norm. avg. (of 1) = 0.00900901 fft 11: mflops = -1 (norm. = -0.0038147), norm. avg. (of 0) = -1 fft 12: mflops = 7.88403 (norm. = 0.0300752), norm. avg. (of 1) = 0.0300752 fft 13: mflops = 8.2565 (norm. = 0.0314961), norm. avg. (of 1) = 0.0314961 fft 14: mflops = 41.1206 (norm. = 0.156863), norm. avg. (of 1) = 0.156863 fft 15: mflops = 69.9051 (norm. = 0.266667), norm. avg. (of 1) = 0.266667 fft 16: mflops = 74.8983 (norm. = 0.285714), norm. avg. (of 1) = 0.285714 fft 17: mflops = -1 (norm. = -0.0038147), norm. avg. (of 0) = -1 fft 18: mflops = 5.51882 (norm. = 0.0210526), norm. avg. (of 1) = 0.0210526 fft 19: mflops = 4.59902 (norm. = 0.0175439), norm. avg. (of 1) = 0.0175439 fft 20: mflops = 4.1943 (norm. = 0.016), norm. avg. (of 1) = 0.016 fft 21: mflops = 61.6809 (norm. = 0.235294), norm. avg. (of 1) = 0.235294 fft 22: mflops = -1 (norm. = -0.0038147), norm. avg. (of 0) = -1 fft 23: mflops = -1 (norm. = -0.0038147), norm. avg. (of 0) = -1 fft 24: mflops = -1 (norm. = -0.0038147), norm. avg. (of 0) = -1 fft 25: mflops = -1 (norm. = -0.0038147), norm. avg. (of 0) = -1 fft 26: mflops = 2.18453 (norm. = 0.00833333), norm. avg. (of 1) = 0.00833333 fft 27: mflops = 1.47272 (norm. = 0.00561798), norm. avg. (of 1) = 0.00561798 fft 28: mflops = 5.69878 (norm. = 0.0217391), norm. avg. (of 1) = 0.0217391 fft 29: mflops = 4.36907 (norm. = 0.0166667), norm. avg. (of 1) = 0.0166667 fft 30: mflops = 262.144 (norm. = 1), norm. avg. (of 1) = 1 fft 31: mflops = 55.1882 (norm. = 0.210526), norm. avg. (of 1) = 0.210526 fft 32: mflops = -1 (norm. = -0.0038147), norm. avg. (of 0) = -1 fft 33: mflops = -1 (norm. = -0.0038147), norm. avg. (of 0) = -1 fft 34: mflops = -1 (norm. = -0.0038147), norm. avg. (of 0) = -1 fft 35: mflops = 4.29744 (norm. = 0.0163934), norm. avg. (of 1) = 0.0163934 fft 36: mflops = 4.40578 (norm. = 0.0168067), norm. avg. (of 1) = 0.0168067 fft 37: mflops = 18.7246 (norm. = 0.0714286), norm. avg. (of 1) = 0.0714286 fft 38: mflops = 15.1968 (norm. = 0.057971), norm. avg. (of 1) = 0.057971 fft 39: mflops = 2.59549 (norm. = 0.00990099), norm. avg. (of 1) = 0.00990099 fft 40: mflops = 2.67494 (norm. = 0.0102041), norm. avg. (of 1) = 0.0102041 fft 41: mflops = 4.40578 (norm. = 0.0168067), norm. avg. (of 1) = 0.0168067 Benchmarking for array size = 4 (power of 2): 0. Arndt DIF: elapsed time t=1.28 s, 1048576 iters, t-(init.)=0.76 s t(norm)=0.0905991, mflops=55.1882 (err=9.6e-17) 1. Arndt DIT: elapsed time t=1.31 s, 1048576 iters, t-(init.)=0.62 s t(norm)=0.0739098, mflops=67.6501 (err=9.6e-17) 2. Arndt Split-Radix: elapsed time t=1.53 s, 524288 iters, t-(init.)=1.19 s t(norm)=0.283718, mflops=17.6231 (err=1.5e-16) 3. Arndt 4-step: elapsed time t=1.08 s, 131072 iters, t-(init.)=1.03 s t(norm)=0.982285, mflops=5.09017 (err=1.5e-16) 4. Bailey: elapsed time t=1.22 s, 262144 iters, t-(init.)=1.18 s t(norm)=0.562668, mflops=8.88624 (err=1.5e-16) 5. Beauregard: elapsed time t=1.07 s, 131072 iters, t-(init.)=1.05 s t(norm)=1.00136, mflops=4.99322 (err=1.4e-16) 6. Bergland: elapsed time t=1.62 s, 524288 iters, t-(init.)=1.25 s t(norm)=0.298023, mflops=16.7772 (err=7.6e-17) 7. Brenner: elapsed time t=1.35 s, 262144 iters, t-(init.)=1.25 s t(norm)=0.596046, mflops=8.38861 (err=1.4e-16) 8. Burrus: elapsed time t=1.88 s, 524288 iters, t-(init.)=1.5 s t(norm)=0.357628, mflops=13.981 (err=1.5e-16) 9. CWP (min N): elapsed time t=1.32 s, 262144 iters, t-(init.)=1.28 s t(norm)=0.610352, mflops=8.192 10. CWP (best N) (N=15): elapsed time t=1.07 s, 65536 iters, t-(init.)=1 s t(norm)=1.90735, mflops=2.62144 11. Edelblute: elapsed time t=1.63 s, 524288 iters, t-(init.)=1.33 s t(norm)=0.317097, mflops=15.7681 (err=1.5e-16) 12. FFTPACK: elapsed time t=1.01 s, 524288 iters, t-(init.)=0.66 s t(norm)=0.157356, mflops=31.775 (err=1.2e-16) 13. FFTPACK (f2c): elapsed time t=1.58 s, 524288 iters, t-(init.)=1.22 s t(norm)=0.290871, mflops=17.1898 (err=1.2e-16) FFTW_MEASURE plan: (cost = 1.029968e-06) FFTW_NOTW 4 14. FFTW: elapsed time t=1.13 s, 1048576 iters, t-(init.)=0.39 s t(norm)=0.0464916, mflops=107.546 (err=1.4e-16) FFTW_ESTIMATE plan: (cost = 3.176000e+02) FFTW_NOTW 4 15. FFTW_ESTIMATE: elapsed time t=1.12 s, 1048576 iters, t-(init.)=0.41 s t(norm)=0.0488758, mflops=102.3 (err=1.4e-16) 16. Frigo-old: elapsed time t=1.92 s, 2097152 iters, t-(init.)=0.23 s t(norm)=0.0137091, mflops=364.722 (err=1.4e-16) 17. Skipping fft (Green can't handle this size.). 18. GSL: elapsed time t=1.7 s, 524288 iters, t-(init.)=1.31 s t(norm)=0.312328, mflops=16.0088 (err=1.4e-16) 19. GSL DIT: elapsed time t=1.45 s, 262144 iters, t-(init.)=1.3 s t(norm)=0.619888, mflops=8.06597 (err=1.4e-16) 20. GSL DIF: elapsed time t=1.51 s, 262144 iters, t-(init.)=1.32 s t(norm)=0.629425, mflops=7.94376 (err=1.8e-16) 21. Krukar: elapsed time t=1 s, 1048576 iters, t-(init.)=0.25 s t(norm)=0.0298023, mflops=167.772 (err=1.4e-16) 22. Mayer (Buneman): elapsed time t=1.18 s, 524288 iters, t-(init.)=0.76 s t(norm)=0.181198, mflops=27.5941 (err=8.1e-17) 23. Mayer (simple): elapsed time t=1.17 s, 524288 iters, t-(init.)=0.92 s t(norm)=0.219345, mflops=22.7951 24. Mayer (lookup): elapsed time t=1.19 s, 524288 iters, t-(init.)=0.94 s t(norm)=0.224113, mflops=22.3101 (err=8.1e-17) 25. Monro: elapsed time t=1.54 s, 131072 iters, t-(init.)=1.51 s t(norm)=1.44005, mflops=3.47211 (err=8.1e-17) 26. NAPACK (f2c): elapsed time t=1.35 s, 131072 iters, t-(init.)=1.33 s t(norm)=1.26839, mflops=3.94202 (err=1.8e-16) 27. Nielsen: elapsed time t=1.03 s, 131072 iters, t-(init.)=1.01 s t(norm)=0.963211, mflops=5.19097 (err=1.5e-16) 28. NR (C): elapsed time t=1.22 s, 262144 iters, t-(init.)=1.06 s t(norm)=0.505447, mflops=9.89223 (err=1.4e-16) 29. NR (F): elapsed time t=1.63 s, 262144 iters, t-(init.)=1.58 s t(norm)=0.753403, mflops=6.63656 (err=1.4e-16) 30. Ooura (C): elapsed time t=1.54 s, 1048576 iters, t-(init.)=0.98 s t(norm)=0.116825, mflops=42.799 (err=9.8e-17) 31. Ooura (F): elapsed time t=1.7 s, 1048576 iters, t-(init.)=1.03 s t(norm)=0.122786, mflops=40.7214 (err=9.8e-17) 32. Skipping fft (QFT requires N >= 16). 33. Ransom: elapsed time t=1.52 s, 131072 iters, t-(init.)=1.48 s t(norm)=1.41144, mflops=3.54249 (err=2.1e-16) 34. SCIPORT: elapsed time t=1.84 s, 524288 iters, t-(init.)=1.41 s t(norm)=0.33617, mflops=14.8734 (err=8.0e-09) 35. Singleton: elapsed time t=1.07 s, 262144 iters, t-(init.)=1.03 s t(norm)=0.491142, mflops=10.1803 (err=1.5e-16) 36. Singleton (f2c): elapsed time t=1.08 s, 262144 iters, t-(init.)=0.89 s t(norm)=0.424385, mflops=11.7818 (err=1.1e-16) 37. Sorensen: elapsed time t=1.44 s, 524288 iters, t-(init.)=1.26 s t(norm)=0.300407, mflops=16.6441 (err=1.5e-16) 38. Sorensen DIT: elapsed time t=1.78 s, 524288 iters, t-(init.)=1.61 s t(norm)=0.383854, mflops=13.0258 (err=8.1e-17) 39. Temperton: elapsed time t=1.39 s, 262144 iters, t-(init.)=1.22 s t(norm)=0.581741, mflops=8.59489 (err=1.7e-16) 40. Temperton (f2c): elapsed time t=1.34 s, 262144 iters, t-(init.)=1.12 s t(norm)=0.534058, mflops=9.36229 (err=1.7e-16) 41. Valkenburg: elapsed time t=1.3 s, 131072 iters, t-(init.)=1.28 s t(norm)=1.2207, mflops=4.096 (err=1.8e-16) Top mflops for N=4 = 364.722 Normalized results and averages for N=4: fft 0: mflops = 55.1882 (norm. = 0.151316), norm. avg. (of 2) = 0.229504 fft 1: mflops = 67.6501 (norm. = 0.185484), norm. avg. (of 2) = 0.259409 fft 2: mflops = 17.6231 (norm. = 0.0483193), norm. avg. (of 2) = 0.0559057 fft 3: mflops = 5.09017 (norm. = 0.0139563), norm. avg. (of 2) = 0.0093146 fft 4: mflops = 8.88624 (norm. = 0.0243644), norm. avg. (of 2) = 0.0239469 fft 5: mflops = 4.99322 (norm. = 0.0136905), norm. avg. (of 2) = 0.0159361 fft 6: mflops = 16.7772 (norm. = 0.046), norm. avg. (of 2) = 0.0319286 fft 7: mflops = 8.38861 (norm. = 0.023), norm. avg. (of 2) = 0.0192519 fft 8: mflops = 13.981 (norm. = 0.0383333), norm. avg. (of 2) = 0.0481522 fft 9: mflops = 8.192 (norm. = 0.0224609), norm. avg. (of 2) = 0.0159034 fft 10: mflops = 2.62144 (norm. = 0.0071875), norm. avg. (of 2) = 0.00809825 fft 11: mflops = 15.7681 (norm. = 0.0432331), norm. avg. (of 1) = 0.0432331 fft 12: mflops = 31.775 (norm. = 0.0871212), norm. avg. (of 2) = 0.0585982 fft 13: mflops = 17.1898 (norm. = 0.0471311), norm. avg. (of 2) = 0.0393136 fft 14: mflops = 107.546 (norm. = 0.294872), norm. avg. (of 2) = 0.225867 fft 15: mflops = 102.3 (norm. = 0.280488), norm. avg. (of 2) = 0.273577 fft 16: mflops = 364.722 (norm. = 1), norm. avg. (of 2) = 0.642857 fft 17: mflops = -1 (norm. = -0.00274181), norm. avg. (of 0) = -1 fft 18: mflops = 16.0088 (norm. = 0.0438931), norm. avg. (of 2) = 0.0324729 fft 19: mflops = 8.06597 (norm. = 0.0221154), norm. avg. (of 2) = 0.0198296 fft 20: mflops = 7.94376 (norm. = 0.0217803), norm. avg. (of 2) = 0.0188902 fft 21: mflops = 167.772 (norm. = 0.46), norm. avg. (of 2) = 0.347647 fft 22: mflops = 27.5941 (norm. = 0.0756579), norm. avg. (of 1) = 0.0756579 fft 23: mflops = 22.7951 (norm. = 0.0625), norm. avg. (of 1) = 0.0625 fft 24: mflops = 22.3101 (norm. = 0.0611702), norm. avg. (of 1) = 0.0611702 fft 25: mflops = 3.47211 (norm. = 0.00951987), norm. avg. (of 1) = 0.00951987 fft 26: mflops = 3.94202 (norm. = 0.0108083), norm. avg. (of 2) = 0.0095708 fft 27: mflops = 5.19097 (norm. = 0.0142327), norm. avg. (of 2) = 0.00992533 fft 28: mflops = 9.89223 (norm. = 0.0271226), norm. avg. (of 2) = 0.0244309 fft 29: mflops = 6.63656 (norm. = 0.0181962), norm. avg. (of 2) = 0.0174314 fft 30: mflops = 42.799 (norm. = 0.117347), norm. avg. (of 2) = 0.558673 fft 31: mflops = 40.7214 (norm. = 0.11165), norm. avg. (of 2) = 0.161088 fft 32: mflops = -1 (norm. = -0.00274181), norm. avg. (of 0) = -1 fft 33: mflops = 3.54249 (norm. = 0.00971284), norm. avg. (of 1) = 0.00971284 fft 34: mflops = 14.8734 (norm. = 0.0407801), norm. avg. (of 1) = 0.0407801 fft 35: mflops = 10.1803 (norm. = 0.0279126), norm. avg. (of 2) = 0.022153 fft 36: mflops = 11.7818 (norm. = 0.0323034), norm. avg. (of 2) = 0.024555 fft 37: mflops = 16.6441 (norm. = 0.0456349), norm. avg. (of 2) = 0.0585317 fft 38: mflops = 13.0258 (norm. = 0.0357143), norm. avg. (of 2) = 0.0468427 fft 39: mflops = 8.59489 (norm. = 0.0235656), norm. avg. (of 2) = 0.0167333 fft 40: mflops = 9.36229 (norm. = 0.0256696), norm. avg. (of 2) = 0.0179369 fft 41: mflops = 4.096 (norm. = 0.0112305), norm. avg. (of 2) = 0.0140186 Benchmarking for array size = 8 (power of 2): 0. Arndt DIF: elapsed time t=1.42 s, 524288 iters, t-(init.)=0.61 s t(norm)=0.0484784, mflops=103.139 (err=1.6e-16) 1. Arndt DIT: elapsed time t=1.44 s, 524288 iters, t-(init.)=0.75 s t(norm)=0.0596046, mflops=83.8861 (err=1.6e-16) 2. Arndt Split-Radix: elapsed time t=1.96 s, 262144 iters, t-(init.)=1.81 s t(norm)=0.287692, mflops=17.3797 (err=2.0e-16) 3. Arndt 4-step: elapsed time t=1.39 s, 65536 iters, t-(init.)=1.37 s t(norm)=0.871023, mflops=5.74038 (err=2.1e-16) 4. Bailey: elapsed time t=1.2 s, 131072 iters, t-(init.)=1.16 s t(norm)=0.368754, mflops=13.5592 (err=1.6e-16) 5. Beauregard: elapsed time t=1.16 s, 65536 iters, t-(init.)=1.13 s t(norm)=0.718435, mflops=6.95958 (err=1.3e-16) 6. Bergland: elapsed time t=1.87 s, 262144 iters, t-(init.)=1.47 s t(norm)=0.23365, mflops=21.3995 (err=2.0e-16) 7. Brenner: elapsed time t=1.49 s, 131072 iters, t-(init.)=1.43 s t(norm)=0.454585, mflops=10.999 (err=1.3e-16) 8. Burrus: elapsed time t=1.39 s, 131072 iters, t-(init.)=1.35 s t(norm)=0.429153, mflops=11.6508 (err=1.9e-16) 9. CWP (min N): elapsed time t=1 s, 131072 iters, t-(init.)=0.92 s t(norm)=0.29246, mflops=17.0963 10. CWP (best N) (N=15): elapsed time t=1.07 s, 65536 iters, t-(init.)=0.89 s t(norm)=0.565847, mflops=8.83631 11. Edelblute: elapsed time t=1.23 s, 131072 iters, t-(init.)=1.19 s t(norm)=0.378291, mflops=13.2173 (err=1.9e-16) 12. FFTPACK: elapsed time t=1.19 s, 262144 iters, t-(init.)=0.83 s t(norm)=0.131925, mflops=37.9003 (err=1.8e-16) 13. FFTPACK (f2c): elapsed time t=1.91 s, 262144 iters, t-(init.)=1.51 s t(norm)=0.240008, mflops=20.8326 (err=1.8e-16) FFTW_MEASURE plan: (cost = 2.670288e-06) FFTW_NOTW 8 14. FFTW: elapsed time t=1.48 s, 524288 iters, t-(init.)=0.68 s t(norm)=0.0540415, mflops=92.5214 (err=1.0e-16) FFTW_ESTIMATE plan: (cost = 4.688000e+02) FFTW_NOTW 8 15. FFTW_ESTIMATE: elapsed time t=1.49 s, 524288 iters, t-(init.)=0.73 s t(norm)=0.0580152, mflops=86.1843 (err=1.0e-16) 16. Frigo-old: elapsed time t=1.27 s, 524288 iters, t-(init.)=0.54 s t(norm)=0.0429153, mflops=116.508 (err=1.0e-16) 17. Green: elapsed time t=1.56 s, 262144 iters, t-(init.)=1.28 s t(norm)=0.203451, mflops=24.576 (err=1.2e-16) 18. GSL: elapsed time t=1.88 s, 262144 iters, t-(init.)=1.53 s t(norm)=0.243187, mflops=20.5603 (err=1.2e-16) 19. GSL DIT: elapsed time t=1.37 s, 131072 iters, t-(init.)=1.3 s t(norm)=0.413259, mflops=12.099 (err=1.2e-16) 20. GSL DIF: elapsed time t=1.41 s, 131072 iters, t-(init.)=1.22 s t(norm)=0.387828, mflops=12.8923 (err=1.6e-16) 21. Krukar: elapsed time t=1.25 s, 524288 iters, t-(init.)=0.63 s t(norm)=0.0500679, mflops=99.8644 (err=1.2e-16) 22. Mayer (Buneman): elapsed time t=1.17 s, 262144 iters, t-(init.)=0.83 s t(norm)=0.131925, mflops=37.9003 (err=1.5e-16) 23. Mayer (simple): elapsed time t=1.16 s, 262144 iters, t-(init.)=0.91 s t(norm)=0.144641, mflops=34.5684 24. Mayer (lookup): elapsed time t=1.17 s, 262144 iters, t-(init.)=0.81 s t(norm)=0.128746, mflops=38.8361 (err=1.5e-16) 25. Monro: elapsed time t=1.08 s, 65536 iters, t-(init.)=1.06 s t(norm)=0.67393, mflops=7.41917 (err=1.3e-08) 26. NAPACK (f2c): elapsed time t=1.43 s, 65536 iters, t-(init.)=1.37 s t(norm)=0.871023, mflops=5.74038 (err=2.8e-16) 27. Nielsen: elapsed time t=1.54 s, 131072 iters, t-(init.)=1.38 s t(norm)=0.43869, mflops=11.3976 (err=1.3e-15) 28. NR (C): elapsed time t=1.21 s, 131072 iters, t-(init.)=1.14 s t(norm)=0.362396, mflops=13.7971 (err=1.2e-16) 29. NR (F): elapsed time t=1.63 s, 131072 iters, t-(init.)=1.59 s t(norm)=0.505447, mflops=9.89223 (err=1.5e-16) 30. Ooura (C): elapsed time t=1.57 s, 524288 iters, t-(init.)=0.77 s t(norm)=0.0611941, mflops=81.7072 (err=1.6e-16) 31. Ooura (F): elapsed time t=1.9 s, 524288 iters, t-(init.)=1.1 s t(norm)=0.0874201, mflops=57.1951 (err=1.6e-16) 32. Skipping fft (QFT requires N >= 16). 33. Ransom: elapsed time t=1.82 s, 65536 iters, t-(init.)=1.76 s t(norm)=1.11898, mflops=4.46836 (err=7.5e-16) 34. SCIPORT: elapsed time t=1.43 s, 131072 iters, t-(init.)=1.37 s t(norm)=0.435511, mflops=11.4808 (err=4.5e-08) 35. Singleton: elapsed time t=1.45 s, 131072 iters, t-(init.)=1.35 s t(norm)=0.429153, mflops=11.6508 (err=1.2e-16) 36. Singleton (f2c): elapsed time t=1.31 s, 131072 iters, t-(init.)=1.13 s t(norm)=0.359217, mflops=13.9192 (err=1.2e-16) 37. Sorensen: elapsed time t=1.6 s, 262144 iters, t-(init.)=1.22 s t(norm)=0.193914, mflops=25.7847 (err=2.4e-16) 38. Sorensen DIT: elapsed time t=1.27 s, 131072 iters, t-(init.)=1.17 s t(norm)=0.371933, mflops=13.4433 (err=1.8e-16) 39. Temperton: elapsed time t=1.3 s, 131072 iters, t-(init.)=1.15 s t(norm)=0.365575, mflops=13.6771 (err=7.5e-09) 40. Temperton (f2c): elapsed time t=1.13 s, 131072 iters, t-(init.)=0.98 s t(norm)=0.311534, mflops=16.0496 (err=1.4e-16) 41. Valkenburg: elapsed time t=1.89 s, 65536 iters, t-(init.)=1.79 s t(norm)=1.13805, mflops=4.39347 (err=1.3e-16) Top mflops for N=8 = 116.508 Normalized results and averages for N=8: fft 0: mflops = 103.139 (norm. = 0.885246), norm. avg. (of 3) = 0.448085 fft 1: mflops = 83.8861 (norm. = 0.72), norm. avg. (of 3) = 0.412939 fft 2: mflops = 17.3797 (norm. = 0.149171), norm. avg. (of 3) = 0.0869942 fft 3: mflops = 5.74038 (norm. = 0.0492701), norm. avg. (of 3) = 0.0226331 fft 4: mflops = 13.5592 (norm. = 0.116379), norm. avg. (of 3) = 0.0547577 fft 5: mflops = 6.95958 (norm. = 0.0597345), norm. avg. (of 3) = 0.0305356 fft 6: mflops = 21.3995 (norm. = 0.183673), norm. avg. (of 3) = 0.0825102 fft 7: mflops = 10.999 (norm. = 0.0944056), norm. avg. (of 3) = 0.0443032 fft 8: mflops = 11.6508 (norm. = 0.1), norm. avg. (of 3) = 0.0654348 fft 9: mflops = 17.0963 (norm. = 0.146739), norm. avg. (of 3) = 0.0595153 fft 10: mflops = 8.83631 (norm. = 0.0758427), norm. avg. (of 3) = 0.0306797 fft 11: mflops = 13.2173 (norm. = 0.113445), norm. avg. (of 2) = 0.0783392 fft 12: mflops = 37.9003 (norm. = 0.325301), norm. avg. (of 3) = 0.147499 fft 13: mflops = 20.8326 (norm. = 0.178808), norm. avg. (of 3) = 0.0858117 fft 14: mflops = 92.5214 (norm. = 0.794118), norm. avg. (of 3) = 0.415284 fft 15: mflops = 86.1843 (norm. = 0.739726), norm. avg. (of 3) = 0.42896 fft 16: mflops = 116.508 (norm. = 1), norm. avg. (of 3) = 0.761905 fft 17: mflops = 24.576 (norm. = 0.210938), norm. avg. (of 1) = 0.210938 fft 18: mflops = 20.5603 (norm. = 0.176471), norm. avg. (of 3) = 0.0804721 fft 19: mflops = 12.099 (norm. = 0.103846), norm. avg. (of 3) = 0.0478351 fft 20: mflops = 12.8923 (norm. = 0.110656), norm. avg. (of 3) = 0.0494787 fft 21: mflops = 99.8644 (norm. = 0.857143), norm. avg. (of 3) = 0.517479 fft 22: mflops = 37.9003 (norm. = 0.325301), norm. avg. (of 2) = 0.20048 fft 23: mflops = 34.5684 (norm. = 0.296703), norm. avg. (of 2) = 0.179602 fft 24: mflops = 38.8361 (norm. = 0.333333), norm. avg. (of 2) = 0.197252 fft 25: mflops = 7.41917 (norm. = 0.0636792), norm. avg. (of 2) = 0.0365996 fft 26: mflops = 5.74038 (norm. = 0.0492701), norm. avg. (of 3) = 0.0228039 fft 27: mflops = 11.3976 (norm. = 0.0978261), norm. avg. (of 3) = 0.0392256 fft 28: mflops = 13.7971 (norm. = 0.118421), norm. avg. (of 3) = 0.0557609 fft 29: mflops = 9.89223 (norm. = 0.0849057), norm. avg. (of 3) = 0.0399228 fft 30: mflops = 81.7072 (norm. = 0.701299), norm. avg. (of 3) = 0.606215 fft 31: mflops = 57.1951 (norm. = 0.490909), norm. avg. (of 3) = 0.271029 fft 32: mflops = -1 (norm. = -0.00858307), norm. avg. (of 0) = -1 fft 33: mflops = 4.46836 (norm. = 0.0383523), norm. avg. (of 2) = 0.0240326 fft 34: mflops = 11.4808 (norm. = 0.0985401), norm. avg. (of 2) = 0.0696601 fft 35: mflops = 11.6508 (norm. = 0.1), norm. avg. (of 3) = 0.048102 fft 36: mflops = 13.9192 (norm. = 0.119469), norm. avg. (of 3) = 0.056193 fft 37: mflops = 25.7847 (norm. = 0.221311), norm. avg. (of 3) = 0.112792 fft 38: mflops = 13.4433 (norm. = 0.115385), norm. avg. (of 3) = 0.06969 fft 39: mflops = 13.6771 (norm. = 0.117391), norm. avg. (of 3) = 0.050286 fft 40: mflops = 16.0496 (norm. = 0.137755), norm. avg. (of 3) = 0.0578763 fft 41: mflops = 4.39347 (norm. = 0.0377095), norm. avg. (of 3) = 0.0219156 Benchmarking for array size = 16 (power of 2): 0. Arndt DIF: elapsed time t=1.82 s, 131072 iters, t-(init.)=1.47 s t(norm)=0.175238, mflops=28.5327 (err=1.6e-16) 1. Arndt DIT: elapsed time t=1.8 s, 131072 iters, t-(init.)=1.46 s t(norm)=0.174046, mflops=28.7281 (err=1.2e-16) 2. Arndt Split-Radix: elapsed time t=1.15 s, 65536 iters, t-(init.)=0.99 s t(norm)=0.236034, mflops=21.1834 (err=1.4e-16) 3. Arndt 4-step: elapsed time t=1.13 s, 32768 iters, t-(init.)=1.11 s t(norm)=0.529289, mflops=9.44663 (err=1.3e-16) 4. Bailey: elapsed time t=1.24 s, 65536 iters, t-(init.)=1.12 s t(norm)=0.267029, mflops=18.7246 (err=1.4e-16) 5. Beauregard: elapsed time t=1.37 s, 32768 iters, t-(init.)=1.36 s t(norm)=0.648499, mflops=7.71012 (err=1.9e-16) 6. Bergland: elapsed time t=1.8 s, 131072 iters, t-(init.)=1.4 s t(norm)=0.166893, mflops=29.9593 (err=2.0e-16) 7. Brenner: elapsed time t=1.45 s, 65536 iters, t-(init.)=1.42 s t(norm)=0.338554, mflops=14.7687 (err=1.5e-16) 8. Burrus: elapsed time t=1.7 s, 65536 iters, t-(init.)=1.53 s t(norm)=0.36478, mflops=13.7069 (err=1.3e-16) 9. CWP (min N): elapsed time t=1.76 s, 131072 iters, t-(init.)=1.47 s t(norm)=0.175238, mflops=28.5327 10. CWP (best N) (N=28): elapsed time t=1.86 s, 65536 iters, t-(init.)=1.55 s t(norm)=0.369549, mflops=13.53 11. Edelblute: elapsed time t=1.57 s, 65536 iters, t-(init.)=1.53 s t(norm)=0.36478, mflops=13.7069 (err=1.3e-16) 12. FFTPACK: elapsed time t=1.09 s, 131072 iters, t-(init.)=0.72 s t(norm)=0.0858307, mflops=58.2542 (err=1.5e-16) 13. FFTPACK (f2c): elapsed time t=1.12 s, 65536 iters, t-(init.)=0.94 s t(norm)=0.224113, mflops=22.3101 (err=1.3e-16) FFTW_MEASURE plan: (cost = 6.713867e-06) FFTW_TWIDDLE 2 FFTW_NOTW 8 14. FFTW: elapsed time t=1.78 s, 262144 iters, t-(init.)=1.04 s t(norm)=0.0619888, mflops=80.6597 (err=1.6e-16) FFTW_ESTIMATE plan: (cost = 4.256000e+02) FFTW_NOTW 16 15. FFTW_ESTIMATE: elapsed time t=1.74 s, 262144 iters, t-(init.)=1.03 s t(norm)=0.0613928, mflops=81.4428 (err=1.6e-16) 16. Frigo-old: elapsed time t=1.97 s, 262144 iters, t-(init.)=1.18 s t(norm)=0.0703335, mflops=71.0899 (err=1.6e-16) 17. Green: elapsed time t=1.16 s, 65536 iters, t-(init.)=1.12 s t(norm)=0.267029, mflops=18.7246 (err=1.9e-16) 18. GSL: elapsed time t=1.72 s, 131072 iters, t-(init.)=1.31 s t(norm)=0.156164, mflops=32.0176 (err=1.5e-16) 19. GSL DIT: elapsed time t=1.3 s, 65536 iters, t-(init.)=1.22 s t(norm)=0.290871, mflops=17.1898 (err=1.6e-16) 20. GSL DIF: elapsed time t=1.32 s, 65536 iters, t-(init.)=1.19 s t(norm)=0.283718, mflops=17.6231 (err=1.9e-16) 21. Krukar: elapsed time t=1.48 s, 262144 iters, t-(init.)=0.88 s t(norm)=0.0524521, mflops=95.3251 (err=1.7e-16) 22. Mayer (Buneman): elapsed time t=1.79 s, 131072 iters, t-(init.)=1.64 s t(norm)=0.195503, mflops=25.575 (err=1.3e-16) 23. Mayer (simple): elapsed time t=1.59 s, 131072 iters, t-(init.)=1.31 s t(norm)=0.156164, mflops=32.0176 24. Mayer (lookup): elapsed time t=1.51 s, 131072 iters, t-(init.)=1.17 s t(norm)=0.139475, mflops=35.8488 (err=1.4e-16) 25. Monro: elapsed time t=1.79 s, 65536 iters, t-(init.)=1.68 s t(norm)=0.400543, mflops=12.483 (err=1.2e-08) 26. NAPACK (f2c): elapsed time t=1.41 s, 32768 iters, t-(init.)=1.38 s t(norm)=0.658035, mflops=7.59838 (err=3.5e-16) 27. Nielsen: elapsed time t=1.97 s, 65536 iters, t-(init.)=1.79 s t(norm)=0.426769, mflops=11.7159 (err=1.3e-16) 28. NR (C): elapsed time t=1.15 s, 65536 iters, t-(init.)=1.01 s t(norm)=0.240803, mflops=20.7639 (err=1.6e-16) 29. NR (F): elapsed time t=1.58 s, 65536 iters, t-(init.)=1.39 s t(norm)=0.331402, mflops=15.0874 (err=1.7e-16) 30. Ooura (C): elapsed time t=1.11 s, 131072 iters, t-(init.)=0.74 s t(norm)=0.0882149, mflops=56.6798 (err=1.4e-16) 31. Ooura (F): elapsed time t=1.26 s, 131072 iters, t-(init.)=0.93 s t(norm)=0.110865, mflops=45.1 (err=1.4e-16) 32. QFT: elapsed time t=1 s, 131072 iters, t-(init.)=0.75 s t(norm)=0.089407, mflops=55.9241 (err=1.3e-16) 33. Ransom: elapsed time t=1.02 s, 32768 iters, t-(init.)=0.96 s t(norm)=0.457764, mflops=10.9227 (err=6.0e-16) 34. SCIPORT: elapsed time t=1.92 s, 65536 iters, t-(init.)=1.8 s t(norm)=0.429153, mflops=11.6508 (err=5.2e-08) 35. Singleton: elapsed time t=1.09 s, 65536 iters, t-(init.)=1.06 s t(norm)=0.252724, mflops=19.7845 (err=1.7e-16) 36. Singleton (f2c): elapsed time t=1.08 s, 65536 iters, t-(init.)=0.9 s t(norm)=0.214577, mflops=23.3017 (err=1.9e-16) 37. Sorensen: elapsed time t=1.75 s, 131072 iters, t-(init.)=1.44 s t(norm)=0.171661, mflops=29.1271 (err=1.3e-16) 38. Sorensen DIT: elapsed time t=1.58 s, 65536 iters, t-(init.)=1.43 s t(norm)=0.340939, mflops=14.6654 (err=1.4e-16) 39. Temperton: elapsed time t=1.31 s, 65536 iters, t-(init.)=1.2 s t(norm)=0.286102, mflops=17.4763 (err=2.9e-08) 40. Temperton (f2c): elapsed time t=1.14 s, 65536 iters, t-(init.)=0.94 s t(norm)=0.224113, mflops=22.3101 (err=1.5e-16) 41. Valkenburg: elapsed time t=1.22 s, 16384 iters, t-(init.)=1.21 s t(norm)=1.15395, mflops=4.33296 (err=3.0e-16) Top mflops for N=16 = 95.3251 Normalized results and averages for N=16: fft 0: mflops = 28.5327 (norm. = 0.29932), norm. avg. (of 4) = 0.410893 fft 1: mflops = 28.7281 (norm. = 0.30137), norm. avg. (of 4) = 0.385047 fft 2: mflops = 21.1834 (norm. = 0.222222), norm. avg. (of 4) = 0.120801 fft 3: mflops = 9.44663 (norm. = 0.0990991), norm. avg. (of 4) = 0.0417496 fft 4: mflops = 18.7246 (norm. = 0.196429), norm. avg. (of 4) = 0.0901754 fft 5: mflops = 7.71012 (norm. = 0.0808824), norm. avg. (of 4) = 0.0431223 fft 6: mflops = 29.9593 (norm. = 0.314286), norm. avg. (of 4) = 0.140454 fft 7: mflops = 14.7687 (norm. = 0.15493), norm. avg. (of 4) = 0.0719598 fft 8: mflops = 13.7069 (norm. = 0.143791), norm. avg. (of 4) = 0.0850238 fft 9: mflops = 28.5327 (norm. = 0.29932), norm. avg. (of 4) = 0.119466 fft 10: mflops = 13.53 (norm. = 0.141935), norm. avg. (of 4) = 0.0584937 fft 11: mflops = 13.7069 (norm. = 0.143791), norm. avg. (of 3) = 0.100156 fft 12: mflops = 58.2542 (norm. = 0.611111), norm. avg. (of 4) = 0.263402 fft 13: mflops = 22.3101 (norm. = 0.234043), norm. avg. (of 4) = 0.122869 fft 14: mflops = 80.6597 (norm. = 0.846154), norm. avg. (of 4) = 0.523002 fft 15: mflops = 81.4428 (norm. = 0.854369), norm. avg. (of 4) = 0.535312 fft 16: mflops = 71.0899 (norm. = 0.745763), norm. avg. (of 4) = 0.757869 fft 17: mflops = 18.7246 (norm. = 0.196429), norm. avg. (of 2) = 0.203683 fft 18: mflops = 32.0176 (norm. = 0.335878), norm. avg. (of 4) = 0.144324 fft 19: mflops = 17.1898 (norm. = 0.180328), norm. avg. (of 4) = 0.0809583 fft 20: mflops = 17.6231 (norm. = 0.184874), norm. avg. (of 4) = 0.0833275 fft 21: mflops = 95.3251 (norm. = 1), norm. avg. (of 4) = 0.638109 fft 22: mflops = 25.575 (norm. = 0.268293), norm. avg. (of 3) = 0.223084 fft 23: mflops = 32.0176 (norm. = 0.335878), norm. avg. (of 3) = 0.231694 fft 24: mflops = 35.8488 (norm. = 0.376068), norm. avg. (of 3) = 0.256857 fft 25: mflops = 12.483 (norm. = 0.130952), norm. avg. (of 3) = 0.0680505 fft 26: mflops = 7.59838 (norm. = 0.0797101), norm. avg. (of 4) = 0.0370305 fft 27: mflops = 11.7159 (norm. = 0.122905), norm. avg. (of 4) = 0.0601454 fft 28: mflops = 20.7639 (norm. = 0.217822), norm. avg. (of 4) = 0.0962762 fft 29: mflops = 15.0874 (norm. = 0.158273), norm. avg. (of 4) = 0.0695105 fft 30: mflops = 56.6798 (norm. = 0.594595), norm. avg. (of 4) = 0.60331 fft 31: mflops = 45.1 (norm. = 0.473118), norm. avg. (of 4) = 0.321551 fft 32: mflops = 55.9241 (norm. = 0.586667), norm. avg. (of 1) = 0.586667 fft 33: mflops = 10.9227 (norm. = 0.114583), norm. avg. (of 3) = 0.0542161 fft 34: mflops = 11.6508 (norm. = 0.122222), norm. avg. (of 3) = 0.0871808 fft 35: mflops = 19.7845 (norm. = 0.207547), norm. avg. (of 4) = 0.0879633 fft 36: mflops = 23.3017 (norm. = 0.244444), norm. avg. (of 4) = 0.103256 fft 37: mflops = 29.1271 (norm. = 0.305556), norm. avg. (of 4) = 0.160983 fft 38: mflops = 14.6654 (norm. = 0.153846), norm. avg. (of 4) = 0.090729 fft 39: mflops = 17.4763 (norm. = 0.183333), norm. avg. (of 4) = 0.0835478 fft 40: mflops = 22.3101 (norm. = 0.234043), norm. avg. (of 4) = 0.101918 fft 41: mflops = 4.33296 (norm. = 0.0454545), norm. avg. (of 4) = 0.0278003 Benchmarking for array size = 32 (power of 2): 0. Arndt DIF: elapsed time t=1.94 s, 65536 iters, t-(init.)=1.57 s t(norm)=0.149727, mflops=33.3941 (err=6.0e-16) 1. Arndt DIT: elapsed time t=1.9 s, 65536 iters, t-(init.)=1.51 s t(norm)=0.144005, mflops=34.7211 (err=5.5e-16) 2. Arndt Split-Radix: elapsed time t=1.29 s, 32768 iters, t-(init.)=1.22 s t(norm)=0.232697, mflops=21.4872 (err=3.6e-16) 3. Arndt 4-step: elapsed time t=1.32 s, 16384 iters, t-(init.)=1.3 s t(norm)=0.495911, mflops=10.0825 (err=3.2e-16) 4. Bailey: elapsed time t=1.25 s, 32768 iters, t-(init.)=1.09 s t(norm)=0.207901, mflops=24.0499 (err=6.7e-16) 5. Beauregard: elapsed time t=1.65 s, 16384 iters, t-(init.)=1.64 s t(norm)=0.62561, mflops=7.9922 (err=6.6e-16) 6. Bergland: elapsed time t=1.84 s, 65536 iters, t-(init.)=1.73 s t(norm)=0.164986, mflops=30.3057 (err=6.4e-16) 7. Brenner: elapsed time t=1.5 s, 32768 iters, t-(init.)=1.43 s t(norm)=0.272751, mflops=18.3317 (err=6.0e-16) 8. Burrus: elapsed time t=1.96 s, 32768 iters, t-(init.)=1.93 s t(norm)=0.368118, mflops=13.5826 (err=3.5e-16) 9. CWP (min N) (N=33): elapsed time t=1.31 s, 32768 iters, t-(init.)=1.17 s t(norm)=0.22316, mflops=22.4055 10. CWP (best N) (N=35): elapsed time t=1.38 s, 32768 iters, t-(init.)=1.24 s t(norm)=0.236511, mflops=21.1406 11. Edelblute: elapsed time t=1.78 s, 32768 iters, t-(init.)=1.59 s t(norm)=0.303268, mflops=16.487 (err=3.5e-16) 12. FFTPACK: elapsed time t=1.52 s, 65536 iters, t-(init.)=1.27 s t(norm)=0.121117, mflops=41.2825 (err=4.4e-16) 13. FFTPACK (f2c): elapsed time t=1.43 s, 32768 iters, t-(init.)=1.31 s t(norm)=0.249863, mflops=20.011 (err=5.3e-16) FFTW_MEASURE plan: (cost = 1.403809e-05) FFTW_TWIDDLE 4 FFTW_NOTW 8 14. FFTW: elapsed time t=1.9 s, 131072 iters, t-(init.)=1.17 s t(norm)=0.0557899, mflops=89.6219 (err=5.6e-16) FFTW_ESTIMATE plan: (cost = 3.200000e+01) FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.08 s, 65536 iters, t-(init.)=0.75 s t(norm)=0.0715256, mflops=69.9051 (err=6.1e-16) 16. Frigo-old: elapsed time t=1.35 s, 65536 iters, t-(init.)=0.98 s t(norm)=0.0934601, mflops=53.4988 (err=5.5e-16) 17. Green: elapsed time t=1.34 s, 32768 iters, t-(init.)=1.14 s t(norm)=0.217438, mflops=22.9951 (err=6.9e-16) 18. GSL: elapsed time t=1.07 s, 32768 iters, t-(init.)=1.03 s t(norm)=0.196457, mflops=25.4509 (err=5.0e-16) 19. GSL DIT: elapsed time t=1.29 s, 32768 iters, t-(init.)=1.11 s t(norm)=0.211716, mflops=23.6166 (err=6.1e-16) 20. GSL DIF: elapsed time t=1.29 s, 32768 iters, t-(init.)=1.26 s t(norm)=0.240326, mflops=20.8051 (err=4.3e-16) 21. Krukar: elapsed time t=1.83 s, 131072 iters, t-(init.)=1.12 s t(norm)=0.0534058, mflops=93.6229 (err=8.6e-16) 22. Mayer (Buneman): elapsed time t=1.98 s, 65536 iters, t-(init.)=1.75 s t(norm)=0.166893, mflops=29.9593 (err=3.5e-16) 23. Mayer (simple): elapsed time t=1.65 s, 65536 iters, t-(init.)=1.4 s t(norm)=0.133514, mflops=37.4491 24. Mayer (lookup): elapsed time t=1.59 s, 65536 iters, t-(init.)=1.25 s t(norm)=0.119209, mflops=41.943 (err=5.9e-16) 25. Monro: elapsed time t=1.63 s, 32768 iters, t-(init.)=1.6 s t(norm)=0.305176, mflops=16.384 (err=1.2e-07) 26. NAPACK (f2c): elapsed time t=1.51 s, 16384 iters, t-(init.)=1.47 s t(norm)=0.56076, mflops=8.91646 (err=9.3e-16) 27. Nielsen: elapsed time t=1.89 s, 32768 iters, t-(init.)=1.69 s t(norm)=0.322342, mflops=15.5115 (err=3.1e-15) 28. NR (C): elapsed time t=1.14 s, 32768 iters, t-(init.)=1.11 s t(norm)=0.211716, mflops=23.6166 (err=6.1e-16) 29. NR (F): elapsed time t=1.58 s, 32768 iters, t-(init.)=1.47 s t(norm)=0.28038, mflops=17.8329 (err=7.0e-16) 30. Ooura (C): elapsed time t=1.28 s, 65536 iters, t-(init.)=1.01 s t(norm)=0.0963211, mflops=51.9097 (err=4.3e-16) 31. Ooura (F): elapsed time t=1.46 s, 65536 iters, t-(init.)=1.07 s t(norm)=0.102043, mflops=48.9989 (err=4.3e-16) 32. QFT: elapsed time t=1.44 s, 65536 iters, t-(init.)=1.06 s t(norm)=0.101089, mflops=49.4611 (err=4.6e-16) 33. Ransom: elapsed time t=1.19 s, 16384 iters, t-(init.)=1.16 s t(norm)=0.442505, mflops=11.2993 (err=3.8e-15) 34. SCIPORT: elapsed time t=1.21 s, 16384 iters, t-(init.)=1.19 s t(norm)=0.453949, mflops=11.0145 (err=3.1e-07) 35. Singleton: elapsed time t=1.15 s, 32768 iters, t-(init.)=1.01 s t(norm)=0.192642, mflops=25.9549 (err=7.1e-16) 36. Singleton (f2c): elapsed time t=1.13 s, 32768 iters, t-(init.)=1.1 s t(norm)=0.209808, mflops=23.8313 (err=5.8e-16) 37. Sorensen: elapsed time t=1.83 s, 65536 iters, t-(init.)=1.47 s t(norm)=0.14019, mflops=35.6659 (err=3.4e-16) 38. Sorensen DIT: elapsed time t=1.79 s, 32768 iters, t-(init.)=1.71 s t(norm)=0.326157, mflops=15.3301 (err=5.1e-16) 39. Temperton: elapsed time t=1.52 s, 32768 iters, t-(init.)=1.46 s t(norm)=0.278473, mflops=17.9551 (err=1.8e-07) 40. Temperton (f2c): elapsed time t=1.51 s, 32768 iters, t-(init.)=1.46 s t(norm)=0.278473, mflops=17.9551 (err=5.1e-16) 41. Valkenburg: elapsed time t=1.5 s, 8192 iters, t-(init.)=1.5 s t(norm)=1.14441, mflops=4.36907 (err=8.4e-16) Top mflops for N=32 = 93.6229 Normalized results and averages for N=32: fft 0: mflops = 33.3941 (norm. = 0.356688), norm. avg. (of 5) = 0.400052 fft 1: mflops = 34.7211 (norm. = 0.370861), norm. avg. (of 5) = 0.38221 fft 2: mflops = 21.4872 (norm. = 0.229508), norm. avg. (of 5) = 0.142543 fft 3: mflops = 10.0825 (norm. = 0.107692), norm. avg. (of 5) = 0.0549381 fft 4: mflops = 24.0499 (norm. = 0.256881), norm. avg. (of 5) = 0.123516 fft 5: mflops = 7.9922 (norm. = 0.0853659), norm. avg. (of 5) = 0.051571 fft 6: mflops = 30.3057 (norm. = 0.323699), norm. avg. (of 5) = 0.177103 fft 7: mflops = 18.3317 (norm. = 0.195804), norm. avg. (of 5) = 0.0967286 fft 8: mflops = 13.5826 (norm. = 0.145078), norm. avg. (of 5) = 0.0970346 fft 9: mflops = 22.4055 (norm. = 0.239316), norm. avg. (of 5) = 0.143436 fft 10: mflops = 21.1406 (norm. = 0.225806), norm. avg. (of 5) = 0.0919562 fft 11: mflops = 16.487 (norm. = 0.176101), norm. avg. (of 4) = 0.119142 fft 12: mflops = 41.2825 (norm. = 0.440945), norm. avg. (of 5) = 0.298911 fft 13: mflops = 20.011 (norm. = 0.21374), norm. avg. (of 5) = 0.141044 fft 14: mflops = 89.6219 (norm. = 0.957265), norm. avg. (of 5) = 0.609854 fft 15: mflops = 69.9051 (norm. = 0.746667), norm. avg. (of 5) = 0.577583 fft 16: mflops = 53.4988 (norm. = 0.571429), norm. avg. (of 5) = 0.720581 fft 17: mflops = 22.9951 (norm. = 0.245614), norm. avg. (of 3) = 0.21766 fft 18: mflops = 25.4509 (norm. = 0.271845), norm. avg. (of 5) = 0.169828 fft 19: mflops = 23.6166 (norm. = 0.252252), norm. avg. (of 5) = 0.115217 fft 20: mflops = 20.8051 (norm. = 0.222222), norm. avg. (of 5) = 0.111106 fft 21: mflops = 93.6229 (norm. = 1), norm. avg. (of 5) = 0.710487 fft 22: mflops = 29.9593 (norm. = 0.32), norm. avg. (of 4) = 0.247313 fft 23: mflops = 37.4491 (norm. = 0.4), norm. avg. (of 4) = 0.27377 fft 24: mflops = 41.943 (norm. = 0.448), norm. avg. (of 4) = 0.304643 fft 25: mflops = 16.384 (norm. = 0.175), norm. avg. (of 4) = 0.0947879 fft 26: mflops = 8.91646 (norm. = 0.0952381), norm. avg. (of 5) = 0.048672 fft 27: mflops = 15.5115 (norm. = 0.16568), norm. avg. (of 5) = 0.0812524 fft 28: mflops = 23.6166 (norm. = 0.252252), norm. avg. (of 5) = 0.127471 fft 29: mflops = 17.8329 (norm. = 0.190476), norm. avg. (of 5) = 0.0937036 fft 30: mflops = 51.9097 (norm. = 0.554455), norm. avg. (of 5) = 0.593539 fft 31: mflops = 48.9989 (norm. = 0.523364), norm. avg. (of 5) = 0.361914 fft 32: mflops = 49.4611 (norm. = 0.528302), norm. avg. (of 2) = 0.557484 fft 33: mflops = 11.2993 (norm. = 0.12069), norm. avg. (of 4) = 0.0708345 fft 34: mflops = 11.0145 (norm. = 0.117647), norm. avg. (of 4) = 0.0947974 fft 35: mflops = 25.9549 (norm. = 0.277228), norm. avg. (of 5) = 0.125816 fft 36: mflops = 23.8313 (norm. = 0.254545), norm. avg. (of 5) = 0.133514 fft 37: mflops = 35.6659 (norm. = 0.380952), norm. avg. (of 5) = 0.204977 fft 38: mflops = 15.3301 (norm. = 0.163743), norm. avg. (of 5) = 0.105332 fft 39: mflops = 17.9551 (norm. = 0.191781), norm. avg. (of 5) = 0.105194 fft 40: mflops = 17.9551 (norm. = 0.191781), norm. avg. (of 5) = 0.11989 fft 41: mflops = 4.36907 (norm. = 0.0466667), norm. avg. (of 5) = 0.0315736 Benchmarking for array size = 64 (power of 2): 0. Arndt DIF: elapsed time t=1.26 s, 16384 iters, t-(init.)=1.23 s t(norm)=0.195503, mflops=25.575 (err=2.3e-16) 1. Arndt DIT: elapsed time t=1.24 s, 16384 iters, t-(init.)=1.08 s t(norm)=0.171661, mflops=29.1271 (err=3.4e-16) 2. Arndt Split-Radix: elapsed time t=1.42 s, 16384 iters, t-(init.)=1.31 s t(norm)=0.208219, mflops=24.0132 (err=4.4e-16) 3. Arndt 4-step: elapsed time t=1.21 s, 8192 iters, t-(init.)=1.19 s t(norm)=0.378291, mflops=13.2173 (err=3.6e-16) 4. Bailey: elapsed time t=1.37 s, 16384 iters, t-(init.)=1.18 s t(norm)=0.187556, mflops=26.6587 (err=4.8e-16) 5. Beauregard: elapsed time t=1 s, 4096 iters, t-(init.)=0.99 s t(norm)=0.629425, mflops=7.94376 (err=3.8e-16) 6. Bergland: elapsed time t=1.08 s, 16384 iters, t-(init.)=1.04 s t(norm)=0.165304, mflops=30.2474 (err=2.5e-16) 7. Brenner: elapsed time t=1.54 s, 16384 iters, t-(init.)=1.38 s t(norm)=0.219345, mflops=22.7951 (err=3.8e-16) 8. Burrus: elapsed time t=1.07 s, 8192 iters, t-(init.)=1.06 s t(norm)=0.336965, mflops=14.8383 (err=4.2e-16) 9. CWP (min N) (N=65): elapsed time t=1.53 s, 16384 iters, t-(init.)=1.41 s t(norm)=0.224113, mflops=22.3101 10. CWP (best N) (N=84): elapsed time t=1.54 s, 16384 iters, t-(init.)=1.35 s t(norm)=0.214577, mflops=23.3017 11. Edelblute: elapsed time t=1.98 s, 16384 iters, t-(init.)=1.88 s t(norm)=0.298818, mflops=16.7326 (err=3.6e-16) 12. FFTPACK: elapsed time t=1.6 s, 32768 iters, t-(init.)=1.23 s t(norm)=0.0977516, mflops=51.15 (err=4.2e-16) 13. FFTPACK (f2c): elapsed time t=1.69 s, 16384 iters, t-(init.)=1.67 s t(norm)=0.265439, mflops=18.8367 (err=4.5e-16) FFTW_MEASURE plan: (cost = 3.784180e-05) FFTW_TWIDDLE 8 FFTW_NOTW 8 14. FFTW: elapsed time t=1.29 s, 32768 iters, t-(init.)=0.96 s t(norm)=0.0762939, mflops=65.536 (err=4.7e-16) FFTW_ESTIMATE plan: (cost = 7.680000e+02) FFTW_TWIDDLE 2 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.3 s, 32768 iters, t-(init.)=0.92 s t(norm)=0.073115, mflops=68.3854 (err=2.6e-16) 16. Frigo-old: elapsed time t=1.17 s, 32768 iters, t-(init.)=0.83 s t(norm)=0.0659625, mflops=75.8007 (err=4.5e-16) 17. Green: elapsed time t=1.45 s, 16384 iters, t-(init.)=1.29 s t(norm)=0.20504, mflops=24.3855 (err=3.7e-16) 18. GSL: elapsed time t=1.1 s, 16384 iters, t-(init.)=1.04 s t(norm)=0.165304, mflops=30.2474 (err=3.8e-16) 19. GSL DIT: elapsed time t=1.38 s, 16384 iters, t-(init.)=1.2 s t(norm)=0.190735, mflops=26.2144 (err=3.2e-16) 20. GSL DIF: elapsed time t=1.36 s, 16384 iters, t-(init.)=1.18 s t(norm)=0.187556, mflops=26.6587 (err=3.1e-16) 21. Krukar: elapsed time t=1.84 s, 32768 iters, t-(init.)=1.69 s t(norm)=0.134309, mflops=37.2276 (err=5.3e-16) 22. Mayer (Buneman): elapsed time t=1.21 s, 16384 iters, t-(init.)=1.09 s t(norm)=0.173251, mflops=28.8599 (err=2.0e-16) 23. Mayer (simple): elapsed time t=1 s, 16384 iters, t-(init.)=0.82 s t(norm)=0.130335, mflops=38.3625 24. Mayer (lookup): elapsed time t=1.9 s, 32768 iters, t-(init.)=1.66 s t(norm)=0.131925, mflops=37.9003 (err=3.4e-16) 25. Monro: elapsed time t=1.64 s, 16384 iters, t-(init.)=1.6 s t(norm)=0.254313, mflops=19.6608 (err=4.9e-08) 26. NAPACK (f2c): elapsed time t=1.59 s, 8192 iters, t-(init.)=1.53 s t(norm)=0.486374, mflops=10.2802 (err=1.0e-15) 27. Nielsen: elapsed time t=1.83 s, 16384 iters, t-(init.)=1.65 s t(norm)=0.26226, mflops=19.065 (err=6.5e-15) 28. NR (C): elapsed time t=1.19 s, 16384 iters, t-(init.)=0.99 s t(norm)=0.157356, mflops=31.775 (err=3.2e-16) 29. NR (F): elapsed time t=1.63 s, 16384 iters, t-(init.)=1.45 s t(norm)=0.230471, mflops=21.6947 (err=3.4e-16) 30. Ooura (C): elapsed time t=1.63 s, 32768 iters, t-(init.)=1.43 s t(norm)=0.113646, mflops=43.9962 (err=2.9e-16) 31. Ooura (F): elapsed time t=1.79 s, 32768 iters, t-(init.)=1.56 s t(norm)=0.123978, mflops=40.3298 (err=2.9e-16) 32. QFT: elapsed time t=1.89 s, 32768 iters, t-(init.)=1.52 s t(norm)=0.120799, mflops=41.3912 (err=5.9e-16) 33. Ransom: elapsed time t=1.83 s, 16384 iters, t-(init.)=1.8 s t(norm)=0.286102, mflops=17.4763 (err=2.5e-15) 34. SCIPORT: elapsed time t=1.47 s, 8192 iters, t-(init.)=1.45 s t(norm)=0.460943, mflops=10.8473 (err=2.0e-07) 35. Singleton: elapsed time t=1.2 s, 16384 iters, t-(init.)=1.17 s t(norm)=0.185966, mflops=26.8866 (err=3.5e-16) 36. Singleton (f2c): elapsed time t=1.22 s, 16384 iters, t-(init.)=1.11 s t(norm)=0.17643, mflops=28.3399 (err=4.6e-16) 37. Sorensen: elapsed time t=1.96 s, 32768 iters, t-(init.)=1.73 s t(norm)=0.137488, mflops=36.3668 (err=3.4e-16) 38. Sorensen DIT: elapsed time t=1.97 s, 16384 iters, t-(init.)=1.91 s t(norm)=0.303586, mflops=16.4698 (err=5.4e-16) 39. Temperton: elapsed time t=1.55 s, 16384 iters, t-(init.)=1.38 s t(norm)=0.219345, mflops=22.7951 (err=1.2e-07) 40. Temperton (f2c): elapsed time t=1.39 s, 16384 iters, t-(init.)=1.36 s t(norm)=0.216166, mflops=23.1304 (err=3.7e-16) 41. Valkenburg: elapsed time t=1.77 s, 4096 iters, t-(init.)=1.76 s t(norm)=1.11898, mflops=4.46836 (err=8.5e-16) Top mflops for N=64 = 75.8007 Normalized results and averages for N=64: fft 0: mflops = 25.575 (norm. = 0.337398), norm. avg. (of 6) = 0.38961 fft 1: mflops = 29.1271 (norm. = 0.384259), norm. avg. (of 6) = 0.382551 fft 2: mflops = 24.0132 (norm. = 0.316794), norm. avg. (of 6) = 0.171584 fft 3: mflops = 13.2173 (norm. = 0.17437), norm. avg. (of 6) = 0.0748434 fft 4: mflops = 26.6587 (norm. = 0.351695), norm. avg. (of 6) = 0.161546 fft 5: mflops = 7.94376 (norm. = 0.104798), norm. avg. (of 6) = 0.0604422 fft 6: mflops = 30.2474 (norm. = 0.399038), norm. avg. (of 6) = 0.214092 fft 7: mflops = 22.7951 (norm. = 0.300725), norm. avg. (of 6) = 0.130728 fft 8: mflops = 14.8383 (norm. = 0.195755), norm. avg. (of 6) = 0.113488 fft 9: mflops = 22.3101 (norm. = 0.294326), norm. avg. (of 6) = 0.168585 fft 10: mflops = 23.3017 (norm. = 0.307407), norm. avg. (of 6) = 0.127865 fft 11: mflops = 16.7326 (norm. = 0.220745), norm. avg. (of 5) = 0.139463 fft 12: mflops = 51.15 (norm. = 0.674797), norm. avg. (of 6) = 0.361558 fft 13: mflops = 18.8367 (norm. = 0.248503), norm. avg. (of 6) = 0.158954 fft 14: mflops = 65.536 (norm. = 0.864583), norm. avg. (of 6) = 0.652309 fft 15: mflops = 68.3854 (norm. = 0.902174), norm. avg. (of 6) = 0.631682 fft 16: mflops = 75.8007 (norm. = 1), norm. avg. (of 6) = 0.767151 fft 17: mflops = 24.3855 (norm. = 0.321705), norm. avg. (of 4) = 0.243671 fft 18: mflops = 30.2474 (norm. = 0.399038), norm. avg. (of 6) = 0.20803 fft 19: mflops = 26.2144 (norm. = 0.345833), norm. avg. (of 6) = 0.153653 fft 20: mflops = 26.6587 (norm. = 0.351695), norm. avg. (of 6) = 0.151205 fft 21: mflops = 37.2276 (norm. = 0.491124), norm. avg. (of 6) = 0.673927 fft 22: mflops = 28.8599 (norm. = 0.380734), norm. avg. (of 5) = 0.273997 fft 23: mflops = 38.3625 (norm. = 0.506098), norm. avg. (of 5) = 0.320236 fft 24: mflops = 37.9003 (norm. = 0.5), norm. avg. (of 5) = 0.343714 fft 25: mflops = 19.6608 (norm. = 0.259375), norm. avg. (of 5) = 0.127705 fft 26: mflops = 10.2802 (norm. = 0.135621), norm. avg. (of 6) = 0.0631635 fft 27: mflops = 19.065 (norm. = 0.251515), norm. avg. (of 6) = 0.10963 fft 28: mflops = 31.775 (norm. = 0.419192), norm. avg. (of 6) = 0.176091 fft 29: mflops = 21.6947 (norm. = 0.286207), norm. avg. (of 6) = 0.125787 fft 30: mflops = 43.9962 (norm. = 0.58042), norm. avg. (of 6) = 0.591353 fft 31: mflops = 40.3298 (norm. = 0.532051), norm. avg. (of 6) = 0.39027 fft 32: mflops = 41.3912 (norm. = 0.546053), norm. avg. (of 3) = 0.553674 fft 33: mflops = 17.4763 (norm. = 0.230556), norm. avg. (of 5) = 0.102779 fft 34: mflops = 10.8473 (norm. = 0.143103), norm. avg. (of 5) = 0.104459 fft 35: mflops = 26.8866 (norm. = 0.354701), norm. avg. (of 6) = 0.163964 fft 36: mflops = 28.3399 (norm. = 0.373874), norm. avg. (of 6) = 0.173574 fft 37: mflops = 36.3668 (norm. = 0.479769), norm. avg. (of 6) = 0.250775 fft 38: mflops = 16.4698 (norm. = 0.217277), norm. avg. (of 6) = 0.123989 fft 39: mflops = 22.7951 (norm. = 0.300725), norm. avg. (of 6) = 0.137783 fft 40: mflops = 23.1304 (norm. = 0.305147), norm. avg. (of 6) = 0.150767 fft 41: mflops = 4.46836 (norm. = 0.0589489), norm. avg. (of 6) = 0.0361361 Benchmarking for array size = 128 (power of 2): 0. Arndt DIF: elapsed time t=1.33 s, 8192 iters, t-(init.)=1.3 s t(norm)=0.177111, mflops=28.2309 (err=3.8e-16) 1. Arndt DIT: elapsed time t=1.3 s, 8192 iters, t-(init.)=1.11 s t(norm)=0.151225, mflops=33.0632 (err=5.1e-16) 2. Arndt Split-Radix: elapsed time t=1.54 s, 8192 iters, t-(init.)=1.51 s t(norm)=0.205721, mflops=24.3047 (err=6.1e-16) 3. Arndt 4-step: elapsed time t=1.43 s, 4096 iters, t-(init.)=1.42 s t(norm)=0.386919, mflops=12.9226 (err=3.3e-16) 4. Bailey: elapsed time t=1.46 s, 8192 iters, t-(init.)=1.32 s t(norm)=0.179836, mflops=27.8032 (err=6.1e-16) 5. Beauregard: elapsed time t=1.17 s, 2048 iters, t-(init.)=1.15 s t(norm)=0.6267, mflops=7.9783 (err=9.3e-16) 6. Bergland: elapsed time t=1.16 s, 8192 iters, t-(init.)=1.02 s t(norm)=0.138964, mflops=35.9805 (err=6.2e-16) 7. Brenner: elapsed time t=1.66 s, 8192 iters, t-(init.)=1.54 s t(norm)=0.209808, mflops=23.8313 (err=6.6e-16) 8. Burrus: elapsed time t=1.14 s, 4096 iters, t-(init.)=1.12 s t(norm)=0.305176, mflops=16.384 (err=5.3e-16) 9. CWP (min N) (N=130): elapsed time t=1.61 s, 8192 iters, t-(init.)=1.58 s t(norm)=0.215258, mflops=23.2279 10. CWP (best N) (N=140): elapsed time t=1.51 s, 8192 iters, t-(init.)=1.47 s t(norm)=0.200272, mflops=24.9661 11. Edelblute: elapsed time t=1.07 s, 4096 iters, t-(init.)=1.02 s t(norm)=0.277928, mflops=17.9903 (err=6.7e-16) 12. FFTPACK: elapsed time t=1.81 s, 16384 iters, t-(init.)=1.51 s t(norm)=0.102861, mflops=48.6095 (err=5.3e-16) 13. FFTPACK (f2c): elapsed time t=1.87 s, 8192 iters, t-(init.)=1.83 s t(norm)=0.249318, mflops=20.0547 (err=6.2e-16) FFTW_MEASURE plan: (cost = 9.277344e-05) FFTW_TWIDDLE 8 FFTW_NOTW 16 14. FFTW: elapsed time t=1.58 s, 16384 iters, t-(init.)=1.21 s t(norm)=0.0824247, mflops=60.6614 (err=4.6e-16) FFTW_ESTIMATE plan: (cost = 1.075200e+03) FFTW_TWIDDLE 4 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.39 s, 16384 iters, t-(init.)=1.06 s t(norm)=0.0722068, mflops=69.2456 (err=3.6e-16) 16. Frigo-old: elapsed time t=1.31 s, 16384 iters, t-(init.)=1.01 s t(norm)=0.0688008, mflops=72.6736 (err=4.4e-16) 17. Green: elapsed time t=1.8 s, 8192 iters, t-(init.)=1.61 s t(norm)=0.219345, mflops=22.7951 (err=6.9e-16) 18. GSL: elapsed time t=1.24 s, 8192 iters, t-(init.)=1.2 s t(norm)=0.163487, mflops=30.5835 (err=8.2e-16) 19. GSL DIT: elapsed time t=1.49 s, 8192 iters, t-(init.)=1.3 s t(norm)=0.177111, mflops=28.2309 (err=7.5e-16) 20. GSL DIF: elapsed time t=1.46 s, 8192 iters, t-(init.)=1.32 s t(norm)=0.179836, mflops=27.8032 (err=7.6e-16) 21. Krukar: elapsed time t=1.4 s, 8192 iters, t-(init.)=1.34 s t(norm)=0.182561, mflops=27.3882 (err=6.5e-16) 22. Mayer (Buneman): elapsed time t=1.3 s, 8192 iters, t-(init.)=1.24 s t(norm)=0.168937, mflops=29.5969 (err=3.1e-16) 23. Mayer (simple): elapsed time t=1.05 s, 8192 iters, t-(init.)=1.02 s t(norm)=0.138964, mflops=35.9805 24. Mayer (lookup): elapsed time t=1.01 s, 8192 iters, t-(init.)=0.84 s t(norm)=0.114441, mflops=43.6907 (err=3.5e-16) 25. Monro: elapsed time t=1.68 s, 8192 iters, t-(init.)=1.65 s t(norm)=0.224795, mflops=22.2425 (err=8.3e-08) 26. NAPACK (f2c): elapsed time t=1.76 s, 4096 iters, t-(init.)=1.74 s t(norm)=0.474112, mflops=10.546 (err=1.6e-15) 27. Nielsen: elapsed time t=1.12 s, 4096 iters, t-(init.)=1.09 s t(norm)=0.297001, mflops=16.8349 (err=1.7e-15) 28. NR (C): elapsed time t=1.28 s, 8192 iters, t-(init.)=1.25 s t(norm)=0.170299, mflops=29.3601 (err=7.5e-16) 29. NR (F): elapsed time t=1.72 s, 8192 iters, t-(init.)=1.69 s t(norm)=0.230244, mflops=21.7161 (err=6.9e-16) 30. Ooura (C): elapsed time t=1.8 s, 16384 iters, t-(init.)=1.41 s t(norm)=0.0960486, mflops=52.057 (err=6.7e-16) 31. Ooura (F): elapsed time t=1 s, 8192 iters, t-(init.)=0.92 s t(norm)=0.12534, mflops=39.8915 (err=6.7e-16) 32. QFT: elapsed time t=1.14 s, 8192 iters, t-(init.)=0.94 s t(norm)=0.128065, mflops=39.0427 (err=4.9e-16) 33. Ransom: elapsed time t=1.06 s, 4096 iters, t-(init.)=1.04 s t(norm)=0.283378, mflops=17.6443 (err=1.7e-15) 34. SCIPORT: elapsed time t=1.75 s, 4096 iters, t-(init.)=1.73 s t(norm)=0.471388, mflops=10.607 (err=1.6e-07) 35. Singleton: elapsed time t=1.36 s, 8192 iters, t-(init.)=1.28 s t(norm)=0.174386, mflops=28.672 (err=6.2e-16) 36. Singleton (f2c): elapsed time t=1.29 s, 8192 iters, t-(init.)=1.26 s t(norm)=0.171661, mflops=29.1271 (err=5.7e-16) 37. Sorensen: elapsed time t=1.05 s, 8192 iters, t-(init.)=1.02 s t(norm)=0.138964, mflops=35.9805 (err=4.3e-16) 38. Sorensen DIT: elapsed time t=1.06 s, 4096 iters, t-(init.)=1.04 s t(norm)=0.283378, mflops=17.6443 (err=4.0e-16) 39. Temperton: elapsed time t=1.78 s, 8192 iters, t-(init.)=1.65 s t(norm)=0.224795, mflops=22.2425 (err=9.9e-08) 40. Temperton (f2c): elapsed time t=1.54 s, 8192 iters, t-(init.)=1.44 s t(norm)=0.196184, mflops=25.4862 (err=7.7e-16) 41. Valkenburg: elapsed time t=1.02 s, 1024 iters, t-(init.)=1.02 s t(norm)=1.11171, mflops=4.49757 (err=8.6e-16) Top mflops for N=128 = 72.6736 Normalized results and averages for N=128: fft 0: mflops = 28.2309 (norm. = 0.388462), norm. avg. (of 7) = 0.389446 fft 1: mflops = 33.0632 (norm. = 0.454955), norm. avg. (of 7) = 0.392895 fft 2: mflops = 24.3047 (norm. = 0.334437), norm. avg. (of 7) = 0.194849 fft 3: mflops = 12.9226 (norm. = 0.177817), norm. avg. (of 7) = 0.0895539 fft 4: mflops = 27.8032 (norm. = 0.382576), norm. avg. (of 7) = 0.193122 fft 5: mflops = 7.9783 (norm. = 0.109783), norm. avg. (of 7) = 0.0674908 fft 6: mflops = 35.9805 (norm. = 0.495098), norm. avg. (of 7) = 0.254236 fft 7: mflops = 23.8313 (norm. = 0.327922), norm. avg. (of 7) = 0.158899 fft 8: mflops = 16.384 (norm. = 0.225446), norm. avg. (of 7) = 0.129482 fft 9: mflops = 23.2279 (norm. = 0.31962), norm. avg. (of 7) = 0.190161 fft 10: mflops = 24.9661 (norm. = 0.343537), norm. avg. (of 7) = 0.158675 fft 11: mflops = 17.9903 (norm. = 0.247549), norm. avg. (of 6) = 0.157477 fft 12: mflops = 48.6095 (norm. = 0.668874), norm. avg. (of 7) = 0.405461 fft 13: mflops = 20.0547 (norm. = 0.275956), norm. avg. (of 7) = 0.175668 fft 14: mflops = 60.6614 (norm. = 0.834711), norm. avg. (of 7) = 0.678366 fft 15: mflops = 69.2456 (norm. = 0.95283), norm. avg. (of 7) = 0.67756 fft 16: mflops = 72.6736 (norm. = 1), norm. avg. (of 7) = 0.800415 fft 17: mflops = 22.7951 (norm. = 0.313665), norm. avg. (of 5) = 0.25767 fft 18: mflops = 30.5835 (norm. = 0.420833), norm. avg. (of 7) = 0.23843 fft 19: mflops = 28.2309 (norm. = 0.388462), norm. avg. (of 7) = 0.187197 fft 20: mflops = 27.8032 (norm. = 0.382576), norm. avg. (of 7) = 0.184258 fft 21: mflops = 27.3882 (norm. = 0.376866), norm. avg. (of 7) = 0.63149 fft 22: mflops = 29.5969 (norm. = 0.407258), norm. avg. (of 6) = 0.296207 fft 23: mflops = 35.9805 (norm. = 0.495098), norm. avg. (of 6) = 0.349379 fft 24: mflops = 43.6907 (norm. = 0.60119), norm. avg. (of 6) = 0.386627 fft 25: mflops = 22.2425 (norm. = 0.306061), norm. avg. (of 6) = 0.157431 fft 26: mflops = 10.546 (norm. = 0.145115), norm. avg. (of 7) = 0.0748708 fft 27: mflops = 16.8349 (norm. = 0.231651), norm. avg. (of 7) = 0.127061 fft 28: mflops = 29.3601 (norm. = 0.404), norm. avg. (of 7) = 0.20865 fft 29: mflops = 21.7161 (norm. = 0.298817), norm. avg. (of 7) = 0.150506 fft 30: mflops = 52.057 (norm. = 0.716312), norm. avg. (of 7) = 0.609204 fft 31: mflops = 39.8915 (norm. = 0.548913), norm. avg. (of 7) = 0.412933 fft 32: mflops = 39.0427 (norm. = 0.537234), norm. avg. (of 4) = 0.549564 fft 33: mflops = 17.6443 (norm. = 0.242788), norm. avg. (of 6) = 0.126114 fft 34: mflops = 10.607 (norm. = 0.145954), norm. avg. (of 6) = 0.111374 fft 35: mflops = 28.672 (norm. = 0.394531), norm. avg. (of 7) = 0.196902 fft 36: mflops = 29.1271 (norm. = 0.400794), norm. avg. (of 7) = 0.206034 fft 37: mflops = 35.9805 (norm. = 0.495098), norm. avg. (of 7) = 0.285679 fft 38: mflops = 17.6443 (norm. = 0.242788), norm. avg. (of 7) = 0.140961 fft 39: mflops = 22.2425 (norm. = 0.306061), norm. avg. (of 7) = 0.161822 fft 40: mflops = 25.4862 (norm. = 0.350694), norm. avg. (of 7) = 0.179328 fft 41: mflops = 4.49757 (norm. = 0.0618873), norm. avg. (of 7) = 0.0398149 Benchmarking for array size = 256 (power of 2): 0. Arndt DIF: elapsed time t=1.53 s, 4096 iters, t-(init.)=1.37 s t(norm)=0.163317, mflops=30.6154 (err=4.8e-16) 1. Arndt DIT: elapsed time t=1.54 s, 4096 iters, t-(init.)=1.51 s t(norm)=0.180006, mflops=27.7768 (err=5.1e-16) 2. Arndt Split-Radix: elapsed time t=1.65 s, 4096 iters, t-(init.)=1.63 s t(norm)=0.194311, mflops=25.7319 (err=5.5e-16) 3. Arndt 4-step: elapsed time t=1.5 s, 2048 iters, t-(init.)=1.49 s t(norm)=0.355244, mflops=14.0748 (err=5.7e-16) 4. Bailey: elapsed time t=1.63 s, 4096 iters, t-(init.)=1.6 s t(norm)=0.190735, mflops=26.2144 (err=5.5e-16) 5. Beauregard: elapsed time t=1.33 s, 1024 iters, t-(init.)=1.33 s t(norm)=0.634193, mflops=7.88403 (err=4.8e-16) 6. Bergland: elapsed time t=1.24 s, 4096 iters, t-(init.)=1.21 s t(norm)=0.144243, mflops=34.6637 (err=5.7e-16) 7. Brenner: elapsed time t=1.73 s, 4096 iters, t-(init.)=1.58 s t(norm)=0.188351, mflops=26.5462 (err=4.8e-16) 8. Burrus: elapsed time t=1.21 s, 2048 iters, t-(init.)=1.2 s t(norm)=0.286102, mflops=17.4763 (err=5.4e-16) 9. CWP (min N) (N=260): elapsed time t=1.7 s, 4096 iters, t-(init.)=1.58 s t(norm)=0.188351, mflops=26.5462 10. CWP (best N) (N=280): elapsed time t=1.68 s, 4096 iters, t-(init.)=1.56 s t(norm)=0.185966, mflops=26.8866 11. Edelblute: elapsed time t=1.13 s, 2048 iters, t-(init.)=1.11 s t(norm)=0.264645, mflops=18.8933 (err=5.9e-16) 12. FFTPACK: elapsed time t=1.96 s, 8192 iters, t-(init.)=1.78 s t(norm)=0.106096, mflops=47.127 (err=4.4e-16) 13. FFTPACK (f2c): elapsed time t=1.08 s, 2048 iters, t-(init.)=1.06 s t(norm)=0.252724, mflops=19.7845 (err=4.5e-16) FFTW_MEASURE plan: (cost = 2.050781e-04) FFTW_TWIDDLE 4 FFTW_TWIDDLE 4 FFTW_NOTW 16 14. FFTW: elapsed time t=1.71 s, 8192 iters, t-(init.)=1.35 s t(norm)=0.0804663, mflops=62.1378 (err=4.5e-16) FFTW_ESTIMATE plan: (cost = 9.216000e+02) FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.8 s, 8192 iters, t-(init.)=1.42 s t(norm)=0.0846386, mflops=59.0747 (err=4.6e-16) 16. Frigo-old: elapsed time t=1.53 s, 8192 iters, t-(init.)=1.19 s t(norm)=0.0709295, mflops=70.4925 (err=4.5e-16) 17. Green: elapsed time t=1.02 s, 2048 iters, t-(init.)=1.01 s t(norm)=0.240803, mflops=20.7639 (err=4.8e-16) 18. GSL: elapsed time t=1.32 s, 4096 iters, t-(init.)=1.29 s t(norm)=0.15378, mflops=32.514 (err=4.7e-16) 19. GSL DIT: elapsed time t=1.62 s, 4096 iters, t-(init.)=1.59 s t(norm)=0.189543, mflops=26.3793 (err=5.0e-16) 20. GSL DIF: elapsed time t=1.56 s, 4096 iters, t-(init.)=1.47 s t(norm)=0.175238, mflops=28.5327 (err=4.9e-16) 21. Krukar: elapsed time t=1.37 s, 4096 iters, t-(init.)=1.32 s t(norm)=0.157356, mflops=31.775 (err=5.0e-16) 22. Mayer (Buneman): elapsed time t=1.47 s, 4096 iters, t-(init.)=1.33 s t(norm)=0.158548, mflops=31.5361 (err=4.7e-16) 23. Mayer (simple): elapsed time t=1.19 s, 4096 iters, t-(init.)=1.13 s t(norm)=0.134706, mflops=37.1177 24. Mayer (lookup): elapsed time t=1.15 s, 4096 iters, t-(init.)=1.01 s t(norm)=0.120401, mflops=41.5278 (err=5.7e-16) 25. Monro: elapsed time t=1.75 s, 4096 iters, t-(init.)=1.57 s t(norm)=0.187159, mflops=26.7153 (err=8.2e-08) 26. NAPACK (f2c): elapsed time t=1.89 s, 2048 iters, t-(init.)=1.82 s t(norm)=0.433922, mflops=11.5228 (err=3.9e-15) 27. Nielsen: elapsed time t=1.16 s, 2048 iters, t-(init.)=1.11 s t(norm)=0.264645, mflops=18.8933 (err=3.8e-15) 28. NR (C): elapsed time t=1.36 s, 4096 iters, t-(init.)=1.33 s t(norm)=0.158548, mflops=31.5361 (err=4.9e-16) 29. NR (F): elapsed time t=1.82 s, 4096 iters, t-(init.)=1.74 s t(norm)=0.207424, mflops=24.1052 (err=4.5e-16) 30. Ooura (C): elapsed time t=1.06 s, 4096 iters, t-(init.)=0.95 s t(norm)=0.113249, mflops=44.1506 (err=5.0e-16) 31. Ooura (F): elapsed time t=1.16 s, 4096 iters, t-(init.)=0.97 s t(norm)=0.115633, mflops=43.2402 (err=5.0e-16) 32. QFT: elapsed time t=1.44 s, 4096 iters, t-(init.)=1.26 s t(norm)=0.150204, mflops=33.2881 (err=7.0e-16) 33. Ransom: elapsed time t=1.89 s, 4096 iters, t-(init.)=1.73 s t(norm)=0.206232, mflops=24.2445 (err=2.0e-15) 34. SCIPORT: elapsed time t=1.03 s, 1024 iters, t-(init.)=1.02 s t(norm)=0.486374, mflops=10.2802 (err=1.4e-07) 35. Singleton: elapsed time t=1.43 s, 4096 iters, t-(init.)=1.39 s t(norm)=0.165701, mflops=30.1748 (err=5.0e-16) 36. Singleton (f2c): elapsed time t=1.49 s, 4096 iters, t-(init.)=1.46 s t(norm)=0.174046, mflops=28.7281 (err=5.4e-16) 37. Sorensen: elapsed time t=1.15 s, 4096 iters, t-(init.)=1.07 s t(norm)=0.127554, mflops=39.1991 (err=6.0e-16) 38. Sorensen DIT: elapsed time t=1.11 s, 2048 iters, t-(init.)=1.06 s t(norm)=0.252724, mflops=19.7845 (err=5.7e-16) 39. Temperton: elapsed time t=1.92 s, 4096 iters, t-(init.)=1.84 s t(norm)=0.219345, mflops=22.7951 (err=9.1e-08) 40. Temperton (f2c): elapsed time t=1.73 s, 4096 iters, t-(init.)=1.64 s t(norm)=0.195503, mflops=25.575 (err=4.5e-16) 41. Valkenburg: elapsed time t=1.16 s, 512 iters, t-(init.)=1.15 s t(norm)=1.09673, mflops=4.55903 (err=6.4e-16) Top mflops for N=256 = 70.4925 Normalized results and averages for N=256: fft 0: mflops = 30.6154 (norm. = 0.434307), norm. avg. (of 8) = 0.395054 fft 1: mflops = 27.7768 (norm. = 0.39404), norm. avg. (of 8) = 0.393038 fft 2: mflops = 25.7319 (norm. = 0.365031), norm. avg. (of 8) = 0.216122 fft 3: mflops = 14.0748 (norm. = 0.199664), norm. avg. (of 8) = 0.103318 fft 4: mflops = 26.2144 (norm. = 0.371875), norm. avg. (of 8) = 0.215466 fft 5: mflops = 7.88403 (norm. = 0.111842), norm. avg. (of 8) = 0.0730347 fft 6: mflops = 34.6637 (norm. = 0.491736), norm. avg. (of 8) = 0.283923 fft 7: mflops = 26.5462 (norm. = 0.376582), norm. avg. (of 8) = 0.186109 fft 8: mflops = 17.4763 (norm. = 0.247917), norm. avg. (of 8) = 0.144286 fft 9: mflops = 26.5462 (norm. = 0.376582), norm. avg. (of 8) = 0.213464 fft 10: mflops = 26.8866 (norm. = 0.38141), norm. avg. (of 8) = 0.186517 fft 11: mflops = 18.8933 (norm. = 0.268018), norm. avg. (of 7) = 0.173269 fft 12: mflops = 47.127 (norm. = 0.668539), norm. avg. (of 8) = 0.438345 fft 13: mflops = 19.7845 (norm. = 0.28066), norm. avg. (of 8) = 0.188792 fft 14: mflops = 62.1378 (norm. = 0.881481), norm. avg. (of 8) = 0.703756 fft 15: mflops = 59.0747 (norm. = 0.838028), norm. avg. (of 8) = 0.697619 fft 16: mflops = 70.4925 (norm. = 1), norm. avg. (of 8) = 0.825363 fft 17: mflops = 20.7639 (norm. = 0.294554), norm. avg. (of 6) = 0.263817 fft 18: mflops = 32.514 (norm. = 0.46124), norm. avg. (of 8) = 0.266281 fft 19: mflops = 26.3793 (norm. = 0.374214), norm. avg. (of 8) = 0.210574 fft 20: mflops = 28.5327 (norm. = 0.404762), norm. avg. (of 8) = 0.211821 fft 21: mflops = 31.775 (norm. = 0.450758), norm. avg. (of 8) = 0.608898 fft 22: mflops = 31.5361 (norm. = 0.447368), norm. avg. (of 7) = 0.317802 fft 23: mflops = 37.1177 (norm. = 0.526549), norm. avg. (of 7) = 0.374689 fft 24: mflops = 41.5278 (norm. = 0.589109), norm. avg. (of 7) = 0.415553 fft 25: mflops = 26.7153 (norm. = 0.378981), norm. avg. (of 7) = 0.189081 fft 26: mflops = 11.5228 (norm. = 0.163462), norm. avg. (of 8) = 0.0859447 fft 27: mflops = 18.8933 (norm. = 0.268018), norm. avg. (of 8) = 0.144681 fft 28: mflops = 31.5361 (norm. = 0.447368), norm. avg. (of 8) = 0.23849 fft 29: mflops = 24.1052 (norm. = 0.341954), norm. avg. (of 8) = 0.174437 fft 30: mflops = 44.1506 (norm. = 0.626316), norm. avg. (of 8) = 0.611343 fft 31: mflops = 43.2402 (norm. = 0.613402), norm. avg. (of 8) = 0.437992 fft 32: mflops = 33.2881 (norm. = 0.472222), norm. avg. (of 5) = 0.534095 fft 33: mflops = 24.2445 (norm. = 0.343931), norm. avg. (of 7) = 0.15723 fft 34: mflops = 10.2802 (norm. = 0.145833), norm. avg. (of 7) = 0.116297 fft 35: mflops = 30.1748 (norm. = 0.428058), norm. avg. (of 8) = 0.225796 fft 36: mflops = 28.7281 (norm. = 0.407534), norm. avg. (of 8) = 0.231221 fft 37: mflops = 39.1991 (norm. = 0.556075), norm. avg. (of 8) = 0.319478 fft 38: mflops = 19.7845 (norm. = 0.28066), norm. avg. (of 8) = 0.158423 fft 39: mflops = 22.7951 (norm. = 0.32337), norm. avg. (of 8) = 0.182016 fft 40: mflops = 25.575 (norm. = 0.362805), norm. avg. (of 8) = 0.202262 fft 41: mflops = 4.55903 (norm. = 0.0646739), norm. avg. (of 8) = 0.0429222 Benchmarking for array size = 512 (power of 2): 0. Arndt DIF: elapsed time t=1.6 s, 2048 iters, t-(init.)=1.57 s t(norm)=0.166363, mflops=30.0547 (err=5.4e-16) 1. Arndt DIT: elapsed time t=1.6 s, 2048 iters, t-(init.)=1.57 s t(norm)=0.166363, mflops=30.0547 (err=5.5e-16) 2. Arndt Split-Radix: elapsed time t=1.76 s, 2048 iters, t-(init.)=1.73 s t(norm)=0.183317, mflops=27.2751 (err=6.0e-16) 3. Arndt 4-step: elapsed time t=1.69 s, 1024 iters, t-(init.)=1.67 s t(norm)=0.353919, mflops=14.1275 (err=5.5e-16) 4. Bailey: elapsed time t=1.97 s, 2048 iters, t-(init.)=1.89 s t(norm)=0.200272, mflops=24.9661 (err=5.7e-16) 5. Beauregard: elapsed time t=1.52 s, 512 iters, t-(init.)=1.52 s t(norm)=0.64426, mflops=7.76084 (err=6.6e-16) 6. Bergland: elapsed time t=1.38 s, 2048 iters, t-(init.)=1.29 s t(norm)=0.136693, mflops=36.5782 (err=5.5e-16) 7. Brenner: elapsed time t=1.86 s, 2048 iters, t-(init.)=1.73 s t(norm)=0.183317, mflops=27.2751 (err=5.7e-16) 8. Burrus: elapsed time t=1.29 s, 1024 iters, t-(init.)=1.28 s t(norm)=0.271267, mflops=18.432 (err=5.8e-16) 9. CWP (min N) (N=520): elapsed time t=1.87 s, 2048 iters, t-(init.)=1.74 s t(norm)=0.184377, mflops=27.1183 10. CWP (best N) (N=560): elapsed time t=1.83 s, 2048 iters, t-(init.)=1.67 s t(norm)=0.17696, mflops=28.255 11. Edelblute: elapsed time t=1.19 s, 1024 iters, t-(init.)=1.17 s t(norm)=0.247955, mflops=20.1649 (err=6.1e-16) 12. FFTPACK: elapsed time t=1.47 s, 2048 iters, t-(init.)=1.33 s t(norm)=0.140932, mflops=35.4781 (err=5.5e-16) 13. FFTPACK (f2c): elapsed time t=1.38 s, 1024 iters, t-(init.)=1.31 s t(norm)=0.277625, mflops=18.0099 (err=5.4e-16) FFTW_MEASURE plan: (cost = 5.664062e-04) FFTW_TWIDDLE 4 FFTW_TWIDDLE 8 FFTW_NOTW 16 14. FFTW: elapsed time t=1.16 s, 2048 iters, t-(init.)=0.96 s t(norm)=0.101725, mflops=49.152 (err=5.2e-16) FFTW_ESTIMATE plan: (cost = 1.843200e+03) FFTW_TWIDDLE 16 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.31 s, 2048 iters, t-(init.)=1.11 s t(norm)=0.11762, mflops=42.5098 (err=5.4e-16) 16. Frigo-old: elapsed time t=1.42 s, 2048 iters, t-(init.)=1.28 s t(norm)=0.135634, mflops=36.864 (err=5.4e-16) 17. Green: elapsed time t=1.09 s, 1024 iters, t-(init.)=1.08 s t(norm)=0.228882, mflops=21.8453 (err=5.9e-16) 18. GSL: elapsed time t=1.73 s, 2048 iters, t-(init.)=1.64 s t(norm)=0.173781, mflops=28.7719 (err=6.1e-16) 19. GSL DIT: elapsed time t=1.75 s, 2048 iters, t-(init.)=1.66 s t(norm)=0.1759, mflops=28.4253 (err=5.9e-16) 20. GSL DIF: elapsed time t=1.68 s, 2048 iters, t-(init.)=1.54 s t(norm)=0.163184, mflops=30.6402 (err=5.6e-16) 21. Krukar: elapsed time t=1.76 s, 2048 iters, t-(init.)=1.67 s t(norm)=0.17696, mflops=28.255 (err=5.6e-16) 22. Mayer (Buneman): elapsed time t=1.54 s, 2048 iters, t-(init.)=1.5 s t(norm)=0.158946, mflops=31.4573 (err=5.7e-16) 23. Mayer (simple): elapsed time t=1.24 s, 2048 iters, t-(init.)=1.11 s t(norm)=0.11762, mflops=42.5098 24. Mayer (lookup): elapsed time t=1.21 s, 2048 iters, t-(init.)=1.1 s t(norm)=0.11656, mflops=42.8963 (err=5.1e-16) 25. Monro: elapsed time t=1.86 s, 2048 iters, t-(init.)=1.72 s t(norm)=0.182258, mflops=27.4337 (err=7.5e-08) 26. NAPACK (f2c): elapsed time t=1.07 s, 512 iters, t-(init.)=1.03 s t(norm)=0.436571, mflops=11.4529 (err=6.0e-15) 27. Nielsen: elapsed time t=1.27 s, 1024 iters, t-(init.)=1.19 s t(norm)=0.252194, mflops=19.826 (err=3.0e-15) 28. NR (C): elapsed time t=1.48 s, 2048 iters, t-(init.)=1.44 s t(norm)=0.152588, mflops=32.768 (err=5.9e-16) 29. NR (F): elapsed time t=1.94 s, 2048 iters, t-(init.)=1.83 s t(norm)=0.193914, mflops=25.7847 (err=5.8e-16) 30. Ooura (C): elapsed time t=1.16 s, 2048 iters, t-(init.)=1.01 s t(norm)=0.107023, mflops=46.7187 (err=5.3e-16) 31. Ooura (F): elapsed time t=1.26 s, 2048 iters, t-(init.)=1.14 s t(norm)=0.120799, mflops=41.3912 (err=5.3e-16) 32. QFT: elapsed time t=1.14 s, 1024 iters, t-(init.)=1.04 s t(norm)=0.220405, mflops=22.6855 (err=7.4e-16) 33. Ransom: elapsed time t=1.08 s, 1024 iters, t-(init.)=1.03 s t(norm)=0.218285, mflops=22.9058 (err=1.6e-15) 34. SCIPORT: elapsed time t=1.4 s, 512 iters, t-(init.)=1.35 s t(norm)=0.572205, mflops=8.73813 (err=1.3e-07) 35. Singleton: elapsed time t=1.57 s, 2048 iters, t-(init.)=1.43 s t(norm)=0.151528, mflops=32.9971 (err=7.9e-16) 36. Singleton (f2c): elapsed time t=1.6 s, 2048 iters, t-(init.)=1.57 s t(norm)=0.166363, mflops=30.0547 (err=8.0e-16) 37. Sorensen: elapsed time t=1.26 s, 2048 iters, t-(init.)=1.23 s t(norm)=0.130335, mflops=38.3625 (err=5.7e-16) 38. Sorensen DIT: elapsed time t=1.16 s, 1024 iters, t-(init.)=1.14 s t(norm)=0.241597, mflops=20.6956 (err=5.5e-16) 39. Temperton: elapsed time t=1.12 s, 1024 iters, t-(init.)=1.1 s t(norm)=0.23312, mflops=21.4481 (err=9.2e-08) 40. Temperton (f2c): elapsed time t=1.19 s, 1024 iters, t-(init.)=1.16 s t(norm)=0.245836, mflops=20.3388 (err=6.1e-16) 41. Valkenburg: elapsed time t=1.39 s, 256 iters, t-(init.)=1.37 s t(norm)=1.16136, mflops=4.30528 (err=6.6e-16) Top mflops for N=512 = 49.152 Normalized results and averages for N=512: fft 0: mflops = 30.0547 (norm. = 0.611465), norm. avg. (of 9) = 0.419099 fft 1: mflops = 30.0547 (norm. = 0.611465), norm. avg. (of 9) = 0.417307 fft 2: mflops = 27.2751 (norm. = 0.554913), norm. avg. (of 9) = 0.253765 fft 3: mflops = 14.1275 (norm. = 0.287425), norm. avg. (of 9) = 0.123774 fft 4: mflops = 24.9661 (norm. = 0.507937), norm. avg. (of 9) = 0.247963 fft 5: mflops = 7.76084 (norm. = 0.157895), norm. avg. (of 9) = 0.0824636 fft 6: mflops = 36.5782 (norm. = 0.744186), norm. avg. (of 9) = 0.335064 fft 7: mflops = 27.2751 (norm. = 0.554913), norm. avg. (of 9) = 0.227087 fft 8: mflops = 18.432 (norm. = 0.375), norm. avg. (of 9) = 0.169921 fft 9: mflops = 27.1183 (norm. = 0.551724), norm. avg. (of 9) = 0.251048 fft 10: mflops = 28.255 (norm. = 0.57485), norm. avg. (of 9) = 0.229665 fft 11: mflops = 20.1649 (norm. = 0.410256), norm. avg. (of 8) = 0.202892 fft 12: mflops = 35.4781 (norm. = 0.721805), norm. avg. (of 9) = 0.469841 fft 13: mflops = 18.0099 (norm. = 0.366412), norm. avg. (of 9) = 0.208528 fft 14: mflops = 49.152 (norm. = 1), norm. avg. (of 9) = 0.736672 fft 15: mflops = 42.5098 (norm. = 0.864865), norm. avg. (of 9) = 0.716201 fft 16: mflops = 36.864 (norm. = 0.75), norm. avg. (of 9) = 0.81699 fft 17: mflops = 21.8453 (norm. = 0.444444), norm. avg. (of 7) = 0.289621 fft 18: mflops = 28.7719 (norm. = 0.585366), norm. avg. (of 9) = 0.301735 fft 19: mflops = 28.4253 (norm. = 0.578313), norm. avg. (of 9) = 0.251434 fft 20: mflops = 30.6402 (norm. = 0.623377), norm. avg. (of 9) = 0.257549 fft 21: mflops = 28.255 (norm. = 0.57485), norm. avg. (of 9) = 0.605115 fft 22: mflops = 31.4573 (norm. = 0.64), norm. avg. (of 8) = 0.358077 fft 23: mflops = 42.5098 (norm. = 0.864865), norm. avg. (of 8) = 0.435961 fft 24: mflops = 42.8963 (norm. = 0.872727), norm. avg. (of 8) = 0.4727 fft 25: mflops = 27.4337 (norm. = 0.55814), norm. avg. (of 8) = 0.235213 fft 26: mflops = 11.4529 (norm. = 0.23301), norm. avg. (of 9) = 0.102285 fft 27: mflops = 19.826 (norm. = 0.403361), norm. avg. (of 9) = 0.173423 fft 28: mflops = 32.768 (norm. = 0.666667), norm. avg. (of 9) = 0.286065 fft 29: mflops = 25.7847 (norm. = 0.52459), norm. avg. (of 9) = 0.213343 fft 30: mflops = 46.7187 (norm. = 0.950495), norm. avg. (of 9) = 0.649026 fft 31: mflops = 41.3912 (norm. = 0.842105), norm. avg. (of 9) = 0.482893 fft 32: mflops = 22.6855 (norm. = 0.461538), norm. avg. (of 6) = 0.522003 fft 33: mflops = 22.9058 (norm. = 0.466019), norm. avg. (of 8) = 0.195829 fft 34: mflops = 8.73813 (norm. = 0.177778), norm. avg. (of 8) = 0.123982 fft 35: mflops = 32.9971 (norm. = 0.671329), norm. avg. (of 9) = 0.2753 fft 36: mflops = 30.0547 (norm. = 0.611465), norm. avg. (of 9) = 0.273471 fft 37: mflops = 38.3625 (norm. = 0.780488), norm. avg. (of 9) = 0.370701 fft 38: mflops = 20.6956 (norm. = 0.421053), norm. avg. (of 9) = 0.187604 fft 39: mflops = 21.4481 (norm. = 0.436364), norm. avg. (of 9) = 0.210277 fft 40: mflops = 20.3388 (norm. = 0.413793), norm. avg. (of 9) = 0.225766 fft 41: mflops = 4.30528 (norm. = 0.0875912), norm. avg. (of 9) = 0.0478855 Benchmarking for array size = 1024 (power of 2): 0. Arndt DIF: elapsed time t=1.84 s, 1024 iters, t-(init.)=1.66 s t(norm)=0.15831, mflops=31.5836 (err=5.2e-16) 1. Arndt DIT: elapsed time t=1.83 s, 1024 iters, t-(init.)=1.67 s t(norm)=0.159264, mflops=31.3945 (err=4.9e-16) 2. Arndt Split-Radix: elapsed time t=1.93 s, 1024 iters, t-(init.)=1.75 s t(norm)=0.166893, mflops=29.9593 (err=5.1e-16) 3. Arndt 4-step: elapsed time t=1.76 s, 512 iters, t-(init.)=1.69 s t(norm)=0.322342, mflops=15.5115 (err=4.4e-16) 4. Bailey: elapsed time t=1.59 s, 512 iters, t-(init.)=1.49 s t(norm)=0.284195, mflops=17.5936 (err=5.6e-16) 5. Beauregard: elapsed time t=1.7 s, 256 iters, t-(init.)=1.68 s t(norm)=0.640869, mflops=7.8019 (err=5.1e-16) 6. Bergland: elapsed time t=1.55 s, 1024 iters, t-(init.)=1.39 s t(norm)=0.132561, mflops=37.7186 (err=5.0e-16) 7. Brenner: elapsed time t=1 s, 512 iters, t-(init.)=0.95 s t(norm)=0.181198, mflops=27.5941 (err=5.1e-16) 8. Burrus: elapsed time t=1.4 s, 512 iters, t-(init.)=1.37 s t(norm)=0.261307, mflops=19.1346 (err=5.2e-16) 9. CWP (min N) (N=1040): elapsed time t=1.02 s, 512 iters, t-(init.)=0.98 s t(norm)=0.18692, mflops=26.7494 10. CWP (best N) (N=1040): elapsed time t=1.02 s, 512 iters, t-(init.)=0.96 s t(norm)=0.183105, mflops=27.3067 11. Edelblute: elapsed time t=1.29 s, 512 iters, t-(init.)=1.26 s t(norm)=0.240326, mflops=20.8051 (err=5.2e-16) 12. FFTPACK: elapsed time t=1.09 s, 512 iters, t-(init.)=1 s t(norm)=0.190735, mflops=26.2144 (err=4.9e-16) 13. FFTPACK (f2c): elapsed time t=1.83 s, 512 iters, t-(init.)=1.74 s t(norm)=0.331879, mflops=15.0657 (err=4.7e-16) FFTW_MEASURE plan: (cost = 1.445312e-03) FFTW_TWIDDLE 2 FFTW_TWIDDLE 16 FFTW_NOTW 32 14. FFTW: elapsed time t=1.53 s, 1024 iters, t-(init.)=1.32 s t(norm)=0.125885, mflops=39.7188 (err=4.9e-16) FFTW_ESTIMATE plan: (cost = 1.126400e+04) FFTW_TWIDDLE 8 FFTW_TWIDDLE 4 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.56 s, 1024 iters, t-(init.)=1.36 s t(norm)=0.1297, mflops=38.5506 (err=4.8e-16) 16. Frigo-old: elapsed time t=1.05 s, 512 iters, t-(init.)=0.96 s t(norm)=0.183105, mflops=27.3067 (err=4.7e-16) 17. Green: elapsed time t=1.33 s, 512 iters, t-(init.)=1.29 s t(norm)=0.246048, mflops=20.3212 (err=5.9e-16) 18. GSL: elapsed time t=1.22 s, 512 iters, t-(init.)=1.13 s t(norm)=0.21553, mflops=23.1986 (err=4.9e-16) 19. GSL DIT: elapsed time t=1.94 s, 1024 iters, t-(init.)=1.87 s t(norm)=0.178337, mflops=28.0368 (err=5.1e-16) 20. GSL DIF: elapsed time t=1.85 s, 1024 iters, t-(init.)=1.65 s t(norm)=0.157356, mflops=31.775 (err=4.9e-16) 21. Krukar: elapsed time t=1.13 s, 512 iters, t-(init.)=1.1 s t(norm)=0.209808, mflops=23.8313 (err=5.4e-16) 22. Mayer (Buneman): elapsed time t=1.72 s, 1024 iters, t-(init.)=1.62 s t(norm)=0.154495, mflops=32.3635 (err=4.6e-16) 23. Mayer (simple): elapsed time t=1.41 s, 1024 iters, t-(init.)=1.3 s t(norm)=0.123978, mflops=40.3298 24. Mayer (lookup): elapsed time t=1.44 s, 1024 iters, t-(init.)=1.28 s t(norm)=0.12207, mflops=40.96 (err=4.6e-16) 25. Monro: elapsed time t=1.02 s, 512 iters, t-(init.)=0.99 s t(norm)=0.188828, mflops=26.4792 (err=8.4e-08) 26. NAPACK (f2c): elapsed time t=1.6 s, 256 iters, t-(init.)=1.55 s t(norm)=0.591278, mflops=8.45626 (err=1.5e-14) 27. Nielsen: elapsed time t=1.72 s, 512 iters, t-(init.)=1.63 s t(norm)=0.310898, mflops=16.0825 (err=6.2e-15) 28. NR (C): elapsed time t=1.65 s, 1024 iters, t-(init.)=1.56 s t(norm)=0.148773, mflops=33.6082 (err=5.1e-16) 29. NR (F): elapsed time t=1.06 s, 512 iters, t-(init.)=1.01 s t(norm)=0.192642, mflops=25.9549 (err=5.0e-16) 30. Ooura (C): elapsed time t=1.39 s, 1024 iters, t-(init.)=1.19 s t(norm)=0.113487, mflops=44.0578 (err=4.6e-16) 31. Ooura (F): elapsed time t=1.51 s, 1024 iters, t-(init.)=1.32 s t(norm)=0.125885, mflops=39.7188 (err=4.6e-16) 32. QFT: elapsed time t=1.5 s, 512 iters, t-(init.)=1.4 s t(norm)=0.267029, mflops=18.7246 (err=9.5e-16) 33. Ransom: elapsed time t=1.05 s, 512 iters, t-(init.)=1.02 s t(norm)=0.19455, mflops=25.7004 (err=1.8e-15) 34. SCIPORT: elapsed time t=1.84 s, 256 iters, t-(init.)=1.79 s t(norm)=0.682831, mflops=7.32246 (err=1.4e-07) 35. Singleton: elapsed time t=1.74 s, 1024 iters, t-(init.)=1.56 s t(norm)=0.148773, mflops=33.6082 (err=6.0e-16) 36. Singleton (f2c): elapsed time t=1.84 s, 1024 iters, t-(init.)=1.74 s t(norm)=0.165939, mflops=30.1315 (err=6.0e-16) 37. Sorensen: elapsed time t=1.6 s, 1024 iters, t-(init.)=1.44 s t(norm)=0.137329, mflops=36.4089 (err=4.9e-16) 38. Sorensen DIT: elapsed time t=1.26 s, 512 iters, t-(init.)=1.18 s t(norm)=0.225067, mflops=22.2156 (err=4.9e-16) 39. Temperton: elapsed time t=1.28 s, 512 iters, t-(init.)=1.18 s t(norm)=0.225067, mflops=22.2156 (err=9.8e-08) 40. Temperton (f2c): elapsed time t=1.38 s, 512 iters, t-(init.)=1.3 s t(norm)=0.247955, mflops=20.1649 (err=4.8e-16) 41. Valkenburg: elapsed time t=1.67 s, 128 iters, t-(init.)=1.66 s t(norm)=1.26648, mflops=3.94795 (err=8.2e-16) Top mflops for N=1024 = 44.0578 Normalized results and averages for N=1024: fft 0: mflops = 31.5836 (norm. = 0.716867), norm. avg. (of 10) = 0.448876 fft 1: mflops = 31.3945 (norm. = 0.712575), norm. avg. (of 10) = 0.446834 fft 2: mflops = 29.9593 (norm. = 0.68), norm. avg. (of 10) = 0.296389 fft 3: mflops = 15.5115 (norm. = 0.352071), norm. avg. (of 10) = 0.146604 fft 4: mflops = 17.5936 (norm. = 0.399329), norm. avg. (of 10) = 0.263099 fft 5: mflops = 7.8019 (norm. = 0.177083), norm. avg. (of 10) = 0.0919256 fft 6: mflops = 37.7186 (norm. = 0.856115), norm. avg. (of 10) = 0.387169 fft 7: mflops = 27.5941 (norm. = 0.626316), norm. avg. (of 10) = 0.26701 fft 8: mflops = 19.1346 (norm. = 0.434307), norm. avg. (of 10) = 0.19636 fft 9: mflops = 26.7494 (norm. = 0.607143), norm. avg. (of 10) = 0.286658 fft 10: mflops = 27.3067 (norm. = 0.619792), norm. avg. (of 10) = 0.268678 fft 11: mflops = 20.8051 (norm. = 0.472222), norm. avg. (of 9) = 0.232818 fft 12: mflops = 26.2144 (norm. = 0.595), norm. avg. (of 10) = 0.482357 fft 13: mflops = 15.0657 (norm. = 0.341954), norm. avg. (of 10) = 0.22187 fft 14: mflops = 39.7188 (norm. = 0.901515), norm. avg. (of 10) = 0.753156 fft 15: mflops = 38.5506 (norm. = 0.875), norm. avg. (of 10) = 0.732081 fft 16: mflops = 27.3067 (norm. = 0.619792), norm. avg. (of 10) = 0.79727 fft 17: mflops = 20.3212 (norm. = 0.46124), norm. avg. (of 8) = 0.311074 fft 18: mflops = 23.1986 (norm. = 0.526549), norm. avg. (of 10) = 0.324217 fft 19: mflops = 28.0368 (norm. = 0.636364), norm. avg. (of 10) = 0.289927 fft 20: mflops = 31.775 (norm. = 0.721212), norm. avg. (of 10) = 0.303915 fft 21: mflops = 23.8313 (norm. = 0.540909), norm. avg. (of 10) = 0.598694 fft 22: mflops = 32.3635 (norm. = 0.734568), norm. avg. (of 9) = 0.399909 fft 23: mflops = 40.3298 (norm. = 0.915385), norm. avg. (of 9) = 0.489231 fft 24: mflops = 40.96 (norm. = 0.929687), norm. avg. (of 9) = 0.523476 fft 25: mflops = 26.4792 (norm. = 0.60101), norm. avg. (of 9) = 0.275858 fft 26: mflops = 8.45626 (norm. = 0.191935), norm. avg. (of 10) = 0.11125 fft 27: mflops = 16.0825 (norm. = 0.365031), norm. avg. (of 10) = 0.192584 fft 28: mflops = 33.6082 (norm. = 0.762821), norm. avg. (of 10) = 0.33374 fft 29: mflops = 25.9549 (norm. = 0.589109), norm. avg. (of 10) = 0.250919 fft 30: mflops = 44.0578 (norm. = 1), norm. avg. (of 10) = 0.684124 fft 31: mflops = 39.7188 (norm. = 0.901515), norm. avg. (of 10) = 0.524756 fft 32: mflops = 18.7246 (norm. = 0.425), norm. avg. (of 7) = 0.508145 fft 33: mflops = 25.7004 (norm. = 0.583333), norm. avg. (of 9) = 0.238885 fft 34: mflops = 7.32246 (norm. = 0.166201), norm. avg. (of 9) = 0.128673 fft 35: mflops = 33.6082 (norm. = 0.762821), norm. avg. (of 10) = 0.324052 fft 36: mflops = 30.1315 (norm. = 0.683908), norm. avg. (of 10) = 0.314514 fft 37: mflops = 36.4089 (norm. = 0.826389), norm. avg. (of 10) = 0.41627 fft 38: mflops = 22.2156 (norm. = 0.504237), norm. avg. (of 10) = 0.219268 fft 39: mflops = 22.2156 (norm. = 0.504237), norm. avg. (of 10) = 0.239673 fft 40: mflops = 20.1649 (norm. = 0.457692), norm. avg. (of 10) = 0.248958 fft 41: mflops = 3.94795 (norm. = 0.0896084), norm. avg. (of 10) = 0.0520578 Benchmarking for array size = 2048 (power of 2): 0. Arndt DIF: elapsed time t=1.69 s, 256 iters, t-(init.)=1.59 s t(norm)=0.275699, mflops=18.1357 (err=4.6e-16) 1. Arndt DIT: elapsed time t=1.66 s, 256 iters, t-(init.)=1.59 s t(norm)=0.275699, mflops=18.1357 (err=4.6e-16) 2. Arndt Split-Radix: elapsed time t=1.03 s, 128 iters, t-(init.)=0.98 s t(norm)=0.339855, mflops=14.7122 (err=4.8e-16) 3. Arndt 4-step: elapsed time t=1.1 s, 128 iters, t-(init.)=1.05 s t(norm)=0.36413, mflops=13.7314 (err=4.7e-16) 4. Bailey: elapsed time t=1.83 s, 256 iters, t-(init.)=1.74 s t(norm)=0.301708, mflops=16.5723 (err=5.0e-16) 5. Beauregard: elapsed time t=1.03 s, 64 iters, t-(init.)=1.02 s t(norm)=0.707453, mflops=7.06761 (err=4.8e-16) 6. Bergland: elapsed time t=1.11 s, 256 iters, t-(init.)=1.03 s t(norm)=0.178597, mflops=27.996 (err=4.9e-16) 7. Brenner: elapsed time t=1.49 s, 256 iters, t-(init.)=1.42 s t(norm)=0.246221, mflops=20.3069 (err=5.0e-16) 8. Burrus: elapsed time t=1.21 s, 128 iters, t-(init.)=1.18 s t(norm)=0.409213, mflops=12.2186 (err=4.6e-16) 9. CWP (min N) (N=2145): elapsed time t=1.41 s, 256 iters, t-(init.)=1.32 s t(norm)=0.228882, mflops=21.8453 10. CWP (best N) (N=2184): elapsed time t=1.33 s, 256 iters, t-(init.)=1.23 s t(norm)=0.213276, mflops=23.4438 11. Edelblute: elapsed time t=1.18 s, 128 iters, t-(init.)=1.15 s t(norm)=0.398809, mflops=12.5373 (err=4.7e-16) 12. FFTPACK: elapsed time t=1.07 s, 256 iters, t-(init.)=0.97 s t(norm)=0.168193, mflops=29.7277 (err=4.5e-16) 13. FFTPACK (f2c): elapsed time t=1.85 s, 256 iters, t-(init.)=1.75 s t(norm)=0.303442, mflops=16.4776 (err=4.6e-16) FFTW_MEASURE plan: (cost = 3.437500e-03) FFTW_TWIDDLE 2 FFTW_TWIDDLE 32 FFTW_TWIDDLE 4 FFTW_NOTW 8 14. FFTW: elapsed time t=1.72 s, 512 iters, t-(init.)=1.52 s t(norm)=0.13178, mflops=37.9419 (err=4.7e-16) FFTW_ESTIMATE plan: (cost = 1.269760e+04) FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.76 s, 512 iters, t-(init.)=1.56 s t(norm)=0.135248, mflops=36.969 (err=4.4e-16) 16. Frigo-old: elapsed time t=1.24 s, 256 iters, t-(init.)=1.15 s t(norm)=0.199405, mflops=25.0746 (err=4.6e-16) 17. Green: elapsed time t=1.71 s, 256 iters, t-(init.)=1.62 s t(norm)=0.2809, mflops=17.7999 (err=5.8e-16) 18. GSL: elapsed time t=1.24 s, 256 iters, t-(init.)=1.13 s t(norm)=0.195937, mflops=25.5184 (err=4.7e-16) 19. GSL DIT: elapsed time t=1.69 s, 256 iters, t-(init.)=1.6 s t(norm)=0.277433, mflops=18.0224 (err=4.5e-16) 20. GSL DIF: elapsed time t=1.59 s, 256 iters, t-(init.)=1.5 s t(norm)=0.260093, mflops=19.2239 (err=4.4e-16) 21. Krukar: elapsed time t=1.4 s, 256 iters, t-(init.)=1.32 s t(norm)=0.228882, mflops=21.8453 (err=5.0e-16) 22. Mayer (Buneman): elapsed time t=1.03 s, 256 iters, t-(init.)=0.95 s t(norm)=0.164726, mflops=30.3535 (err=4.5e-16) 23. Mayer (simple): elapsed time t=1.7 s, 512 iters, t-(init.)=1.52 s t(norm)=0.13178, mflops=37.9419 24. Mayer (lookup): elapsed time t=1.92 s, 512 iters, t-(init.)=1.74 s t(norm)=0.150854, mflops=33.1446 (err=4.5e-16) 25. Monro: elapsed time t=1.95 s, 256 iters, t-(init.)=1.84 s t(norm)=0.319047, mflops=15.6717 (err=1.0e-07) 26. NAPACK (f2c): elapsed time t=1.96 s, 128 iters, t-(init.)=1.91 s t(norm)=0.66237, mflops=7.54865 (err=1.5e-14) 27. Nielsen: elapsed time t=1.2 s, 128 iters, t-(init.)=1.15 s t(norm)=0.398809, mflops=12.5373 (err=1.1e-14) 28. NR (C): elapsed time t=1.49 s, 256 iters, t-(init.)=1.4 s t(norm)=0.242753, mflops=20.597 (err=4.5e-16) 29. NR (F): elapsed time t=1.74 s, 256 iters, t-(init.)=1.65 s t(norm)=0.286102, mflops=17.4763 (err=4.5e-16) 30. Ooura (C): elapsed time t=1.94 s, 512 iters, t-(init.)=1.75 s t(norm)=0.151721, mflops=32.9552 (err=4.6e-16) 31. Ooura (F): elapsed time t=1.02 s, 256 iters, t-(init.)=0.94 s t(norm)=0.162992, mflops=30.6764 (err=4.6e-16) 32. QFT: elapsed time t=1.77 s, 256 iters, t-(init.)=1.67 s t(norm)=0.28957, mflops=17.267 (err=1.2e-15) 33. Ransom: elapsed time t=1.58 s, 256 iters, t-(init.)=1.51 s t(norm)=0.261827, mflops=19.0966 (err=2.1e-15) 34. SCIPORT: elapsed time t=1.06 s, 64 iters, t-(init.)=1.04 s t(norm)=0.721325, mflops=6.93169 (err=1.6e-07) 35. Singleton: elapsed time t=1.32 s, 256 iters, t-(init.)=1.23 s t(norm)=0.213276, mflops=23.4438 (err=5.9e-16) 36. Singleton (f2c): elapsed time t=1.38 s, 256 iters, t-(init.)=1.28 s t(norm)=0.221946, mflops=22.528 (err=5.9e-16) 37. Sorensen: elapsed time t=1.43 s, 256 iters, t-(init.)=1.34 s t(norm)=0.23235, mflops=21.5193 (err=4.5e-16) 38. Sorensen DIT: elapsed time t=1.16 s, 128 iters, t-(init.)=1.11 s t(norm)=0.384938, mflops=12.9891 (err=4.4e-16) 39. Temperton: elapsed time t=1.67 s, 256 iters, t-(init.)=1.58 s t(norm)=0.273965, mflops=18.2505 (err=1.0e-07) 40. Temperton (f2c): elapsed time t=1.69 s, 256 iters, t-(init.)=1.61 s t(norm)=0.279166, mflops=17.9105 (err=4.7e-16) 41. Valkenburg: elapsed time t=1.93 s, 64 iters, t-(init.)=1.9 s t(norm)=1.3178, mflops=3.79419 (err=7.4e-16) Top mflops for N=2048 = 37.9419 Normalized results and averages for N=2048: fft 0: mflops = 18.1357 (norm. = 0.477987), norm. avg. (of 11) = 0.451523 fft 1: mflops = 18.1357 (norm. = 0.477987), norm. avg. (of 11) = 0.449666 fft 2: mflops = 14.7122 (norm. = 0.387755), norm. avg. (of 11) = 0.304695 fft 3: mflops = 13.7314 (norm. = 0.361905), norm. avg. (of 11) = 0.166177 fft 4: mflops = 16.5723 (norm. = 0.436782), norm. avg. (of 11) = 0.278889 fft 5: mflops = 7.06761 (norm. = 0.186275), norm. avg. (of 11) = 0.100503 fft 6: mflops = 27.996 (norm. = 0.737864), norm. avg. (of 11) = 0.41905 fft 7: mflops = 20.3069 (norm. = 0.535211), norm. avg. (of 11) = 0.291392 fft 8: mflops = 12.2186 (norm. = 0.322034), norm. avg. (of 11) = 0.207785 fft 9: mflops = 21.8453 (norm. = 0.575758), norm. avg. (of 11) = 0.31294 fft 10: mflops = 23.4438 (norm. = 0.617886), norm. avg. (of 11) = 0.300424 fft 11: mflops = 12.5373 (norm. = 0.330435), norm. avg. (of 10) = 0.24258 fft 12: mflops = 29.7277 (norm. = 0.783505), norm. avg. (of 11) = 0.509734 fft 13: mflops = 16.4776 (norm. = 0.434286), norm. avg. (of 11) = 0.241181 fft 14: mflops = 37.9419 (norm. = 1), norm. avg. (of 11) = 0.775597 fft 15: mflops = 36.969 (norm. = 0.974359), norm. avg. (of 11) = 0.754107 fft 16: mflops = 25.0746 (norm. = 0.66087), norm. avg. (of 11) = 0.78487 fft 17: mflops = 17.7999 (norm. = 0.469136), norm. avg. (of 9) = 0.328636 fft 18: mflops = 25.5184 (norm. = 0.672566), norm. avg. (of 11) = 0.355885 fft 19: mflops = 18.0224 (norm. = 0.475), norm. avg. (of 11) = 0.306752 fft 20: mflops = 19.2239 (norm. = 0.506667), norm. avg. (of 11) = 0.322347 fft 21: mflops = 21.8453 (norm. = 0.575758), norm. avg. (of 11) = 0.596609 fft 22: mflops = 30.3535 (norm. = 0.8), norm. avg. (of 10) = 0.439918 fft 23: mflops = 37.9419 (norm. = 1), norm. avg. (of 10) = 0.540307 fft 24: mflops = 33.1446 (norm. = 0.873563), norm. avg. (of 10) = 0.558485 fft 25: mflops = 15.6717 (norm. = 0.413043), norm. avg. (of 10) = 0.289576 fft 26: mflops = 7.54865 (norm. = 0.198953), norm. avg. (of 11) = 0.119223 fft 27: mflops = 12.5373 (norm. = 0.330435), norm. avg. (of 11) = 0.205116 fft 28: mflops = 20.597 (norm. = 0.542857), norm. avg. (of 11) = 0.352751 fft 29: mflops = 17.4763 (norm. = 0.460606), norm. avg. (of 11) = 0.269982 fft 30: mflops = 32.9552 (norm. = 0.868571), norm. avg. (of 11) = 0.700892 fft 31: mflops = 30.6764 (norm. = 0.808511), norm. avg. (of 11) = 0.550551 fft 32: mflops = 17.267 (norm. = 0.45509), norm. avg. (of 8) = 0.501513 fft 33: mflops = 19.0966 (norm. = 0.503311), norm. avg. (of 10) = 0.265328 fft 34: mflops = 6.93169 (norm. = 0.182692), norm. avg. (of 10) = 0.134075 fft 35: mflops = 23.4438 (norm. = 0.617886), norm. avg. (of 11) = 0.350764 fft 36: mflops = 22.528 (norm. = 0.59375), norm. avg. (of 11) = 0.339899 fft 37: mflops = 21.5193 (norm. = 0.567164), norm. avg. (of 11) = 0.429988 fft 38: mflops = 12.9891 (norm. = 0.342342), norm. avg. (of 11) = 0.230456 fft 39: mflops = 18.2505 (norm. = 0.481013), norm. avg. (of 11) = 0.261613 fft 40: mflops = 17.9105 (norm. = 0.47205), norm. avg. (of 11) = 0.269239 fft 41: mflops = 3.79419 (norm. = 0.1), norm. avg. (of 11) = 0.0564161 Benchmarking for array size = 4096 (power of 2): 0. Arndt DIF: elapsed time t=1.01 s, 64 iters, t-(init.)=0.97 s t(norm)=0.308355, mflops=16.2151 (err=1.0e-15) 1. Arndt DIT: elapsed time t=1.97 s, 128 iters, t-(init.)=1.88 s t(norm)=0.298818, mflops=16.7326 (err=1.1e-15) 2. Arndt Split-Radix: elapsed time t=1.19 s, 64 iters, t-(init.)=1.15 s t(norm)=0.365575, mflops=13.6771 (err=1.0e-15) 3. Arndt 4-step: elapsed time t=1.14 s, 64 iters, t-(init.)=1.1 s t(norm)=0.349681, mflops=14.2988 (err=1.0e-15) 4. Bailey: elapsed time t=1.08 s, 64 iters, t-(init.)=1.03 s t(norm)=0.327428, mflops=15.2705 (err=1.0e-15) 5. Beauregard: elapsed time t=1.14 s, 32 iters, t-(init.)=1.12 s t(norm)=0.712077, mflops=7.02171 (err=1.0e-15) 6. Bergland: elapsed time t=1.22 s, 128 iters, t-(init.)=1.13 s t(norm)=0.179609, mflops=27.8383 (err=1.1e-15) 7. Brenner: elapsed time t=1.58 s, 128 iters, t-(init.)=1.49 s t(norm)=0.236829, mflops=21.1123 (err=1.1e-15) 8. Burrus: elapsed time t=1.37 s, 64 iters, t-(init.)=1.33 s t(norm)=0.422796, mflops=11.826 (err=1.0e-15) 9. CWP (min N) (N=4290): elapsed time t=1.58 s, 128 iters, t-(init.)=1.49 s t(norm)=0.236829, mflops=21.1123 10. CWP (best N) (N=4368): elapsed time t=1.44 s, 128 iters, t-(init.)=1.33 s t(norm)=0.211398, mflops=23.6521 11. Edelblute: elapsed time t=1.33 s, 64 iters, t-(init.)=1.29 s t(norm)=0.41008, mflops=12.1927 (err=1.1e-15) 12. FFTPACK: elapsed time t=1.21 s, 128 iters, t-(init.)=1.12 s t(norm)=0.178019, mflops=28.0869 (err=1.0e-15) 13. FFTPACK (f2c): elapsed time t=1.06 s, 64 iters, t-(init.)=1.01 s t(norm)=0.32107, mflops=15.5729 (err=1.0e-15) FFTW_MEASURE plan: (cost = 6.875000e-03) FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_TWIDDLE 4 FFTW_NOTW 16 14. FFTW: elapsed time t=1.82 s, 256 iters, t-(init.)=1.63 s t(norm)=0.129541, mflops=38.5979 (err=1.0e-15) FFTW_ESTIMATE plan: (cost = 2.539520e+04) FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.91 s, 256 iters, t-(init.)=1.71 s t(norm)=0.135899, mflops=36.7921 (err=1.1e-15) 16. Frigo-old: elapsed time t=1.26 s, 128 iters, t-(init.)=1.17 s t(norm)=0.185966, mflops=26.8866 (err=1.1e-15) 17. Green: elapsed time t=1.82 s, 128 iters, t-(init.)=1.72 s t(norm)=0.273387, mflops=18.2891 (err=1.1e-15) 18. GSL: elapsed time t=1.31 s, 128 iters, t-(init.)=1.21 s t(norm)=0.192324, mflops=25.9978 (err=1.0e-15) 19. GSL DIT: elapsed time t=1.88 s, 128 iters, t-(init.)=1.78 s t(norm)=0.282923, mflops=17.6726 (err=1.0e-15) 20. GSL DIF: elapsed time t=1.8 s, 128 iters, t-(init.)=1.71 s t(norm)=0.271797, mflops=18.3961 (err=1.0e-15) 21. Krukar: elapsed time t=1.53 s, 128 iters, t-(init.)=1.44 s t(norm)=0.228882, mflops=21.8453 (err=1.1e-15) 22. Mayer (Buneman): elapsed time t=1.69 s, 128 iters, t-(init.)=1.6 s t(norm)=0.254313, mflops=19.6608 (err=1.1e-15) 23. Mayer (simple): elapsed time t=1.53 s, 128 iters, t-(init.)=1.43 s t(norm)=0.227292, mflops=21.9981 24. Mayer (lookup): elapsed time t=1.61 s, 128 iters, t-(init.)=1.51 s t(norm)=0.240008, mflops=20.8326 (err=1.1e-15) 25. Monro: elapsed time t=1.07 s, 64 iters, t-(init.)=1.02 s t(norm)=0.324249, mflops=15.4202 (err=1.1e-07) 26. NAPACK (f2c): elapsed time t=1.02 s, 32 iters, t-(init.)=1 s t(norm)=0.635783, mflops=7.86432 (err=4.5e-14) 27. Nielsen: elapsed time t=1.24 s, 64 iters, t-(init.)=1.19 s t(norm)=0.378291, mflops=13.2173 (err=2.2e-14) 28. NR (C): elapsed time t=1.65 s, 128 iters, t-(init.)=1.56 s t(norm)=0.247955, mflops=20.1649 (err=1.0e-15) 29. NR (F): elapsed time t=1.9 s, 128 iters, t-(init.)=1.8 s t(norm)=0.286102, mflops=17.4763 (err=1.0e-15) 30. Ooura (C): elapsed time t=1.08 s, 128 iters, t-(init.)=0.98 s t(norm)=0.155767, mflops=32.0993 (err=1.1e-15) 31. Ooura (F): elapsed time t=1.12 s, 128 iters, t-(init.)=1.03 s t(norm)=0.163714, mflops=30.541 (err=1.1e-15) 32. QFT: elapsed time t=1.18 s, 64 iters, t-(init.)=1.13 s t(norm)=0.359217, mflops=13.9192 (err=1.9e-15) 33. Ransom: elapsed time t=1.38 s, 128 iters, t-(init.)=1.28 s t(norm)=0.203451, mflops=24.576 (err=2.6e-15) 34. SCIPORT: elapsed time t=1.22 s, 32 iters, t-(init.)=1.19 s t(norm)=0.756582, mflops=6.60867 (err=1.7e-07) 35. Singleton: elapsed time t=1.41 s, 128 iters, t-(init.)=1.31 s t(norm)=0.208219, mflops=24.0132 (err=1.6e-15) 36. Singleton (f2c): elapsed time t=1.51 s, 128 iters, t-(init.)=1.42 s t(norm)=0.225703, mflops=22.153 (err=1.6e-15) 37. Sorensen: elapsed time t=1.68 s, 128 iters, t-(init.)=1.58 s t(norm)=0.251134, mflops=19.9097 (err=1.1e-15) 38. Sorensen DIT: elapsed time t=1.31 s, 64 iters, t-(init.)=1.27 s t(norm)=0.403722, mflops=12.3848 (err=1.0e-15) 39. Temperton: elapsed time t=1.76 s, 128 iters, t-(init.)=1.67 s t(norm)=0.265439, mflops=18.8367 (err=1.2e-07) 40. Temperton (f2c): elapsed time t=1.92 s, 128 iters, t-(init.)=1.83 s t(norm)=0.290871, mflops=17.1898 (err=1.0e-15) 41. Valkenburg: elapsed time t=1.07 s, 16 iters, t-(init.)=1.06 s t(norm)=1.34786, mflops=3.70958 (err=1.1e-15) Top mflops for N=4096 = 38.5979 Normalized results and averages for N=4096: fft 0: mflops = 16.2151 (norm. = 0.420103), norm. avg. (of 12) = 0.448904 fft 1: mflops = 16.7326 (norm. = 0.433511), norm. avg. (of 12) = 0.44832 fft 2: mflops = 13.6771 (norm. = 0.354348), norm. avg. (of 12) = 0.308833 fft 3: mflops = 14.2988 (norm. = 0.370455), norm. avg. (of 12) = 0.1832 fft 4: mflops = 15.2705 (norm. = 0.395631), norm. avg. (of 12) = 0.288617 fft 5: mflops = 7.02171 (norm. = 0.18192), norm. avg. (of 12) = 0.107287 fft 6: mflops = 27.8383 (norm. = 0.721239), norm. avg. (of 12) = 0.444233 fft 7: mflops = 21.1123 (norm. = 0.54698), norm. avg. (of 12) = 0.312691 fft 8: mflops = 11.826 (norm. = 0.306391), norm. avg. (of 12) = 0.216002 fft 9: mflops = 21.1123 (norm. = 0.54698), norm. avg. (of 12) = 0.332443 fft 10: mflops = 23.6521 (norm. = 0.612782), norm. avg. (of 12) = 0.326454 fft 11: mflops = 12.1927 (norm. = 0.315891), norm. avg. (of 11) = 0.249244 fft 12: mflops = 28.0869 (norm. = 0.727679), norm. avg. (of 12) = 0.527896 fft 13: mflops = 15.5729 (norm. = 0.403465), norm. avg. (of 12) = 0.254705 fft 14: mflops = 38.5979 (norm. = 1), norm. avg. (of 12) = 0.794297 fft 15: mflops = 36.7921 (norm. = 0.953216), norm. avg. (of 12) = 0.770699 fft 16: mflops = 26.8866 (norm. = 0.696581), norm. avg. (of 12) = 0.777512 fft 17: mflops = 18.2891 (norm. = 0.473837), norm. avg. (of 10) = 0.343156 fft 18: mflops = 25.9978 (norm. = 0.673554), norm. avg. (of 12) = 0.382357 fft 19: mflops = 17.6726 (norm. = 0.457865), norm. avg. (of 12) = 0.319345 fft 20: mflops = 18.3961 (norm. = 0.476608), norm. avg. (of 12) = 0.335202 fft 21: mflops = 21.8453 (norm. = 0.565972), norm. avg. (of 12) = 0.594056 fft 22: mflops = 19.6608 (norm. = 0.509375), norm. avg. (of 11) = 0.446232 fft 23: mflops = 21.9981 (norm. = 0.56993), norm. avg. (of 11) = 0.543 fft 24: mflops = 20.8326 (norm. = 0.539735), norm. avg. (of 11) = 0.55678 fft 25: mflops = 15.4202 (norm. = 0.39951), norm. avg. (of 11) = 0.29957 fft 26: mflops = 7.86432 (norm. = 0.20375), norm. avg. (of 12) = 0.126267 fft 27: mflops = 13.2173 (norm. = 0.342437), norm. avg. (of 12) = 0.216559 fft 28: mflops = 20.1649 (norm. = 0.522436), norm. avg. (of 12) = 0.366891 fft 29: mflops = 17.4763 (norm. = 0.452778), norm. avg. (of 12) = 0.285215 fft 30: mflops = 32.0993 (norm. = 0.831633), norm. avg. (of 12) = 0.711787 fft 31: mflops = 30.541 (norm. = 0.791262), norm. avg. (of 12) = 0.570611 fft 32: mflops = 13.9192 (norm. = 0.360619), norm. avg. (of 9) = 0.485858 fft 33: mflops = 24.576 (norm. = 0.636719), norm. avg. (of 11) = 0.299091 fft 34: mflops = 6.60867 (norm. = 0.171218), norm. avg. (of 11) = 0.137452 fft 35: mflops = 24.0132 (norm. = 0.622137), norm. avg. (of 12) = 0.373379 fft 36: mflops = 22.153 (norm. = 0.573944), norm. avg. (of 12) = 0.359403 fft 37: mflops = 19.9097 (norm. = 0.515823), norm. avg. (of 12) = 0.437141 fft 38: mflops = 12.3848 (norm. = 0.320866), norm. avg. (of 12) = 0.23799 fft 39: mflops = 18.8367 (norm. = 0.488024), norm. avg. (of 12) = 0.28048 fft 40: mflops = 17.1898 (norm. = 0.445355), norm. avg. (of 12) = 0.283916 fft 41: mflops = 3.70958 (norm. = 0.0961085), norm. avg. (of 12) = 0.0597238 Benchmarking for array size = 8192 (power of 2): 0. Arndt DIF: elapsed time t=1.16 s, 32 iters, t-(init.)=1.12 s t(norm)=0.328651, mflops=15.2137 (err=1.3e-15) 1. Arndt DIT: elapsed time t=1.11 s, 32 iters, t-(init.)=1.06 s t(norm)=0.311045, mflops=16.0749 (err=1.3e-15) 2. Arndt Split-Radix: elapsed time t=1.44 s, 32 iters, t-(init.)=1.39 s t(norm)=0.407879, mflops=12.2585 (err=1.3e-15) 3. Arndt 4-step: elapsed time t=1.38 s, 32 iters, t-(init.)=1.34 s t(norm)=0.393207, mflops=12.7159 (err=1.4e-15) 4. Bailey: elapsed time t=1.21 s, 32 iters, t-(init.)=1.16 s t(norm)=0.340388, mflops=14.6891 (err=1.3e-15) 5. Beauregard: elapsed time t=1.28 s, 16 iters, t-(init.)=1.26 s t(norm)=0.739464, mflops=6.76165 (err=1.3e-15) 6. Bergland: elapsed time t=1.45 s, 64 iters, t-(init.)=1.35 s t(norm)=0.198071, mflops=25.2435 (err=1.4e-15) 7. Brenner: elapsed time t=1.88 s, 64 iters, t-(init.)=1.79 s t(norm)=0.262627, mflops=19.0384 (err=1.4e-15) 8. Burrus: elapsed time t=1.63 s, 32 iters, t-(init.)=1.58 s t(norm)=0.463632, mflops=10.7844 (err=1.3e-15) 9. CWP (min N) (N=8580): elapsed time t=1.69 s, 64 iters, t-(init.)=1.58 s t(norm)=0.231816, mflops=21.5688 10. CWP (best N) (N=9240): elapsed time t=1.74 s, 64 iters, t-(init.)=1.63 s t(norm)=0.239152, mflops=20.9072 11. Edelblute: elapsed time t=1.59 s, 32 iters, t-(init.)=1.54 s t(norm)=0.451895, mflops=11.0645 (err=1.3e-15) 12. FFTPACK: elapsed time t=1.62 s, 64 iters, t-(init.)=1.53 s t(norm)=0.22448, mflops=22.2737 (err=1.3e-15) 13. FFTPACK (f2c): elapsed time t=1.31 s, 32 iters, t-(init.)=1.26 s t(norm)=0.369732, mflops=13.5233 (err=1.3e-15) FFTW_MEASURE plan: (cost = 1.562500e-02) FFTW_TWIDDLE 8 FFTW_TWIDDLE 32 FFTW_TWIDDLE 4 FFTW_NOTW 8 14. FFTW: elapsed time t=1.08 s, 64 iters, t-(init.)=0.98 s t(norm)=0.143785, mflops=34.7742 (err=1.3e-15) FFTW_ESTIMATE plan: (cost = 5.079040e+04) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.03 s, 64 iters, t-(init.)=0.93 s t(norm)=0.136449, mflops=36.6438 (err=1.4e-15) 16. Frigo-old: elapsed time t=1.55 s, 64 iters, t-(init.)=1.45 s t(norm)=0.212743, mflops=23.5026 (err=1.4e-15) 17. Green: elapsed time t=1.08 s, 32 iters, t-(init.)=1.03 s t(norm)=0.302241, mflops=16.5431 (err=1.4e-15) 18. GSL: elapsed time t=1.73 s, 64 iters, t-(init.)=1.63 s t(norm)=0.239152, mflops=20.9072 (err=1.3e-15) 19. GSL DIT: elapsed time t=1.16 s, 32 iters, t-(init.)=1.11 s t(norm)=0.325716, mflops=15.3508 (err=1.3e-15) 20. GSL DIF: elapsed time t=1.12 s, 32 iters, t-(init.)=1.07 s t(norm)=0.313979, mflops=15.9246 (err=1.3e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1 s, 32 iters, t-(init.)=0.95 s t(norm)=0.278766, mflops=17.9362 (err=1.3e-15) 23. Mayer (simple): elapsed time t=1.84 s, 64 iters, t-(init.)=1.74 s t(norm)=0.255291, mflops=19.5855 24. Mayer (lookup): elapsed time t=1.96 s, 64 iters, t-(init.)=1.86 s t(norm)=0.272898, mflops=18.3219 (err=1.4e-15) 25. Monro: elapsed time t=1.29 s, 32 iters, t-(init.)=1.25 s t(norm)=0.366798, mflops=13.6315 (err=1.2e-07) 26. NAPACK (f2c): elapsed time t=1.19 s, 16 iters, t-(init.)=1.17 s t(norm)=0.686646, mflops=7.28178 (err=4.1e-14) 27. Nielsen: elapsed time t=1.51 s, 32 iters, t-(init.)=1.46 s t(norm)=0.42842, mflops=11.6708 (err=1.1e-14) 28. NR (C): elapsed time t=1.06 s, 32 iters, t-(init.)=1.02 s t(norm)=0.299307, mflops=16.7053 (err=1.3e-15) 29. NR (F): elapsed time t=1.16 s, 32 iters, t-(init.)=1.11 s t(norm)=0.325716, mflops=15.3508 (err=1.3e-15) 30. Ooura (C): elapsed time t=1.28 s, 64 iters, t-(init.)=1.18 s t(norm)=0.173129, mflops=28.8803 (err=1.4e-15) 31. Ooura (F): elapsed time t=1.32 s, 64 iters, t-(init.)=1.22 s t(norm)=0.178997, mflops=27.9334 (err=1.4e-15) 32. QFT: elapsed time t=1.43 s, 32 iters, t-(init.)=1.38 s t(norm)=0.404945, mflops=12.3474 (err=2.8e-15) 33. Ransom: elapsed time t=1.75 s, 64 iters, t-(init.)=1.66 s t(norm)=0.243554, mflops=20.5293 (err=3.2e-15) 34. SCIPORT: elapsed time t=1.33 s, 16 iters, t-(init.)=1.3 s t(norm)=0.762939, mflops=6.5536 (err=1.9e-07) 35. Singleton: elapsed time t=1.72 s, 64 iters, t-(init.)=1.63 s t(norm)=0.239152, mflops=20.9072 (err=2.0e-15) 36. Singleton (f2c): elapsed time t=1.81 s, 64 iters, t-(init.)=1.72 s t(norm)=0.252357, mflops=19.8132 (err=2.0e-15) 37. Sorensen: elapsed time t=1.08 s, 32 iters, t-(init.)=1.02 s t(norm)=0.299307, mflops=16.7053 (err=1.4e-15) 38. Sorensen DIT: elapsed time t=1.56 s, 32 iters, t-(init.)=1.52 s t(norm)=0.446026, mflops=11.2101 (err=1.3e-15) 39. Temperton: elapsed time t=1.07 s, 32 iters, t-(init.)=1.02 s t(norm)=0.299307, mflops=16.7053 (err=1.4e-07) 40. Temperton (f2c): elapsed time t=1.23 s, 32 iters, t-(init.)=1.19 s t(norm)=0.349192, mflops=14.3188 (err=1.3e-15) 41. Valkenburg: elapsed time t=1.19 s, 8 iters, t-(init.)=1.17 s t(norm)=1.37329, mflops=3.64089 (err=1.4e-15) Top mflops for N=8192 = 36.6438 Normalized results and averages for N=8192: fft 0: mflops = 15.2137 (norm. = 0.415179), norm. avg. (of 13) = 0.44631 fft 1: mflops = 16.0749 (norm. = 0.438679), norm. avg. (of 13) = 0.447578 fft 2: mflops = 12.2585 (norm. = 0.334532), norm. avg. (of 13) = 0.310809 fft 3: mflops = 12.7159 (norm. = 0.347015), norm. avg. (of 13) = 0.195801 fft 4: mflops = 14.6891 (norm. = 0.400862), norm. avg. (of 13) = 0.297251 fft 5: mflops = 6.76165 (norm. = 0.184524), norm. avg. (of 13) = 0.113229 fft 6: mflops = 25.2435 (norm. = 0.688889), norm. avg. (of 13) = 0.463052 fft 7: mflops = 19.0384 (norm. = 0.519553), norm. avg. (of 13) = 0.328604 fft 8: mflops = 10.7844 (norm. = 0.294304), norm. avg. (of 13) = 0.222025 fft 9: mflops = 21.5688 (norm. = 0.588608), norm. avg. (of 13) = 0.352148 fft 10: mflops = 20.9072 (norm. = 0.570552), norm. avg. (of 13) = 0.345231 fft 11: mflops = 11.0645 (norm. = 0.301948), norm. avg. (of 12) = 0.253636 fft 12: mflops = 22.2737 (norm. = 0.607843), norm. avg. (of 13) = 0.534046 fft 13: mflops = 13.5233 (norm. = 0.369048), norm. avg. (of 13) = 0.2635 fft 14: mflops = 34.7742 (norm. = 0.94898), norm. avg. (of 13) = 0.806195 fft 15: mflops = 36.6438 (norm. = 1), norm. avg. (of 13) = 0.788338 fft 16: mflops = 23.5026 (norm. = 0.641379), norm. avg. (of 13) = 0.767041 fft 17: mflops = 16.5431 (norm. = 0.451456), norm. avg. (of 11) = 0.353002 fft 18: mflops = 20.9072 (norm. = 0.570552), norm. avg. (of 13) = 0.396834 fft 19: mflops = 15.3508 (norm. = 0.418919), norm. avg. (of 13) = 0.327004 fft 20: mflops = 15.9246 (norm. = 0.434579), norm. avg. (of 13) = 0.342847 fft 21: mflops = -1 (norm. = -0.0272898), norm. avg. (of 12) = 0.594056 fft 22: mflops = 17.9362 (norm. = 0.489474), norm. avg. (of 12) = 0.449836 fft 23: mflops = 19.5855 (norm. = 0.534483), norm. avg. (of 12) = 0.542291 fft 24: mflops = 18.3219 (norm. = 0.5), norm. avg. (of 12) = 0.552049 fft 25: mflops = 13.6315 (norm. = 0.372), norm. avg. (of 12) = 0.305606 fft 26: mflops = 7.28178 (norm. = 0.198718), norm. avg. (of 13) = 0.13184 fft 27: mflops = 11.6708 (norm. = 0.318493), norm. avg. (of 13) = 0.2244 fft 28: mflops = 16.7053 (norm. = 0.455882), norm. avg. (of 13) = 0.373737 fft 29: mflops = 15.3508 (norm. = 0.418919), norm. avg. (of 13) = 0.2955 fft 30: mflops = 28.8803 (norm. = 0.788136), norm. avg. (of 13) = 0.71766 fft 31: mflops = 27.9334 (norm. = 0.762295), norm. avg. (of 13) = 0.585356 fft 32: mflops = 12.3474 (norm. = 0.336957), norm. avg. (of 10) = 0.470968 fft 33: mflops = 20.5293 (norm. = 0.560241), norm. avg. (of 12) = 0.320853 fft 34: mflops = 6.5536 (norm. = 0.178846), norm. avg. (of 12) = 0.140901 fft 35: mflops = 20.9072 (norm. = 0.570552), norm. avg. (of 13) = 0.388546 fft 36: mflops = 19.8132 (norm. = 0.540698), norm. avg. (of 13) = 0.373349 fft 37: mflops = 16.7053 (norm. = 0.455882), norm. avg. (of 13) = 0.438582 fft 38: mflops = 11.2101 (norm. = 0.305921), norm. avg. (of 13) = 0.243216 fft 39: mflops = 16.7053 (norm. = 0.455882), norm. avg. (of 13) = 0.293973 fft 40: mflops = 14.3188 (norm. = 0.390756), norm. avg. (of 13) = 0.292134 fft 41: mflops = 3.64089 (norm. = 0.099359), norm. avg. (of 13) = 0.0627727 Benchmarking for array size = 16384 (power of 2): 0. Arndt DIF: elapsed time t=1.4 s, 16 iters, t-(init.)=1.35 s t(norm)=0.367846, mflops=13.5927 (err=1.7e-15) 1. Arndt DIT: elapsed time t=1.3 s, 16 iters, t-(init.)=1.25 s t(norm)=0.340598, mflops=14.6801 (err=1.8e-15) 2. Arndt Split-Radix: elapsed time t=1.65 s, 16 iters, t-(init.)=1.6 s t(norm)=0.435965, mflops=11.4688 (err=1.8e-15) 3. Arndt 4-step: elapsed time t=1.34 s, 16 iters, t-(init.)=1.29 s t(norm)=0.351497, mflops=14.2249 (err=1.8e-15) 4. Bailey: elapsed time t=1.52 s, 16 iters, t-(init.)=1.47 s t(norm)=0.400543, mflops=12.483 (err=1.7e-15) 5. Beauregard: elapsed time t=1.42 s, 8 iters, t-(init.)=1.4 s t(norm)=0.762939, mflops=6.5536 (err=1.8e-15) 6. Bergland: elapsed time t=1.61 s, 32 iters, t-(init.)=1.51 s t(norm)=0.205721, mflops=24.3047 (err=1.8e-15) 7. Brenner: elapsed time t=1.02 s, 16 iters, t-(init.)=0.97 s t(norm)=0.264304, mflops=18.9176 (err=1.8e-15) 8. Burrus: elapsed time t=1.83 s, 16 iters, t-(init.)=1.78 s t(norm)=0.485012, mflops=10.309 (err=1.8e-15) 9. CWP (min N) (N=17160): elapsed time t=1.82 s, 32 iters, t-(init.)=1.72 s t(norm)=0.234331, mflops=21.3373 10. CWP (best N) (N=17160): elapsed time t=1.81 s, 32 iters, t-(init.)=1.7 s t(norm)=0.231607, mflops=21.5883 11. Edelblute: elapsed time t=1.8 s, 16 iters, t-(init.)=1.75 s t(norm)=0.476837, mflops=10.4858 (err=1.8e-15) 12. FFTPACK: elapsed time t=1.8 s, 32 iters, t-(init.)=1.7 s t(norm)=0.231607, mflops=21.5883 (err=1.8e-15) 13. FFTPACK (f2c): elapsed time t=1.47 s, 16 iters, t-(init.)=1.42 s t(norm)=0.386919, mflops=12.9226 (err=1.8e-15) FFTW_MEASURE plan: (cost = 3.500000e-02) FFTW_TWIDDLE 4 FFTW_TWIDDLE 16 FFTW_TWIDDLE 4 FFTW_TWIDDLE 4 FFTW_NOTW 16 14. FFTW: elapsed time t=1.21 s, 32 iters, t-(init.)=1.12 s t(norm)=0.152588, mflops=32.768 (err=1.8e-15) FFTW_ESTIMATE plan: (cost = 1.441792e+05) FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.27 s, 32 iters, t-(init.)=1.17 s t(norm)=0.1594, mflops=31.3677 (err=1.8e-15) 16. Frigo-old: elapsed time t=1.72 s, 32 iters, t-(init.)=1.63 s t(norm)=0.22207, mflops=22.5154 (err=1.9e-15) 17. Green: elapsed time t=1.23 s, 16 iters, t-(init.)=1.18 s t(norm)=0.321524, mflops=15.5509 (err=1.8e-15) 18. GSL: elapsed time t=1.86 s, 32 iters, t-(init.)=1.76 s t(norm)=0.239781, mflops=20.8524 (err=1.8e-15) 19. GSL DIT: elapsed time t=1.36 s, 16 iters, t-(init.)=1.31 s t(norm)=0.356947, mflops=14.0077 (err=1.8e-15) 20. GSL DIF: elapsed time t=1.35 s, 16 iters, t-(init.)=1.3 s t(norm)=0.354222, mflops=14.1154 (err=1.8e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1.08 s, 16 iters, t-(init.)=1.04 s t(norm)=0.283378, mflops=17.6443 (err=1.8e-15) 23. Mayer (simple): elapsed time t=1 s, 16 iters, t-(init.)=0.95 s t(norm)=0.258854, mflops=19.3159 24. Mayer (lookup): elapsed time t=1.06 s, 16 iters, t-(init.)=1.01 s t(norm)=0.275203, mflops=18.1684 (err=1.9e-15) 25. Monro: elapsed time t=1.47 s, 16 iters, t-(init.)=1.42 s t(norm)=0.386919, mflops=12.9226 (err=1.4e-07) 26. NAPACK (f2c): elapsed time t=1.26 s, 8 iters, t-(init.)=1.24 s t(norm)=0.675746, mflops=7.39923 (err=2.3e-13) 27. Nielsen: elapsed time t=1.73 s, 16 iters, t-(init.)=1.68 s t(norm)=0.457764, mflops=10.9227 (err=1.3e-13) 28. NR (C): elapsed time t=1.23 s, 16 iters, t-(init.)=1.18 s t(norm)=0.321524, mflops=15.5509 (err=1.8e-15) 29. NR (F): elapsed time t=1.35 s, 16 iters, t-(init.)=1.3 s t(norm)=0.354222, mflops=14.1154 (err=1.8e-15) 30. Ooura (C): elapsed time t=1.44 s, 32 iters, t-(init.)=1.34 s t(norm)=0.182561, mflops=27.3882 (err=1.9e-15) 31. Ooura (F): elapsed time t=1.5 s, 32 iters, t-(init.)=1.4 s t(norm)=0.190735, mflops=26.2144 (err=1.9e-15) 32. QFT: elapsed time t=1.61 s, 16 iters, t-(init.)=1.56 s t(norm)=0.425066, mflops=11.7629 (err=3.8e-15) 33. Ransom: elapsed time t=1.61 s, 32 iters, t-(init.)=1.52 s t(norm)=0.207084, mflops=24.1448 (err=4.0e-15) 34. SCIPORT: elapsed time t=1.5 s, 8 iters, t-(init.)=1.47 s t(norm)=0.801086, mflops=6.24152 (err=2.1e-07) 35. Singleton: elapsed time t=1 s, 16 iters, t-(init.)=0.95 s t(norm)=0.258854, mflops=19.3159 (err=2.5e-15) 36. Singleton (f2c): elapsed time t=1.04 s, 16 iters, t-(init.)=0.99 s t(norm)=0.269754, mflops=18.5354 (err=2.5e-15) 37. Sorensen: elapsed time t=1.23 s, 16 iters, t-(init.)=1.18 s t(norm)=0.321524, mflops=15.5509 (err=1.9e-15) 38. Sorensen DIT: elapsed time t=1.72 s, 16 iters, t-(init.)=1.67 s t(norm)=0.455039, mflops=10.9881 (err=1.8e-15) 39. Temperton: elapsed time t=1.21 s, 16 iters, t-(init.)=1.16 s t(norm)=0.316075, mflops=15.819 (err=1.5e-07) 40. Temperton (f2c): elapsed time t=1.32 s, 16 iters, t-(init.)=1.27 s t(norm)=0.346048, mflops=14.4489 (err=1.8e-15) 41. Valkenburg: elapsed time t=1.34 s, 4 iters, t-(init.)=1.32 s t(norm)=1.43869, mflops=3.47539 (err=1.7e-15) Top mflops for N=16384 = 32.768 Normalized results and averages for N=16384: fft 0: mflops = 13.5927 (norm. = 0.414815), norm. avg. (of 14) = 0.44406 fft 1: mflops = 14.6801 (norm. = 0.448), norm. avg. (of 14) = 0.447609 fft 2: mflops = 11.4688 (norm. = 0.35), norm. avg. (of 14) = 0.313609 fft 3: mflops = 14.2249 (norm. = 0.434109), norm. avg. (of 14) = 0.212823 fft 4: mflops = 12.483 (norm. = 0.380952), norm. avg. (of 14) = 0.30323 fft 5: mflops = 6.5536 (norm. = 0.2), norm. avg. (of 14) = 0.119427 fft 6: mflops = 24.3047 (norm. = 0.741722), norm. avg. (of 14) = 0.482957 fft 7: mflops = 18.9176 (norm. = 0.57732), norm. avg. (of 14) = 0.346369 fft 8: mflops = 10.309 (norm. = 0.314607), norm. avg. (of 14) = 0.228638 fft 9: mflops = 21.3373 (norm. = 0.651163), norm. avg. (of 14) = 0.373506 fft 10: mflops = 21.5883 (norm. = 0.658824), norm. avg. (of 14) = 0.36763 fft 11: mflops = 10.4858 (norm. = 0.32), norm. avg. (of 13) = 0.258741 fft 12: mflops = 21.5883 (norm. = 0.658824), norm. avg. (of 14) = 0.542958 fft 13: mflops = 12.9226 (norm. = 0.394366), norm. avg. (of 14) = 0.272848 fft 14: mflops = 32.768 (norm. = 1), norm. avg. (of 14) = 0.820039 fft 15: mflops = 31.3677 (norm. = 0.957265), norm. avg. (of 14) = 0.800404 fft 16: mflops = 22.5154 (norm. = 0.687117), norm. avg. (of 14) = 0.761332 fft 17: mflops = 15.5509 (norm. = 0.474576), norm. avg. (of 12) = 0.363133 fft 18: mflops = 20.8524 (norm. = 0.636364), norm. avg. (of 14) = 0.413943 fft 19: mflops = 14.0077 (norm. = 0.427481), norm. avg. (of 14) = 0.334181 fft 20: mflops = 14.1154 (norm. = 0.430769), norm. avg. (of 14) = 0.349127 fft 21: mflops = -1 (norm. = -0.0305176), norm. avg. (of 12) = 0.594056 fft 22: mflops = 17.6443 (norm. = 0.538462), norm. avg. (of 13) = 0.456653 fft 23: mflops = 19.3159 (norm. = 0.589474), norm. avg. (of 13) = 0.54592 fft 24: mflops = 18.1684 (norm. = 0.554455), norm. avg. (of 13) = 0.552234 fft 25: mflops = 12.9226 (norm. = 0.394366), norm. avg. (of 13) = 0.312434 fft 26: mflops = 7.39923 (norm. = 0.225806), norm. avg. (of 14) = 0.138552 fft 27: mflops = 10.9227 (norm. = 0.333333), norm. avg. (of 14) = 0.232181 fft 28: mflops = 15.5509 (norm. = 0.474576), norm. avg. (of 14) = 0.38094 fft 29: mflops = 14.1154 (norm. = 0.430769), norm. avg. (of 14) = 0.305162 fft 30: mflops = 27.3882 (norm. = 0.835821), norm. avg. (of 14) = 0.7261 fft 31: mflops = 26.2144 (norm. = 0.8), norm. avg. (of 14) = 0.600687 fft 32: mflops = 11.7629 (norm. = 0.358974), norm. avg. (of 11) = 0.460787 fft 33: mflops = 24.1448 (norm. = 0.736842), norm. avg. (of 13) = 0.352852 fft 34: mflops = 6.24152 (norm. = 0.190476), norm. avg. (of 13) = 0.144715 fft 35: mflops = 19.3159 (norm. = 0.589474), norm. avg. (of 14) = 0.402898 fft 36: mflops = 18.5354 (norm. = 0.565657), norm. avg. (of 14) = 0.387085 fft 37: mflops = 15.5509 (norm. = 0.474576), norm. avg. (of 14) = 0.441153 fft 38: mflops = 10.9881 (norm. = 0.335329), norm. avg. (of 14) = 0.249795 fft 39: mflops = 15.819 (norm. = 0.482759), norm. avg. (of 14) = 0.307458 fft 40: mflops = 14.4489 (norm. = 0.440945), norm. avg. (of 14) = 0.302764 fft 41: mflops = 3.47539 (norm. = 0.106061), norm. avg. (of 14) = 0.0658647 Benchmarking for array size = 32768 (power of 2): 0. Arndt DIF: elapsed time t=1.71 s, 8 iters, t-(init.)=1.66 s t(norm)=0.42216, mflops=11.8439 (err=2.1e-15) 1. Arndt DIT: elapsed time t=1.61 s, 8 iters, t-(init.)=1.56 s t(norm)=0.396729, mflops=12.6031 (err=2.1e-15) 2. Arndt Split-Radix: elapsed time t=1.06 s, 4 iters, t-(init.)=1.03 s t(norm)=0.523885, mflops=9.54408 (err=2.1e-15) 3. Arndt 4-step: elapsed time t=1.58 s, 8 iters, t-(init.)=1.53 s t(norm)=0.389099, mflops=12.8502 (err=2.1e-15) 4. Bailey: elapsed time t=1.66 s, 8 iters, t-(init.)=1.6 s t(norm)=0.406901, mflops=12.288 (err=2.1e-15) 5. Beauregard: elapsed time t=1.59 s, 4 iters, t-(init.)=1.56 s t(norm)=0.793457, mflops=6.30154 (err=2.2e-15) 6. Bergland: elapsed time t=1.87 s, 16 iters, t-(init.)=1.78 s t(norm)=0.226339, mflops=22.0908 (err=2.2e-15) 7. Brenner: elapsed time t=1.28 s, 8 iters, t-(init.)=1.23 s t(norm)=0.312805, mflops=15.9844 (err=2.2e-15) 8. Burrus: elapsed time t=1.16 s, 4 iters, t-(init.)=1.13 s t(norm)=0.574748, mflops=8.69947 (err=2.1e-15) 9. CWP (min N) (N=34320): elapsed time t=1.98 s, 16 iters, t-(init.)=1.88 s t(norm)=0.239054, mflops=20.9157 10. CWP (best N) (N=34320): elapsed time t=1.98 s, 16 iters, t-(init.)=1.87 s t(norm)=0.237783, mflops=21.0276 11. Edelblute: elapsed time t=1.14 s, 4 iters, t-(init.)=1.12 s t(norm)=0.569661, mflops=8.77714 (err=2.1e-15) 12. FFTPACK: elapsed time t=1.9 s, 16 iters, t-(init.)=1.8 s t(norm)=0.228882, mflops=21.8453 (err=2.1e-15) 13. FFTPACK (f2c): elapsed time t=1.56 s, 8 iters, t-(init.)=1.52 s t(norm)=0.386556, mflops=12.9347 (err=2.1e-15) FFTW_MEASURE plan: (cost = 8.250000e-02) FFTW_TWIDDLE 32 FFTW_TWIDDLE 32 FFTW_TWIDDLE 4 FFTW_NOTW 8 14. FFTW: elapsed time t=1.31 s, 16 iters, t-(init.)=1.21 s t(norm)=0.153859, mflops=32.4972 (err=2.1e-15) FFTW_ESTIMATE plan: (cost = 2.883584e+05) FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.4 s, 16 iters, t-(init.)=1.31 s t(norm)=0.166575, mflops=30.0165 (err=2.1e-15) 16. Frigo-old: elapsed time t=1.99 s, 16 iters, t-(init.)=1.89 s t(norm)=0.240326, mflops=20.8051 (err=2.2e-15) 17. Green: elapsed time t=1.34 s, 8 iters, t-(init.)=1.29 s t(norm)=0.328064, mflops=15.2409 (err=2.2e-15) 18. GSL: elapsed time t=1.97 s, 16 iters, t-(init.)=1.87 s t(norm)=0.237783, mflops=21.0276 (err=2.2e-15) 19. GSL DIT: elapsed time t=1.71 s, 8 iters, t-(init.)=1.66 s t(norm)=0.42216, mflops=11.8439 (err=2.2e-15) 20. GSL DIF: elapsed time t=1.73 s, 8 iters, t-(init.)=1.68 s t(norm)=0.427246, mflops=11.7029 (err=2.2e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1.27 s, 8 iters, t-(init.)=1.23 s t(norm)=0.312805, mflops=15.9844 (err=2.1e-15) 23. Mayer (simple): elapsed time t=1.17 s, 8 iters, t-(init.)=1.12 s t(norm)=0.284831, mflops=17.5543 24. Mayer (lookup): elapsed time t=1.28 s, 8 iters, t-(init.)=1.24 s t(norm)=0.315348, mflops=15.8555 (err=2.1e-15) 25. Monro: elapsed time t=1.87 s, 8 iters, t-(init.)=1.83 s t(norm)=0.465393, mflops=10.7436 (err=1.5e-07) 26. NAPACK (f2c): elapsed time t=1.43 s, 4 iters, t-(init.)=1.4 s t(norm)=0.712077, mflops=7.02171 (err=5.7e-13) 27. Nielsen: elapsed time t=1.91 s, 8 iters, t-(init.)=1.86 s t(norm)=0.473022, mflops=10.5703 (err=2.3e-13) 28. NR (C): elapsed time t=1.57 s, 8 iters, t-(init.)=1.52 s t(norm)=0.386556, mflops=12.9347 (err=2.2e-15) 29. NR (F): elapsed time t=1.69 s, 8 iters, t-(init.)=1.65 s t(norm)=0.419617, mflops=11.9156 (err=2.2e-15) 30. Ooura (C): elapsed time t=1.71 s, 16 iters, t-(init.)=1.61 s t(norm)=0.204722, mflops=24.4234 (err=2.2e-15) 31. Ooura (F): elapsed time t=1.77 s, 16 iters, t-(init.)=1.67 s t(norm)=0.212351, mflops=23.5459 (err=2.2e-15) 32. QFT: elapsed time t=1.9 s, 8 iters, t-(init.)=1.85 s t(norm)=0.470479, mflops=10.6275 (err=4.9e-15) 33. Ransom: elapsed time t=1.05 s, 8 iters, t-(init.)=1 s t(norm)=0.254313, mflops=19.6608 (err=3.6e-15) 34. SCIPORT: elapsed time t=1.63 s, 4 iters, t-(init.)=1.61 s t(norm)=0.818888, mflops=6.10584 (err=2.3e-07) 35. Singleton: elapsed time t=1.32 s, 8 iters, t-(init.)=1.27 s t(norm)=0.322978, mflops=15.4809 (err=3.2e-15) 36. Singleton (f2c): elapsed time t=1.36 s, 8 iters, t-(init.)=1.31 s t(norm)=0.33315, mflops=15.0082 (err=3.2e-15) 37. Sorensen: elapsed time t=1.52 s, 8 iters, t-(init.)=1.48 s t(norm)=0.376383, mflops=13.2843 (err=2.1e-15) 38. Sorensen DIT: elapsed time t=1.11 s, 4 iters, t-(init.)=1.09 s t(norm)=0.554403, mflops=9.01872 (err=2.1e-15) 39. Temperton: elapsed time t=1.43 s, 8 iters, t-(init.)=1.39 s t(norm)=0.353495, mflops=14.1445 (err=1.5e-07) 40. Temperton (f2c): elapsed time t=1.55 s, 8 iters, t-(init.)=1.5 s t(norm)=0.38147, mflops=13.1072 (err=2.2e-15) 41. Valkenburg: elapsed time t=1.52 s, 2 iters, t-(init.)=1.51 s t(norm)=1.53605, mflops=3.2551 (err=2.3e-15) Top mflops for N=32768 = 32.4972 Normalized results and averages for N=32768: fft 0: mflops = 11.8439 (norm. = 0.364458), norm. avg. (of 15) = 0.438753 fft 1: mflops = 12.6031 (norm. = 0.387821), norm. avg. (of 15) = 0.443623 fft 2: mflops = 9.54408 (norm. = 0.293689), norm. avg. (of 15) = 0.312281 fft 3: mflops = 12.8502 (norm. = 0.395425), norm. avg. (of 15) = 0.224996 fft 4: mflops = 12.288 (norm. = 0.378125), norm. avg. (of 15) = 0.308223 fft 5: mflops = 6.30154 (norm. = 0.19391), norm. avg. (of 15) = 0.124392 fft 6: mflops = 22.0908 (norm. = 0.679775), norm. avg. (of 15) = 0.496079 fft 7: mflops = 15.9844 (norm. = 0.49187), norm. avg. (of 15) = 0.356069 fft 8: mflops = 8.69947 (norm. = 0.267699), norm. avg. (of 15) = 0.231242 fft 9: mflops = 20.9157 (norm. = 0.643617), norm. avg. (of 15) = 0.391513 fft 10: mflops = 21.0276 (norm. = 0.647059), norm. avg. (of 15) = 0.386259 fft 11: mflops = 8.77714 (norm. = 0.270089), norm. avg. (of 14) = 0.259552 fft 12: mflops = 21.8453 (norm. = 0.672222), norm. avg. (of 15) = 0.551576 fft 13: mflops = 12.9347 (norm. = 0.398026), norm. avg. (of 15) = 0.281193 fft 14: mflops = 32.4972 (norm. = 1), norm. avg. (of 15) = 0.832036 fft 15: mflops = 30.0165 (norm. = 0.923664), norm. avg. (of 15) = 0.808621 fft 16: mflops = 20.8051 (norm. = 0.640212), norm. avg. (of 15) = 0.753257 fft 17: mflops = 15.2409 (norm. = 0.468992), norm. avg. (of 13) = 0.371276 fft 18: mflops = 21.0276 (norm. = 0.647059), norm. avg. (of 15) = 0.429484 fft 19: mflops = 11.8439 (norm. = 0.364458), norm. avg. (of 15) = 0.3362 fft 20: mflops = 11.7029 (norm. = 0.360119), norm. avg. (of 15) = 0.34986 fft 21: mflops = -1 (norm. = -0.0307719), norm. avg. (of 12) = 0.594056 fft 22: mflops = 15.9844 (norm. = 0.49187), norm. avg. (of 14) = 0.459169 fft 23: mflops = 17.5543 (norm. = 0.540179), norm. avg. (of 14) = 0.54551 fft 24: mflops = 15.8555 (norm. = 0.487903), norm. avg. (of 14) = 0.547639 fft 25: mflops = 10.7436 (norm. = 0.330601), norm. avg. (of 14) = 0.313731 fft 26: mflops = 7.02171 (norm. = 0.216071), norm. avg. (of 15) = 0.14372 fft 27: mflops = 10.5703 (norm. = 0.325269), norm. avg. (of 15) = 0.238387 fft 28: mflops = 12.9347 (norm. = 0.398026), norm. avg. (of 15) = 0.382079 fft 29: mflops = 11.9156 (norm. = 0.366667), norm. avg. (of 15) = 0.309262 fft 30: mflops = 24.4234 (norm. = 0.751553), norm. avg. (of 15) = 0.727797 fft 31: mflops = 23.5459 (norm. = 0.724551), norm. avg. (of 15) = 0.608945 fft 32: mflops = 10.6275 (norm. = 0.327027), norm. avg. (of 12) = 0.44964 fft 33: mflops = 19.6608 (norm. = 0.605), norm. avg. (of 14) = 0.370863 fft 34: mflops = 6.10584 (norm. = 0.187888), norm. avg. (of 14) = 0.147799 fft 35: mflops = 15.4809 (norm. = 0.476378), norm. avg. (of 15) = 0.407796 fft 36: mflops = 15.0082 (norm. = 0.461832), norm. avg. (of 15) = 0.392068 fft 37: mflops = 13.2843 (norm. = 0.408784), norm. avg. (of 15) = 0.438995 fft 38: mflops = 9.01872 (norm. = 0.277523), norm. avg. (of 15) = 0.251644 fft 39: mflops = 14.1445 (norm. = 0.435252), norm. avg. (of 15) = 0.315977 fft 40: mflops = 13.1072 (norm. = 0.403333), norm. avg. (of 15) = 0.309468 fft 41: mflops = 3.2551 (norm. = 0.100166), norm. avg. (of 15) = 0.0681514 Benchmarking for array size = 65536 (power of 2): 0. Arndt DIF: elapsed time t=1.06 s, 2 iters, t-(init.)=1.04 s t(norm)=0.495911, mflops=10.0825 (err=4.0e-15) 1. Arndt DIT: elapsed time t=1.02 s, 2 iters, t-(init.)=0.99 s t(norm)=0.472069, mflops=10.5917 (err=4.1e-15) 2. Arndt Split-Radix: elapsed time t=1.28 s, 2 iters, t-(init.)=1.26 s t(norm)=0.600815, mflops=8.32203 (err=4.1e-15) 3. Arndt 4-step: elapsed time t=1.58 s, 4 iters, t-(init.)=1.53 s t(norm)=0.36478, mflops=13.7069 (err=4.2e-15) 4. Bailey: elapsed time t=1 s, 2 iters, t-(init.)=0.98 s t(norm)=0.4673, mflops=10.6998 (err=4.0e-15) 5. Beauregard: elapsed time t=1.79 s, 2 iters, t-(init.)=1.76 s t(norm)=0.839233, mflops=5.95782 (err=4.2e-15) 6. Bergland: elapsed time t=1.14 s, 4 iters, t-(init.)=1.09 s t(norm)=0.259876, mflops=19.2399 (err=4.3e-15) 7. Brenner: elapsed time t=1.44 s, 4 iters, t-(init.)=1.39 s t(norm)=0.331402, mflops=15.0874 (err=4.3e-15) 8. Burrus: elapsed time t=1.38 s, 2 iters, t-(init.)=1.36 s t(norm)=0.648499, mflops=7.71012 (err=4.1e-15) 9. CWP (min N) (N=72072): elapsed time t=1.18 s, 4 iters, t-(init.)=1.13 s t(norm)=0.269413, mflops=18.5589 10. CWP (best N) (N=72072): elapsed time t=1.18 s, 4 iters, t-(init.)=1.12 s t(norm)=0.267029, mflops=18.7246 11. Edelblute: elapsed time t=1.35 s, 2 iters, t-(init.)=1.33 s t(norm)=0.634193, mflops=7.88403 (err=4.1e-15) 12. FFTPACK: elapsed time t=1.99 s, 8 iters, t-(init.)=1.89 s t(norm)=0.225306, mflops=22.1921 (err=4.2e-15) 13. FFTPACK (f2c): elapsed time t=1.65 s, 4 iters, t-(init.)=1.6 s t(norm)=0.38147, mflops=13.1072 (err=4.2e-15) FFTW_MEASURE plan: (cost = 1.800000e-01) FFTW_TWIDDLE 4 FFTW_TWIDDLE 32 FFTW_TWIDDLE 16 FFTW_NOTW 32 14. FFTW: elapsed time t=1.45 s, 8 iters, t-(init.)=1.35 s t(norm)=0.160933, mflops=31.0689 (err=4.3e-15) FFTW_ESTIMATE plan: (cost = 5.767168e+05) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.47 s, 8 iters, t-(init.)=1.37 s t(norm)=0.163317, mflops=30.6154 (err=4.3e-15) 16. Frigo-old: elapsed time t=1.14 s, 4 iters, t-(init.)=1.08 s t(norm)=0.257492, mflops=19.4181 (err=4.4e-15) 17. Green: elapsed time t=1.55 s, 4 iters, t-(init.)=1.5 s t(norm)=0.357628, mflops=13.981 (err=4.3e-15) 18. GSL: elapsed time t=1.02 s, 4 iters, t-(init.)=0.97 s t(norm)=0.231266, mflops=21.6201 (err=4.2e-15) 19. GSL DIT: elapsed time t=1.03 s, 2 iters, t-(init.)=1.01 s t(norm)=0.481606, mflops=10.3819 (err=4.2e-15) 20. GSL DIF: elapsed time t=1.05 s, 2 iters, t-(init.)=1.03 s t(norm)=0.491142, mflops=10.1803 (err=4.2e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1.57 s, 4 iters, t-(init.)=1.52 s t(norm)=0.362396, mflops=13.7971 (err=4.2e-15) 23. Mayer (simple): elapsed time t=1.51 s, 4 iters, t-(init.)=1.46 s t(norm)=0.348091, mflops=14.3641 24. Mayer (lookup): elapsed time t=1.6 s, 4 iters, t-(init.)=1.56 s t(norm)=0.371933, mflops=13.4433 (err=4.2e-15) 25. Monro: elapsed time t=1.1 s, 2 iters, t-(init.)=1.07 s t(norm)=0.510216, mflops=9.79978 (err=1.6e-07) 26. NAPACK (f2c): elapsed time t=1.51 s, 2 iters, t-(init.)=1.49 s t(norm)=0.710487, mflops=7.03742 (err=8.9e-13) 27. Nielsen: elapsed time t=1.2 s, 2 iters, t-(init.)=1.17 s t(norm)=0.557899, mflops=8.96219 (err=2.7e-13) 28. NR (C): elapsed time t=1.91 s, 4 iters, t-(init.)=1.86 s t(norm)=0.443459, mflops=11.275 (err=4.2e-15) 29. NR (F): elapsed time t=1.01 s, 2 iters, t-(init.)=0.98 s t(norm)=0.4673, mflops=10.6998 (err=4.2e-15) 30. Ooura (C): elapsed time t=1 s, 4 iters, t-(init.)=0.95 s t(norm)=0.226498, mflops=22.0753 (err=4.4e-15) 31. Ooura (F): elapsed time t=1.04 s, 4 iters, t-(init.)=0.99 s t(norm)=0.236034, mflops=21.1834 (err=4.4e-15) 32. QFT: elapsed time t=1.07 s, 2 iters, t-(init.)=1.04 s t(norm)=0.495911, mflops=10.0825 (err=7.9e-15) 33. Ransom: elapsed time t=1.28 s, 4 iters, t-(init.)=1.22 s t(norm)=0.290871, mflops=17.1898 (err=6.9e-15) 34. SCIPORT: elapsed time t=1.77 s, 2 iters, t-(init.)=1.74 s t(norm)=0.829697, mflops=6.0263 (err=2.5e-07) 35. Singleton: elapsed time t=1.41 s, 4 iters, t-(init.)=1.37 s t(norm)=0.326633, mflops=15.3077 (err=5.6e-15) 36. Singleton (f2c): elapsed time t=1.47 s, 4 iters, t-(init.)=1.43 s t(norm)=0.340939, mflops=14.6654 (err=5.6e-15) 37. Sorensen: elapsed time t=1.84 s, 4 iters, t-(init.)=1.79 s t(norm)=0.426769, mflops=11.7159 (err=4.2e-15) 38. Sorensen DIT: elapsed time t=1.29 s, 2 iters, t-(init.)=1.27 s t(norm)=0.605583, mflops=8.2565 (err=4.1e-15) 39. Temperton: elapsed time t=1.56 s, 4 iters, t-(init.)=1.51 s t(norm)=0.360012, mflops=13.8884 (err=1.8e-07) 40. Temperton (f2c): elapsed time t=1.77 s, 4 iters, t-(init.)=1.72 s t(norm)=0.41008, mflops=12.1927 (err=4.2e-15) 41. Valkenburg: elapsed time t=1.67 s, 1 iters, t-(init.)=1.66 s t(norm)=1.5831, mflops=3.15836 (err=4.0e-15) Top mflops for N=65536 = 31.0689 Normalized results and averages for N=65536: fft 0: mflops = 10.0825 (norm. = 0.324519), norm. avg. (of 16) = 0.431614 fft 1: mflops = 10.5917 (norm. = 0.340909), norm. avg. (of 16) = 0.437203 fft 2: mflops = 8.32203 (norm. = 0.267857), norm. avg. (of 16) = 0.309504 fft 3: mflops = 13.7069 (norm. = 0.441176), norm. avg. (of 16) = 0.238508 fft 4: mflops = 10.6998 (norm. = 0.344388), norm. avg. (of 16) = 0.310483 fft 5: mflops = 5.95782 (norm. = 0.191761), norm. avg. (of 16) = 0.128603 fft 6: mflops = 19.2399 (norm. = 0.619266), norm. avg. (of 16) = 0.503778 fft 7: mflops = 15.0874 (norm. = 0.485612), norm. avg. (of 16) = 0.364165 fft 8: mflops = 7.71012 (norm. = 0.248162), norm. avg. (of 16) = 0.2323 fft 9: mflops = 18.5589 (norm. = 0.597345), norm. avg. (of 16) = 0.404378 fft 10: mflops = 18.7246 (norm. = 0.602679), norm. avg. (of 16) = 0.399785 fft 11: mflops = 7.88403 (norm. = 0.253759), norm. avg. (of 15) = 0.259166 fft 12: mflops = 22.1921 (norm. = 0.714286), norm. avg. (of 16) = 0.561745 fft 13: mflops = 13.1072 (norm. = 0.421875), norm. avg. (of 16) = 0.289986 fft 14: mflops = 31.0689 (norm. = 1), norm. avg. (of 16) = 0.842534 fft 15: mflops = 30.6154 (norm. = 0.985401), norm. avg. (of 16) = 0.81967 fft 16: mflops = 19.4181 (norm. = 0.625), norm. avg. (of 16) = 0.745241 fft 17: mflops = 13.981 (norm. = 0.45), norm. avg. (of 14) = 0.376899 fft 18: mflops = 21.6201 (norm. = 0.695876), norm. avg. (of 16) = 0.446134 fft 19: mflops = 10.3819 (norm. = 0.334158), norm. avg. (of 16) = 0.336072 fft 20: mflops = 10.1803 (norm. = 0.32767), norm. avg. (of 16) = 0.348473 fft 21: mflops = -1 (norm. = -0.0321865), norm. avg. (of 12) = 0.594056 fft 22: mflops = 13.7971 (norm. = 0.444079), norm. avg. (of 15) = 0.458163 fft 23: mflops = 14.3641 (norm. = 0.462329), norm. avg. (of 15) = 0.539965 fft 24: mflops = 13.4433 (norm. = 0.432692), norm. avg. (of 15) = 0.539976 fft 25: mflops = 9.79978 (norm. = 0.315421), norm. avg. (of 15) = 0.313844 fft 26: mflops = 7.03742 (norm. = 0.22651), norm. avg. (of 16) = 0.148894 fft 27: mflops = 8.96219 (norm. = 0.288462), norm. avg. (of 16) = 0.241517 fft 28: mflops = 11.275 (norm. = 0.362903), norm. avg. (of 16) = 0.38088 fft 29: mflops = 10.6998 (norm. = 0.344388), norm. avg. (of 16) = 0.311458 fft 30: mflops = 22.0753 (norm. = 0.710526), norm. avg. (of 16) = 0.726717 fft 31: mflops = 21.1834 (norm. = 0.681818), norm. avg. (of 16) = 0.6135 fft 32: mflops = 10.0825 (norm. = 0.324519), norm. avg. (of 13) = 0.440016 fft 33: mflops = 17.1898 (norm. = 0.553279), norm. avg. (of 15) = 0.383024 fft 34: mflops = 6.0263 (norm. = 0.193966), norm. avg. (of 15) = 0.150876 fft 35: mflops = 15.3077 (norm. = 0.492701), norm. avg. (of 16) = 0.413103 fft 36: mflops = 14.6654 (norm. = 0.472028), norm. avg. (of 16) = 0.397066 fft 37: mflops = 11.7159 (norm. = 0.377095), norm. avg. (of 16) = 0.435127 fft 38: mflops = 8.2565 (norm. = 0.265748), norm. avg. (of 16) = 0.252525 fft 39: mflops = 13.8884 (norm. = 0.44702), norm. avg. (of 16) = 0.324167 fft 40: mflops = 12.1927 (norm. = 0.392442), norm. avg. (of 16) = 0.314654 fft 41: mflops = 3.15836 (norm. = 0.101657), norm. avg. (of 16) = 0.0702455 Benchmarking for array size = 131072 (power of 2): 0. Arndt DIF: elapsed time t=1.2 s, 1 iters, t-(init.)=1.17 s t(norm)=0.525082, mflops=9.52232 (err=2.8e-15) 1. Arndt DIT: elapsed time t=1.16 s, 1 iters, t-(init.)=1.14 s t(norm)=0.511618, mflops=9.77291 (err=2.8e-15) 2. Arndt Split-Radix: elapsed time t=1.52 s, 1 iters, t-(init.)=1.49 s t(norm)=0.668694, mflops=7.47726 (err=2.8e-15) 3. Arndt 4-step: elapsed time t=1.93 s, 2 iters, t-(init.)=1.88 s t(norm)=0.421861, mflops=11.8523 (err=2.8e-15) 4. Bailey: elapsed time t=1.04 s, 1 iters, t-(init.)=1.01 s t(norm)=0.453276, mflops=11.0308 (err=2.8e-15) 5. Beauregard: elapsed time t=1.94 s, 1 iters, t-(init.)=1.91 s t(norm)=0.857185, mflops=5.83305 (err=2.9e-15) 6. Bergland: elapsed time t=1.24 s, 2 iters, t-(init.)=1.19 s t(norm)=0.267029, mflops=18.7246 (err=2.9e-15) 7. Brenner: elapsed time t=1.67 s, 2 iters, t-(init.)=1.62 s t(norm)=0.363518, mflops=13.7545 (err=2.9e-15) 8. Burrus: elapsed time t=1.62 s, 1 iters, t-(init.)=1.59 s t(norm)=0.713573, mflops=7.00699 (err=2.8e-15) 9. CWP (min N) (N=144144): elapsed time t=1.27 s, 2 iters, t-(init.)=1.21 s t(norm)=0.271517, mflops=18.4151 10. CWP (best N) (N=144144): elapsed time t=1.25 s, 2 iters, t-(init.)=1.2 s t(norm)=0.269273, mflops=18.5685 11. Edelblute: elapsed time t=1.61 s, 1 iters, t-(init.)=1.58 s t(norm)=0.709085, mflops=7.05134 (err=2.8e-15) 12. FFTPACK: elapsed time t=1.19 s, 2 iters, t-(init.)=1.13 s t(norm)=0.253565, mflops=19.7188 (err=2.9e-15) 13. FFTPACK (f2c): elapsed time t=1.92 s, 2 iters, t-(init.)=1.87 s t(norm)=0.419617, mflops=11.9156 (err=2.9e-15) FFTW_MEASURE plan: (cost = 3.900000e-01) FFTW_TWIDDLE 32 FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_NOTW 16 14. FFTW: elapsed time t=1.56 s, 4 iters, t-(init.)=1.47 s t(norm)=0.16493, mflops=30.316 (err=2.8e-15) FFTW_ESTIMATE plan: (cost = 1.153434e+06) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.58 s, 4 iters, t-(init.)=1.48 s t(norm)=0.166052, mflops=30.1111 (err=2.9e-15) 16. Frigo-old: elapsed time t=1.21 s, 2 iters, t-(init.)=1.16 s t(norm)=0.260297, mflops=19.2088 (err=2.8e-15) 17. Green: elapsed time t=1.73 s, 2 iters, t-(init.)=1.68 s t(norm)=0.376982, mflops=13.2632 (err=2.9e-15) 18. GSL: elapsed time t=1.21 s, 2 iters, t-(init.)=1.16 s t(norm)=0.260297, mflops=19.2088 (err=2.9e-15) 19. GSL DIT: elapsed time t=1.21 s, 1 iters, t-(init.)=1.18 s t(norm)=0.52957, mflops=9.44163 (err=2.9e-15) 20. GSL DIF: elapsed time t=1.21 s, 1 iters, t-(init.)=1.18 s t(norm)=0.52957, mflops=9.44163 (err=2.9e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1.07 s, 1 iters, t-(init.)=1.05 s t(norm)=0.471227, mflops=10.6106 (err=2.8e-15) 23. Mayer (simple): elapsed time t=1.02 s, 1 iters, t-(init.)=1 s t(norm)=0.448788, mflops=11.1411 24. Mayer (lookup): elapsed time t=1.06 s, 1 iters, t-(init.)=1.03 s t(norm)=0.462252, mflops=10.8166 (err=2.8e-15) 25. Monro: elapsed time t=1.31 s, 1 iters, t-(init.)=1.29 s t(norm)=0.578936, mflops=8.63653 (err=1.7e-07) 26. NAPACK (f2c): elapsed time t=1.66 s, 1 iters, t-(init.)=1.64 s t(norm)=0.736012, mflops=6.79337 (err=2.1e-12) 27. Nielsen: elapsed time t=1.29 s, 1 iters, t-(init.)=1.27 s t(norm)=0.569961, mflops=8.77254 (err=9.6e-13) 28. NR (C): elapsed time t=1.12 s, 1 iters, t-(init.)=1.09 s t(norm)=0.489179, mflops=10.2212 (err=2.9e-15) 29. NR (F): elapsed time t=1.16 s, 1 iters, t-(init.)=1.14 s t(norm)=0.511618, mflops=9.77291 (err=2.9e-15) 30. Ooura (C): elapsed time t=1.13 s, 2 iters, t-(init.)=1.07 s t(norm)=0.240102, mflops=20.8245 (err=2.8e-15) 31. Ooura (F): elapsed time t=1.16 s, 2 iters, t-(init.)=1.11 s t(norm)=0.249077, mflops=20.0741 (err=2.8e-15) 32. QFT: elapsed time t=1.25 s, 1 iters, t-(init.)=1.23 s t(norm)=0.552009, mflops=9.05782 (err=8.6e-15) 33. Ransom: elapsed time t=1.52 s, 2 iters, t-(init.)=1.46 s t(norm)=0.327615, mflops=15.2618 (err=4.0e-15) 34. SCIPORT: elapsed time t=1.9 s, 1 iters, t-(init.)=1.87 s t(norm)=0.839233, mflops=5.95782 (err=2.7e-07) 35. Singleton: elapsed time t=1.65 s, 2 iters, t-(init.)=1.6 s t(norm)=0.35903, mflops=13.9264 (err=4.3e-15) 36. Singleton (f2c): elapsed time t=1.71 s, 2 iters, t-(init.)=1.66 s t(norm)=0.372494, mflops=13.423 (err=4.2e-15) 37. Sorensen: elapsed time t=1.08 s, 1 iters, t-(init.)=1.06 s t(norm)=0.475715, mflops=10.5105 (err=2.8e-15) 38. Sorensen DIT: elapsed time t=1.56 s, 1 iters, t-(init.)=1.53 s t(norm)=0.686646, mflops=7.28178 (err=2.8e-15) 39. Temperton: elapsed time t=1.83 s, 2 iters, t-(init.)=1.79 s t(norm)=0.401665, mflops=12.4482 (err=2.0e-07) 40. Temperton (f2c): elapsed time t=1.04 s, 1 iters, t-(init.)=1.02 s t(norm)=0.457764, mflops=10.9227 (err=2.9e-15) 41. Valkenburg: elapsed time t=3.64 s, 1 iters, t-(init.)=3.62 s t(norm)=1.62461, mflops=3.07766 (err=3.1e-15) Top mflops for N=131072 = 30.316 Normalized results and averages for N=131072: fft 0: mflops = 9.52232 (norm. = 0.314103), norm. avg. (of 17) = 0.424701 fft 1: mflops = 9.77291 (norm. = 0.322368), norm. avg. (of 17) = 0.430448 fft 2: mflops = 7.47726 (norm. = 0.246644), norm. avg. (of 17) = 0.305807 fft 3: mflops = 11.8523 (norm. = 0.390957), norm. avg. (of 17) = 0.247475 fft 4: mflops = 11.0308 (norm. = 0.363861), norm. avg. (of 17) = 0.313623 fft 5: mflops = 5.83305 (norm. = 0.192408), norm. avg. (of 17) = 0.132356 fft 6: mflops = 18.7246 (norm. = 0.617647), norm. avg. (of 17) = 0.510476 fft 7: mflops = 13.7545 (norm. = 0.453704), norm. avg. (of 17) = 0.369432 fft 8: mflops = 7.00699 (norm. = 0.231132), norm. avg. (of 17) = 0.232231 fft 9: mflops = 18.4151 (norm. = 0.607438), norm. avg. (of 17) = 0.416323 fft 10: mflops = 18.5685 (norm. = 0.6125), norm. avg. (of 17) = 0.412298 fft 11: mflops = 7.05134 (norm. = 0.232595), norm. avg. (of 16) = 0.257505 fft 12: mflops = 19.7188 (norm. = 0.650442), norm. avg. (of 17) = 0.566963 fft 13: mflops = 11.9156 (norm. = 0.393048), norm. avg. (of 17) = 0.296048 fft 14: mflops = 30.316 (norm. = 1), norm. avg. (of 17) = 0.851797 fft 15: mflops = 30.1111 (norm. = 0.993243), norm. avg. (of 17) = 0.82988 fft 16: mflops = 19.2088 (norm. = 0.633621), norm. avg. (of 17) = 0.738675 fft 17: mflops = 13.2632 (norm. = 0.4375), norm. avg. (of 15) = 0.380939 fft 18: mflops = 19.2088 (norm. = 0.633621), norm. avg. (of 17) = 0.457162 fft 19: mflops = 9.44163 (norm. = 0.311441), norm. avg. (of 17) = 0.334623 fft 20: mflops = 9.44163 (norm. = 0.311441), norm. avg. (of 17) = 0.346295 fft 21: mflops = -1 (norm. = -0.0329859), norm. avg. (of 12) = 0.594056 fft 22: mflops = 10.6106 (norm. = 0.35), norm. avg. (of 16) = 0.451402 fft 23: mflops = 11.1411 (norm. = 0.3675), norm. avg. (of 16) = 0.529186 fft 24: mflops = 10.8166 (norm. = 0.356796), norm. avg. (of 16) = 0.528527 fft 25: mflops = 8.63653 (norm. = 0.284884), norm. avg. (of 16) = 0.312034 fft 26: mflops = 6.79337 (norm. = 0.224085), norm. avg. (of 17) = 0.153317 fft 27: mflops = 8.77254 (norm. = 0.28937), norm. avg. (of 17) = 0.244332 fft 28: mflops = 10.2212 (norm. = 0.337156), norm. avg. (of 17) = 0.378308 fft 29: mflops = 9.77291 (norm. = 0.322368), norm. avg. (of 17) = 0.312099 fft 30: mflops = 20.8245 (norm. = 0.686916), norm. avg. (of 17) = 0.724376 fft 31: mflops = 20.0741 (norm. = 0.662162), norm. avg. (of 17) = 0.616362 fft 32: mflops = 9.05782 (norm. = 0.29878), norm. avg. (of 14) = 0.429927 fft 33: mflops = 15.2618 (norm. = 0.503425), norm. avg. (of 16) = 0.390549 fft 34: mflops = 5.95782 (norm. = 0.196524), norm. avg. (of 16) = 0.153729 fft 35: mflops = 13.9264 (norm. = 0.459375), norm. avg. (of 17) = 0.415825 fft 36: mflops = 13.423 (norm. = 0.442771), norm. avg. (of 17) = 0.399754 fft 37: mflops = 10.5105 (norm. = 0.346698), norm. avg. (of 17) = 0.429925 fft 38: mflops = 7.28178 (norm. = 0.240196), norm. avg. (of 17) = 0.2518 fft 39: mflops = 12.4482 (norm. = 0.410615), norm. avg. (of 17) = 0.329252 fft 40: mflops = 10.9227 (norm. = 0.360294), norm. avg. (of 17) = 0.317339 fft 41: mflops = 3.07766 (norm. = 0.101519), norm. avg. (of 17) = 0.0720851 Benchmarking for array size = 262144 (power of 2): 0. Arndt DIF: elapsed time t=2.62 s, 1 iters, t-(init.)=2.56 s t(norm)=0.542535, mflops=9.216 (err=6.7e-15) 1. Arndt DIT: elapsed time t=2.55 s, 1 iters, t-(init.)=2.5 s t(norm)=0.529819, mflops=9.43718 (err=6.7e-15) 2. Arndt Split-Radix: elapsed time t=3.2 s, 1 iters, t-(init.)=3.15 s t(norm)=0.667572, mflops=7.48983 (err=6.7e-15) 3. Arndt 4-step: elapsed time t=1.77 s, 1 iters, t-(init.)=1.72 s t(norm)=0.364516, mflops=13.7168 (err=6.8e-15) 4. Bailey: elapsed time t=2.17 s, 1 iters, t-(init.)=2.12 s t(norm)=0.449287, mflops=11.1288 (err=6.7e-15) 5. Beauregard: elapsed time t=4.14 s, 1 iters, t-(init.)=4.09 s t(norm)=0.866784, mflops=5.76845 (err=6.8e-15) 6. Bergland: elapsed time t=1.27 s, 1 iters, t-(init.)=1.22 s t(norm)=0.258552, mflops=19.3385 (err=6.8e-15) 7. Brenner: elapsed time t=1.71 s, 1 iters, t-(init.)=1.66 s t(norm)=0.3518, mflops=14.2126 (err=6.9e-15) 8. Burrus: elapsed time t=3.41 s, 1 iters, t-(init.)=3.35 s t(norm)=0.709958, mflops=7.04267 (err=6.7e-15) 9. CWP (min N) (N=360360): elapsed time t=1.77 s, 1 iters, t-(init.)=1.7 s t(norm)=0.360277, mflops=13.8782 10. CWP (best N) (N=360360): elapsed time t=1.76 s, 1 iters, t-(init.)=1.68 s t(norm)=0.356038, mflops=14.0434 11. Edelblute: elapsed time t=3.4 s, 1 iters, t-(init.)=3.35 s t(norm)=0.709958, mflops=7.04267 (err=6.7e-15) 12. FFTPACK: elapsed time t=1.24 s, 1 iters, t-(init.)=1.19 s t(norm)=0.252194, mflops=19.826 (err=6.8e-15) 13. FFTPACK (f2c): elapsed time t=2 s, 1 iters, t-(init.)=1.95 s t(norm)=0.413259, mflops=12.099 (err=6.8e-15) FFTW_MEASURE plan: (cost = 8.300000e-01) FFTW_TWIDDLE 32 FFTW_TWIDDLE 32 FFTW_TWIDDLE 4 FFTW_TWIDDLE 4 FFTW_NOTW 16 14. FFTW: elapsed time t=1.63 s, 2 iters, t-(init.)=1.53 s t(norm)=0.162125, mflops=30.8405 (err=6.8e-15) FFTW_ESTIMATE plan: (cost = 2.988442e+06) FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.74 s, 2 iters, t-(init.)=1.64 s t(norm)=0.173781, mflops=28.7719 (err=6.8e-15) 16. Frigo-old: elapsed time t=1.36 s, 1 iters, t-(init.)=1.31 s t(norm)=0.277625, mflops=18.0099 (err=6.9e-15) 17. Green: elapsed time t=1.78 s, 1 iters, t-(init.)=1.73 s t(norm)=0.366635, mflops=13.6375 (err=6.9e-15) 18. GSL: elapsed time t=1.21 s, 1 iters, t-(init.)=1.15 s t(norm)=0.243717, mflops=20.5156 (err=6.8e-15) 19. GSL DIT: elapsed time t=2.61 s, 1 iters, t-(init.)=2.56 s t(norm)=0.542535, mflops=9.216 (err=6.8e-15) 20. GSL DIF: elapsed time t=2.6 s, 1 iters, t-(init.)=2.55 s t(norm)=0.540415, mflops=9.25214 (err=6.8e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=2.31 s, 1 iters, t-(init.)=2.25 s t(norm)=0.476837, mflops=10.4858 (err=6.8e-15) 23. Mayer (simple): elapsed time t=2.23 s, 1 iters, t-(init.)=2.18 s t(norm)=0.462002, mflops=10.8225 24. Mayer (lookup): elapsed time t=2.34 s, 1 iters, t-(init.)=2.28 s t(norm)=0.483195, mflops=10.3478 (err=6.8e-15) 25. Monro: elapsed time t=2.71 s, 1 iters, t-(init.)=2.65 s t(norm)=0.561608, mflops=8.903 (err=1.8e-07) 26. NAPACK (f2c): elapsed time t=3.44 s, 1 iters, t-(init.)=3.38 s t(norm)=0.716315, mflops=6.98017 (err=3.7e-12) 27. Nielsen: elapsed time t=2.65 s, 1 iters, t-(init.)=2.6 s t(norm)=0.551012, mflops=9.07422 (err=2.2e-12) 28. NR (C): elapsed time t=2.41 s, 1 iters, t-(init.)=2.36 s t(norm)=0.500149, mflops=9.99702 (err=6.8e-15) 29. NR (F): elapsed time t=2.51 s, 1 iters, t-(init.)=2.46 s t(norm)=0.521342, mflops=9.59063 (err=6.8e-15) 30. Ooura (C): elapsed time t=1.19 s, 1 iters, t-(init.)=1.14 s t(norm)=0.241597, mflops=20.6956 (err=6.9e-15) 31. Ooura (F): elapsed time t=1.22 s, 1 iters, t-(init.)=1.17 s t(norm)=0.247955, mflops=20.1649 (err=6.9e-15) 32. QFT: elapsed time t=3.08 s, 1 iters, t-(init.)=3.03 s t(norm)=0.642141, mflops=7.78646 (err=1.4e-14) 33. Ransom: elapsed time t=1.36 s, 1 iters, t-(init.)=1.31 s t(norm)=0.277625, mflops=18.0099 (err=8.2e-15) 34. SCIPORT: elapsed time t=4.15 s, 1 iters, t-(init.)=4.11 s t(norm)=0.871023, mflops=5.74038 (err=2.8e-07) 35. Singleton: elapsed time t=1.7 s, 1 iters, t-(init.)=1.66 s t(norm)=0.3518, mflops=14.2126 (err=1.0e-14) 36. Singleton (f2c): elapsed time t=1.76 s, 1 iters, t-(init.)=1.71 s t(norm)=0.362396, mflops=13.7971 (err=1.0e-14) 37. Sorensen: elapsed time t=2.32 s, 1 iters, t-(init.)=2.28 s t(norm)=0.483195, mflops=10.3478 (err=6.8e-15) 38. Sorensen DIT: elapsed time t=3.33 s, 1 iters, t-(init.)=3.28 s t(norm)=0.695123, mflops=7.19298 (err=6.7e-15) 39. Temperton: elapsed time t=1.86 s, 1 iters, t-(init.)=1.81 s t(norm)=0.383589, mflops=13.0348 (err=2.0e-07) 40. Temperton (f2c): elapsed time t=2.12 s, 1 iters, t-(init.)=2.07 s t(norm)=0.43869, mflops=11.3976 (err=6.8e-15) 41. Valkenburg: elapsed time t=7.88 s, 1 iters, t-(init.)=7.83 s t(norm)=1.65939, mflops=3.01315 (err=6.8e-15) Top mflops for N=262144 = 30.8405 Normalized results and averages for N=262144: fft 0: mflops = 9.216 (norm. = 0.298828), norm. avg. (of 18) = 0.417708 fft 1: mflops = 9.43718 (norm. = 0.306), norm. avg. (of 18) = 0.423534 fft 2: mflops = 7.48983 (norm. = 0.242857), norm. avg. (of 18) = 0.30231 fft 3: mflops = 13.7168 (norm. = 0.444767), norm. avg. (of 18) = 0.258436 fft 4: mflops = 11.1288 (norm. = 0.360849), norm. avg. (of 18) = 0.316247 fft 5: mflops = 5.76845 (norm. = 0.187042), norm. avg. (of 18) = 0.135394 fft 6: mflops = 19.3385 (norm. = 0.627049), norm. avg. (of 18) = 0.516952 fft 7: mflops = 14.2126 (norm. = 0.460843), norm. avg. (of 18) = 0.374511 fft 8: mflops = 7.04267 (norm. = 0.228358), norm. avg. (of 18) = 0.232016 fft 9: mflops = 13.8782 (norm. = 0.45), norm. avg. (of 18) = 0.418194 fft 10: mflops = 14.0434 (norm. = 0.455357), norm. avg. (of 18) = 0.41469 fft 11: mflops = 7.04267 (norm. = 0.228358), norm. avg. (of 17) = 0.25579 fft 12: mflops = 19.826 (norm. = 0.642857), norm. avg. (of 18) = 0.571179 fft 13: mflops = 12.099 (norm. = 0.392308), norm. avg. (of 18) = 0.301396 fft 14: mflops = 30.8405 (norm. = 1), norm. avg. (of 18) = 0.86003 fft 15: mflops = 28.7719 (norm. = 0.932927), norm. avg. (of 18) = 0.835605 fft 16: mflops = 18.0099 (norm. = 0.583969), norm. avg. (of 18) = 0.73008 fft 17: mflops = 13.6375 (norm. = 0.442197), norm. avg. (of 16) = 0.384768 fft 18: mflops = 20.5156 (norm. = 0.665217), norm. avg. (of 18) = 0.468721 fft 19: mflops = 9.216 (norm. = 0.298828), norm. avg. (of 18) = 0.332635 fft 20: mflops = 9.25214 (norm. = 0.3), norm. avg. (of 18) = 0.343723 fft 21: mflops = -1 (norm. = -0.0324249), norm. avg. (of 12) = 0.594056 fft 22: mflops = 10.4858 (norm. = 0.34), norm. avg. (of 17) = 0.444849 fft 23: mflops = 10.8225 (norm. = 0.350917), norm. avg. (of 17) = 0.518699 fft 24: mflops = 10.3478 (norm. = 0.335526), norm. avg. (of 17) = 0.517174 fft 25: mflops = 8.903 (norm. = 0.288679), norm. avg. (of 17) = 0.31066 fft 26: mflops = 6.98017 (norm. = 0.226331), norm. avg. (of 18) = 0.157374 fft 27: mflops = 9.07422 (norm. = 0.294231), norm. avg. (of 18) = 0.247104 fft 28: mflops = 9.99702 (norm. = 0.324153), norm. avg. (of 18) = 0.3753 fft 29: mflops = 9.59063 (norm. = 0.310976), norm. avg. (of 18) = 0.312037 fft 30: mflops = 20.6956 (norm. = 0.671053), norm. avg. (of 18) = 0.721414 fft 31: mflops = 20.1649 (norm. = 0.653846), norm. avg. (of 18) = 0.618444 fft 32: mflops = 7.78646 (norm. = 0.252475), norm. avg. (of 15) = 0.418097 fft 33: mflops = 18.0099 (norm. = 0.583969), norm. avg. (of 17) = 0.401927 fft 34: mflops = 5.74038 (norm. = 0.186131), norm. avg. (of 17) = 0.155635 fft 35: mflops = 14.2126 (norm. = 0.460843), norm. avg. (of 18) = 0.418326 fft 36: mflops = 13.7971 (norm. = 0.447368), norm. avg. (of 18) = 0.4024 fft 37: mflops = 10.3478 (norm. = 0.335526), norm. avg. (of 18) = 0.424681 fft 38: mflops = 7.19298 (norm. = 0.233232), norm. avg. (of 18) = 0.250768 fft 39: mflops = 13.0348 (norm. = 0.422652), norm. avg. (of 18) = 0.334441 fft 40: mflops = 11.3976 (norm. = 0.369565), norm. avg. (of 18) = 0.32024 fft 41: mflops = 3.01315 (norm. = 0.0977011), norm. avg. (of 18) = 0.0735082 ------------------------------------------------------ @@@@ bench.1d.np2.log Benchmarking for sizes: 6 (0.000686646 MB) 9 (0.000915527 MB) 12 (0.00114441 MB) 15 (0.00137329 MB) 18 (0.00180054 MB) 24 (0.0022583 MB) 36 (0.0032959 MB) 80 (0.00738525 MB) 108 (0.00994873 MB) 210 (0.0192261 MB) 504 (0.0461426 MB) 1000 (0.0916748 MB) 1960 (0.179749 MB) 4725 (0.437393 MB) 10368 (0.960205 MB) 27000 (2.48291 MB) 75600 (6.98975 MB) 165375 (15.3664 MB) 362880 (38.6829 MB) Maximum array size = 720720 Benchmarking FFTs: 0. Brenner 1. CWP (min N) 2. CWP (best N) 3. FFTPACK 4. FFTPACK (f2c) 5. FFTW 6. FFTW_ESTIMATE 7. Frigo-old 8. GSL 9. Nielsen 10. Singleton 11. Singleton (f2c) 12. Temperton 13. Temperton (f2c) 14. Valkenburg Computing normalized averages (15 transforms). Benchmarking for array size = 6: 0. Skipping fft (Brenner has a bug for N=3*2^m). 1. CWP (min N): elapsed time t=1.88 s, 262144 iters, t-(init.)=1.57 s t(norm)=0.386148, mflops=12.9484 2. CWP (best N) (N=15): elapsed time t=1.05 s, 65536 iters, t-(init.)=0.88 s t(norm)=0.865759, mflops=5.77528 3. FFTPACK: elapsed time t=1.07 s, 262144 iters, t-(init.)=0.82 s t(norm)=0.201683, mflops=24.7914 (err=1.2e-16) 4. FFTPACK (f2c): elapsed time t=1.44 s, 262144 iters, t-(init.)=1.14 s t(norm)=0.280388, mflops=17.8324 (err=1.5e-16) FFTW_MEASURE plan: (cost = 1.907349e-06) FFTW_NOTW 6 5. FFTW: elapsed time t=1.03 s, 524288 iters, t-(init.)=0.42 s t(norm)=0.0516504, mflops=96.8046 (err=8.7e-17) FFTW_ESTIMATE plan: (cost = 4.116000e+02) FFTW_NOTW 6 6. FFTW_ESTIMATE: elapsed time t=1.02 s, 524288 iters, t-(init.)=0.4 s t(norm)=0.0491909, mflops=101.645 (err=8.7e-17) 7. Frigo-old: elapsed time t=1.77 s, 262144 iters, t-(init.)=1.47 s t(norm)=0.361553, mflops=13.8292 (err=2.6e-16) 8. GSL: elapsed time t=1.7 s, 262144 iters, t-(init.)=1.39 s t(norm)=0.341877, mflops=14.6252 (err=8.2e-17) 9. Nielsen: elapsed time t=1.11 s, 65536 iters, t-(init.)=1.05 s t(norm)=1.03301, mflops=4.84023 (err=5.7e-16) 10. Singleton: elapsed time t=1.23 s, 131072 iters, t-(init.)=1.18 s t(norm)=0.580452, mflops=8.61397 (err=1.2e-16) 11. Singleton (f2c): elapsed time t=1.15 s, 131072 iters, t-(init.)=1.11 s t(norm)=0.546019, mflops=9.15719 (err=1.2e-16) 12. Temperton: elapsed time t=1.19 s, 131072 iters, t-(init.)=1.06 s t(norm)=0.521423, mflops=9.58914 (err=3.9e-09) 13. Temperton (f2c): elapsed time t=1.14 s, 131072 iters, t-(init.)=1.08 s t(norm)=0.531261, mflops=9.41156 (err=2.1e-16) 14. Valkenburg: elapsed time t=1.22 s, 65536 iters, t-(init.)=1.14 s t(norm)=1.12155, mflops=4.45811 (err=2.4e-16) Top mflops for N=6 = 101.645 Normalized results and averages for N=6: fft 0: mflops = -1 (norm. = -0.00983818), norm. avg. (of 0) = -1 fft 1: mflops = 12.9484 (norm. = 0.127389), norm. avg. (of 1) = 0.127389 fft 2: mflops = 5.77528 (norm. = 0.0568182), norm. avg. (of 1) = 0.0568182 fft 3: mflops = 24.7914 (norm. = 0.243902), norm. avg. (of 1) = 0.243902 fft 4: mflops = 17.8324 (norm. = 0.175439), norm. avg. (of 1) = 0.175439 fft 5: mflops = 96.8046 (norm. = 0.952381), norm. avg. (of 1) = 0.952381 fft 6: mflops = 101.645 (norm. = 1), norm. avg. (of 1) = 1 fft 7: mflops = 13.8292 (norm. = 0.136054), norm. avg. (of 1) = 0.136054 fft 8: mflops = 14.6252 (norm. = 0.143885), norm. avg. (of 1) = 0.143885 fft 9: mflops = 4.84023 (norm. = 0.047619), norm. avg. (of 1) = 0.047619 fft 10: mflops = 8.61397 (norm. = 0.0847458), norm. avg. (of 1) = 0.0847458 fft 11: mflops = 9.15719 (norm. = 0.0900901), norm. avg. (of 1) = 0.0900901 fft 12: mflops = 9.58914 (norm. = 0.0943396), norm. avg. (of 1) = 0.0943396 fft 13: mflops = 9.41156 (norm. = 0.0925926), norm. avg. (of 1) = 0.0925926 fft 14: mflops = 4.45811 (norm. = 0.0438596), norm. avg. (of 1) = 0.0438596 Benchmarking for array size = 9: 0. Brenner: elapsed time t=1.16 s, 32768 iters, t-(init.)=1.11 s t(norm)=1.18736, mflops=4.21103 (err=4.8e-16) 1. CWP (min N): elapsed time t=1.29 s, 131072 iters, t-(init.)=1.09 s t(norm)=0.291491, mflops=17.1532 2. CWP (best N) (N=15): elapsed time t=1.05 s, 65536 iters, t-(init.)=0.95 s t(norm)=0.508103, mflops=9.84051 3. FFTPACK: elapsed time t=1.57 s, 262144 iters, t-(init.)=1.19 s t(norm)=0.159117, mflops=31.4235 (err=1.1e-16) 4. FFTPACK (f2c): elapsed time t=1.29 s, 131072 iters, t-(init.)=1.15 s t(norm)=0.307536, mflops=16.2582 (err=2.5e-16) FFTW_MEASURE plan: (cost = 3.967285e-06) FFTW_NOTW 9 5. FFTW: elapsed time t=1.07 s, 262144 iters, t-(init.)=0.67 s t(norm)=0.0895867, mflops=55.8119 (err=2.1e-16) FFTW_ESTIMATE plan: (cost = 4.851000e+02) FFTW_NOTW 9 6. FFTW_ESTIMATE: elapsed time t=1.06 s, 262144 iters, t-(init.)=0.6 s t(norm)=0.0802269, mflops=62.3233 (err=2.1e-16) 7. Frigo-old: elapsed time t=1.01 s, 65536 iters, t-(init.)=0.98 s t(norm)=0.524149, mflops=9.53927 (err=3.1e-16) 8. GSL: elapsed time t=1.48 s, 131072 iters, t-(init.)=1.34 s t(norm)=0.358347, mflops=13.953 (err=1.7e-16) 9. Nielsen: elapsed time t=1.45 s, 65536 iters, t-(init.)=1.34 s t(norm)=0.716693, mflops=6.97648 (err=9.6e-16) 10. Singleton: elapsed time t=1.42 s, 131072 iters, t-(init.)=1.19 s t(norm)=0.318233, mflops=15.7117 (err=1.5e-16) 11. Singleton (f2c): elapsed time t=1.39 s, 131072 iters, t-(init.)=1.33 s t(norm)=0.355672, mflops=14.0579 (err=1.5e-16) 12. Temperton: elapsed time t=1.58 s, 131072 iters, t-(init.)=1.54 s t(norm)=0.411831, mflops=12.1409 (err=1.9e-08) 13. Temperton (f2c): elapsed time t=1.39 s, 131072 iters, t-(init.)=1.17 s t(norm)=0.312885, mflops=15.9803 (err=1.3e-16) 14. Valkenburg: elapsed time t=1.14 s, 32768 iters, t-(init.)=1.08 s t(norm)=1.15527, mflops=4.328 (err=4.0e-16) Top mflops for N=9 = 62.3233 Normalized results and averages for N=9: fft 0: mflops = 4.21103 (norm. = 0.0675676), norm. avg. (of 1) = 0.0675676 fft 1: mflops = 17.1532 (norm. = 0.275229), norm. avg. (of 2) = 0.201309 fft 2: mflops = 9.84051 (norm. = 0.157895), norm. avg. (of 2) = 0.107356 fft 3: mflops = 31.4235 (norm. = 0.504202), norm. avg. (of 2) = 0.374052 fft 4: mflops = 16.2582 (norm. = 0.26087), norm. avg. (of 2) = 0.218154 fft 5: mflops = 55.8119 (norm. = 0.895522), norm. avg. (of 2) = 0.923952 fft 6: mflops = 62.3233 (norm. = 1), norm. avg. (of 2) = 1 fft 7: mflops = 9.53927 (norm. = 0.153061), norm. avg. (of 2) = 0.144558 fft 8: mflops = 13.953 (norm. = 0.223881), norm. avg. (of 2) = 0.183883 fft 9: mflops = 6.97648 (norm. = 0.11194), norm. avg. (of 2) = 0.0797797 fft 10: mflops = 15.7117 (norm. = 0.252101), norm. avg. (of 2) = 0.168423 fft 11: mflops = 14.0579 (norm. = 0.225564), norm. avg. (of 2) = 0.157827 fft 12: mflops = 12.1409 (norm. = 0.194805), norm. avg. (of 2) = 0.144572 fft 13: mflops = 15.9803 (norm. = 0.25641), norm. avg. (of 2) = 0.174501 fft 14: mflops = 4.328 (norm. = 0.0694444), norm. avg. (of 2) = 0.056652 Benchmarking for array size = 12: 0. Skipping fft (Brenner has a bug for N=3*2^m). 1. CWP (min N): elapsed time t=1.45 s, 131072 iters, t-(init.)=1.18 s t(norm)=0.20927, mflops=23.8926 2. CWP (best N) (N=15): elapsed time t=1.05 s, 65536 iters, t-(init.)=0.91 s t(norm)=0.322772, mflops=15.4908 3. FFTPACK: elapsed time t=1.8 s, 262144 iters, t-(init.)=1.22 s t(norm)=0.108182, mflops=46.2185 (err=2.0e-16) 4. FFTPACK (f2c): elapsed time t=1.63 s, 131072 iters, t-(init.)=1.36 s t(norm)=0.241192, mflops=20.7304 (err=2.6e-16) FFTW_MEASURE plan: (cost = 4.730225e-06) FFTW_NOTW 12 5. FFTW: elapsed time t=1.25 s, 262144 iters, t-(init.)=0.68 s t(norm)=0.060298, mflops=82.9214 (err=1.4e-16) FFTW_ESTIMATE plan: (cost = 4.920000e+02) FFTW_NOTW 12 6. FFTW_ESTIMATE: elapsed time t=1.24 s, 262144 iters, t-(init.)=0.72 s t(norm)=0.063845, mflops=78.3147 (err=1.4e-16) 7. Frigo-old: elapsed time t=1.96 s, 131072 iters, t-(init.)=1.75 s t(norm)=0.310358, mflops=16.1105 (err=2.9e-16) 8. GSL: elapsed time t=1.81 s, 131072 iters, t-(init.)=1.53 s t(norm)=0.271341, mflops=18.427 (err=3.2e-16) 9. Nielsen: elapsed time t=1.66 s, 65536 iters, t-(init.)=1.53 s t(norm)=0.542682, mflops=9.21349 (err=6.0e-16) 10. Singleton: elapsed time t=1.02 s, 65536 iters, t-(init.)=0.99 s t(norm)=0.351147, mflops=14.239 (err=2.5e-16) 11. Singleton (f2c): elapsed time t=1.84 s, 131072 iters, t-(init.)=1.56 s t(norm)=0.276662, mflops=18.0726 (err=2.5e-16) 12. Temperton: elapsed time t=1.8 s, 131072 iters, t-(init.)=1.59 s t(norm)=0.281982, mflops=17.7316 (err=8.2e-09) 13. Temperton (f2c): elapsed time t=1.76 s, 131072 iters, t-(init.)=1.49 s t(norm)=0.264247, mflops=18.9217 (err=1.7e-16) 14. Valkenburg: elapsed time t=1.67 s, 32768 iters, t-(init.)=1.66 s t(norm)=1.17759, mflops=4.24598 (err=4.2e-16) Top mflops for N=12 = 82.9214 Normalized results and averages for N=12: fft 0: mflops = -1 (norm. = -0.0120596), norm. avg. (of 1) = 0.0675676 fft 1: mflops = 23.8926 (norm. = 0.288136), norm. avg. (of 3) = 0.230251 fft 2: mflops = 15.4908 (norm. = 0.186813), norm. avg. (of 3) = 0.133842 fft 3: mflops = 46.2185 (norm. = 0.557377), norm. avg. (of 3) = 0.43516 fft 4: mflops = 20.7304 (norm. = 0.25), norm. avg. (of 3) = 0.228769 fft 5: mflops = 82.9214 (norm. = 1), norm. avg. (of 3) = 0.949301 fft 6: mflops = 78.3147 (norm. = 0.944444), norm. avg. (of 3) = 0.981481 fft 7: mflops = 16.1105 (norm. = 0.194286), norm. avg. (of 3) = 0.161134 fft 8: mflops = 18.427 (norm. = 0.222222), norm. avg. (of 3) = 0.196663 fft 9: mflops = 9.21349 (norm. = 0.111111), norm. avg. (of 3) = 0.0902235 fft 10: mflops = 14.239 (norm. = 0.171717), norm. avg. (of 3) = 0.169521 fft 11: mflops = 18.0726 (norm. = 0.217949), norm. avg. (of 3) = 0.177868 fft 12: mflops = 17.7316 (norm. = 0.213836), norm. avg. (of 3) = 0.16766 fft 13: mflops = 18.9217 (norm. = 0.228188), norm. avg. (of 3) = 0.192397 fft 14: mflops = 4.24598 (norm. = 0.0512048), norm. avg. (of 3) = 0.0548363 Benchmarking for array size = 15: 0. Brenner: elapsed time t=1.69 s, 32768 iters, t-(init.)=1.66 s t(norm)=0.864442, mflops=5.78408 (err=4.1e-16) 1. CWP (min N): elapsed time t=1.06 s, 65536 iters, t-(init.)=0.92 s t(norm)=0.239544, mflops=20.873 2. CWP (best N): elapsed time t=1.05 s, 65536 iters, t-(init.)=1 s t(norm)=0.260374, mflops=19.2031 3. FFTPACK: elapsed time t=1.37 s, 131072 iters, t-(init.)=1.02 s t(norm)=0.132791, mflops=37.6532 (err=1.4e-16) 4. FFTPACK (f2c): elapsed time t=1.19 s, 65536 iters, t-(init.)=1.01 s t(norm)=0.262978, mflops=19.013 (err=3.0e-16) FFTW_MEASURE plan: (cost = 7.629395e-06) FFTW_NOTW 15 5. FFTW: elapsed time t=1.03 s, 131072 iters, t-(init.)=0.71 s t(norm)=0.0924328, mflops=54.0934 (err=1.9e-16) FFTW_ESTIMATE plan: (cost = 4.485000e+02) FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.03 s, 131072 iters, t-(init.)=0.78 s t(norm)=0.101546, mflops=49.2388 (err=1.9e-16) 7. Frigo-old: elapsed time t=1.85 s, 65536 iters, t-(init.)=1.81 s t(norm)=0.471277, mflops=10.6095 (err=2.6e-16) 8. GSL: elapsed time t=1.24 s, 65536 iters, t-(init.)=1.06 s t(norm)=0.275996, mflops=18.1162 (err=1.4e-16) 9. Nielsen: elapsed time t=1.05 s, 32768 iters, t-(init.)=0.99 s t(norm)=0.51554, mflops=9.69856 (err=4.3e-15) 10. Singleton: elapsed time t=1.37 s, 65536 iters, t-(init.)=1.34 s t(norm)=0.348901, mflops=14.3307 (err=2.2e-16) 11. Singleton (f2c): elapsed time t=1.35 s, 65536 iters, t-(init.)=1.19 s t(norm)=0.309845, mflops=16.1371 (err=2.2e-16) 12. Temperton: elapsed time t=1.15 s, 65536 iters, t-(init.)=1.02 s t(norm)=0.265581, mflops=18.8266 (err=1.0e-08) 13. Temperton (f2c): elapsed time t=1.12 s, 65536 iters, t-(init.)=1.09 s t(norm)=0.283808, mflops=17.6176 (err=1.8e-16) 14. Valkenburg: elapsed time t=1.3 s, 16384 iters, t-(init.)=1.29 s t(norm)=1.34353, mflops=3.72154 (err=2.2e-16) Top mflops for N=15 = 54.0934 Normalized results and averages for N=15: fft 0: mflops = 5.78408 (norm. = 0.106928), norm. avg. (of 2) = 0.0872476 fft 1: mflops = 20.873 (norm. = 0.38587), norm. avg. (of 4) = 0.269156 fft 2: mflops = 19.2031 (norm. = 0.355), norm. avg. (of 4) = 0.189132 fft 3: mflops = 37.6532 (norm. = 0.696078), norm. avg. (of 4) = 0.50039 fft 4: mflops = 19.013 (norm. = 0.351485), norm. avg. (of 4) = 0.259448 fft 5: mflops = 54.0934 (norm. = 1), norm. avg. (of 4) = 0.961976 fft 6: mflops = 49.2388 (norm. = 0.910256), norm. avg. (of 4) = 0.963675 fft 7: mflops = 10.6095 (norm. = 0.196133), norm. avg. (of 4) = 0.169883 fft 8: mflops = 18.1162 (norm. = 0.334906), norm. avg. (of 4) = 0.231223 fft 9: mflops = 9.69856 (norm. = 0.179293), norm. avg. (of 4) = 0.112491 fft 10: mflops = 14.3307 (norm. = 0.264925), norm. avg. (of 4) = 0.193372 fft 11: mflops = 16.1371 (norm. = 0.298319), norm. avg. (of 4) = 0.207981 fft 12: mflops = 18.8266 (norm. = 0.348039), norm. avg. (of 4) = 0.212755 fft 13: mflops = 17.6176 (norm. = 0.325688), norm. avg. (of 4) = 0.22572 fft 14: mflops = 3.72154 (norm. = 0.0687984), norm. avg. (of 4) = 0.0583268 Benchmarking for array size = 18: 0. Brenner: elapsed time t=1.19 s, 16384 iters, t-(init.)=1.18 s t(norm)=0.959536, mflops=5.21085 (err=4.1e-16) 1. CWP (min N): elapsed time t=1.28 s, 65536 iters, t-(init.)=1.08 s t(norm)=0.219555, mflops=22.7734 2. CWP (best N) (N=28): elapsed time t=1.84 s, 65536 iters, t-(init.)=1.49 s t(norm)=0.302904, mflops=16.5069 3. FFTPACK: elapsed time t=1 s, 65536 iters, t-(init.)=0.96 s t(norm)=0.19516, mflops=25.62 (err=2.5e-16) 4. FFTPACK (f2c): elapsed time t=1.54 s, 65536 iters, t-(init.)=1.39 s t(norm)=0.282575, mflops=17.6944 (err=2.8e-16) FFTW_MEASURE plan: (cost = 8.850098e-06) FFTW_TWIDDLE 3 FFTW_NOTW 6 5. FFTW: elapsed time t=1.18 s, 131072 iters, t-(init.)=0.84 s t(norm)=0.0853824, mflops=58.56 (err=2.0e-16) FFTW_ESTIMATE plan: (cost = 1.168200e+03) FFTW_TWIDDLE 2 FFTW_NOTW 9 6. FFTW_ESTIMATE: elapsed time t=1.25 s, 131072 iters, t-(init.)=0.92 s t(norm)=0.0935141, mflops=53.4679 (err=2.2e-16) 7. Frigo-old: elapsed time t=1.11 s, 32768 iters, t-(init.)=1.09 s t(norm)=0.443176, mflops=11.2822 (err=3.5e-16) 8. GSL: elapsed time t=1.46 s, 65536 iters, t-(init.)=1.29 s t(norm)=0.262246, mflops=19.0661 (err=2.1e-16) 9. Nielsen: elapsed time t=1.54 s, 32768 iters, t-(init.)=1.47 s t(norm)=0.597677, mflops=8.36572 (err=8.7e-16) 10. Singleton: elapsed time t=1.45 s, 65536 iters, t-(init.)=1.33 s t(norm)=0.270378, mflops=18.4926 (err=2.1e-16) 11. Singleton (f2c): elapsed time t=1.37 s, 65536 iters, t-(init.)=1.15 s t(norm)=0.233785, mflops=21.3871 (err=2.1e-16) 12. Temperton: elapsed time t=1.51 s, 65536 iters, t-(init.)=1.38 s t(norm)=0.280542, mflops=17.8226 (err=4.5e-08) 13. Temperton (f2c): elapsed time t=1.46 s, 65536 iters, t-(init.)=1.28 s t(norm)=0.260213, mflops=19.215 (err=2.6e-16) 14. Valkenburg: elapsed time t=1.43 s, 16384 iters, t-(init.)=1.38 s t(norm)=1.12217, mflops=4.45566 (err=4.2e-16) Top mflops for N=18 = 58.56 Normalized results and averages for N=18: fft 0: mflops = 5.21085 (norm. = 0.0889831), norm. avg. (of 3) = 0.0878261 fft 1: mflops = 22.7734 (norm. = 0.388889), norm. avg. (of 5) = 0.293102 fft 2: mflops = 16.5069 (norm. = 0.281879), norm. avg. (of 5) = 0.207681 fft 3: mflops = 25.62 (norm. = 0.4375), norm. avg. (of 5) = 0.487812 fft 4: mflops = 17.6944 (norm. = 0.302158), norm. avg. (of 5) = 0.26799 fft 5: mflops = 58.56 (norm. = 1), norm. avg. (of 5) = 0.969581 fft 6: mflops = 53.4679 (norm. = 0.913043), norm. avg. (of 5) = 0.953549 fft 7: mflops = 11.2822 (norm. = 0.192661), norm. avg. (of 5) = 0.174439 fft 8: mflops = 19.0661 (norm. = 0.325581), norm. avg. (of 5) = 0.250095 fft 9: mflops = 8.36572 (norm. = 0.142857), norm. avg. (of 5) = 0.118564 fft 10: mflops = 18.4926 (norm. = 0.315789), norm. avg. (of 5) = 0.217856 fft 11: mflops = 21.3871 (norm. = 0.365217), norm. avg. (of 5) = 0.239428 fft 12: mflops = 17.8226 (norm. = 0.304348), norm. avg. (of 5) = 0.231074 fft 13: mflops = 19.215 (norm. = 0.328125), norm. avg. (of 5) = 0.246201 fft 14: mflops = 4.45566 (norm. = 0.076087), norm. avg. (of 5) = 0.0618789 Benchmarking for array size = 24: 0. Skipping fft (Brenner has a bug for N=3*2^m). 1. CWP (min N): elapsed time t=1.44 s, 65536 iters, t-(init.)=1.27 s t(norm)=0.176107, mflops=28.3918 2. CWP (best N) (N=28): elapsed time t=1.86 s, 65536 iters, t-(init.)=1.55 s t(norm)=0.214934, mflops=23.263 3. FFTPACK: elapsed time t=1.2 s, 65536 iters, t-(init.)=1.13 s t(norm)=0.156694, mflops=31.9094 (err=1.7e-16) 4. FFTPACK (f2c): elapsed time t=1.99 s, 65536 iters, t-(init.)=1.82 s t(norm)=0.252374, mflops=19.8119 (err=2.5e-16) FFTW_MEASURE plan: (cost = 1.068115e-05) FFTW_TWIDDLE 2 FFTW_NOTW 12 5. FFTW: elapsed time t=1.41 s, 131072 iters, t-(init.)=0.89 s t(norm)=0.0617068, mflops=81.0283 (err=2.1e-16) FFTW_ESTIMATE plan: (cost = 1.248000e+03) FFTW_TWIDDLE 2 FFTW_NOTW 12 6. FFTW_ESTIMATE: elapsed time t=1.4 s, 131072 iters, t-(init.)=0.97 s t(norm)=0.0672535, mflops=74.3456 (err=2.1e-16) 7. Frigo-old: elapsed time t=1.9 s, 65536 iters, t-(init.)=1.77 s t(norm)=0.245441, mflops=20.3715 (err=3.8e-16) 8. GSL: elapsed time t=1.78 s, 65536 iters, t-(init.)=1.58 s t(norm)=0.219094, mflops=22.8213 (err=2.1e-16) 9. Nielsen: elapsed time t=1.52 s, 32768 iters, t-(init.)=1.37 s t(norm)=0.379948, mflops=13.1597 (err=1.7e-15) 10. Singleton: elapsed time t=1.1 s, 32768 iters, t-(init.)=1.08 s t(norm)=0.299521, mflops=16.6933 (err=2.1e-16) 11. Singleton (f2c): elapsed time t=1.93 s, 65536 iters, t-(init.)=1.65 s t(norm)=0.228801, mflops=21.8531 (err=2.1e-16) 12. Temperton: elapsed time t=1.87 s, 65536 iters, t-(init.)=1.83 s t(norm)=0.253761, mflops=19.7036 (err=8.0e-09) 13. Temperton (f2c): elapsed time t=1.65 s, 65536 iters, t-(init.)=1.56 s t(norm)=0.21632, mflops=23.1139 (err=2.4e-16) 14. Valkenburg: elapsed time t=1.03 s, 8192 iters, t-(init.)=1.02 s t(norm)=1.13152, mflops=4.41883 (err=6.1e-16) Top mflops for N=24 = 81.0283 Normalized results and averages for N=24: fft 0: mflops = -1 (norm. = -0.0123414), norm. avg. (of 3) = 0.0878261 fft 1: mflops = 28.3918 (norm. = 0.350394), norm. avg. (of 6) = 0.302651 fft 2: mflops = 23.263 (norm. = 0.287097), norm. avg. (of 6) = 0.220917 fft 3: mflops = 31.9094 (norm. = 0.393805), norm. avg. (of 6) = 0.472144 fft 4: mflops = 19.8119 (norm. = 0.244505), norm. avg. (of 6) = 0.264076 fft 5: mflops = 81.0283 (norm. = 1), norm. avg. (of 6) = 0.974651 fft 6: mflops = 74.3456 (norm. = 0.917526), norm. avg. (of 6) = 0.947545 fft 7: mflops = 20.3715 (norm. = 0.251412), norm. avg. (of 6) = 0.187268 fft 8: mflops = 22.8213 (norm. = 0.281646), norm. avg. (of 6) = 0.255353 fft 9: mflops = 13.1597 (norm. = 0.162409), norm. avg. (of 6) = 0.125872 fft 10: mflops = 16.6933 (norm. = 0.206019), norm. avg. (of 6) = 0.215883 fft 11: mflops = 21.8531 (norm. = 0.269697), norm. avg. (of 6) = 0.244473 fft 12: mflops = 19.7036 (norm. = 0.243169), norm. avg. (of 6) = 0.23309 fft 13: mflops = 23.1139 (norm. = 0.285256), norm. avg. (of 6) = 0.25271 fft 14: mflops = 4.41883 (norm. = 0.0545343), norm. avg. (of 6) = 0.0606548 Benchmarking for array size = 36: 0. Brenner: elapsed time t=1.18 s, 8192 iters, t-(init.)=1.17 s t(norm)=0.767378, mflops=6.5157 (err=1.4e-15) 1. CWP (min N): elapsed time t=1.22 s, 32768 iters, t-(init.)=1.04 s t(norm)=0.170528, mflops=29.3206 2. CWP (best N): elapsed time t=1.22 s, 32768 iters, t-(init.)=1.02 s t(norm)=0.167249, mflops=29.8955 3. FFTPACK: elapsed time t=1.91 s, 65536 iters, t-(init.)=1.49 s t(norm)=0.122157, mflops=40.9308 (err=4.3e-16) 4. FFTPACK (f2c): elapsed time t=1.78 s, 32768 iters, t-(init.)=1.58 s t(norm)=0.259072, mflops=19.2997 (err=1.1e-15) FFTW_MEASURE plan: (cost = 1.770020e-05) FFTW_TWIDDLE 3 FFTW_NOTW 12 5. FFTW: elapsed time t=1.18 s, 65536 iters, t-(init.)=0.8 s t(norm)=0.0655878, mflops=76.2336 (err=6.2e-16) FFTW_ESTIMATE plan: (cost = 1.803600e+03) FFTW_TWIDDLE 3 FFTW_NOTW 12 6. FFTW_ESTIMATE: elapsed time t=1.18 s, 65536 iters, t-(init.)=0.82 s t(norm)=0.0672275, mflops=74.3743 (err=6.2e-16) 7. Frigo-old: elapsed time t=1.2 s, 16384 iters, t-(init.)=1.11 s t(norm)=0.364012, mflops=13.7358 (err=6.6e-16) 8. GSL: elapsed time t=1.59 s, 32768 iters, t-(init.)=1.38 s t(norm)=0.226278, mflops=22.0967 (err=4.2e-16) 9. Nielsen: elapsed time t=1.33 s, 16384 iters, t-(init.)=1.25 s t(norm)=0.409924, mflops=12.1974 (err=1.6e-15) 10. Singleton: elapsed time t=1.41 s, 32768 iters, t-(init.)=1.23 s t(norm)=0.201683, mflops=24.7914 (err=4.2e-16) 11. Singleton (f2c): elapsed time t=1.41 s, 32768 iters, t-(init.)=1.19 s t(norm)=0.195124, mflops=25.6248 (err=4.1e-16) 12. Temperton: elapsed time t=1.4 s, 32768 iters, t-(init.)=1.37 s t(norm)=0.224638, mflops=22.258 (err=6.4e-08) 13. Temperton (f2c): elapsed time t=1.37 s, 32768 iters, t-(init.)=1.25 s t(norm)=0.204962, mflops=24.3948 (err=3.5e-16) 14. Valkenburg: elapsed time t=1.74 s, 8192 iters, t-(init.)=1.73 s t(norm)=1.13467, mflops=4.40657 (err=8.4e-16) Top mflops for N=36 = 76.2336 Normalized results and averages for N=36: fft 0: mflops = 6.5157 (norm. = 0.0854701), norm. avg. (of 4) = 0.0872371 fft 1: mflops = 29.3206 (norm. = 0.384615), norm. avg. (of 7) = 0.31436 fft 2: mflops = 29.8955 (norm. = 0.392157), norm. avg. (of 7) = 0.24538 fft 3: mflops = 40.9308 (norm. = 0.536913), norm. avg. (of 7) = 0.481397 fft 4: mflops = 19.2997 (norm. = 0.253165), norm. avg. (of 7) = 0.262517 fft 5: mflops = 76.2336 (norm. = 1), norm. avg. (of 7) = 0.978272 fft 6: mflops = 74.3743 (norm. = 0.97561), norm. avg. (of 7) = 0.951554 fft 7: mflops = 13.7358 (norm. = 0.18018), norm. avg. (of 7) = 0.186255 fft 8: mflops = 22.0967 (norm. = 0.289855), norm. avg. (of 7) = 0.260282 fft 9: mflops = 12.1974 (norm. = 0.16), norm. avg. (of 7) = 0.130747 fft 10: mflops = 24.7914 (norm. = 0.325203), norm. avg. (of 7) = 0.2315 fft 11: mflops = 25.6248 (norm. = 0.336134), norm. avg. (of 7) = 0.257567 fft 12: mflops = 22.258 (norm. = 0.291971), norm. avg. (of 7) = 0.241501 fft 13: mflops = 24.3948 (norm. = 0.32), norm. avg. (of 7) = 0.262323 fft 14: mflops = 4.40657 (norm. = 0.0578035), norm. avg. (of 7) = 0.0602474 Benchmarking for array size = 80: 0. Brenner: elapsed time t=1.98 s, 8192 iters, t-(init.)=1.96 s t(norm)=0.473071, mflops=10.5692 (err=5.1e-16) 1. CWP (min N): elapsed time t=1.47 s, 16384 iters, t-(init.)=1.32 s t(norm)=0.1593, mflops=31.3874 2. CWP (best N) (N=84): elapsed time t=1.54 s, 16384 iters, t-(init.)=1.48 s t(norm)=0.178609, mflops=27.9942 3. FFTPACK: elapsed time t=1.16 s, 16384 iters, t-(init.)=0.96 s t(norm)=0.115854, mflops=43.1577 (err=4.3e-16) 4. FFTPACK (f2c): elapsed time t=1.18 s, 8192 iters, t-(init.)=1.16 s t(norm)=0.279981, mflops=17.8584 (err=4.7e-16) FFTW_MEASURE plan: (cost = 5.615234e-05) FFTW_TWIDDLE 10 FFTW_NOTW 8 5. FFTW: elapsed time t=1.89 s, 32768 iters, t-(init.)=1.53 s t(norm)=0.0923213, mflops=54.1587 (err=3.7e-16) FFTW_ESTIMATE plan: (cost = 2.600000e+03) FFTW_TWIDDLE 5 FFTW_NOTW 16 6. FFTW_ESTIMATE: elapsed time t=1.83 s, 32768 iters, t-(init.)=1.33 s t(norm)=0.0802532, mflops=62.3028 (err=4.6e-16) 7. Frigo-old: elapsed time t=1.17 s, 8192 iters, t-(init.)=1.09 s t(norm)=0.263086, mflops=19.0052 (err=3.3e-16) 8. GSL: elapsed time t=1.77 s, 16384 iters, t-(init.)=1.53 s t(norm)=0.184643, mflops=27.0793 (err=4.1e-16) 9. Nielsen: elapsed time t=1.03 s, 8192 iters, t-(init.)=0.94 s t(norm)=0.226881, mflops=22.038 (err=8.1e-15) 10. Singleton: elapsed time t=1.57 s, 16384 iters, t-(init.)=1.38 s t(norm)=0.16654, mflops=30.0227 (err=4.3e-16) 11. Singleton (f2c): elapsed time t=1.62 s, 16384 iters, t-(init.)=1.57 s t(norm)=0.18947, mflops=26.3894 (err=3.5e-16) 12. Temperton: elapsed time t=1.69 s, 16384 iters, t-(init.)=1.47 s t(norm)=0.177402, mflops=28.1846 (err=1.7e-07) 13. Temperton (f2c): elapsed time t=1.68 s, 16384 iters, t-(init.)=1.62 s t(norm)=0.195504, mflops=25.5749 (err=4.0e-16) 14. Valkenburg: elapsed time t=1.26 s, 2048 iters, t-(init.)=1.25 s t(norm)=1.20681, mflops=4.14314 (err=5.4e-16) Top mflops for N=80 = 62.3028 Normalized results and averages for N=80: fft 0: mflops = 10.5692 (norm. = 0.169643), norm. avg. (of 5) = 0.103718 fft 1: mflops = 31.3874 (norm. = 0.503788), norm. avg. (of 8) = 0.338039 fft 2: mflops = 27.9942 (norm. = 0.449324), norm. avg. (of 8) = 0.270873 fft 3: mflops = 43.1577 (norm. = 0.692708), norm. avg. (of 8) = 0.507811 fft 4: mflops = 17.8584 (norm. = 0.286638), norm. avg. (of 8) = 0.265532 fft 5: mflops = 54.1587 (norm. = 0.869281), norm. avg. (of 8) = 0.964648 fft 6: mflops = 62.3028 (norm. = 1), norm. avg. (of 8) = 0.95761 fft 7: mflops = 19.0052 (norm. = 0.305046), norm. avg. (of 8) = 0.201104 fft 8: mflops = 27.0793 (norm. = 0.434641), norm. avg. (of 8) = 0.282077 fft 9: mflops = 22.038 (norm. = 0.353723), norm. avg. (of 8) = 0.158619 fft 10: mflops = 30.0227 (norm. = 0.481884), norm. avg. (of 8) = 0.262798 fft 11: mflops = 26.3894 (norm. = 0.423567), norm. avg. (of 8) = 0.278317 fft 12: mflops = 28.1846 (norm. = 0.452381), norm. avg. (of 8) = 0.267861 fft 13: mflops = 25.5749 (norm. = 0.410494), norm. avg. (of 8) = 0.280844 fft 14: mflops = 4.14314 (norm. = 0.0665), norm. avg. (of 8) = 0.061029 Benchmarking for array size = 108: 0. Brenner: elapsed time t=1.06 s, 2048 iters, t-(init.)=1.05 s t(norm)=0.702777, mflops=7.11463 (err=8.7e-16) 1. CWP (min N) (N=110): elapsed time t=1.3 s, 8192 iters, t-(init.)=1.17 s t(norm)=0.195774, mflops=25.5397 2. CWP (best N) (N=112): elapsed time t=1.09 s, 8192 iters, t-(init.)=0.97 s t(norm)=0.162308, mflops=30.8056 3. FFTPACK: elapsed time t=1.71 s, 16384 iters, t-(init.)=1.47 s t(norm)=0.122986, mflops=40.655 (err=3.4e-16) 4. FFTPACK (f2c): elapsed time t=1.66 s, 8192 iters, t-(init.)=1.56 s t(norm)=0.261031, mflops=19.1548 (err=7.1e-16) FFTW_MEASURE plan: (cost = 8.056641e-05) FFTW_TWIDDLE 9 FFTW_NOTW 12 5. FFTW: elapsed time t=1.29 s, 16384 iters, t-(init.)=0.97 s t(norm)=0.081154, mflops=61.6113 (err=3.0e-16) FFTW_ESTIMATE plan: (cost = 4.633200e+03) FFTW_TWIDDLE 9 FFTW_NOTW 12 6. FFTW_ESTIMATE: elapsed time t=1.3 s, 16384 iters, t-(init.)=1.03 s t(norm)=0.0861738, mflops=58.0223 (err=3.0e-16) 7. Frigo-old: elapsed time t=1.27 s, 4096 iters, t-(init.)=1.23 s t(norm)=0.411626, mflops=12.1469 (err=5.6e-16) 8. GSL: elapsed time t=1.42 s, 8192 iters, t-(init.)=1.33 s t(norm)=0.222546, mflops=22.4673 (err=3.2e-16) 9. Nielsen: elapsed time t=1.16 s, 4096 iters, t-(init.)=1.11 s t(norm)=0.371468, mflops=13.4601 (err=1.2e-15) 10. Singleton: elapsed time t=1.26 s, 8192 iters, t-(init.)=1.18 s t(norm)=0.197447, mflops=25.3233 (err=3.3e-16) 11. Singleton (f2c): elapsed time t=1.23 s, 8192 iters, t-(init.)=1.21 s t(norm)=0.202467, mflops=24.6954 (err=3.3e-16) 12. Temperton: elapsed time t=1.23 s, 8192 iters, t-(init.)=1.18 s t(norm)=0.197447, mflops=25.3233 (err=1.0e-07) 13. Temperton (f2c): elapsed time t=1.24 s, 8192 iters, t-(init.)=1.22 s t(norm)=0.20414, mflops=24.493 (err=3.1e-16) 14. Valkenburg: elapsed time t=1.69 s, 2048 iters, t-(init.)=1.67 s t(norm)=1.11775, mflops=4.47327 (err=6.6e-16) Top mflops for N=108 = 61.6113 Normalized results and averages for N=108: fft 0: mflops = 7.11463 (norm. = 0.115476), norm. avg. (of 6) = 0.105678 fft 1: mflops = 25.5397 (norm. = 0.41453), norm. avg. (of 9) = 0.346538 fft 2: mflops = 30.8056 (norm. = 0.5), norm. avg. (of 9) = 0.296331 fft 3: mflops = 40.655 (norm. = 0.659864), norm. avg. (of 9) = 0.524706 fft 4: mflops = 19.1548 (norm. = 0.310897), norm. avg. (of 9) = 0.270573 fft 5: mflops = 61.6113 (norm. = 1), norm. avg. (of 9) = 0.968576 fft 6: mflops = 58.0223 (norm. = 0.941748), norm. avg. (of 9) = 0.955847 fft 7: mflops = 12.1469 (norm. = 0.197154), norm. avg. (of 9) = 0.200665 fft 8: mflops = 22.4673 (norm. = 0.364662), norm. avg. (of 9) = 0.291253 fft 9: mflops = 13.4601 (norm. = 0.218468), norm. avg. (of 9) = 0.165269 fft 10: mflops = 25.3233 (norm. = 0.411017), norm. avg. (of 9) = 0.279267 fft 11: mflops = 24.6954 (norm. = 0.400826), norm. avg. (of 9) = 0.291929 fft 12: mflops = 25.3233 (norm. = 0.411017), norm. avg. (of 9) = 0.283767 fft 13: mflops = 24.493 (norm. = 0.397541), norm. avg. (of 9) = 0.293811 fft 14: mflops = 4.47327 (norm. = 0.0726048), norm. avg. (of 9) = 0.0623152 Benchmarking for array size = 210: 0. Brenner: elapsed time t=1.82 s, 2048 iters, t-(init.)=1.81 s t(norm)=0.545552, mflops=9.16504 (err=6.2e-16) 1. CWP (min N): elapsed time t=1.24 s, 4096 iters, t-(init.)=1.12 s t(norm)=0.168789, mflops=29.6227 2. CWP (best N): elapsed time t=1.24 s, 4096 iters, t-(init.)=1.22 s t(norm)=0.18386, mflops=27.1946 3. FFTPACK: elapsed time t=1.36 s, 4096 iters, t-(init.)=1.33 s t(norm)=0.200437, mflops=24.9454 (err=3.7e-16) 4. FFTPACK (f2c): elapsed time t=1.06 s, 2048 iters, t-(init.)=1.04 s t(norm)=0.313466, mflops=15.9507 (err=4.2e-16) FFTW_MEASURE plan: (cost = 2.343750e-04) FFTW_TWIDDLE 2 FFTW_TWIDDLE 7 FFTW_NOTW 15 5. FFTW: elapsed time t=1.93 s, 8192 iters, t-(init.)=1.62 s t(norm)=0.122071, mflops=40.9598 (err=2.8e-16) FFTW_ESTIMATE plan: (cost = 9.324000e+03) FFTW_TWIDDLE 7 FFTW_TWIDDLE 2 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.04 s, 4096 iters, t-(init.)=0.95 s t(norm)=0.14317, mflops=34.9236 (err=3.5e-16) 7. Frigo-old: elapsed time t=1.46 s, 2048 iters, t-(init.)=1.4 s t(norm)=0.421974, mflops=11.8491 (err=4.6e-16) 8. GSL: elapsed time t=1.86 s, 4096 iters, t-(init.)=1.83 s t(norm)=0.27579, mflops=18.1297 (err=3.6e-16) 9. Nielsen: elapsed time t=1.11 s, 2048 iters, t-(init.)=1.04 s t(norm)=0.313466, mflops=15.9507 (err=8.6e-15) 10. Singleton: elapsed time t=1.75 s, 4096 iters, t-(init.)=1.73 s t(norm)=0.260719, mflops=19.1777 (err=3.4e-16) 11. Singleton (f2c): elapsed time t=1.63 s, 4096 iters, t-(init.)=1.59 s t(norm)=0.239621, mflops=20.8663 (err=3.4e-16) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=1.19 s, 512 iters, t-(init.)=1.18 s t(norm)=1.42265, mflops=3.51456 (err=6.1e-16) Top mflops for N=210 = 40.9598 Normalized results and averages for N=210: fft 0: mflops = 9.16504 (norm. = 0.223757), norm. avg. (of 7) = 0.122546 fft 1: mflops = 29.6227 (norm. = 0.723214), norm. avg. (of 10) = 0.384205 fft 2: mflops = 27.1946 (norm. = 0.663934), norm. avg. (of 10) = 0.333092 fft 3: mflops = 24.9454 (norm. = 0.609023), norm. avg. (of 10) = 0.533137 fft 4: mflops = 15.9507 (norm. = 0.389423), norm. avg. (of 10) = 0.282458 fft 5: mflops = 40.9598 (norm. = 1), norm. avg. (of 10) = 0.971718 fft 6: mflops = 34.9236 (norm. = 0.852632), norm. avg. (of 10) = 0.945526 fft 7: mflops = 11.8491 (norm. = 0.289286), norm. avg. (of 10) = 0.209527 fft 8: mflops = 18.1297 (norm. = 0.442623), norm. avg. (of 10) = 0.30639 fft 9: mflops = 15.9507 (norm. = 0.389423), norm. avg. (of 10) = 0.187684 fft 10: mflops = 19.1777 (norm. = 0.468208), norm. avg. (of 10) = 0.298161 fft 11: mflops = 20.8663 (norm. = 0.509434), norm. avg. (of 10) = 0.31368 fft 12: mflops = -1 (norm. = -0.0244142), norm. avg. (of 9) = 0.283767 fft 13: mflops = -1 (norm. = -0.0244142), norm. avg. (of 9) = 0.293811 fft 14: mflops = 3.51456 (norm. = 0.0858051), norm. avg. (of 10) = 0.0646642 Benchmarking for array size = 504: 0. Brenner: elapsed time t=1.2 s, 512 iters, t-(init.)=1.19 s t(norm)=0.513691, mflops=9.73348 (err=6.6e-16) 1. CWP (min N): elapsed time t=1.65 s, 2048 iters, t-(init.)=1.57 s t(norm)=0.169432, mflops=29.5104 2. CWP (best N): elapsed time t=1.65 s, 2048 iters, t-(init.)=1.57 s t(norm)=0.169432, mflops=29.5104 3. FFTPACK: elapsed time t=1.93 s, 2048 iters, t-(init.)=1.8 s t(norm)=0.194253, mflops=25.7397 (err=4.6e-16) 4. FFTPACK (f2c): elapsed time t=1.6 s, 1024 iters, t-(init.)=1.54 s t(norm)=0.332388, mflops=15.0427 (err=6.0e-16) FFTW_MEASURE plan: (cost = 6.640625e-04) FFTW_TWIDDLE 3 FFTW_TWIDDLE 6 FFTW_TWIDDLE 2 FFTW_NOTW 14 5. FFTW: elapsed time t=1.35 s, 2048 iters, t-(init.)=1.16 s t(norm)=0.125185, mflops=39.9408 (err=4.5e-16) FFTW_ESTIMATE plan: (cost = 2.147040e+04) FFTW_TWIDDLE 6 FFTW_TWIDDLE 6 FFTW_NOTW 14 6. FFTW_ESTIMATE: elapsed time t=1.47 s, 2048 iters, t-(init.)=1.28 s t(norm)=0.138135, mflops=36.1964 (err=4.5e-16) 7. Frigo-old: elapsed time t=1.87 s, 1024 iters, t-(init.)=1.79 s t(norm)=0.386347, mflops=12.9417 (err=6.0e-16) 8. GSL: elapsed time t=1.26 s, 1024 iters, t-(init.)=1.2 s t(norm)=0.259004, mflops=19.3047 (err=5.7e-16) 9. Nielsen: elapsed time t=1.64 s, 1024 iters, t-(init.)=1.58 s t(norm)=0.341022, mflops=14.6618 (err=5.4e-15) 10. Singleton: elapsed time t=1.08 s, 1024 iters, t-(init.)=1.06 s t(norm)=0.228787, mflops=21.8544 (err=6.6e-16) 11. Singleton (f2c): elapsed time t=1.95 s, 2048 iters, t-(init.)=1.91 s t(norm)=0.206124, mflops=24.2573 (err=6.6e-16) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=1.58 s, 256 iters, t-(init.)=1.57 s t(norm)=1.35545, mflops=3.6888 (err=7.6e-16) Top mflops for N=504 = 39.9408 Normalized results and averages for N=504: fft 0: mflops = 9.73348 (norm. = 0.243697), norm. avg. (of 8) = 0.13769 fft 1: mflops = 29.5104 (norm. = 0.738854), norm. avg. (of 11) = 0.416446 fft 2: mflops = 29.5104 (norm. = 0.738854), norm. avg. (of 11) = 0.369979 fft 3: mflops = 25.7397 (norm. = 0.644444), norm. avg. (of 11) = 0.543256 fft 4: mflops = 15.0427 (norm. = 0.376623), norm. avg. (of 11) = 0.291018 fft 5: mflops = 39.9408 (norm. = 1), norm. avg. (of 11) = 0.974289 fft 6: mflops = 36.1964 (norm. = 0.90625), norm. avg. (of 11) = 0.941955 fft 7: mflops = 12.9417 (norm. = 0.324022), norm. avg. (of 11) = 0.219936 fft 8: mflops = 19.3047 (norm. = 0.483333), norm. avg. (of 11) = 0.322476 fft 9: mflops = 14.6618 (norm. = 0.367089), norm. avg. (of 11) = 0.203994 fft 10: mflops = 21.8544 (norm. = 0.54717), norm. avg. (of 11) = 0.320798 fft 11: mflops = 24.2573 (norm. = 0.60733), norm. avg. (of 11) = 0.340375 fft 12: mflops = -1 (norm. = -0.025037), norm. avg. (of 9) = 0.283767 fft 13: mflops = -1 (norm. = -0.025037), norm. avg. (of 9) = 0.293811 fft 14: mflops = 3.6888 (norm. = 0.0923567), norm. avg. (of 11) = 0.0671817 Benchmarking for array size = 1000: 0. Brenner: elapsed time t=1.54 s, 256 iters, t-(init.)=1.5 s t(norm)=0.587949, mflops=8.50414 (err=8.0e-16) 1. CWP (min N) (N=1001): elapsed time t=1.14 s, 512 iters, t-(init.)=1.07 s t(norm)=0.209702, mflops=23.8434 2. CWP (best N) (N=1008): elapsed time t=1.84 s, 1024 iters, t-(init.)=1.7 s t(norm)=0.166586, mflops=30.0146 3. FFTPACK: elapsed time t=1.21 s, 512 iters, t-(init.)=1.11 s t(norm)=0.217541, mflops=22.9842 (err=6.1e-16) 4. FFTPACK (f2c): elapsed time t=1.88 s, 512 iters, t-(init.)=1.78 s t(norm)=0.34885, mflops=14.3328 (err=7.8e-16) FFTW_MEASURE plan: (cost = 1.562500e-03) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_NOTW 10 5. FFTW: elapsed time t=1.73 s, 1024 iters, t-(init.)=1.54 s t(norm)=0.150907, mflops=33.133 (err=6.3e-16) FFTW_ESTIMATE plan: (cost = 5.220000e+04) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_NOTW 10 6. FFTW_ESTIMATE: elapsed time t=1.73 s, 1024 iters, t-(init.)=1.54 s t(norm)=0.150907, mflops=33.133 (err=6.3e-16) 7. Frigo-old: elapsed time t=1.08 s, 256 iters, t-(init.)=1.05 s t(norm)=0.411564, mflops=12.1488 (err=6.3e-16) 8. GSL: elapsed time t=1.51 s, 512 iters, t-(init.)=1.41 s t(norm)=0.276336, mflops=18.0939 (err=6.3e-16) 9. Nielsen: elapsed time t=1.39 s, 512 iters, t-(init.)=1.3 s t(norm)=0.254778, mflops=19.6249 (err=1.3e-14) 10. Singleton: elapsed time t=1.9 s, 1024 iters, t-(init.)=1.72 s t(norm)=0.168545, mflops=29.6656 (err=9.0e-16) 11. Singleton (f2c): elapsed time t=1.9 s, 1024 iters, t-(init.)=1.74 s t(norm)=0.170505, mflops=29.3246 (err=9.1e-16) 12. Temperton: elapsed time t=1.07 s, 512 iters, t-(init.)=1 s t(norm)=0.195983, mflops=25.5124 (err=1.1e-07) 13. Temperton (f2c): elapsed time t=1.14 s, 512 iters, t-(init.)=1.09 s t(norm)=0.213622, mflops=23.4059 (err=6.4e-16) 14. Valkenburg: elapsed time t=1.85 s, 128 iters, t-(init.)=1.83 s t(norm)=1.4346, mflops=3.4853 (err=7.2e-16) Top mflops for N=1000 = 33.133 Normalized results and averages for N=1000: fft 0: mflops = 8.50414 (norm. = 0.256667), norm. avg. (of 9) = 0.15091 fft 1: mflops = 23.8434 (norm. = 0.719626), norm. avg. (of 12) = 0.441711 fft 2: mflops = 30.0146 (norm. = 0.905882), norm. avg. (of 12) = 0.414638 fft 3: mflops = 22.9842 (norm. = 0.693694), norm. avg. (of 12) = 0.555793 fft 4: mflops = 14.3328 (norm. = 0.432584), norm. avg. (of 12) = 0.302816 fft 5: mflops = 33.133 (norm. = 1), norm. avg. (of 12) = 0.976432 fft 6: mflops = 33.133 (norm. = 1), norm. avg. (of 12) = 0.946792 fft 7: mflops = 12.1488 (norm. = 0.366667), norm. avg. (of 12) = 0.232164 fft 8: mflops = 18.0939 (norm. = 0.546099), norm. avg. (of 12) = 0.341111 fft 9: mflops = 19.6249 (norm. = 0.592308), norm. avg. (of 12) = 0.236353 fft 10: mflops = 29.6656 (norm. = 0.895349), norm. avg. (of 12) = 0.368677 fft 11: mflops = 29.3246 (norm. = 0.885057), norm. avg. (of 12) = 0.385765 fft 12: mflops = 25.5124 (norm. = 0.77), norm. avg. (of 10) = 0.332391 fft 13: mflops = 23.4059 (norm. = 0.706422), norm. avg. (of 10) = 0.335072 fft 14: mflops = 3.4853 (norm. = 0.105191), norm. avg. (of 12) = 0.0703492 Benchmarking for array size = 1960: 0. Brenner: elapsed time t=1.81 s, 128 iters, t-(init.)=1.76 s t(norm)=0.64145, mflops=7.79484 (err=7.3e-16) 1. CWP (min N) (N=1980): elapsed time t=1.2 s, 256 iters, t-(init.)=1.13 s t(norm)=0.20592, mflops=24.2813 2. CWP (best N) (N=1980): elapsed time t=1.2 s, 256 iters, t-(init.)=1.13 s t(norm)=0.20592, mflops=24.2813 3. FFTPACK: elapsed time t=1.01 s, 128 iters, t-(init.)=0.97 s t(norm)=0.353526, mflops=14.1432 (err=5.6e-16) 4. FFTPACK (f2c): elapsed time t=1.34 s, 128 iters, t-(init.)=1.29 s t(norm)=0.470154, mflops=10.6348 (err=6.3e-16) FFTW_MEASURE plan: (cost = 3.593750e-03) FFTW_TWIDDLE 7 FFTW_TWIDDLE 10 FFTW_TWIDDLE 2 FFTW_NOTW 14 5. FFTW: elapsed time t=1.94 s, 512 iters, t-(init.)=1.75 s t(norm)=0.159451, mflops=31.3575 (err=5.7e-16) FFTW_ESTIMATE plan: (cost = 9.662800e+04) FFTW_TWIDDLE 10 FFTW_TWIDDLE 7 FFTW_TWIDDLE 2 FFTW_NOTW 14 6. FFTW_ESTIMATE: elapsed time t=1.95 s, 512 iters, t-(init.)=1.76 s t(norm)=0.160362, mflops=31.1794 (err=5.6e-16) 7. Frigo-old: elapsed time t=1.3 s, 128 iters, t-(init.)=1.26 s t(norm)=0.45922, mflops=10.888 (err=6.9e-16) 8. GSL: elapsed time t=1.79 s, 256 iters, t-(init.)=1.7 s t(norm)=0.309791, mflops=16.1399 (err=7.0e-16) 9. Nielsen: elapsed time t=1.1 s, 128 iters, t-(init.)=1.06 s t(norm)=0.386328, mflops=12.9424 (err=1.5e-14) 10. Singleton: elapsed time t=1.56 s, 256 iters, t-(init.)=1.48 s t(norm)=0.269701, mflops=18.5391 (err=7.7e-16) 11. Singleton (f2c): elapsed time t=1.49 s, 256 iters, t-(init.)=1.41 s t(norm)=0.256944, mflops=19.4595 (err=7.7e-16) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=1.13 s, 32 iters, t-(init.)=1.12 s t(norm)=1.63278, mflops=3.06226 (err=6.1e-16) Top mflops for N=1960 = 31.3575 Normalized results and averages for N=1960: fft 0: mflops = 7.79484 (norm. = 0.24858), norm. avg. (of 10) = 0.160677 fft 1: mflops = 24.2813 (norm. = 0.774336), norm. avg. (of 13) = 0.467298 fft 2: mflops = 24.2813 (norm. = 0.774336), norm. avg. (of 13) = 0.442307 fft 3: mflops = 14.1432 (norm. = 0.451031), norm. avg. (of 13) = 0.547734 fft 4: mflops = 10.6348 (norm. = 0.339147), norm. avg. (of 13) = 0.30561 fft 5: mflops = 31.3575 (norm. = 1), norm. avg. (of 13) = 0.978245 fft 6: mflops = 31.1794 (norm. = 0.994318), norm. avg. (of 13) = 0.950448 fft 7: mflops = 10.888 (norm. = 0.347222), norm. avg. (of 13) = 0.241014 fft 8: mflops = 16.1399 (norm. = 0.514706), norm. avg. (of 13) = 0.354465 fft 9: mflops = 12.9424 (norm. = 0.412736), norm. avg. (of 13) = 0.249921 fft 10: mflops = 18.5391 (norm. = 0.591216), norm. avg. (of 13) = 0.385796 fft 11: mflops = 19.4595 (norm. = 0.620567), norm. avg. (of 13) = 0.403827 fft 12: mflops = -1 (norm. = -0.0318903), norm. avg. (of 10) = 0.332391 fft 13: mflops = -1 (norm. = -0.0318903), norm. avg. (of 10) = 0.335072 fft 14: mflops = 3.06226 (norm. = 0.0976563), norm. avg. (of 13) = 0.0724497 Benchmarking for array size = 4725: 0. Brenner: elapsed time t=1.45 s, 32 iters, t-(init.)=1.42 s t(norm)=0.769413, mflops=6.49846 (err=1.4e-15) 1. CWP (min N) (N=5005): elapsed time t=1.93 s, 128 iters, t-(init.)=1.82 s t(norm)=0.246537, mflops=20.2809 2. CWP (best N) (N=5040): elapsed time t=1.65 s, 128 iters, t-(init.)=1.54 s t(norm)=0.208609, mflops=23.9683 3. FFTPACK: elapsed time t=1.06 s, 64 iters, t-(init.)=1 s t(norm)=0.27092, mflops=18.4556 (err=1.2e-15) 4. FFTPACK (f2c): elapsed time t=1.54 s, 64 iters, t-(init.)=1.48 s t(norm)=0.400962, mflops=12.47 (err=1.3e-15) FFTW_MEASURE plan: (cost = 1.000000e-02) FFTW_TWIDDLE 7 FFTW_TWIDDLE 9 FFTW_TWIDDLE 5 FFTW_NOTW 15 5. FFTW: elapsed time t=1.32 s, 128 iters, t-(init.)=1.21 s t(norm)=0.163907, mflops=30.5052 (err=1.2e-15) FFTW_ESTIMATE plan: (cost = 1.946700e+05) FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 5 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.31 s, 128 iters, t-(init.)=1.19 s t(norm)=0.161198, mflops=31.0179 (err=1.2e-15) 7. Frigo-old: elapsed time t=1.15 s, 32 iters, t-(init.)=1.13 s t(norm)=0.61228, mflops=8.1662 (err=1.3e-15) 8. GSL: elapsed time t=1.17 s, 64 iters, t-(init.)=1.11 s t(norm)=0.300721, mflops=16.6267 (err=1.3e-15) 9. Nielsen: elapsed time t=1.65 s, 64 iters, t-(init.)=1.59 s t(norm)=0.430763, mflops=11.6073 (err=4.3e-14) 10. Singleton: elapsed time t=1.06 s, 64 iters, t-(init.)=1.01 s t(norm)=0.273629, mflops=18.2729 (err=1.8e-15) 11. Singleton (f2c): elapsed time t=1.04 s, 64 iters, t-(init.)=0.99 s t(norm)=0.268211, mflops=18.642 (err=1.8e-15) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=1.45 s, 16 iters, t-(init.)=1.44 s t(norm)=1.5605, mflops=3.2041 (err=1.3e-15) Top mflops for N=4725 = 31.0179 Normalized results and averages for N=4725: fft 0: mflops = 6.49846 (norm. = 0.209507), norm. avg. (of 11) = 0.165116 fft 1: mflops = 20.2809 (norm. = 0.653846), norm. avg. (of 14) = 0.480623 fft 2: mflops = 23.9683 (norm. = 0.772727), norm. avg. (of 14) = 0.465908 fft 3: mflops = 18.4556 (norm. = 0.595), norm. avg. (of 14) = 0.55111 fft 4: mflops = 12.47 (norm. = 0.402027), norm. avg. (of 14) = 0.312497 fft 5: mflops = 30.5052 (norm. = 0.983471), norm. avg. (of 14) = 0.978618 fft 6: mflops = 31.0179 (norm. = 1), norm. avg. (of 14) = 0.953988 fft 7: mflops = 8.1662 (norm. = 0.263274), norm. avg. (of 14) = 0.242604 fft 8: mflops = 16.6267 (norm. = 0.536036), norm. avg. (of 14) = 0.367434 fft 9: mflops = 11.6073 (norm. = 0.374214), norm. avg. (of 14) = 0.258799 fft 10: mflops = 18.2729 (norm. = 0.589109), norm. avg. (of 14) = 0.400318 fft 11: mflops = 18.642 (norm. = 0.60101), norm. avg. (of 14) = 0.417912 fft 12: mflops = -1 (norm. = -0.0322395), norm. avg. (of 10) = 0.332391 fft 13: mflops = -1 (norm. = -0.0322395), norm. avg. (of 10) = 0.335072 fft 14: mflops = 3.2041 (norm. = 0.103299), norm. avg. (of 14) = 0.0746532 Benchmarking for array size = 10368: 0. Brenner: elapsed time t=1.67 s, 16 iters, t-(init.)=1.64 s t(norm)=0.741102, mflops=6.74671 (err=1.1e-15) 1. CWP (min N) (N=10920): elapsed time t=1.09 s, 32 iters, t-(init.)=1.03 s t(norm)=0.232724, mflops=21.4847 2. CWP (best N) (N=11088): elapsed time t=1.06 s, 32 iters, t-(init.)=0.99 s t(norm)=0.223686, mflops=22.3527 3. FFTPACK: elapsed time t=1.93 s, 64 iters, t-(init.)=1.81 s t(norm)=0.204481, mflops=24.4522 (err=9.8e-16) 4. FFTPACK (f2c): elapsed time t=1.59 s, 32 iters, t-(init.)=1.52 s t(norm)=0.343437, mflops=14.5587 (err=1.1e-15) FFTW_MEASURE plan: (cost = 2.062500e-02) FFTW_TWIDDLE 3 FFTW_TWIDDLE 32 FFTW_TWIDDLE 9 FFTW_NOTW 12 5. FFTW: elapsed time t=1.4 s, 64 iters, t-(init.)=1.28 s t(norm)=0.144605, mflops=34.5769 (err=8.8e-16) FFTW_ESTIMATE plan: (cost = 1.254528e+05) FFTW_TWIDDLE 9 FFTW_TWIDDLE 6 FFTW_TWIDDLE 6 FFTW_NOTW 32 6. FFTW_ESTIMATE: elapsed time t=1.47 s, 64 iters, t-(init.)=1.34 s t(norm)=0.151384, mflops=33.0287 (err=9.4e-16) 7. Frigo-old: elapsed time t=1 s, 16 iters, t-(init.)=0.97 s t(norm)=0.438335, mflops=11.4068 (err=1.0e-15) 8. GSL: elapsed time t=1.19 s, 32 iters, t-(init.)=1.13 s t(norm)=0.255319, mflops=19.5834 (err=9.4e-16) 9. Nielsen: elapsed time t=1.02 s, 16 iters, t-(init.)=0.99 s t(norm)=0.447372, mflops=11.1764 (err=1.1e-14) 10. Singleton: elapsed time t=1.3 s, 32 iters, t-(init.)=1.24 s t(norm)=0.280173, mflops=17.8461 (err=1.3e-15) 11. Singleton (f2c): elapsed time t=1.35 s, 32 iters, t-(init.)=1.29 s t(norm)=0.29147, mflops=17.1544 (err=1.3e-15) 12. Temperton: elapsed time t=1.37 s, 32 iters, t-(init.)=1.32 s t(norm)=0.298248, mflops=16.7646 (err=2.2e-07) 13. Temperton (f2c): elapsed time t=1.42 s, 32 iters, t-(init.)=1.36 s t(norm)=0.307286, mflops=16.2715 (err=9.7e-16) 14. Valkenburg: elapsed time t=1.54 s, 8 iters, t-(init.)=1.52 s t(norm)=1.37375, mflops=3.63967 (err=1.3e-15) Top mflops for N=10368 = 34.5769 Normalized results and averages for N=10368: fft 0: mflops = 6.74671 (norm. = 0.195122), norm. avg. (of 12) = 0.167616 fft 1: mflops = 21.4847 (norm. = 0.621359), norm. avg. (of 15) = 0.490005 fft 2: mflops = 22.3527 (norm. = 0.646465), norm. avg. (of 15) = 0.477945 fft 3: mflops = 24.4522 (norm. = 0.707182), norm. avg. (of 15) = 0.561515 fft 4: mflops = 14.5587 (norm. = 0.421053), norm. avg. (of 15) = 0.319734 fft 5: mflops = 34.5769 (norm. = 1), norm. avg. (of 15) = 0.980044 fft 6: mflops = 33.0287 (norm. = 0.955224), norm. avg. (of 15) = 0.95407 fft 7: mflops = 11.4068 (norm. = 0.329897), norm. avg. (of 15) = 0.248424 fft 8: mflops = 19.5834 (norm. = 0.566372), norm. avg. (of 15) = 0.380696 fft 9: mflops = 11.1764 (norm. = 0.323232), norm. avg. (of 15) = 0.263095 fft 10: mflops = 17.8461 (norm. = 0.516129), norm. avg. (of 15) = 0.408039 fft 11: mflops = 17.1544 (norm. = 0.496124), norm. avg. (of 15) = 0.423126 fft 12: mflops = 16.7646 (norm. = 0.484848), norm. avg. (of 11) = 0.34625 fft 13: mflops = 16.2715 (norm. = 0.470588), norm. avg. (of 11) = 0.347391 fft 14: mflops = 3.63967 (norm. = 0.105263), norm. avg. (of 15) = 0.0766939 Benchmarking for array size = 27000: 0. Brenner: elapsed time t=1.41 s, 4 iters, t-(init.)=1.39 s t(norm)=0.874306, mflops=5.71882 (err=3.6e-15) 1. CWP (min N) (N=27720): elapsed time t=1.54 s, 16 iters, t-(init.)=1.45 s t(norm)=0.228011, mflops=21.9287 2. CWP (best N) (N=27720): elapsed time t=1.52 s, 16 iters, t-(init.)=1.43 s t(norm)=0.224866, mflops=22.2354 3. FFTPACK: elapsed time t=1.63 s, 16 iters, t-(init.)=1.55 s t(norm)=0.243736, mflops=20.514 (err=3.4e-15) 4. FFTPACK (f2c): elapsed time t=1.31 s, 8 iters, t-(init.)=1.27 s t(norm)=0.399413, mflops=12.5184 (err=3.5e-15) FFTW_MEASURE plan: (cost = 7.750000e-02) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_TWIDDLE 3 FFTW_TWIDDLE 2 FFTW_TWIDDLE 3 FFTW_NOTW 15 5. FFTW: elapsed time t=1.17 s, 16 iters, t-(init.)=1.08 s t(norm)=0.169829, mflops=29.4413 (err=3.5e-15) FFTW_ESTIMATE plan: (cost = 1.231200e+06) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_TWIDDLE 9 FFTW_TWIDDLE 2 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.23 s, 16 iters, t-(init.)=1.15 s t(norm)=0.180837, mflops=27.6493 (err=3.5e-15) 7. Frigo-old: elapsed time t=1.93 s, 8 iters, t-(init.)=1.89 s t(norm)=0.594402, mflops=8.41181 (err=3.6e-15) 8. GSL: elapsed time t=1.96 s, 16 iters, t-(init.)=1.88 s t(norm)=0.295629, mflops=16.9131 (err=3.4e-15) 9. Nielsen: elapsed time t=1.32 s, 8 iters, t-(init.)=1.28 s t(norm)=0.402558, mflops=12.4206 (err=2.0e-13) 10. Singleton: elapsed time t=1.04 s, 8 iters, t-(init.)=1.01 s t(norm)=0.317644, mflops=15.7409 (err=5.0e-15) 11. Singleton (f2c): elapsed time t=1.06 s, 8 iters, t-(init.)=1.02 s t(norm)=0.320789, mflops=15.5866 (err=5.0e-15) 12. Temperton: elapsed time t=1 s, 8 iters, t-(init.)=0.96 s t(norm)=0.301919, mflops=16.5608 (err=1.4e-07) 13. Temperton (f2c): elapsed time t=1.1 s, 8 iters, t-(init.)=1.06 s t(norm)=0.333368, mflops=14.9984 (err=3.6e-15) 14. Valkenburg: elapsed time t=1.27 s, 2 iters, t-(init.)=1.26 s t(norm)=1.58507, mflops=3.15443 (err=3.4e-15) Top mflops for N=27000 = 29.4413 Normalized results and averages for N=27000: fft 0: mflops = 5.71882 (norm. = 0.194245), norm. avg. (of 13) = 0.169665 fft 1: mflops = 21.9287 (norm. = 0.744828), norm. avg. (of 16) = 0.505931 fft 2: mflops = 22.2354 (norm. = 0.755245), norm. avg. (of 16) = 0.495277 fft 3: mflops = 20.514 (norm. = 0.696774), norm. avg. (of 16) = 0.569969 fft 4: mflops = 12.5184 (norm. = 0.425197), norm. avg. (of 16) = 0.326326 fft 5: mflops = 29.4413 (norm. = 1), norm. avg. (of 16) = 0.981291 fft 6: mflops = 27.6493 (norm. = 0.93913), norm. avg. (of 16) = 0.953136 fft 7: mflops = 8.41181 (norm. = 0.285714), norm. avg. (of 16) = 0.250754 fft 8: mflops = 16.9131 (norm. = 0.574468), norm. avg. (of 16) = 0.392807 fft 9: mflops = 12.4206 (norm. = 0.421875), norm. avg. (of 16) = 0.273019 fft 10: mflops = 15.7409 (norm. = 0.534653), norm. avg. (of 16) = 0.415952 fft 11: mflops = 15.5866 (norm. = 0.529412), norm. avg. (of 16) = 0.429769 fft 12: mflops = 16.5608 (norm. = 0.5625), norm. avg. (of 12) = 0.364271 fft 13: mflops = 14.9984 (norm. = 0.509434), norm. avg. (of 12) = 0.360895 fft 14: mflops = 3.15443 (norm. = 0.107143), norm. avg. (of 16) = 0.0785969 Benchmarking for array size = 75600: 0. Brenner: elapsed time t=1.09 s, 1 iters, t-(init.)=1.07 s t(norm)=0.87334, mflops=5.72515 (err=4.7e-15) 1. CWP (min N) (N=80080): elapsed time t=1.33 s, 4 iters, t-(init.)=1.27 s t(norm)=0.259145, mflops=19.2942 2. CWP (best N) (N=80080): elapsed time t=1.34 s, 4 iters, t-(init.)=1.28 s t(norm)=0.261186, mflops=19.1435 3. FFTPACK: elapsed time t=1.74 s, 4 iters, t-(init.)=1.68 s t(norm)=0.342806, mflops=14.5855 (err=4.7e-15) 4. FFTPACK (f2c): elapsed time t=1.25 s, 2 iters, t-(init.)=1.22 s t(norm)=0.497886, mflops=10.0425 (err=4.7e-15) FFTW_MEASURE plan: (cost = 2.400000e-01) FFTW_TWIDDLE 8 FFTW_TWIDDLE 7 FFTW_TWIDDLE 5 FFTW_TWIDDLE 6 FFTW_TWIDDLE 3 FFTW_NOTW 15 5. FFTW: elapsed time t=1.94 s, 8 iters, t-(init.)=1.83 s t(norm)=0.186707, mflops=26.7799 (err=4.7e-15) FFTW_ESTIMATE plan: (cost = 2.971080e+06) FFTW_TWIDDLE 10 FFTW_TWIDDLE 9 FFTW_TWIDDLE 8 FFTW_TWIDDLE 7 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1 s, 4 iters, t-(init.)=0.93 s t(norm)=0.189768, mflops=26.348 (err=4.7e-15) 7. Frigo-old: elapsed time t=1.59 s, 2 iters, t-(init.)=1.56 s t(norm)=0.636641, mflops=7.85372 (err=4.7e-15) 8. GSL: elapsed time t=1.71 s, 4 iters, t-(init.)=1.65 s t(norm)=0.336685, mflops=14.8507 (err=4.7e-15) 9. Nielsen: elapsed time t=1.33 s, 2 iters, t-(init.)=1.3 s t(norm)=0.530534, mflops=9.42447 (err=4.8e-13) 10. Singleton: elapsed time t=1.98 s, 4 iters, t-(init.)=1.92 s t(norm)=0.391779, mflops=12.7623 (err=6.1e-15) 11. Singleton (f2c): elapsed time t=1.98 s, 4 iters, t-(init.)=1.92 s t(norm)=0.391779, mflops=12.7623 (err=6.1e-15) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=2.16 s, 1 iters, t-(init.)=2.15 s t(norm)=1.75484, mflops=2.84926 (err=4.5e-15) Top mflops for N=75600 = 26.7799 Normalized results and averages for N=75600: fft 0: mflops = 5.72515 (norm. = 0.213785), norm. avg. (of 14) = 0.172816 fft 1: mflops = 19.2942 (norm. = 0.720472), norm. avg. (of 17) = 0.518551 fft 2: mflops = 19.1435 (norm. = 0.714844), norm. avg. (of 17) = 0.508192 fft 3: mflops = 14.5855 (norm. = 0.544643), norm. avg. (of 17) = 0.568479 fft 4: mflops = 10.0425 (norm. = 0.375), norm. avg. (of 17) = 0.329189 fft 5: mflops = 26.7799 (norm. = 1), norm. avg. (of 17) = 0.982391 fft 6: mflops = 26.348 (norm. = 0.983871), norm. avg. (of 17) = 0.954944 fft 7: mflops = 7.85372 (norm. = 0.293269), norm. avg. (of 17) = 0.253255 fft 8: mflops = 14.8507 (norm. = 0.554545), norm. avg. (of 17) = 0.402321 fft 9: mflops = 9.42447 (norm. = 0.351923), norm. avg. (of 17) = 0.27766 fft 10: mflops = 12.7623 (norm. = 0.476563), norm. avg. (of 17) = 0.419518 fft 11: mflops = 12.7623 (norm. = 0.476563), norm. avg. (of 17) = 0.432521 fft 12: mflops = -1 (norm. = -0.0373414), norm. avg. (of 12) = 0.364271 fft 13: mflops = -1 (norm. = -0.0373414), norm. avg. (of 12) = 0.360895 fft 14: mflops = 2.84926 (norm. = 0.106395), norm. avg. (of 17) = 0.0802321 Benchmarking for array size = 165375: 0. Brenner: elapsed time t=2.74 s, 1 iters, t-(init.)=2.71 s t(norm)=0.945292, mflops=5.28937 (err=1.2e-14) 1. CWP (min N) (N=180180): elapsed time t=1.69 s, 2 iters, t-(init.)=1.63 s t(norm)=0.284285, mflops=17.588 2. CWP (best N) (N=180180): elapsed time t=1.74 s, 2 iters, t-(init.)=1.67 s t(norm)=0.291262, mflops=17.1667 3. FFTPACK: elapsed time t=1.46 s, 1 iters, t-(init.)=1.43 s t(norm)=0.498807, mflops=10.0239 (err=1.2e-14) 4. FFTPACK (f2c): elapsed time t=1.87 s, 1 iters, t-(init.)=1.84 s t(norm)=0.641822, mflops=7.79032 (err=1.2e-14) FFTW_MEASURE plan: (cost = 6.500000e-01) FFTW_TWIDDLE 5 FFTW_TWIDDLE 7 FFTW_TWIDDLE 7 FFTW_TWIDDLE 9 FFTW_TWIDDLE 5 FFTW_NOTW 15 5. FFTW: elapsed time t=1.27 s, 2 iters, t-(init.)=1.2 s t(norm)=0.20929, mflops=23.8903 (err=1.2e-14) FFTW_ESTIMATE plan: (cost = 8.367975e+06) FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 7 FFTW_TWIDDLE 5 FFTW_TWIDDLE 5 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.28 s, 2 iters, t-(init.)=1.22 s t(norm)=0.212778, mflops=23.4987 (err=1.2e-14) 7. Frigo-old: elapsed time t=2.32 s, 1 iters, t-(init.)=2.29 s t(norm)=0.798789, mflops=6.25947 (err=1.2e-14) 8. GSL: elapsed time t=1.01 s, 1 iters, t-(init.)=0.98 s t(norm)=0.34184, mflops=14.6267 (err=1.2e-14) 9. Nielsen: elapsed time t=1.74 s, 1 iters, t-(init.)=1.71 s t(norm)=0.596476, mflops=8.38257 (err=1.7e-12) 10. Singleton: elapsed time t=1.21 s, 1 iters, t-(init.)=1.18 s t(norm)=0.411603, mflops=12.1476 (err=1.8e-14) 11. Singleton (f2c): elapsed time t=1.2 s, 1 iters, t-(init.)=1.17 s t(norm)=0.408115, mflops=12.2514 (err=1.8e-14) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=5.25 s, 1 iters, t-(init.)=5.21 s t(norm)=1.81733, mflops=2.75128 (err=1.2e-14) Top mflops for N=165375 = 23.8903 Normalized results and averages for N=165375: fft 0: mflops = 5.28937 (norm. = 0.221402), norm. avg. (of 15) = 0.176055 fft 1: mflops = 17.588 (norm. = 0.736196), norm. avg. (of 18) = 0.530643 fft 2: mflops = 17.1667 (norm. = 0.718563), norm. avg. (of 18) = 0.51988 fft 3: mflops = 10.0239 (norm. = 0.41958), norm. avg. (of 18) = 0.560207 fft 4: mflops = 7.79032 (norm. = 0.326087), norm. avg. (of 18) = 0.329017 fft 5: mflops = 23.8903 (norm. = 1), norm. avg. (of 18) = 0.98337 fft 6: mflops = 23.4987 (norm. = 0.983607), norm. avg. (of 18) = 0.956537 fft 7: mflops = 6.25947 (norm. = 0.262009), norm. avg. (of 18) = 0.253742 fft 8: mflops = 14.6267 (norm. = 0.612245), norm. avg. (of 18) = 0.413984 fft 9: mflops = 8.38257 (norm. = 0.350877), norm. avg. (of 18) = 0.281728 fft 10: mflops = 12.1476 (norm. = 0.508475), norm. avg. (of 18) = 0.42446 fft 11: mflops = 12.2514 (norm. = 0.512821), norm. avg. (of 18) = 0.436982 fft 12: mflops = -1 (norm. = -0.041858), norm. avg. (of 12) = 0.364271 fft 13: mflops = -1 (norm. = -0.041858), norm. avg. (of 12) = 0.360895 fft 14: mflops = 2.75128 (norm. = 0.115163), norm. avg. (of 18) = 0.0821727 Benchmarking for array size = 362880: 0. Brenner: elapsed time t=6.01 s, 1 iters, t-(init.)=5.94 s t(norm)=0.886292, mflops=5.64148 (err=7.7e-15) 1. CWP (min N) (N=720720): elapsed time t=3.64 s, 1 iters, t-(init.)=3.5 s t(norm)=0.522226, mflops=9.5744 2. CWP (best N) (N=720720): elapsed time t=3.67 s, 1 iters, t-(init.)=3.54 s t(norm)=0.528194, mflops=9.46621 3. FFTPACK: elapsed time t=2.39 s, 1 iters, t-(init.)=2.32 s t(norm)=0.346161, mflops=14.4441 (err=7.5e-15) 4. FFTPACK (f2c): elapsed time t=3.52 s, 1 iters, t-(init.)=3.45 s t(norm)=0.514766, mflops=9.71316 (err=7.5e-15) FFTW_MEASURE plan: (cost = 1.340000e+00) FFTW_TWIDDLE 5 FFTW_TWIDDLE 16 FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 6 FFTW_NOTW 12 5. FFTW: elapsed time t=1.32 s, 1 iters, t-(init.)=1.25 s t(norm)=0.186509, mflops=26.8083 (err=7.6e-15) FFTW_ESTIMATE plan: (cost = 7.511616e+06) FFTW_TWIDDLE 10 FFTW_TWIDDLE 9 FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 2 FFTW_NOTW 32 6. FFTW_ESTIMATE: elapsed time t=1.39 s, 1 iters, t-(init.)=1.32 s t(norm)=0.196954, mflops=25.3867 (err=7.6e-15) 7. Frigo-old: elapsed time t=4.35 s, 1 iters, t-(init.)=4.28 s t(norm)=0.638608, mflops=7.82953 (err=7.6e-15) 8. GSL: elapsed time t=2.08 s, 1 iters, t-(init.)=2.02 s t(norm)=0.301399, mflops=16.5893 (err=7.5e-15) 9. Nielsen: elapsed time t=4.1 s, 1 iters, t-(init.)=4.03 s t(norm)=0.601306, mflops=8.31523 (err=3.5e-12) 10. Singleton: elapsed time t=3.15 s, 1 iters, t-(init.)=3.08 s t(norm)=0.459559, mflops=10.88 (err=1.1e-14) 11. Singleton (f2c): elapsed time t=3.15 s, 1 iters, t-(init.)=3.08 s t(norm)=0.459559, mflops=10.88 (err=1.1e-14) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=12 s, 1 iters, t-(init.)=11.94 s t(norm)=1.78154, mflops=2.80657 (err=7.9e-15) Top mflops for N=362880 = 26.8083 Normalized results and averages for N=362880: fft 0: mflops = 5.64148 (norm. = 0.210438), norm. avg. (of 16) = 0.178204 fft 1: mflops = 9.5744 (norm. = 0.357143), norm. avg. (of 19) = 0.521511 fft 2: mflops = 9.46621 (norm. = 0.353107), norm. avg. (of 19) = 0.511102 fft 3: mflops = 14.4441 (norm. = 0.538793), norm. avg. (of 19) = 0.55908 fft 4: mflops = 9.71316 (norm. = 0.362319), norm. avg. (of 19) = 0.330769 fft 5: mflops = 26.8083 (norm. = 1), norm. avg. (of 19) = 0.984245 fft 6: mflops = 25.3867 (norm. = 0.94697), norm. avg. (of 19) = 0.956033 fft 7: mflops = 7.82953 (norm. = 0.292056), norm. avg. (of 19) = 0.255758 fft 8: mflops = 16.5893 (norm. = 0.618812), norm. avg. (of 19) = 0.424764 fft 9: mflops = 8.31523 (norm. = 0.310174), norm. avg. (of 19) = 0.283225 fft 10: mflops = 10.88 (norm. = 0.405844), norm. avg. (of 19) = 0.42348 fft 11: mflops = 10.88 (norm. = 0.405844), norm. avg. (of 19) = 0.435343 fft 12: mflops = -1 (norm. = -0.0373019), norm. avg. (of 12) = 0.364271 fft 13: mflops = -1 (norm. = -0.0373019), norm. avg. (of 12) = 0.360895 fft 14: mflops = 2.80657 (norm. = 0.10469), norm. avg. (of 19) = 0.0833579 ------------------------------------------------------ @@@@ bench.3d.p2.log Benchmarking for sizes: 4x4x4 (0.00128174 MB) 8x8x8 (0.00830078 MB) 16x16x16 (0.0633545 MB) 32x32x32 (0.501587 MB) 64x64x64 (4.00305 MB) 256x64x32 (8.01184 MB) 16x1024x64 (16.047 MB) 128x128x128 (32.006 MB) Maximum array size N = 2097152 Benchmarking FFTs: 0. FFTW 1. HARM 2. HARM (f2c) 3. NR (C) 4. NR (F) 5. PDA 6. PDA (f2c) 7. Singleton 8. Singleton (f2c) 9. Temperton 10. Temperton (f2c) Computing normalized averages (11 transforms). Benchmarking for array size = 4x4x4 (power of 2): 0. FFTW: elapsed time t=1.08 s, 32768 iters, t-(init.)=0.81 s t(norm)=0.064373, mflops=77.6723 (err=3.0e-16) 1. Skipping fft (all dimensions must be > 4 for HARM). 2. Skipping fft (all dimensions must be > 4 for HARM). 3. NR (C): elapsed time t=1.14 s, 16384 iters, t-(init.)=0.98 s t(norm)=0.155767, mflops=32.0993 (err=3.0e-16) 4. NR (F): elapsed time t=1.65 s, 16384 iters, t-(init.)=1.5 s t(norm)=0.238419, mflops=20.9715 (err=3.0e-16) 5. PDA: elapsed time t=1.12 s, 8192 iters, t-(init.)=1.09 s t(norm)=0.346502, mflops=14.4299 (err=2.9e-16) 6. PDA (f2c): elapsed time t=1.67 s, 8192 iters, t-(init.)=1.62 s t(norm)=0.514984, mflops=9.70904 (err=3.3e-16) 7. Singleton: elapsed time t=1.42 s, 16384 iters, t-(init.)=1.32 s t(norm)=0.209808, mflops=23.8313 (err=3.0e-16) 8. Singleton (f2c): elapsed time t=1.08 s, 16384 iters, t-(init.)=0.97 s t(norm)=0.154177, mflops=32.4302 (err=2.2e-16) 9. Temperton: elapsed time t=1.9 s, 32768 iters, t-(init.)=1.52 s t(norm)=0.120799, mflops=41.3912 (err=4.1e-16) 10. Temperton (f2c): elapsed time t=1.37 s, 16384 iters, t-(init.)=1.33 s t(norm)=0.211398, mflops=23.6521 (err=3.0e-16) Top mflops for N=64 = 77.6723 Normalized results and averages for N=64: fft 0: mflops = 77.6723 (norm. = 1), norm. avg. (of 1) = 1 fft 1: mflops = -1 (norm. = -0.0128746), norm. avg. (of 0) = -1 fft 2: mflops = -1 (norm. = -0.0128746), norm. avg. (of 0) = -1 fft 3: mflops = 32.0993 (norm. = 0.413265), norm. avg. (of 1) = 0.413265 fft 4: mflops = 20.9715 (norm. = 0.27), norm. avg. (of 1) = 0.27 fft 5: mflops = 14.4299 (norm. = 0.18578), norm. avg. (of 1) = 0.18578 fft 6: mflops = 9.70904 (norm. = 0.125), norm. avg. (of 1) = 0.125 fft 7: mflops = 23.8313 (norm. = 0.306818), norm. avg. (of 1) = 0.306818 fft 8: mflops = 32.4302 (norm. = 0.417526), norm. avg. (of 1) = 0.417526 fft 9: mflops = 41.3912 (norm. = 0.532895), norm. avg. (of 1) = 0.532895 fft 10: mflops = 23.6521 (norm. = 0.304511), norm. avg. (of 1) = 0.304511 Benchmarking for array size = 8x8x8 (power of 2): 0. FFTW: elapsed time t=1.34 s, 4096 iters, t-(init.)=1.14 s t(norm)=0.0603994, mflops=82.7823 (err=2.8e-16) 1. HARM: elapsed time t=1.34 s, 2048 iters, t-(init.)=1.2 s t(norm)=0.127157, mflops=39.3216 (err=3.3e-16) 2. HARM (f2c): elapsed time t=1.6 s, 2048 iters, t-(init.)=1.57 s t(norm)=0.166363, mflops=30.0547 (err=3.2e-16) 3. NR (C): elapsed time t=1.3 s, 2048 iters, t-(init.)=1.26 s t(norm)=0.133514, mflops=37.4491 (err=3.1e-16) 4. NR (F): elapsed time t=1.8 s, 2048 iters, t-(init.)=1.67 s t(norm)=0.17696, mflops=28.255 (err=3.1e-16) 5. PDA: elapsed time t=1.11 s, 1024 iters, t-(init.)=1.04 s t(norm)=0.220405, mflops=22.6855 (err=2.5e-16) 6. PDA (f2c): elapsed time t=1.74 s, 1024 iters, t-(init.)=1.71 s t(norm)=0.362396, mflops=13.7971 (err=2.6e-16) 7. Singleton: elapsed time t=1.5 s, 2048 iters, t-(init.)=1.34 s t(norm)=0.141992, mflops=35.2134 (err=3.4e-16) 8. Singleton (f2c): elapsed time t=1.28 s, 2048 iters, t-(init.)=1.1 s t(norm)=0.11656, mflops=42.8963 (err=3.4e-16) 9. Temperton: elapsed time t=1.03 s, 2048 iters, t-(init.)=0.87 s t(norm)=0.0921885, mflops=54.2367 (err=1.1e-08) 10. Temperton (f2c): elapsed time t=1.39 s, 2048 iters, t-(init.)=1.29 s t(norm)=0.136693, mflops=36.5782 (err=3.0e-16) Top mflops for N=512 = 82.7823 Normalized results and averages for N=512: fft 0: mflops = 82.7823 (norm. = 1), norm. avg. (of 2) = 1 fft 1: mflops = 39.3216 (norm. = 0.475), norm. avg. (of 1) = 0.475 fft 2: mflops = 30.0547 (norm. = 0.363057), norm. avg. (of 1) = 0.363057 fft 3: mflops = 37.4491 (norm. = 0.452381), norm. avg. (of 2) = 0.432823 fft 4: mflops = 28.255 (norm. = 0.341317), norm. avg. (of 2) = 0.305659 fft 5: mflops = 22.6855 (norm. = 0.274038), norm. avg. (of 2) = 0.229909 fft 6: mflops = 13.7971 (norm. = 0.166667), norm. avg. (of 2) = 0.145833 fft 7: mflops = 35.2134 (norm. = 0.425373), norm. avg. (of 2) = 0.366096 fft 8: mflops = 42.8963 (norm. = 0.518182), norm. avg. (of 2) = 0.467854 fft 9: mflops = 54.2367 (norm. = 0.655172), norm. avg. (of 2) = 0.594034 fft 10: mflops = 36.5782 (norm. = 0.44186), norm. avg. (of 2) = 0.373186 Benchmarking for array size = 16x16x16 (power of 2): 0. FFTW: elapsed time t=1.54 s, 256 iters, t-(init.)=1.35 s t(norm)=0.107288, mflops=46.6034 (err=3.1e-16) 1. HARM: elapsed time t=1.31 s, 128 iters, t-(init.)=1.21 s t(norm)=0.192324, mflops=25.9978 (err=3.2e-16) 2. HARM (f2c): elapsed time t=1.6 s, 128 iters, t-(init.)=1.51 s t(norm)=0.240008, mflops=20.8326 (err=3.1e-16) 3. NR (C): elapsed time t=1.1 s, 64 iters, t-(init.)=1.05 s t(norm)=0.333786, mflops=14.9797 (err=3.5e-16) 4. NR (F): elapsed time t=1.22 s, 64 iters, t-(init.)=1.18 s t(norm)=0.375112, mflops=13.3294 (err=3.5e-16) 5. PDA: elapsed time t=1.46 s, 128 iters, t-(init.)=1.37 s t(norm)=0.217756, mflops=22.9615 (err=2.9e-16) 6. PDA (f2c): elapsed time t=1.17 s, 64 iters, t-(init.)=1.12 s t(norm)=0.356038, mflops=14.0434 (err=2.9e-16) 7. Singleton: elapsed time t=1.08 s, 64 iters, t-(init.)=1.04 s t(norm)=0.330607, mflops=15.1237 (err=3.5e-16) 8. Singleton (f2c): elapsed time t=1.84 s, 128 iters, t-(init.)=1.75 s t(norm)=0.278155, mflops=17.9756 (err=3.3e-16) 9. Temperton: elapsed time t=1.08 s, 128 iters, t-(init.)=0.99 s t(norm)=0.157356, mflops=31.775 (err=6.0e-08) 10. Temperton (f2c): elapsed time t=1.53 s, 128 iters, t-(init.)=1.44 s t(norm)=0.228882, mflops=21.8453 (err=3.1e-16) Top mflops for N=4096 = 46.6034 Normalized results and averages for N=4096: fft 0: mflops = 46.6034 (norm. = 1), norm. avg. (of 3) = 1 fft 1: mflops = 25.9978 (norm. = 0.557851), norm. avg. (of 2) = 0.516426 fft 2: mflops = 20.8326 (norm. = 0.44702), norm. avg. (of 2) = 0.405039 fft 3: mflops = 14.9797 (norm. = 0.321429), norm. avg. (of 3) = 0.395692 fft 4: mflops = 13.3294 (norm. = 0.286017), norm. avg. (of 3) = 0.299111 fft 5: mflops = 22.9615 (norm. = 0.492701), norm. avg. (of 3) = 0.317506 fft 6: mflops = 14.0434 (norm. = 0.301339), norm. avg. (of 3) = 0.197669 fft 7: mflops = 15.1237 (norm. = 0.324519), norm. avg. (of 3) = 0.352237 fft 8: mflops = 17.9756 (norm. = 0.385714), norm. avg. (of 3) = 0.440474 fft 9: mflops = 31.775 (norm. = 0.681818), norm. avg. (of 3) = 0.623295 fft 10: mflops = 21.8453 (norm. = 0.46875), norm. avg. (of 3) = 0.405041 Benchmarking for array size = 32x32x32 (power of 2): 0. FFTW: elapsed time t=1.09 s, 16 iters, t-(init.)=0.99 s t(norm)=0.125885, mflops=39.7188 (err=4.1e-16) 1. HARM: elapsed time t=1.97 s, 16 iters, t-(init.)=1.87 s t(norm)=0.237783, mflops=21.0276 (err=4.6e-16) 2. HARM (f2c): elapsed time t=1.25 s, 8 iters, t-(init.)=1.2 s t(norm)=0.305176, mflops=16.384 (err=4.3e-16) 3. NR (C): elapsed time t=1.74 s, 8 iters, t-(init.)=1.7 s t(norm)=0.432332, mflops=11.5652 (err=4.2e-16) 4. NR (F): elapsed time t=1.88 s, 8 iters, t-(init.)=1.83 s t(norm)=0.465393, mflops=10.7436 (err=4.2e-16) 5. PDA: elapsed time t=1.08 s, 8 iters, t-(init.)=1.03 s t(norm)=0.261943, mflops=19.0882 (err=3.5e-16) 6. PDA (f2c): elapsed time t=1.61 s, 8 iters, t-(init.)=1.56 s t(norm)=0.396729, mflops=12.6031 (err=3.6e-16) 7. Singleton: elapsed time t=1.63 s, 8 iters, t-(init.)=1.58 s t(norm)=0.401815, mflops=12.4435 (err=4.4e-16) 8. Singleton (f2c): elapsed time t=1.45 s, 8 iters, t-(init.)=1.4 s t(norm)=0.356038, mflops=14.0434 (err=4.2e-16) 9. Temperton: elapsed time t=1.7 s, 16 iters, t-(init.)=1.61 s t(norm)=0.204722, mflops=24.4234 (err=9.6e-08) 10. Temperton (f2c): elapsed time t=1.23 s, 8 iters, t-(init.)=1.18 s t(norm)=0.30009, mflops=16.6617 (err=4.0e-16) Top mflops for N=32768 = 39.7188 Normalized results and averages for N=32768: fft 0: mflops = 39.7188 (norm. = 1), norm. avg. (of 4) = 1 fft 1: mflops = 21.0276 (norm. = 0.529412), norm. avg. (of 3) = 0.520754 fft 2: mflops = 16.384 (norm. = 0.4125), norm. avg. (of 3) = 0.407526 fft 3: mflops = 11.5652 (norm. = 0.291176), norm. avg. (of 4) = 0.369563 fft 4: mflops = 10.7436 (norm. = 0.270492), norm. avg. (of 4) = 0.291957 fft 5: mflops = 19.0882 (norm. = 0.480583), norm. avg. (of 4) = 0.358275 fft 6: mflops = 12.6031 (norm. = 0.317308), norm. avg. (of 4) = 0.227578 fft 7: mflops = 12.4435 (norm. = 0.313291), norm. avg. (of 4) = 0.3425 fft 8: mflops = 14.0434 (norm. = 0.353571), norm. avg. (of 4) = 0.418748 fft 9: mflops = 24.4234 (norm. = 0.614907), norm. avg. (of 4) = 0.621198 fft 10: mflops = 16.6617 (norm. = 0.419492), norm. avg. (of 4) = 0.408653 Benchmarking for array size = 64x64x64 (power of 2): 0. FFTW: elapsed time t=1.6 s, 2 iters, t-(init.)=1.51 s t(norm)=0.160005, mflops=31.249 (err=4.2e-16) 1. HARM: elapsed time t=1.23 s, 1 iters, t-(init.)=1.18 s t(norm)=0.250075, mflops=19.994 (err=4.9e-16) 2. HARM (f2c): elapsed time t=1.61 s, 1 iters, t-(init.)=1.56 s t(norm)=0.330607, mflops=15.1237 (err=4.3e-16) 3. NR (C): elapsed time t=2.53 s, 1 iters, t-(init.)=2.48 s t(norm)=0.525581, mflops=9.51329 (err=4.9e-16) 4. NR (F): elapsed time t=2.72 s, 1 iters, t-(init.)=2.67 s t(norm)=0.565847, mflops=8.83631 (err=4.9e-16) 5. PDA: elapsed time t=1.26 s, 1 iters, t-(init.)=1.22 s t(norm)=0.258552, mflops=19.3385 (err=4.4e-16) 6. PDA (f2c): elapsed time t=1.94 s, 1 iters, t-(init.)=1.89 s t(norm)=0.400543, mflops=12.483 (err=4.4e-16) 7. Singleton: elapsed time t=2.14 s, 1 iters, t-(init.)=2.09 s t(norm)=0.442929, mflops=11.2885 (err=5.0e-16) 8. Singleton (f2c): elapsed time t=1.88 s, 1 iters, t-(init.)=1.82 s t(norm)=0.385708, mflops=12.9632 (err=4.8e-16) 9. Temperton: elapsed time t=1.03 s, 1 iters, t-(init.)=0.99 s t(norm)=0.209808, mflops=23.8313 (err=1.4e-07) 10. Temperton (f2c): elapsed time t=1.52 s, 1 iters, t-(init.)=1.47 s t(norm)=0.311534, mflops=16.0496 (err=4.4e-16) Top mflops for N=262144 = 31.249 Normalized results and averages for N=262144: fft 0: mflops = 31.249 (norm. = 1), norm. avg. (of 5) = 1 fft 1: mflops = 19.994 (norm. = 0.639831), norm. avg. (of 4) = 0.550523 fft 2: mflops = 15.1237 (norm. = 0.483974), norm. avg. (of 4) = 0.426638 fft 3: mflops = 9.51329 (norm. = 0.304435), norm. avg. (of 5) = 0.356537 fft 4: mflops = 8.83631 (norm. = 0.282772), norm. avg. (of 5) = 0.29012 fft 5: mflops = 19.3385 (norm. = 0.618852), norm. avg. (of 5) = 0.410391 fft 6: mflops = 12.483 (norm. = 0.399471), norm. avg. (of 5) = 0.261957 fft 7: mflops = 11.2885 (norm. = 0.361244), norm. avg. (of 5) = 0.346249 fft 8: mflops = 12.9632 (norm. = 0.414835), norm. avg. (of 5) = 0.417966 fft 9: mflops = 23.8313 (norm. = 0.762626), norm. avg. (of 5) = 0.649484 fft 10: mflops = 16.0496 (norm. = 0.513605), norm. avg. (of 5) = 0.429644 Benchmarking for array size = 256x64x32 (power of 2): 0. FFTW: elapsed time t=1.68 s, 1 iters, t-(init.)=1.58 s t(norm)=0.158611, mflops=31.5236 (err=5.5e-16) 1. HARM: elapsed time t=2.75 s, 1 iters, t-(init.)=2.64 s t(norm)=0.265021, mflops=18.8664 (err=6.1e-16) 2. HARM (f2c): elapsed time t=3.55 s, 1 iters, t-(init.)=3.45 s t(norm)=0.346334, mflops=14.4369 (err=5.9e-16) 3. NR (C): elapsed time t=5.4 s, 1 iters, t-(init.)=5.3 s t(norm)=0.53205, mflops=9.39762 (err=5.7e-16) 4. NR (F): elapsed time t=5.83 s, 1 iters, t-(init.)=5.72 s t(norm)=0.574212, mflops=8.70758 (err=5.7e-16) 5. PDA: elapsed time t=2.62 s, 1 iters, t-(init.)=2.53 s t(norm)=0.253979, mflops=19.6867 (err=4.9e-16) 6. PDA (f2c): elapsed time t=4.03 s, 1 iters, t-(init.)=3.93 s t(norm)=0.39452, mflops=12.6736 (err=5.1e-16) 7. Singleton: elapsed time t=4.73 s, 1 iters, t-(init.)=4.63 s t(norm)=0.464791, mflops=10.7575 (err=7.2e-16) 8. Singleton (f2c): elapsed time t=4.25 s, 1 iters, t-(init.)=4.16 s t(norm)=0.417609, mflops=11.9729 (err=7.1e-16) 9. Temperton: elapsed time t=2.23 s, 1 iters, t-(init.)=2.13 s t(norm)=0.213824, mflops=23.3837 (err=1.5e-07) 10. Temperton (f2c): elapsed time t=3.23 s, 1 iters, t-(init.)=3.13 s t(norm)=0.314211, mflops=15.9129 (err=5.5e-16) Top mflops for N=524288 = 31.5236 Normalized results and averages for N=524288: fft 0: mflops = 31.5236 (norm. = 1), norm. avg. (of 6) = 1 fft 1: mflops = 18.8664 (norm. = 0.598485), norm. avg. (of 5) = 0.560116 fft 2: mflops = 14.4369 (norm. = 0.457971), norm. avg. (of 5) = 0.432905 fft 3: mflops = 9.39762 (norm. = 0.298113), norm. avg. (of 6) = 0.3468 fft 4: mflops = 8.70758 (norm. = 0.276224), norm. avg. (of 6) = 0.287804 fft 5: mflops = 19.6867 (norm. = 0.624506), norm. avg. (of 6) = 0.446077 fft 6: mflops = 12.6736 (norm. = 0.402036), norm. avg. (of 6) = 0.285303 fft 7: mflops = 10.7575 (norm. = 0.341253), norm. avg. (of 6) = 0.345416 fft 8: mflops = 11.9729 (norm. = 0.379808), norm. avg. (of 6) = 0.411606 fft 9: mflops = 23.3837 (norm. = 0.741784), norm. avg. (of 6) = 0.664867 fft 10: mflops = 15.9129 (norm. = 0.504792), norm. avg. (of 6) = 0.442169 Benchmarking for array size = 16x1024x64 (power of 2): 0. FFTW: elapsed time t=3.56 s, 1 iters, t-(init.)=3.36 s t(norm)=0.160217, mflops=31.2076 (err=6.0e-16) 1. HARM: elapsed time t=5.7 s, 1 iters, t-(init.)=5.51 s t(norm)=0.262737, mflops=19.0304 (err=5.8e-16) 2. HARM (f2c): elapsed time t=7.35 s, 1 iters, t-(init.)=7.16 s t(norm)=0.341415, mflops=14.6449 (err=5.5e-16) 3. NR (C): elapsed time t=11.26 s, 1 iters, t-(init.)=11.07 s t(norm)=0.527859, mflops=9.47223 (err=7.3e-16) 4. NR (F): elapsed time t=12.16 s, 1 iters, t-(init.)=11.96 s t(norm)=0.570297, mflops=8.76736 (err=7.3e-16) 5. PDA: elapsed time t=5.61 s, 1 iters, t-(init.)=5.42 s t(norm)=0.258446, mflops=19.3464 (err=6.0e-16) 6. PDA (f2c): elapsed time t=8.61 s, 1 iters, t-(init.)=8.42 s t(norm)=0.401497, mflops=12.4534 (err=6.0e-16) 7. Singleton: elapsed time t=9.44 s, 1 iters, t-(init.)=9.24 s t(norm)=0.440598, mflops=11.3482 (err=7.1e-16) 8. Singleton (f2c): elapsed time t=8.39 s, 1 iters, t-(init.)=8.19 s t(norm)=0.39053, mflops=12.8031 (err=6.9e-16) 9. Skipping fft (Temperton can't handle dimensions > 256). 10. Skipping fft (Temperton can't handle dimensions > 256). Top mflops for N=1048576 = 31.2076 Normalized results and averages for N=1048576: fft 0: mflops = 31.2076 (norm. = 1), norm. avg. (of 7) = 1 fft 1: mflops = 19.0304 (norm. = 0.6098), norm. avg. (of 6) = 0.568396 fft 2: mflops = 14.6449 (norm. = 0.469274), norm. avg. (of 6) = 0.438966 fft 3: mflops = 9.47223 (norm. = 0.303523), norm. avg. (of 7) = 0.340618 fft 4: mflops = 8.76736 (norm. = 0.280936), norm. avg. (of 7) = 0.286823 fft 5: mflops = 19.3464 (norm. = 0.619926), norm. avg. (of 7) = 0.470912 fft 6: mflops = 12.4534 (norm. = 0.39905), norm. avg. (of 7) = 0.301553 fft 7: mflops = 11.3482 (norm. = 0.363636), norm. avg. (of 7) = 0.348019 fft 8: mflops = 12.8031 (norm. = 0.410256), norm. avg. (of 7) = 0.411413 fft 9: mflops = -1 (norm. = -0.0320435), norm. avg. (of 6) = 0.664867 fft 10: mflops = -1 (norm. = -0.0320435), norm. avg. (of 6) = 0.442169 Benchmarking for array size = 128x128x128 (power of 2): 0. FFTW: elapsed time t=7.37 s, 1 iters, t-(init.)=6.98 s t(norm)=0.158492, mflops=31.5474 (err=6.6e-16) 1. HARM: elapsed time t=12.24 s, 1 iters, t-(init.)=11.85 s t(norm)=0.269072, mflops=18.5824 (err=6.3e-16) 2. HARM (f2c): elapsed time t=15.56 s, 1 iters, t-(init.)=15.17 s t(norm)=0.344458, mflops=14.5156 (err=6.1e-16) 3. NR (C): elapsed time t=23.86 s, 1 iters, t-(init.)=23.47 s t(norm)=0.532922, mflops=9.38223 (err=7.5e-16) 4. NR (F): elapsed time t=25.61 s, 1 iters, t-(init.)=25.21 s t(norm)=0.572432, mflops=8.73467 (err=7.5e-16) 5. PDA: elapsed time t=11.14 s, 1 iters, t-(init.)=10.74 s t(norm)=0.243868, mflops=20.5029 (err=6.7e-16) 6. PDA (f2c): elapsed time t=17.05 s, 1 iters, t-(init.)=16.66 s t(norm)=0.378291, mflops=13.2173 (err=6.7e-16) 7. Singleton: elapsed time t=22.65 s, 1 iters, t-(init.)=22.25 s t(norm)=0.50522, mflops=9.89667 (err=8.0e-16) 8. Singleton (f2c): elapsed time t=21.26 s, 1 iters, t-(init.)=20.86 s t(norm)=0.473658, mflops=10.5561 (err=7.9e-16) 9. Temperton: elapsed time t=12.63 s, 1 iters, t-(init.)=12.23 s t(norm)=0.277701, mflops=18.005 (err=1.5e-07) 10. Temperton (f2c): elapsed time t=17.24 s, 1 iters, t-(init.)=16.84 s t(norm)=0.382378, mflops=13.0761 (err=7.1e-16) Top mflops for N=2097152 = 31.5474 Normalized results and averages for N=2097152: fft 0: mflops = 31.5474 (norm. = 1), norm. avg. (of 8) = 1 fft 1: mflops = 18.5824 (norm. = 0.58903), norm. avg. (of 7) = 0.571344 fft 2: mflops = 14.5156 (norm. = 0.460119), norm. avg. (of 7) = 0.441988 fft 3: mflops = 9.38223 (norm. = 0.297401), norm. avg. (of 8) = 0.335215 fft 4: mflops = 8.73467 (norm. = 0.276874), norm. avg. (of 8) = 0.285579 fft 5: mflops = 20.5029 (norm. = 0.649907), norm. avg. (of 8) = 0.493287 fft 6: mflops = 13.2173 (norm. = 0.418968), norm. avg. (of 8) = 0.31623 fft 7: mflops = 9.89667 (norm. = 0.313708), norm. avg. (of 8) = 0.34373 fft 8: mflops = 10.5561 (norm. = 0.334612), norm. avg. (of 8) = 0.401813 fft 9: mflops = 18.005 (norm. = 0.570728), norm. avg. (of 7) = 0.651419 fft 10: mflops = 13.0761 (norm. = 0.414489), norm. avg. (of 7) = 0.438214 ------------------------------------------------------ @@@@ bench.3d.np2.log Benchmarking for sizes: 5x5x5 (0.0022583 MB) 6x6x6 (0.00369263 MB) 7x7x7 (0.00567627 MB) 9x9x9 (0.0116577 MB) 10x10x10 (0.0158386 MB) 11x11x11 (0.0209351 MB) 12x12x12 (0.0270386 MB) 13x13x13 (0.0342407 MB) 14x14x14 (0.0426331 MB) 15x15x15 (0.0523071 MB) 24x25x28 (0.257751 MB) 48x48x48 (1.68982 MB) 49x49x49 (1.79755 MB) 60x60x60 (3.29877 MB) 72x60x56 (3.69482 MB) 75x75x75 (6.44086 MB) 80x80x80 (7.81628 MB) 84x84x84 (9.04791 MB) 96x96x96 (13.5045 MB) 105x105x105 (17.6689 MB) 112x112x112 (21.4427 MB) 120x120x120 (26.3728 MB) 144x144x144 (45.5692 MB) Maximum array size N = 2985984 Benchmarking FFTs: 0. FFTW 1. PDA 2. PDA (f2c) 3. Singleton 4. Singleton (f2c) 5. Temperton 6. Temperton (f2c) Computing normalized averages (7 transforms). Benchmarking for array size = 5x5x5: 0. FFTW: elapsed time t=1.25 s, 8192 iters, t-(init.)=1.11 s t(norm)=0.155616, mflops=32.1305 (err=2.9e-16) 1. PDA: elapsed time t=1.24 s, 4096 iters, t-(init.)=1.16 s t(norm)=0.325251, mflops=15.3728 (err=2.5e-16) 2. PDA (f2c): elapsed time t=1.71 s, 4096 iters, t-(init.)=1.66 s t(norm)=0.465445, mflops=10.7424 (err=2.4e-16) 3. Singleton: elapsed time t=1.64 s, 16384 iters, t-(init.)=1.27 s t(norm)=0.0890233, mflops=56.1651 (err=3.2e-16) 4. Singleton (f2c): elapsed time t=1.77 s, 16384 iters, t-(init.)=1.42 s t(norm)=0.0995379, mflops=50.2321 (err=2.6e-16) 5. Temperton: elapsed time t=1.39 s, 8192 iters, t-(init.)=1.31 s t(norm)=0.183654, mflops=27.2251 (err=3.8e-08) 6. Temperton (f2c): elapsed time t=1.43 s, 8192 iters, t-(init.)=1.31 s t(norm)=0.183654, mflops=27.2251 (err=1.7e-16) Top mflops for N=125 = 56.1651 Normalized results and averages for N=125: fft 0: mflops = 32.1305 (norm. = 0.572072), norm. avg. (of 1) = 0.572072 fft 1: mflops = 15.3728 (norm. = 0.273707), norm. avg. (of 1) = 0.273707 fft 2: mflops = 10.7424 (norm. = 0.191265), norm. avg. (of 1) = 0.191265 fft 3: mflops = 56.1651 (norm. = 1), norm. avg. (of 1) = 1 fft 4: mflops = 50.2321 (norm. = 0.894366), norm. avg. (of 1) = 0.894366 fft 5: mflops = 27.2251 (norm. = 0.484733), norm. avg. (of 1) = 0.484733 fft 6: mflops = 27.2251 (norm. = 0.484733), norm. avg. (of 1) = 0.484733 Benchmarking for array size = 6x6x6: 0. FFTW: elapsed time t=1.44 s, 8192 iters, t-(init.)=1.17 s t(norm)=0.0852642, mflops=58.6413 (err=2.3e-16) 1. PDA: elapsed time t=1.14 s, 2048 iters, t-(init.)=1.12 s t(norm)=0.326482, mflops=15.3148 (err=3.7e-16) 2. PDA (f2c): elapsed time t=1.51 s, 2048 iters, t-(init.)=1.49 s t(norm)=0.434337, mflops=11.5118 (err=3.8e-16) 3. Singleton: elapsed time t=1.22 s, 4096 iters, t-(init.)=1.09 s t(norm)=0.158868, mflops=31.4726 (err=3.0e-16) 4. Singleton (f2c): elapsed time t=1.07 s, 4096 iters, t-(init.)=1.04 s t(norm)=0.151581, mflops=32.9857 (err=3.0e-16) 5. Temperton: elapsed time t=1.14 s, 4096 iters, t-(init.)=1.09 s t(norm)=0.158868, mflops=31.4726 (err=1.7e-08) 6. Temperton (f2c): elapsed time t=1.26 s, 4096 iters, t-(init.)=1.17 s t(norm)=0.170528, mflops=29.3206 (err=2.9e-16) Top mflops for N=216 = 58.6413 Normalized results and averages for N=216: fft 0: mflops = 58.6413 (norm. = 1), norm. avg. (of 2) = 0.786036 fft 1: mflops = 15.3148 (norm. = 0.261161), norm. avg. (of 2) = 0.267434 fft 2: mflops = 11.5118 (norm. = 0.196309), norm. avg. (of 2) = 0.193787 fft 3: mflops = 31.4726 (norm. = 0.536697), norm. avg. (of 2) = 0.768349 fft 4: mflops = 32.9857 (norm. = 0.5625), norm. avg. (of 2) = 0.728433 fft 5: mflops = 31.4726 (norm. = 0.536697), norm. avg. (of 2) = 0.510715 fft 6: mflops = 29.3206 (norm. = 0.5), norm. avg. (of 2) = 0.492366 Benchmarking for array size = 7x7x7: 0. FFTW: elapsed time t=1.1 s, 2048 iters, t-(init.)=1.07 s t(norm)=0.180859, mflops=27.6458 (err=2.1e-16) 1. PDA: elapsed time t=1.88 s, 1024 iters, t-(init.)=1.86 s t(norm)=0.628782, mflops=7.95188 (err=3.7e-16) 2. PDA (f2c): elapsed time t=1.01 s, 512 iters, t-(init.)=0.98 s t(norm)=0.662588, mflops=7.54617 (err=3.4e-16) 3. Singleton: elapsed time t=1.31 s, 2048 iters, t-(init.)=1.25 s t(norm)=0.211284, mflops=23.6648 (err=4.2e-16) 4. Singleton (f2c): elapsed time t=1.13 s, 2048 iters, t-(init.)=1.03 s t(norm)=0.174098, mflops=28.7194 (err=4.2e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=343 = 28.7194 Normalized results and averages for N=343: fft 0: mflops = 27.6458 (norm. = 0.962617), norm. avg. (of 3) = 0.844896 fft 1: mflops = 7.95188 (norm. = 0.276882), norm. avg. (of 3) = 0.270583 fft 2: mflops = 7.54617 (norm. = 0.262755), norm. avg. (of 3) = 0.216776 fft 3: mflops = 23.6648 (norm. = 0.824), norm. avg. (of 3) = 0.786899 fft 4: mflops = 28.7194 (norm. = 1), norm. avg. (of 3) = 0.818955 fft 5: mflops = -1 (norm. = -0.0348197), norm. avg. (of 2) = 0.510715 fft 6: mflops = -1 (norm. = -0.0348197), norm. avg. (of 2) = 0.492366 Benchmarking for array size = 9x9x9: 0. FFTW: elapsed time t=1.87 s, 2048 iters, t-(init.)=1.67 s t(norm)=0.117622, mflops=42.509 (err=4.1e-16) 1. PDA: elapsed time t=1.82 s, 1024 iters, t-(init.)=1.77 s t(norm)=0.249331, mflops=20.0537 (err=3.2e-16) 2. PDA (f2c): elapsed time t=1.45 s, 512 iters, t-(init.)=1.42 s t(norm)=0.400056, mflops=12.4983 (err=3.6e-16) 3. Singleton: elapsed time t=1.1 s, 1024 iters, t-(init.)=0.99 s t(norm)=0.139456, mflops=35.8536 (err=3.6e-16) 4. Singleton (f2c): elapsed time t=1.08 s, 1024 iters, t-(init.)=0.97 s t(norm)=0.136639, mflops=36.5928 (err=3.6e-16) 5. Temperton: elapsed time t=1.21 s, 1024 iters, t-(init.)=1.16 s t(norm)=0.163403, mflops=30.5992 (err=5.3e-08) 6. Temperton (f2c): elapsed time t=1.28 s, 1024 iters, t-(init.)=1.21 s t(norm)=0.170446, mflops=29.3347 (err=3.5e-16) Top mflops for N=729 = 42.509 Normalized results and averages for N=729: fft 0: mflops = 42.509 (norm. = 1), norm. avg. (of 4) = 0.883672 fft 1: mflops = 20.0537 (norm. = 0.471751), norm. avg. (of 4) = 0.320875 fft 2: mflops = 12.4983 (norm. = 0.294014), norm. avg. (of 4) = 0.236086 fft 3: mflops = 35.8536 (norm. = 0.843434), norm. avg. (of 4) = 0.801033 fft 4: mflops = 36.5928 (norm. = 0.860825), norm. avg. (of 4) = 0.829423 fft 5: mflops = 30.5992 (norm. = 0.719828), norm. avg. (of 3) = 0.580419 fft 6: mflops = 29.3347 (norm. = 0.690083), norm. avg. (of 3) = 0.558272 Benchmarking for array size = 10x10x10: 0. FFTW: elapsed time t=1.32 s, 1024 iters, t-(init.)=1.16 s t(norm)=0.11367, mflops=43.9869 (err=2.6e-16) 1. PDA: elapsed time t=1.27 s, 512 iters, t-(init.)=1.21 s t(norm)=0.23714, mflops=21.0846 (err=2.8e-16) 2. PDA (f2c): elapsed time t=1.87 s, 512 iters, t-(init.)=1.81 s t(norm)=0.354729, mflops=14.0953 (err=3.1e-16) 3. Singleton: elapsed time t=1.7 s, 1024 iters, t-(init.)=1.53 s t(norm)=0.149927, mflops=33.3496 (err=3.8e-16) 4. Singleton (f2c): elapsed time t=1.6 s, 1024 iters, t-(init.)=1.44 s t(norm)=0.141108, mflops=35.4339 (err=3.7e-16) 5. Temperton: elapsed time t=1.63 s, 1024 iters, t-(init.)=1.5 s t(norm)=0.146987, mflops=34.0165 (err=2.5e-08) 6. Temperton (f2c): elapsed time t=1.91 s, 1024 iters, t-(init.)=1.76 s t(norm)=0.172465, mflops=28.9914 (err=2.8e-16) Top mflops for N=1000 = 43.9869 Normalized results and averages for N=1000: fft 0: mflops = 43.9869 (norm. = 1), norm. avg. (of 5) = 0.906938 fft 1: mflops = 21.0846 (norm. = 0.479339), norm. avg. (of 5) = 0.352568 fft 2: mflops = 14.0953 (norm. = 0.320442), norm. avg. (of 5) = 0.252957 fft 3: mflops = 33.3496 (norm. = 0.75817), norm. avg. (of 5) = 0.79246 fft 4: mflops = 35.4339 (norm. = 0.805556), norm. avg. (of 5) = 0.824649 fft 5: mflops = 34.0165 (norm. = 0.773333), norm. avg. (of 4) = 0.628648 fft 6: mflops = 28.9914 (norm. = 0.659091), norm. avg. (of 4) = 0.583477 Benchmarking for array size = 11x11x11: 0. FFTW: elapsed time t=1.65 s, 512 iters, t-(init.)=1.54 s t(norm)=0.217744, mflops=22.9627 (err=2.4e-16) 1. PDA: elapsed time t=1.14 s, 128 iters, t-(init.)=1.12 s t(norm)=0.633438, mflops=7.89343 (err=4.2e-16) 2. PDA (f2c): elapsed time t=1.24 s, 128 iters, t-(init.)=1.22 s t(norm)=0.689995, mflops=7.24643 (err=4.3e-16) 3. Singleton: elapsed time t=1.87 s, 512 iters, t-(init.)=1.8 s t(norm)=0.254506, mflops=19.6459 (err=3.8e-16) 4. Singleton (f2c): elapsed time t=1.65 s, 512 iters, t-(init.)=1.53 s t(norm)=0.21633, mflops=23.1128 (err=3.8e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=1331 = 23.1128 Normalized results and averages for N=1331: fft 0: mflops = 22.9627 (norm. = 0.993506), norm. avg. (of 6) = 0.921366 fft 1: mflops = 7.89343 (norm. = 0.341518), norm. avg. (of 6) = 0.350726 fft 2: mflops = 7.24643 (norm. = 0.313525), norm. avg. (of 6) = 0.263052 fft 3: mflops = 19.6459 (norm. = 0.85), norm. avg. (of 6) = 0.80205 fft 4: mflops = 23.1128 (norm. = 1), norm. avg. (of 6) = 0.853874 fft 5: mflops = -1 (norm. = -0.0432661), norm. avg. (of 4) = 0.628648 fft 6: mflops = -1 (norm. = -0.0432661), norm. avg. (of 4) = 0.583477 Benchmarking for array size = 12x12x12: 0. FFTW: elapsed time t=1.19 s, 512 iters, t-(init.)=1.06 s t(norm)=0.1114, mflops=44.8832 (err=3.3e-16) 1. PDA: elapsed time t=1.08 s, 256 iters, t-(init.)=1 s t(norm)=0.210189, mflops=23.7881 (err=3.1e-16) 2. PDA (f2c): elapsed time t=1.74 s, 256 iters, t-(init.)=1.69 s t(norm)=0.35522, mflops=14.0758 (err=3.5e-16) 3. Singleton: elapsed time t=1 s, 256 iters, t-(init.)=0.95 s t(norm)=0.19968, mflops=25.0401 (err=3.7e-16) 4. Singleton (f2c): elapsed time t=1.97 s, 512 iters, t-(init.)=1.84 s t(norm)=0.193374, mflops=25.8566 (err=3.7e-16) 5. Temperton: elapsed time t=1.69 s, 512 iters, t-(init.)=1.56 s t(norm)=0.163948, mflops=30.4976 (err=1.6e-08) 6. Temperton (f2c): elapsed time t=1.94 s, 512 iters, t-(init.)=1.78 s t(norm)=0.187068, mflops=26.7282 (err=3.9e-16) Top mflops for N=1728 = 44.8832 Normalized results and averages for N=1728: fft 0: mflops = 44.8832 (norm. = 1), norm. avg. (of 7) = 0.932599 fft 1: mflops = 23.7881 (norm. = 0.53), norm. avg. (of 7) = 0.376337 fft 2: mflops = 14.0758 (norm. = 0.313609), norm. avg. (of 7) = 0.270274 fft 3: mflops = 25.0401 (norm. = 0.557895), norm. avg. (of 7) = 0.767171 fft 4: mflops = 25.8566 (norm. = 0.576087), norm. avg. (of 7) = 0.81419 fft 5: mflops = 30.4976 (norm. = 0.679487), norm. avg. (of 5) = 0.638816 fft 6: mflops = 26.7282 (norm. = 0.595506), norm. avg. (of 5) = 0.585882 Benchmarking for array size = 13x13x13: 0. FFTW: elapsed time t=1.56 s, 256 iters, t-(init.)=1.46 s t(norm)=0.233834, mflops=21.3827 (err=2.3e-16) 1. PDA: elapsed time t=1.04 s, 64 iters, t-(init.)=1.02 s t(norm)=0.653455, mflops=7.65164 (err=8.7e-16) 2. PDA (f2c): elapsed time t=1.14 s, 64 iters, t-(init.)=1.12 s t(norm)=0.717519, mflops=6.96846 (err=8.4e-16) 3. Singleton: elapsed time t=1.81 s, 256 iters, t-(init.)=1.71 s t(norm)=0.273874, mflops=18.2565 (err=5.3e-16) 4. Singleton (f2c): elapsed time t=1.64 s, 256 iters, t-(init.)=1.54 s t(norm)=0.246647, mflops=20.2719 (err=5.3e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=2197 = 21.3827 Normalized results and averages for N=2197: fft 0: mflops = 21.3827 (norm. = 1), norm. avg. (of 8) = 0.941024 fft 1: mflops = 7.65164 (norm. = 0.357843), norm. avg. (of 8) = 0.374025 fft 2: mflops = 6.96846 (norm. = 0.325893), norm. avg. (of 8) = 0.277226 fft 3: mflops = 18.2565 (norm. = 0.853801), norm. avg. (of 8) = 0.778 fft 4: mflops = 20.2719 (norm. = 0.948052), norm. avg. (of 8) = 0.830923 fft 5: mflops = -1 (norm. = -0.0467669), norm. avg. (of 5) = 0.638816 fft 6: mflops = -1 (norm. = -0.0467669), norm. avg. (of 5) = 0.585882 Benchmarking for array size = 14x14x14: 0. FFTW: elapsed time t=1.35 s, 256 iters, t-(init.)=1.22 s t(norm)=0.152052, mflops=32.8836 (err=2.6e-16) 1. PDA: elapsed time t=1.68 s, 128 iters, t-(init.)=1.62 s t(norm)=0.403809, mflops=12.3821 (err=3.7e-16) 2. PDA (f2c): elapsed time t=1.95 s, 128 iters, t-(init.)=1.89 s t(norm)=0.471111, mflops=10.6132 (err=3.6e-16) 3. Singleton: elapsed time t=1.12 s, 128 iters, t-(init.)=1.07 s t(norm)=0.266714, mflops=18.7467 (err=4.1e-16) 4. Singleton (f2c): elapsed time t=1.99 s, 256 iters, t-(init.)=1.88 s t(norm)=0.234309, mflops=21.3393 (err=4.1e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=2744 = 32.8836 Normalized results and averages for N=2744: fft 0: mflops = 32.8836 (norm. = 1), norm. avg. (of 9) = 0.947577 fft 1: mflops = 12.3821 (norm. = 0.376543), norm. avg. (of 9) = 0.374305 fft 2: mflops = 10.6132 (norm. = 0.322751), norm. avg. (of 9) = 0.282285 fft 3: mflops = 18.7467 (norm. = 0.570093), norm. avg. (of 9) = 0.754899 fft 4: mflops = 21.3393 (norm. = 0.648936), norm. avg. (of 9) = 0.810702 fft 5: mflops = -1 (norm. = -0.0304103), norm. avg. (of 5) = 0.638816 fft 6: mflops = -1 (norm. = -0.0304103), norm. avg. (of 5) = 0.585882 Benchmarking for array size = 15x15x15: 0. FFTW: elapsed time t=1.52 s, 256 iters, t-(init.)=1.36 s t(norm)=0.134299, mflops=37.2304 (err=3.7e-16) 1. PDA: elapsed time t=1.18 s, 128 iters, t-(init.)=1.11 s t(norm)=0.219223, mflops=22.8078 (err=3.5e-16) 2. PDA (f2c): elapsed time t=1.86 s, 128 iters, t-(init.)=1.78 s t(norm)=0.351547, mflops=14.2228 (err=3.4e-16) 3. Singleton: elapsed time t=1.11 s, 128 iters, t-(init.)=1.03 s t(norm)=0.203423, mflops=24.5793 (err=4.2e-16) 4. Singleton (f2c): elapsed time t=1.14 s, 128 iters, t-(init.)=1.07 s t(norm)=0.211323, mflops=23.6604 (err=4.1e-16) 5. Temperton: elapsed time t=1.01 s, 128 iters, t-(init.)=0.94 s t(norm)=0.185649, mflops=26.9326 (err=1.8e-08) 6. Temperton (f2c): elapsed time t=1.08 s, 128 iters, t-(init.)=1.01 s t(norm)=0.199473, mflops=25.066 (err=3.8e-16) Top mflops for N=3375 = 37.2304 Normalized results and averages for N=3375: fft 0: mflops = 37.2304 (norm. = 1), norm. avg. (of 10) = 0.95282 fft 1: mflops = 22.8078 (norm. = 0.612613), norm. avg. (of 10) = 0.398136 fft 2: mflops = 14.2228 (norm. = 0.382022), norm. avg. (of 10) = 0.292259 fft 3: mflops = 24.5793 (norm. = 0.660194), norm. avg. (of 10) = 0.745429 fft 4: mflops = 23.6604 (norm. = 0.635514), norm. avg. (of 10) = 0.793184 fft 5: mflops = 26.9326 (norm. = 0.723404), norm. avg. (of 6) = 0.652914 fft 6: mflops = 25.066 (norm. = 0.673267), norm. avg. (of 6) = 0.600447 Benchmarking for array size = 24x25x28: 0. FFTW: elapsed time t=1.37 s, 32 iters, t-(init.)=1.27 s t(norm)=0.168305, mflops=29.7081 (err=3.5e-16) 1. PDA: elapsed time t=1.98 s, 32 iters, t-(init.)=1.88 s t(norm)=0.249144, mflops=20.0687 (err=3.8e-16) 2. PDA (f2c): elapsed time t=1.44 s, 16 iters, t-(init.)=1.39 s t(norm)=0.368415, mflops=13.5717 (err=3.8e-16) 3. Singleton: elapsed time t=1.12 s, 16 iters, t-(init.)=1.07 s t(norm)=0.2836, mflops=17.6305 (err=4.5e-16) 4. Singleton (f2c): elapsed time t=1.1 s, 16 iters, t-(init.)=1.05 s t(norm)=0.278299, mflops=17.9663 (err=4.4e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=16800 = 29.7081 Normalized results and averages for N=16800: fft 0: mflops = 29.7081 (norm. = 1), norm. avg. (of 11) = 0.957109 fft 1: mflops = 20.0687 (norm. = 0.675532), norm. avg. (of 11) = 0.423353 fft 2: mflops = 13.5717 (norm. = 0.456835), norm. avg. (of 11) = 0.30722 fft 3: mflops = 17.6305 (norm. = 0.593458), norm. avg. (of 11) = 0.731613 fft 4: mflops = 17.9663 (norm. = 0.604762), norm. avg. (of 11) = 0.776054 fft 5: mflops = -1 (norm. = -0.0336609), norm. avg. (of 6) = 0.652914 fft 6: mflops = -1 (norm. = -0.0336609), norm. avg. (of 6) = 0.600447 Benchmarking for array size = 48x48x48: 0. FFTW: elapsed time t=1.49 s, 4 iters, t-(init.)=1.4 s t(norm)=0.188887, mflops=26.4708 (err=4.4e-16) 1. PDA: elapsed time t=1.03 s, 2 iters, t-(init.)=0.99 s t(norm)=0.267141, mflops=18.7167 (err=4.0e-16) 2. PDA (f2c): elapsed time t=1.5 s, 2 iters, t-(init.)=1.46 s t(norm)=0.393965, mflops=12.6915 (err=4.2e-16) 3. Singleton: elapsed time t=1.47 s, 2 iters, t-(init.)=1.43 s t(norm)=0.38587, mflops=12.9577 (err=3.7e-16) 4. Singleton (f2c): elapsed time t=1.5 s, 2 iters, t-(init.)=1.46 s t(norm)=0.393965, mflops=12.6915 (err=3.6e-16) 5. Temperton: elapsed time t=1.94 s, 4 iters, t-(init.)=1.85 s t(norm)=0.249601, mflops=20.032 (err=1.1e-07) 6. Temperton (f2c): elapsed time t=1.05 s, 2 iters, t-(init.)=1.01 s t(norm)=0.272537, mflops=18.3461 (err=4.8e-16) Top mflops for N=110592 = 26.4708 Normalized results and averages for N=110592: fft 0: mflops = 26.4708 (norm. = 1), norm. avg. (of 12) = 0.960683 fft 1: mflops = 18.7167 (norm. = 0.707071), norm. avg. (of 12) = 0.446997 fft 2: mflops = 12.6915 (norm. = 0.479452), norm. avg. (of 12) = 0.321573 fft 3: mflops = 12.9577 (norm. = 0.48951), norm. avg. (of 12) = 0.711438 fft 4: mflops = 12.6915 (norm. = 0.479452), norm. avg. (of 12) = 0.751337 fft 5: mflops = 20.032 (norm. = 0.756757), norm. avg. (of 7) = 0.667748 fft 6: mflops = 18.3461 (norm. = 0.693069), norm. avg. (of 7) = 0.613678 Benchmarking for array size = 49x49x49: 0. FFTW: elapsed time t=1.77 s, 4 iters, t-(init.)=1.69 s t(norm)=0.213201, mflops=23.452 (err=4.3e-16) 1. PDA: elapsed time t=1.7 s, 2 iters, t-(init.)=1.65 s t(norm)=0.41631, mflops=12.0103 (err=5.7e-16) 2. PDA (f2c): elapsed time t=1.99 s, 2 iters, t-(init.)=1.95 s t(norm)=0.492003, mflops=10.1625 (err=5.7e-16) 3. Singleton: elapsed time t=1.59 s, 2 iters, t-(init.)=1.54 s t(norm)=0.388556, mflops=12.8681 (err=6.8e-16) 4. Singleton (f2c): elapsed time t=1.5 s, 2 iters, t-(init.)=1.46 s t(norm)=0.368372, mflops=13.5733 (err=6.8e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=117649 = 23.452 Normalized results and averages for N=117649: fft 0: mflops = 23.452 (norm. = 1), norm. avg. (of 13) = 0.963707 fft 1: mflops = 12.0103 (norm. = 0.512121), norm. avg. (of 13) = 0.452006 fft 2: mflops = 10.1625 (norm. = 0.433333), norm. avg. (of 13) = 0.33017 fft 3: mflops = 12.8681 (norm. = 0.548701), norm. avg. (of 13) = 0.69892 fft 4: mflops = 13.5733 (norm. = 0.578767), norm. avg. (of 13) = 0.738063 fft 5: mflops = -1 (norm. = -0.0426403), norm. avg. (of 7) = 0.667748 fft 6: mflops = -1 (norm. = -0.0426403), norm. avg. (of 7) = 0.613678 Benchmarking for array size = 60x60x60: 0. FFTW: elapsed time t=1.48 s, 2 iters, t-(init.)=1.4 s t(norm)=0.182879, mflops=27.3405 (err=4.8e-16) 1. PDA: elapsed time t=1.03 s, 1 iters, t-(init.)=0.99 s t(norm)=0.258643, mflops=19.3316 (err=4.4e-16) 2. PDA (f2c): elapsed time t=1.52 s, 1 iters, t-(init.)=1.47 s t(norm)=0.384046, mflops=13.0193 (err=4.4e-16) 3. Singleton: elapsed time t=1.77 s, 1 iters, t-(init.)=1.73 s t(norm)=0.451973, mflops=11.0626 (err=5.4e-16) 4. Singleton (f2c): elapsed time t=1.77 s, 1 iters, t-(init.)=1.73 s t(norm)=0.451973, mflops=11.0626 (err=5.4e-16) 5. Temperton: elapsed time t=1.96 s, 2 iters, t-(init.)=1.88 s t(norm)=0.245581, mflops=20.3599 (err=1.9e-08) 6. Temperton (f2c): elapsed time t=1.09 s, 1 iters, t-(init.)=1.04 s t(norm)=0.271706, mflops=18.4022 (err=5.4e-16) Top mflops for N=216000 = 27.3405 Normalized results and averages for N=216000: fft 0: mflops = 27.3405 (norm. = 1), norm. avg. (of 14) = 0.9663 fft 1: mflops = 19.3316 (norm. = 0.707071), norm. avg. (of 14) = 0.470225 fft 2: mflops = 13.0193 (norm. = 0.47619), norm. avg. (of 14) = 0.3406 fft 3: mflops = 11.0626 (norm. = 0.404624), norm. avg. (of 14) = 0.677899 fft 4: mflops = 11.0626 (norm. = 0.404624), norm. avg. (of 14) = 0.714246 fft 5: mflops = 20.3599 (norm. = 0.744681), norm. avg. (of 8) = 0.677365 fft 6: mflops = 18.4022 (norm. = 0.673077), norm. avg. (of 8) = 0.621103 Benchmarking for array size = 72x60x56: 0. FFTW: elapsed time t=1.66 s, 2 iters, t-(init.)=1.57 s t(norm)=0.181438, mflops=27.5576 (err=4.2e-16) 1. PDA: elapsed time t=1.3 s, 1 iters, t-(init.)=1.26 s t(norm)=0.291226, mflops=17.1688 (err=4.7e-16) 2. PDA (f2c): elapsed time t=1.81 s, 1 iters, t-(init.)=1.77 s t(norm)=0.409103, mflops=12.2219 (err=4.7e-16) 3. Singleton: elapsed time t=2.14 s, 1 iters, t-(init.)=2.1 s t(norm)=0.485376, mflops=10.3013 (err=5.2e-16) 4. Singleton (f2c): elapsed time t=2.13 s, 1 iters, t-(init.)=2.09 s t(norm)=0.483065, mflops=10.3506 (err=5.2e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=241920 = 27.5576 Normalized results and averages for N=241920: fft 0: mflops = 27.5576 (norm. = 1), norm. avg. (of 15) = 0.968546 fft 1: mflops = 17.1688 (norm. = 0.623016), norm. avg. (of 15) = 0.480411 fft 2: mflops = 12.2219 (norm. = 0.443503), norm. avg. (of 15) = 0.34746 fft 3: mflops = 10.3013 (norm. = 0.37381), norm. avg. (of 15) = 0.657626 fft 4: mflops = 10.3506 (norm. = 0.375598), norm. avg. (of 15) = 0.691669 fft 5: mflops = -1 (norm. = -0.0362877), norm. avg. (of 8) = 0.677365 fft 6: mflops = -1 (norm. = -0.0362877), norm. avg. (of 8) = 0.621103 Benchmarking for array size = 75x75x75: 0. FFTW: elapsed time t=1.39 s, 1 iters, t-(init.)=1.31 s t(norm)=0.166173, mflops=30.0891 (err=5.2e-16) 1. PDA: elapsed time t=2.08 s, 1 iters, t-(init.)=2 s t(norm)=0.253699, mflops=19.7084 (err=5.8e-16) 2. PDA (f2c): elapsed time t=3.33 s, 1 iters, t-(init.)=3.25 s t(norm)=0.412261, mflops=12.1282 (err=5.8e-16) 3. Singleton: elapsed time t=2.97 s, 1 iters, t-(init.)=2.89 s t(norm)=0.366595, mflops=13.639 (err=6.6e-16) 4. Singleton (f2c): elapsed time t=3.05 s, 1 iters, t-(init.)=2.97 s t(norm)=0.376743, mflops=13.2716 (err=6.6e-16) 5. Temperton: elapsed time t=1.94 s, 1 iters, t-(init.)=1.86 s t(norm)=0.23594, mflops=21.1918 (err=1.4e-07) 6. Temperton (f2c): elapsed time t=2.13 s, 1 iters, t-(init.)=2.05 s t(norm)=0.260042, mflops=19.2277 (err=7.0e-16) Top mflops for N=421875 = 30.0891 Normalized results and averages for N=421875: fft 0: mflops = 30.0891 (norm. = 1), norm. avg. (of 16) = 0.970512 fft 1: mflops = 19.7084 (norm. = 0.655), norm. avg. (of 16) = 0.491323 fft 2: mflops = 12.1282 (norm. = 0.403077), norm. avg. (of 16) = 0.350936 fft 3: mflops = 13.639 (norm. = 0.453287), norm. avg. (of 16) = 0.644855 fft 4: mflops = 13.2716 (norm. = 0.441077), norm. avg. (of 16) = 0.676007 fft 5: mflops = 21.1918 (norm. = 0.704301), norm. avg. (of 9) = 0.680358 fft 6: mflops = 19.2277 (norm. = 0.639024), norm. avg. (of 9) = 0.623094 Benchmarking for array size = 80x80x80: 0. FFTW: elapsed time t=1.95 s, 1 iters, t-(init.)=1.86 s t(norm)=0.191546, mflops=26.1034 (err=3.9e-16) 1. PDA: elapsed time t=2.73 s, 1 iters, t-(init.)=2.64 s t(norm)=0.271871, mflops=18.3911 (err=3.8e-16) 2. PDA (f2c): elapsed time t=3.92 s, 1 iters, t-(init.)=3.83 s t(norm)=0.394419, mflops=12.6769 (err=3.7e-16) 3. Singleton: elapsed time t=4.07 s, 1 iters, t-(init.)=3.97 s t(norm)=0.408837, mflops=12.2298 (err=4.8e-16) 4. Singleton (f2c): elapsed time t=3.97 s, 1 iters, t-(init.)=3.87 s t(norm)=0.398538, mflops=12.5458 (err=4.7e-16) 5. Temperton: elapsed time t=2.6 s, 1 iters, t-(init.)=2.5 s t(norm)=0.257454, mflops=19.421 (err=1.7e-07) 6. Temperton (f2c): elapsed time t=2.91 s, 1 iters, t-(init.)=2.82 s t(norm)=0.290408, mflops=17.2172 (err=5.2e-16) Top mflops for N=512000 = 26.1034 Normalized results and averages for N=512000: fft 0: mflops = 26.1034 (norm. = 1), norm. avg. (of 17) = 0.972247 fft 1: mflops = 18.3911 (norm. = 0.704545), norm. avg. (of 17) = 0.503865 fft 2: mflops = 12.6769 (norm. = 0.48564), norm. avg. (of 17) = 0.35886 fft 3: mflops = 12.2298 (norm. = 0.468514), norm. avg. (of 17) = 0.634482 fft 4: mflops = 12.5458 (norm. = 0.48062), norm. avg. (of 17) = 0.664514 fft 5: mflops = 19.421 (norm. = 0.744), norm. avg. (of 10) = 0.686722 fft 6: mflops = 17.2172 (norm. = 0.659574), norm. avg. (of 10) = 0.626742 Benchmarking for array size = 84x84x84: 0. FFTW: elapsed time t=2.39 s, 1 iters, t-(init.)=2.27 s t(norm)=0.199714, mflops=25.0358 (err=4.8e-16) 1. PDA: elapsed time t=3.72 s, 1 iters, t-(init.)=3.61 s t(norm)=0.317607, mflops=15.7427 (err=4.7e-16) 2. PDA (f2c): elapsed time t=5.18 s, 1 iters, t-(init.)=5.07 s t(norm)=0.446057, mflops=11.2093 (err=5.0e-16) 3. Singleton: elapsed time t=5.89 s, 1 iters, t-(init.)=5.78 s t(norm)=0.508523, mflops=9.8324 (err=5.4e-16) 4. Singleton (f2c): elapsed time t=5.74 s, 1 iters, t-(init.)=5.63 s t(norm)=0.495326, mflops=10.0944 (err=5.4e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=592704 = 25.0358 Normalized results and averages for N=592704: fft 0: mflops = 25.0358 (norm. = 1), norm. avg. (of 18) = 0.973789 fft 1: mflops = 15.7427 (norm. = 0.628809), norm. avg. (of 18) = 0.510807 fft 2: mflops = 11.2093 (norm. = 0.447732), norm. avg. (of 18) = 0.363797 fft 3: mflops = 9.8324 (norm. = 0.392734), norm. avg. (of 18) = 0.621051 fft 4: mflops = 10.0944 (norm. = 0.403197), norm. avg. (of 18) = 0.649996 fft 5: mflops = -1 (norm. = -0.0399428), norm. avg. (of 10) = 0.686722 fft 6: mflops = -1 (norm. = -0.0399428), norm. avg. (of 10) = 0.626742 Benchmarking for array size = 96x96x96: 0. FFTW: elapsed time t=3.24 s, 1 iters, t-(init.)=3.07 s t(norm)=0.175651, mflops=28.4656 (err=4.8e-16) 1. PDA: elapsed time t=4.73 s, 1 iters, t-(init.)=4.56 s t(norm)=0.260902, mflops=19.1643 (err=4.9e-16) 2. PDA (f2c): elapsed time t=6.69 s, 1 iters, t-(init.)=6.52 s t(norm)=0.373043, mflops=13.4033 (err=4.8e-16) 3. Singleton: elapsed time t=8.36 s, 1 iters, t-(init.)=8.19 s t(norm)=0.468593, mflops=10.6702 (err=5.5e-16) 4. Singleton (f2c): elapsed time t=8.41 s, 1 iters, t-(init.)=8.24 s t(norm)=0.471454, mflops=10.6055 (err=5.5e-16) 5. Temperton: elapsed time t=5.18 s, 1 iters, t-(init.)=5.01 s t(norm)=0.286648, mflops=17.443 (err=1.6e-07) 6. Temperton (f2c): elapsed time t=6.12 s, 1 iters, t-(init.)=5.95 s t(norm)=0.340431, mflops=14.6873 (err=5.2e-16) Top mflops for N=884736 = 28.4656 Normalized results and averages for N=884736: fft 0: mflops = 28.4656 (norm. = 1), norm. avg. (of 19) = 0.975168 fft 1: mflops = 19.1643 (norm. = 0.673246), norm. avg. (of 19) = 0.519356 fft 2: mflops = 13.4033 (norm. = 0.470859), norm. avg. (of 19) = 0.369432 fft 3: mflops = 10.6702 (norm. = 0.374847), norm. avg. (of 19) = 0.608093 fft 4: mflops = 10.6055 (norm. = 0.372573), norm. avg. (of 19) = 0.635395 fft 5: mflops = 17.443 (norm. = 0.612774), norm. avg. (of 11) = 0.68 fft 6: mflops = 14.6873 (norm. = 0.515966), norm. avg. (of 11) = 0.616672 Benchmarking for array size = 105x105x105: 0. FFTW: elapsed time t=4.62 s, 1 iters, t-(init.)=4.4 s t(norm)=0.188698, mflops=26.4974 (err=4.8e-16) 1. PDA: elapsed time t=7.49 s, 1 iters, t-(init.)=7.27 s t(norm)=0.31178, mflops=16.037 (err=5.5e-16) 2. PDA (f2c): elapsed time t=10.8 s, 1 iters, t-(init.)=10.58 s t(norm)=0.453732, mflops=11.0197 (err=5.5e-16) 3. Singleton: elapsed time t=9.91 s, 1 iters, t-(init.)=9.69 s t(norm)=0.415564, mflops=12.0319 (err=6.5e-16) 4. Singleton (f2c): elapsed time t=9.81 s, 1 iters, t-(init.)=9.59 s t(norm)=0.411275, mflops=12.1573 (err=6.5e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=1157625 = 26.4974 Normalized results and averages for N=1157625: fft 0: mflops = 26.4974 (norm. = 1), norm. avg. (of 20) = 0.97641 fft 1: mflops = 16.037 (norm. = 0.605227), norm. avg. (of 20) = 0.52365 fft 2: mflops = 11.0197 (norm. = 0.415879), norm. avg. (of 20) = 0.371754 fft 3: mflops = 12.0319 (norm. = 0.454076), norm. avg. (of 20) = 0.600392 fft 4: mflops = 12.1573 (norm. = 0.458811), norm. avg. (of 20) = 0.626566 fft 5: mflops = -1 (norm. = -0.0377395), norm. avg. (of 11) = 0.68 fft 6: mflops = -1 (norm. = -0.0377395), norm. avg. (of 11) = 0.616672 Benchmarking for array size = 112x112x112: 0. FFTW: elapsed time t=5.85 s, 1 iters, t-(init.)=5.59 s t(norm)=0.194831, mflops=25.6633 (err=5.1e-16) 1. PDA: elapsed time t=9.52 s, 1 iters, t-(init.)=9.25 s t(norm)=0.322395, mflops=15.5089 (err=5.6e-16) 2. PDA (f2c): elapsed time t=12.97 s, 1 iters, t-(init.)=12.71 s t(norm)=0.442988, mflops=11.287 (err=5.7e-16) 3. Singleton: elapsed time t=12.54 s, 1 iters, t-(init.)=12.27 s t(norm)=0.427652, mflops=11.6917 (err=6.3e-16) 4. Singleton (f2c): elapsed time t=12.41 s, 1 iters, t-(init.)=12.14 s t(norm)=0.423121, mflops=11.8169 (err=6.3e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=1404928 = 25.6633 Normalized results and averages for N=1404928: fft 0: mflops = 25.6633 (norm. = 1), norm. avg. (of 21) = 0.977533 fft 1: mflops = 15.5089 (norm. = 0.604324), norm. avg. (of 21) = 0.527491 fft 2: mflops = 11.287 (norm. = 0.439811), norm. avg. (of 21) = 0.374995 fft 3: mflops = 11.6917 (norm. = 0.455583), norm. avg. (of 21) = 0.593497 fft 4: mflops = 11.8169 (norm. = 0.460461), norm. avg. (of 21) = 0.618656 fft 5: mflops = -1 (norm. = -0.0389662), norm. avg. (of 11) = 0.68 fft 6: mflops = -1 (norm. = -0.0389662), norm. avg. (of 11) = 0.616672 Benchmarking for array size = 120x120x120: 0. FFTW: elapsed time t=6.79 s, 1 iters, t-(init.)=6.47 s t(norm)=0.180699, mflops=27.6703 (err=4.7e-16) 1. PDA: elapsed time t=9.48 s, 1 iters, t-(init.)=9.16 s t(norm)=0.255828, mflops=19.5444 (err=4.8e-16) 2. PDA (f2c): elapsed time t=13.86 s, 1 iters, t-(init.)=13.53 s t(norm)=0.377877, mflops=13.2318 (err=4.9e-16) 3. Singleton: elapsed time t=19.03 s, 1 iters, t-(init.)=18.69 s t(norm)=0.521989, mflops=9.57874 (err=5.6e-16) 4. Singleton (f2c): elapsed time t=19.08 s, 1 iters, t-(init.)=18.76 s t(norm)=0.523944, mflops=9.543 (err=5.6e-16) 5. Temperton: elapsed time t=10.25 s, 1 iters, t-(init.)=9.93 s t(norm)=0.277333, mflops=18.0289 (err=2.0e-08) 6. Temperton (f2c): elapsed time t=11.23 s, 1 iters, t-(init.)=10.9 s t(norm)=0.304424, mflops=16.4245 (err=5.7e-16) Top mflops for N=1728000 = 27.6703 Normalized results and averages for N=1728000: fft 0: mflops = 27.6703 (norm. = 1), norm. avg. (of 22) = 0.978554 fft 1: mflops = 19.5444 (norm. = 0.706332), norm. avg. (of 22) = 0.53562 fft 2: mflops = 13.2318 (norm. = 0.478197), norm. avg. (of 22) = 0.379686 fft 3: mflops = 9.57874 (norm. = 0.346174), norm. avg. (of 22) = 0.582255 fft 4: mflops = 9.543 (norm. = 0.344883), norm. avg. (of 22) = 0.606212 fft 5: mflops = 18.0289 (norm. = 0.651561), norm. avg. (of 12) = 0.67763 fft 6: mflops = 16.4245 (norm. = 0.593578), norm. avg. (of 12) = 0.614747 Benchmarking for array size = 144x144x144: 0. FFTW: elapsed time t=12 s, 1 iters, t-(init.)=11.44 s t(norm)=0.178116, mflops=28.0716 (err=6.6e-16) 1. PDA: elapsed time t=17.04 s, 1 iters, t-(init.)=16.46 s t(norm)=0.256275, mflops=19.5103 (err=5.7e-16) 2. PDA (f2c): elapsed time t=25.49 s, 1 iters, t-(init.)=24.92 s t(norm)=0.387994, mflops=12.8868 (err=5.8e-16) 3. Singleton: elapsed time t=30.49 s, 1 iters, t-(init.)=29.92 s t(norm)=0.465842, mflops=10.7333 (err=6.0e-16) 4. Singleton (f2c): elapsed time t=30.88 s, 1 iters, t-(init.)=30.31 s t(norm)=0.471914, mflops=10.5952 (err=5.8e-16) 5. Temperton: elapsed time t=18.01 s, 1 iters, t-(init.)=17.45 s t(norm)=0.271689, mflops=18.4034 (err=1.8e-07) 6. Temperton (f2c): elapsed time t=20.43 s, 1 iters, t-(init.)=19.86 s t(norm)=0.309212, mflops=16.1702 (err=6.6e-16) Top mflops for N=2985984 = 28.0716 Normalized results and averages for N=2985984: fft 0: mflops = 28.0716 (norm. = 1), norm. avg. (of 23) = 0.979487 fft 1: mflops = 19.5103 (norm. = 0.695018), norm. avg. (of 23) = 0.542551 fft 2: mflops = 12.8868 (norm. = 0.459069), norm. avg. (of 23) = 0.383137 fft 3: mflops = 10.7333 (norm. = 0.382353), norm. avg. (of 23) = 0.573563 fft 4: mflops = 10.5952 (norm. = 0.377433), norm. avg. (of 23) = 0.596265 fft 5: mflops = 18.4034 (norm. = 0.655587), norm. avg. (of 13) = 0.675934 fft 6: mflops = 16.1702 (norm. = 0.576032), norm. avg. (of 13) = 0.611769 ------------------------------------------------------ @@@@ bench.1d.p2.dat N, Arndt DIF, Arndt DIT, Arndt Split-Radix, Arndt 4-step, Bailey, Beauregard, Bergland, Brenner, Burrus, CWP (min N), CWP (best N), Edelblute, FFTPACK, FFTPACK (f2c), FFTW, FFTW_ESTIMATE, Frigo-old, Green, GSL, GSL DIT, GSL DIF, Krukar, Mayer (Buneman), Mayer (simple), Mayer (lookup), Monro, NAPACK (f2c), Nielsen, NR (C), NR (F), Ooura (C), Ooura (F), QFT, Ransom, SCIPORT, Singleton, Singleton (f2c), Sorensen, Sorensen DIT, Temperton, Temperton (f2c), Valkenburg 2, 80.6597, 87.3813, 16.6441, 1.22497, 6.16809, 4.76625, 4.68114, 4.06425, 15.1968, 2.44994, 2.36166, , 7.88403, 8.2565, 41.1206, 69.9051, 74.8983, , 5.51882, 4.59902, 4.1943, 61.6809, , , , , 2.18453, 1.47272, 5.69878, 4.36907, 262.144, 55.1882, , , , 4.29744, 4.40578, 18.7246, 15.1968, 2.59549, 2.67494, 4.40578 4, 55.1882, 67.6501, 17.6231, 5.09017, 8.88624, 4.99322, 16.7772, 8.38861, 13.981, 8.192, 2.62144, 15.7681, 31.775, 17.1898, 107.546, 102.3, 364.722, , 16.0088, 8.06597, 7.94376, 167.772, 27.5941, 22.7951, 22.3101, 3.47211, 3.94202, 5.19097, 9.89223, 6.63656, 42.799, 40.7214, , 3.54249, 14.8734, 10.1803, 11.7818, 16.6441, 13.0258, 8.59489, 9.36229, 4.096 8, 103.139, 83.8861, 17.3797, 5.74038, 13.5592, 6.95958, 21.3995, 10.999, 11.6508, 17.0963, 8.83631, 13.2173, 37.9003, 20.8326, 92.5214, 86.1843, 116.508, 24.576, 20.5603, 12.099, 12.8923, 99.8644, 37.9003, 34.5684, 38.8361, 7.41917, 5.74038, 11.3976, 13.7971, 9.89223, 81.7072, 57.1951, , 4.46836, 11.4808, 11.6508, 13.9192, 25.7847, 13.4433, 13.6771, 16.0496, 4.39347 16, 28.5327, 28.7281, 21.1834, 9.44663, 18.7246, 7.71012, 29.9593, 14.7687, 13.7069, 28.5327, 13.53, 13.7069, 58.2542, 22.3101, 80.6597, 81.4428, 71.0899, 18.7246, 32.0176, 17.1898, 17.6231, 95.3251, 25.575, 32.0176, 35.8488, 12.483, 7.59838, 11.7159, 20.7639, 15.0874, 56.6798, 45.1, 55.9241, 10.9227, 11.6508, 19.7845, 23.3017, 29.1271, 14.6654, 17.4763, 22.3101, 4.33296 32, 33.3941, 34.7211, 21.4872, 10.0825, 24.0499, 7.9922, 30.3057, 18.3317, 13.5826, 22.4055, 21.1406, 16.487, 41.2825, 20.011, 89.6219, 69.9051, 53.4988, 22.9951, 25.4509, 23.6166, 20.8051, 93.6229, 29.9593, 37.4491, 41.943, 16.384, 8.91646, 15.5115, 23.6166, 17.8329, 51.9097, 48.9989, 49.4611, 11.2993, 11.0145, 25.9549, 23.8313, 35.6659, 15.3301, 17.9551, 17.9551, 4.36907 64, 25.575, 29.1271, 24.0132, 13.2173, 26.6587, 7.94376, 30.2474, 22.7951, 14.8383, 22.3101, 23.3017, 16.7326, 51.15, 18.8367, 65.536, 68.3854, 75.8007, 24.3855, 30.2474, 26.2144, 26.6587, 37.2276, 28.8599, 38.3625, 37.9003, 19.6608, 10.2802, 19.065, 31.775, 21.6947, 43.9962, 40.3298, 41.3912, 17.4763, 10.8473, 26.8866, 28.3399, 36.3668, 16.4698, 22.7951, 23.1304, 4.46836 128, 28.2309, 33.0632, 24.3047, 12.9226, 27.8032, 7.9783, 35.9805, 23.8313, 16.384, 23.2279, 24.9661, 17.9903, 48.6095, 20.0547, 60.6614, 69.2456, 72.6736, 22.7951, 30.5835, 28.2309, 27.8032, 27.3882, 29.5969, 35.9805, 43.6907, 22.2425, 10.546, 16.8349, 29.3601, 21.7161, 52.057, 39.8915, 39.0427, 17.6443, 10.607, 28.672, 29.1271, 35.9805, 17.6443, 22.2425, 25.4862, 4.49757 256, 30.6154, 27.7768, 25.7319, 14.0748, 26.2144, 7.88403, 34.6637, 26.5462, 17.4763, 26.5462, 26.8866, 18.8933, 47.127, 19.7845, 62.1378, 59.0747, 70.4925, 20.7639, 32.514, 26.3793, 28.5327, 31.775, 31.5361, 37.1177, 41.5278, 26.7153, 11.5228, 18.8933, 31.5361, 24.1052, 44.1506, 43.2402, 33.2881, 24.2445, 10.2802, 30.1748, 28.7281, 39.1991, 19.7845, 22.7951, 25.575, 4.55903 512, 30.0547, 30.0547, 27.2751, 14.1275, 24.9661, 7.76084, 36.5782, 27.2751, 18.432, 27.1183, 28.255, 20.1649, 35.4781, 18.0099, 49.152, 42.5098, 36.864, 21.8453, 28.7719, 28.4253, 30.6402, 28.255, 31.4573, 42.5098, 42.8963, 27.4337, 11.4529, 19.826, 32.768, 25.7847, 46.7187, 41.3912, 22.6855, 22.9058, 8.73813, 32.9971, 30.0547, 38.3625, 20.6956, 21.4481, 20.3388, 4.30528 1024, 31.5836, 31.3945, 29.9593, 15.5115, 17.5936, 7.8019, 37.7186, 27.5941, 19.1346, 26.7494, 27.3067, 20.8051, 26.2144, 15.0657, 39.7188, 38.5506, 27.3067, 20.3212, 23.1986, 28.0368, 31.775, 23.8313, 32.3635, 40.3298, 40.96, 26.4792, 8.45626, 16.0825, 33.6082, 25.9549, 44.0578, 39.7188, 18.7246, 25.7004, 7.32246, 33.6082, 30.1315, 36.4089, 22.2156, 22.2156, 20.1649, 3.94795 2048, 18.1357, 18.1357, 14.7122, 13.7314, 16.5723, 7.06761, 27.996, 20.3069, 12.2186, 21.8453, 23.4438, 12.5373, 29.7277, 16.4776, 37.9419, 36.969, 25.0746, 17.7999, 25.5184, 18.0224, 19.2239, 21.8453, 30.3535, 37.9419, 33.1446, 15.6717, 7.54865, 12.5373, 20.597, 17.4763, 32.9552, 30.6764, 17.267, 19.0966, 6.93169, 23.4438, 22.528, 21.5193, 12.9891, 18.2505, 17.9105, 3.79419 4096, 16.2151, 16.7326, 13.6771, 14.2988, 15.2705, 7.02171, 27.8383, 21.1123, 11.826, 21.1123, 23.6521, 12.1927, 28.0869, 15.5729, 38.5979, 36.7921, 26.8866, 18.2891, 25.9978, 17.6726, 18.3961, 21.8453, 19.6608, 21.9981, 20.8326, 15.4202, 7.86432, 13.2173, 20.1649, 17.4763, 32.0993, 30.541, 13.9192, 24.576, 6.60867, 24.0132, 22.153, 19.9097, 12.3848, 18.8367, 17.1898, 3.70958 8192, 15.2137, 16.0749, 12.2585, 12.7159, 14.6891, 6.76165, 25.2435, 19.0384, 10.7844, 21.5688, 20.9072, 11.0645, 22.2737, 13.5233, 34.7742, 36.6438, 23.5026, 16.5431, 20.9072, 15.3508, 15.9246, , 17.9362, 19.5855, 18.3219, 13.6315, 7.28178, 11.6708, 16.7053, 15.3508, 28.8803, 27.9334, 12.3474, 20.5293, 6.5536, 20.9072, 19.8132, 16.7053, 11.2101, 16.7053, 14.3188, 3.64089 16384, 13.5927, 14.6801, 11.4688, 14.2249, 12.483, 6.5536, 24.3047, 18.9176, 10.309, 21.3373, 21.5883, 10.4858, 21.5883, 12.9226, 32.768, 31.3677, 22.5154, 15.5509, 20.8524, 14.0077, 14.1154, , 17.6443, 19.3159, 18.1684, 12.9226, 7.39923, 10.9227, 15.5509, 14.1154, 27.3882, 26.2144, 11.7629, 24.1448, 6.24152, 19.3159, 18.5354, 15.5509, 10.9881, 15.819, 14.4489, 3.47539 32768, 11.8439, 12.6031, 9.54408, 12.8502, 12.288, 6.30154, 22.0908, 15.9844, 8.69947, 20.9157, 21.0276, 8.77714, 21.8453, 12.9347, 32.4972, 30.0165, 20.8051, 15.2409, 21.0276, 11.8439, 11.7029, , 15.9844, 17.5543, 15.8555, 10.7436, 7.02171, 10.5703, 12.9347, 11.9156, 24.4234, 23.5459, 10.6275, 19.6608, 6.10584, 15.4809, 15.0082, 13.2843, 9.01872, 14.1445, 13.1072, 3.2551 65536, 10.0825, 10.5917, 8.32203, 13.7069, 10.6998, 5.95782, 19.2399, 15.0874, 7.71012, 18.5589, 18.7246, 7.88403, 22.1921, 13.1072, 31.0689, 30.6154, 19.4181, 13.981, 21.6201, 10.3819, 10.1803, , 13.7971, 14.3641, 13.4433, 9.79978, 7.03742, 8.96219, 11.275, 10.6998, 22.0753, 21.1834, 10.0825, 17.1898, 6.0263, 15.3077, 14.6654, 11.7159, 8.2565, 13.8884, 12.1927, 3.15836 131072, 9.52232, 9.77291, 7.47726, 11.8523, 11.0308, 5.83305, 18.7246, 13.7545, 7.00699, 18.4151, 18.5685, 7.05134, 19.7188, 11.9156, 30.316, 30.1111, 19.2088, 13.2632, 19.2088, 9.44163, 9.44163, , 10.6106, 11.1411, 10.8166, 8.63653, 6.79337, 8.77254, 10.2212, 9.77291, 20.8245, 20.0741, 9.05782, 15.2618, 5.95782, 13.9264, 13.423, 10.5105, 7.28178, 12.4482, 10.9227, 3.07766 262144, 9.216, 9.43718, 7.48983, 13.7168, 11.1288, 5.76845, 19.3385, 14.2126, 7.04267, 13.8782, 14.0434, 7.04267, 19.826, 12.099, 30.8405, 28.7719, 18.0099, 13.6375, 20.5156, 9.216, 9.25214, , 10.4858, 10.8225, 10.3478, 8.903, 6.98017, 9.07422, 9.99702, 9.59063, 20.6956, 20.1649, 7.78646, 18.0099, 5.74038, 14.2126, 13.7971, 10.3478, 7.19298, 13.0348, 11.3976, 3.01315 Norm. Avg., 0.417708, 0.423534, 0.30231, 0.258436, 0.316247, 0.135394, 0.516952, 0.374511, 0.232016, 0.418194, 0.41469, 0.25579, 0.571179, 0.301396, 0.86003, 0.835605, 0.73008, 0.384768, 0.468721, 0.332635, 0.343723, 0.594056, 0.444849, 0.518699, 0.517174, 0.31066, 0.157374, 0.247104, 0.3753, 0.312037, 0.721414, 0.618444, 0.418097, 0.401927, 0.155635, 0.418326, 0.4024, 0.424681, 0.250768, 0.334441, 0.32024, 0.0735082 ------------------------------------------------------ @@@@ bench.1d.np2.dat N, Brenner, CWP (min N), CWP (best N), FFTPACK, FFTPACK (f2c), FFTW, FFTW_ESTIMATE, Frigo-old, GSL, Nielsen, Singleton, Singleton (f2c), Temperton, Temperton (f2c), Valkenburg 6, , 12.9484, 5.77528, 24.7914, 17.8324, 96.8046, 101.645, 13.8292, 14.6252, 4.84023, 8.61397, 9.15719, 9.58914, 9.41156, 4.45811 9, 4.21103, 17.1532, 9.84051, 31.4235, 16.2582, 55.8119, 62.3233, 9.53927, 13.953, 6.97648, 15.7117, 14.0579, 12.1409, 15.9803, 4.328 12, , 23.8926, 15.4908, 46.2185, 20.7304, 82.9214, 78.3147, 16.1105, 18.427, 9.21349, 14.239, 18.0726, 17.7316, 18.9217, 4.24598 15, 5.78408, 20.873, 19.2031, 37.6532, 19.013, 54.0934, 49.2388, 10.6095, 18.1162, 9.69856, 14.3307, 16.1371, 18.8266, 17.6176, 3.72154 18, 5.21085, 22.7734, 16.5069, 25.62, 17.6944, 58.56, 53.4679, 11.2822, 19.0661, 8.36572, 18.4926, 21.3871, 17.8226, 19.215, 4.45566 24, , 28.3918, 23.263, 31.9094, 19.8119, 81.0283, 74.3456, 20.3715, 22.8213, 13.1597, 16.6933, 21.8531, 19.7036, 23.1139, 4.41883 36, 6.5157, 29.3206, 29.8955, 40.9308, 19.2997, 76.2336, 74.3743, 13.7358, 22.0967, 12.1974, 24.7914, 25.6248, 22.258, 24.3948, 4.40657 80, 10.5692, 31.3874, 27.9942, 43.1577, 17.8584, 54.1587, 62.3028, 19.0052, 27.0793, 22.038, 30.0227, 26.3894, 28.1846, 25.5749, 4.14314 108, 7.11463, 25.5397, 30.8056, 40.655, 19.1548, 61.6113, 58.0223, 12.1469, 22.4673, 13.4601, 25.3233, 24.6954, 25.3233, 24.493, 4.47327 210, 9.16504, 29.6227, 27.1946, 24.9454, 15.9507, 40.9598, 34.9236, 11.8491, 18.1297, 15.9507, 19.1777, 20.8663, , , 3.51456 504, 9.73348, 29.5104, 29.5104, 25.7397, 15.0427, 39.9408, 36.1964, 12.9417, 19.3047, 14.6618, 21.8544, 24.2573, , , 3.6888 1000, 8.50414, 23.8434, 30.0146, 22.9842, 14.3328, 33.133, 33.133, 12.1488, 18.0939, 19.6249, 29.6656, 29.3246, 25.5124, 23.4059, 3.4853 1960, 7.79484, 24.2813, 24.2813, 14.1432, 10.6348, 31.3575, 31.1794, 10.888, 16.1399, 12.9424, 18.5391, 19.4595, , , 3.06226 4725, 6.49846, 20.2809, 23.9683, 18.4556, 12.47, 30.5052, 31.0179, 8.1662, 16.6267, 11.6073, 18.2729, 18.642, , , 3.2041 10368, 6.74671, 21.4847, 22.3527, 24.4522, 14.5587, 34.5769, 33.0287, 11.4068, 19.5834, 11.1764, 17.8461, 17.1544, 16.7646, 16.2715, 3.63967 27000, 5.71882, 21.9287, 22.2354, 20.514, 12.5184, 29.4413, 27.6493, 8.41181, 16.9131, 12.4206, 15.7409, 15.5866, 16.5608, 14.9984, 3.15443 75600, 5.72515, 19.2942, 19.1435, 14.5855, 10.0425, 26.7799, 26.348, 7.85372, 14.8507, 9.42447, 12.7623, 12.7623, , , 2.84926 165375, 5.28937, 17.588, 17.1667, 10.0239, 7.79032, 23.8903, 23.4987, 6.25947, 14.6267, 8.38257, 12.1476, 12.2514, , , 2.75128 362880, 5.64148, 9.5744, 9.46621, 14.4441, 9.71316, 26.8083, 25.3867, 7.82953, 16.5893, 8.31523, 10.88, 10.88, , , 2.80657 Norm. Avg., 0.178204, 0.521511, 0.511102, 0.55908, 0.330769, 0.984245, 0.956033, 0.255758, 0.424764, 0.283225, 0.42348, 0.435343, 0.364271, 0.360895, 0.0833579 ------------------------------------------------------ @@@@ bench.3d.p2.dat Array Dimensions, FFTW, HARM, HARM (f2c), NR (C), NR (F), PDA, PDA (f2c), Singleton, Singleton (f2c), Temperton, Temperton (f2c) 4x4x4, 77.6723, , , 32.0993, 20.9715, 14.4299, 9.70904, 23.8313, 32.4302, 41.3912, 23.6521 8x8x8, 82.7823, 39.3216, 30.0547, 37.4491, 28.255, 22.6855, 13.7971, 35.2134, 42.8963, 54.2367, 36.5782 16x16x16, 46.6034, 25.9978, 20.8326, 14.9797, 13.3294, 22.9615, 14.0434, 15.1237, 17.9756, 31.775, 21.8453 32x32x32, 39.7188, 21.0276, 16.384, 11.5652, 10.7436, 19.0882, 12.6031, 12.4435, 14.0434, 24.4234, 16.6617 64x64x64, 31.249, 19.994, 15.1237, 9.51329, 8.83631, 19.3385, 12.483, 11.2885, 12.9632, 23.8313, 16.0496 256x64x32, 31.5236, 18.8664, 14.4369, 9.39762, 8.70758, 19.6867, 12.6736, 10.7575, 11.9729, 23.3837, 15.9129 16x1024x64, 31.2076, 19.0304, 14.6449, 9.47223, 8.76736, 19.3464, 12.4534, 11.3482, 12.8031, , 128x128x128, 31.5474, 18.5824, 14.5156, 9.38223, 8.73467, 20.5029, 13.2173, 9.89667, 10.5561, 18.005, 13.0761 Norm. Avg., 1, 0.571344, 0.441988, 0.335215, 0.285579, 0.493287, 0.31623, 0.34373, 0.401813, 0.651419, 0.438214 ------------------------------------------------------ @@@@ bench.3d.np2.dat Array Dimensions, FFTW, PDA, PDA (f2c), Singleton, Singleton (f2c), Temperton, Temperton (f2c) 5x5x5, 32.1305, 15.3728, 10.7424, 56.1651, 50.2321, 27.2251, 27.2251 6x6x6, 58.6413, 15.3148, 11.5118, 31.4726, 32.9857, 31.4726, 29.3206 7x7x7, 27.6458, 7.95188, 7.54617, 23.6648, 28.7194, , 9x9x9, 42.509, 20.0537, 12.4983, 35.8536, 36.5928, 30.5992, 29.3347 10x10x10, 43.9869, 21.0846, 14.0953, 33.3496, 35.4339, 34.0165, 28.9914 11x11x11, 22.9627, 7.89343, 7.24643, 19.6459, 23.1128, , 12x12x12, 44.8832, 23.7881, 14.0758, 25.0401, 25.8566, 30.4976, 26.7282 13x13x13, 21.3827, 7.65164, 6.96846, 18.2565, 20.2719, , 14x14x14, 32.8836, 12.3821, 10.6132, 18.7467, 21.3393, , 15x15x15, 37.2304, 22.8078, 14.2228, 24.5793, 23.6604, 26.9326, 25.066 24x25x28, 29.7081, 20.0687, 13.5717, 17.6305, 17.9663, , 48x48x48, 26.4708, 18.7167, 12.6915, 12.9577, 12.6915, 20.032, 18.3461 49x49x49, 23.452, 12.0103, 10.1625, 12.8681, 13.5733, , 60x60x60, 27.3405, 19.3316, 13.0193, 11.0626, 11.0626, 20.3599, 18.4022 72x60x56, 27.5576, 17.1688, 12.2219, 10.3013, 10.3506, , 75x75x75, 30.0891, 19.7084, 12.1282, 13.639, 13.2716, 21.1918, 19.2277 80x80x80, 26.1034, 18.3911, 12.6769, 12.2298, 12.5458, 19.421, 17.2172 84x84x84, 25.0358, 15.7427, 11.2093, 9.8324, 10.0944, , 96x96x96, 28.4656, 19.1643, 13.4033, 10.6702, 10.6055, 17.443, 14.6873 105x105x105, 26.4974, 16.037, 11.0197, 12.0319, 12.1573, , 112x112x112, 25.6633, 15.5089, 11.287, 11.6917, 11.8169, , 120x120x120, 27.6703, 19.5444, 13.2318, 9.57874, 9.543, 18.0289, 16.4245 144x144x144, 28.0716, 19.5103, 12.8868, 10.7333, 10.5952, 18.4034, 16.1702 Norm. Avg., 0.979487, 0.542551, 0.383137, 0.573563, 0.596265, 0.675934, 0.611769 @@@@ end