To: benchfft@theory.lcs.mit.edu Subject: SUBMIT ------------------- @@SUBMIT@@ @ submitter = James Daryl Forrest @ submitter email = PRIVATE @ submitter organization = NONE @ computer manufacturer = Apple @ computer model = Power Macintosh 8600/300 @ CPU manufacturer = Motorola @ CPU model = PowerPC 604e @ CPU speed = 300 MHz @ RAM = 128 MB @ L2 cache size = 1 MB @ operating system = Mac OS 8.0 @ C compiler = Metrowerks Codewarrior Pro 2 @ C compiler flags = all @ Fortran compiler = NONE @ Fortran compiler flags = NONE @ remarks = no extensions enabled, VM off @ FFTW version = FFTW V1.2 @ floating-point precision = double @ floating-point size = 8 bytes ------------------------------------------------------ @@@@ bench.1d.p2.log Benchmarking for sizes: 2 (0.000228882 MB) 4 (0.000534058 MB) 8 (0.000839233 MB) 16 (0.00164795 MB) 32 (0.00297546 MB) 64 (0.00616455 MB) 128 (0.0119019 MB) 256 (0.0238037 MB) 512 (0.0476074 MB) 1024 (0.0939941 MB) 2048 (0.189575 MB) 4096 (0.37915 MB) 8192 (0.765991 MB) 16384 (1.51184 MB) 32768 (3.02368 MB) 65536 (6.09973 MB) 131072 (12.1995 MB) 262144 (25.4987 MB) 524288 (50.9973 MB) 1048576 (96 MB) Maximum array size = 1048576 Benchmarking FFTs: 0. Arndt DIF 1. Arndt DIT 2. Arndt Split-Radix 3. Arndt 4-step 4. Beauregard 5. Bergland 6. CWP (min N) 7. CWP (best N) 8. Edelblute 9. FFTPACK (f2c) 10. FFTW 11. FFTW_ESTIMATE 12. Frigo-old 13. Green 14. GSL 15. GSL DIT 16. GSL DIF 17. Krukar 18. Mayer (Buneman) 19. Mayer (simple) 20. Mayer (lookup) 21. NAPACK (f2c) 22. Nielsen 23. NR (C) 24. Ooura (C) 25. QFT 26. Ransom 27. Singleton (f2c) 28. Temperton (f2c) 29. Valkenburg Computing normalized averages (30 transforms). Benchmarking for array size = 2 (power of 2): 0. Arndt DIF: elapsed time t=1.46649 s, 4194304 iters, t-(init.)=1.19853 s t(norm)=0.142876, mflops=34.9955 (err=1.7e-17) 1. Arndt DIT: elapsed time t=1.52322 s, 4194304 iters, t-(init.)=1.25541 s t(norm)=0.149656, mflops=33.41 (err=1.7e-17) 2. Arndt Split-Radix: elapsed time t=1.19154 s, 2097152 iters, t-(init.)=1.05748 s t(norm)=0.252122, mflops=19.8316 (err=1.7e-17) 3. Arndt 4-step: elapsed time t=1.87441 s, 262144 iters, t-(init.)=1.8576 s t(norm)=3.54308, mflops=1.4112 (err=1.7e-17) 4. Beauregard: elapsed time t=1.08605 s, 524288 iters, t-(init.)=1.05257 s t(norm)=1.00381, mflops=4.98101 (err=1.7e-17) 5. Bergland: elapsed time t=1.91183 s, 1048576 iters, t-(init.)=1.84485 s t(norm)=0.879694, mflops=5.68379 (err=1.7e-17) 6. CWP (min N): elapsed time t=1.08779 s, 524288 iters, t-(init.)=1.05425 s t(norm)=1.00541, mflops=4.97309 7. CWP (best N) (N=3): elapsed time t=1.15999 s, 524288 iters, t-(init.)=1.11941 s t(norm)=1.06755, mflops=4.68362 8. Skipping fft (Edelblute can't handle N <= 2). 9. FFTPACK (f2c): elapsed time t=1.48785 s, 1048576 iters, t-(init.)=1.41738 s t(norm)=0.67586, mflops=7.39798 (err=1.7e-17) FFTW_MEASURE plan: (cost = 3.225899e-07) FFTW_NOTW 2 10. FFTW: elapsed time t=1.46674 s, 4194304 iters, t-(init.)=1.18484 s t(norm)=0.141244, mflops=35.3998 (err=1.7e-17) FFTW_ESTIMATE plan: (cost = 1.820000e+02) FFTW_NOTW 2 11. FFTW_ESTIMATE: elapsed time t=1.43852 s, 4194304 iters, t-(init.)=1.17061 s t(norm)=0.139548, mflops=35.83 (err=1.7e-17) 12. Frigo-old: elapsed time t=1.04369 s, 4194304 iters, t-(init.)=0.77589 s t(norm)=0.0924933, mflops=54.058 (err=1.7e-17) 13. Skipping fft (Green can't handle this size.). 14. GSL: elapsed time t=1.69248 s, 2097152 iters, t-(init.)=1.55857 s t(norm)=0.371593, mflops=13.4556 (err=1.7e-17) 15. GSL DIT: elapsed time t=1.24791 s, 1048576 iters, t-(init.)=1.17725 s t(norm)=0.561357, mflops=8.90698 (err=1.7e-17) 16. GSL DIF: elapsed time t=1.29057 s, 1048576 iters, t-(init.)=1.22357 s t(norm)=0.583443, mflops=8.56982 (err=1.7e-17) 17. Krukar: elapsed time t=1.67842 s, 4194304 iters, t-(init.)=1.41061 s t(norm)=0.168158, mflops=29.7339 (err=1.7e-17) 18. Skipping fft (Mayer can't handle N <= 2). 19. Skipping fft (Mayer can't handle N <= 2). 20. Skipping fft (Mayer can't handle N <= 2). 21. NAPACK (f2c): elapsed time t=1.59944 s, 524288 iters, t-(init.)=1.56582 s t(norm)=1.49328, mflops=3.34833 (err=1.7e-17) 22. Nielsen: elapsed time t=1.05438 s, 262144 iters, t-(init.)=1.03762 s t(norm)=1.9791, mflops=2.5264 (err=1.7e-17) 23. NR (C): elapsed time t=1.06211 s, 1048576 iters, t-(init.)=0.995131 s t(norm)=0.474515, mflops=10.5371 (err=1.7e-17) 24. Ooura (C): elapsed time t=1.53706 s, 4194304 iters, t-(init.)=1.26889 s t(norm)=0.151264, mflops=33.0549 (err=1.7e-17) 25. Skipping fft (QFT requires N >= 16). 26. Skipping fft (Ransom doesn't work for N=2). 27. Singleton (f2c): elapsed time t=1.517 s, 1048576 iters, t-(init.)=1.45001 s t(norm)=0.69142, mflops=7.23149 (err=1.7e-17) 28. Temperton (f2c): elapsed time t=1.61122 s, 524288 iters, t-(init.)=1.5777 s t(norm)=1.50462, mflops=3.32311 (err=1.7e-17) 29. Valkenburg: elapsed time t=1.28007 s, 1048576 iters, t-(init.)=1.20954 s t(norm)=0.576755, mflops=8.66919 (err=1.7e-17) Top mflops for N=2 = 54.058 Normalized results and averages for N=2: fft 0: mflops = 34.9955 (norm. = 0.64737), norm. avg. (of 1) = 0.64737 fft 1: mflops = 33.41 (norm. = 0.61804), norm. avg. (of 1) = 0.61804 fft 2: mflops = 19.8316 (norm. = 0.366859), norm. avg. (of 1) = 0.366859 fft 3: mflops = 1.4112 (norm. = 0.0261053), norm. avg. (of 1) = 0.0261053 fft 4: mflops = 4.98101 (norm. = 0.092142), norm. avg. (of 1) = 0.092142 fft 5: mflops = 5.68379 (norm. = 0.105143), norm. avg. (of 1) = 0.105143 fft 6: mflops = 4.97309 (norm. = 0.0919954), norm. avg. (of 1) = 0.0919954 fft 7: mflops = 4.68362 (norm. = 0.0866407), norm. avg. (of 1) = 0.0866407 fft 8: mflops = -1 (norm. = -0.0184987), norm. avg. (of 0) = -1 fft 9: mflops = 7.39798 (norm. = 0.136853), norm. avg. (of 1) = 0.136853 fft 10: mflops = 35.3998 (norm. = 0.65485), norm. avg. (of 1) = 0.65485 fft 11: mflops = 35.83 (norm. = 0.662807), norm. avg. (of 1) = 0.662807 fft 12: mflops = 54.058 (norm. = 1), norm. avg. (of 1) = 1 fft 13: mflops = -1 (norm. = -0.0184987), norm. avg. (of 0) = -1 fft 14: mflops = 13.4556 (norm. = 0.248911), norm. avg. (of 1) = 0.248911 fft 15: mflops = 8.90698 (norm. = 0.164767), norm. avg. (of 1) = 0.164767 fft 16: mflops = 8.56982 (norm. = 0.15853), norm. avg. (of 1) = 0.15853 fft 17: mflops = 29.7339 (norm. = 0.550037), norm. avg. (of 1) = 0.550037 fft 18: mflops = -1 (norm. = -0.0184987), norm. avg. (of 0) = -1 fft 19: mflops = -1 (norm. = -0.0184987), norm. avg. (of 0) = -1 fft 20: mflops = -1 (norm. = -0.0184987), norm. avg. (of 0) = -1 fft 21: mflops = 3.34833 (norm. = 0.0619397), norm. avg. (of 1) = 0.0619397 fft 22: mflops = 2.5264 (norm. = 0.046735), norm. avg. (of 1) = 0.046735 fft 23: mflops = 10.5371 (norm. = 0.194922), norm. avg. (of 1) = 0.194922 fft 24: mflops = 33.0549 (norm. = 0.61147), norm. avg. (of 1) = 0.61147 fft 25: mflops = -1 (norm. = -0.0184987), norm. avg. (of 0) = -1 fft 26: mflops = -1 (norm. = -0.0184987), norm. avg. (of 0) = -1 fft 27: mflops = 7.23149 (norm. = 0.133773), norm. avg. (of 1) = 0.133773 fft 28: mflops = 3.32311 (norm. = 0.061473), norm. avg. (of 1) = 0.061473 fft 29: mflops = 8.66919 (norm. = 0.160368), norm. avg. (of 1) = 0.160368 Benchmarking for array size = 4 (power of 2): 0. Arndt DIF: elapsed time t=1.37541 s, 2097152 iters, t-(init.)=1.1851 s t(norm)=0.0706376, mflops=70.7838 (err=1.3e-16) 1. Arndt DIT: elapsed time t=1.35476 s, 2097152 iters, t-(init.)=1.16443 s t(norm)=0.0694057, mflops=72.0402 (err=1.3e-16) 2. Arndt Split-Radix: elapsed time t=1.41541 s, 1048576 iters, t-(init.)=1.32023 s t(norm)=0.157384, mflops=31.7694 (err=1.3e-16) 3. Arndt 4-step: elapsed time t=1.62427 s, 262144 iters, t-(init.)=1.60048 s t(norm)=0.76317, mflops=6.55162 (err=1.3e-16) 4. Beauregard: elapsed time t=1.85332 s, 524288 iters, t-(init.)=1.80577 s t(norm)=0.430529, mflops=11.6136 (err=5.3e-17) 5. Bergland: elapsed time t=1.08036 s, 524288 iters, t-(init.)=1.03274 s t(norm)=0.246224, mflops=20.3067 (err=5.3e-17) 6. CWP (min N): elapsed time t=1.14767 s, 524288 iters, t-(init.)=1.10004 s t(norm)=0.26227, mflops=19.0643 7. CWP (best N) (N=15): elapsed time t=1.16458 s, 262144 iters, t-(init.)=1.10201 s t(norm)=0.525477, mflops=9.51516 8. Edelblute: elapsed time t=1.51523 s, 1048576 iters, t-(init.)=1.42003 s t(norm)=0.169281, mflops=29.5366 (err=1.3e-16) 9. FFTPACK (f2c): elapsed time t=1.16545 s, 524288 iters, t-(init.)=1.11607 s t(norm)=0.266091, mflops=18.7906 (err=5.3e-17) FFTW_MEASURE plan: (cost = 4.071503e-07) FFTW_NOTW 4 10. FFTW: elapsed time t=1.8473 s, 4194304 iters, t-(init.)=1.4523 s t(norm)=0.0432819, mflops=115.522 (err=5.3e-17) FFTW_ESTIMATE plan: (cost = 3.176000e+02) FFTW_NOTW 4 11. FFTW_ESTIMATE: elapsed time t=1.81891 s, 4194304 iters, t-(init.)=1.438 s t(norm)=0.0428557, mflops=116.671 (err=5.3e-17) 12. Frigo-old: elapsed time t=1.35377 s, 4194304 iters, t-(init.)=0.972778 s t(norm)=0.028991, mflops=172.467 (err=5.3e-17) 13. Skipping fft (Green can't handle this size.). 14. GSL: elapsed time t=1.30136 s, 1048576 iters, t-(init.)=1.20584 s t(norm)=0.143747, mflops=34.7833 (err=5.3e-17) 15. GSL DIT: elapsed time t=1.19875 s, 524288 iters, t-(init.)=1.14927 s t(norm)=0.274007, mflops=18.2477 (err=6.4e-17) 16. GSL DIF: elapsed time t=1.23602 s, 524288 iters, t-(init.)=1.18831 s t(norm)=0.283315, mflops=17.6482 (err=6.4e-17) 17. Krukar: elapsed time t=1.19865 s, 2097152 iters, t-(init.)=1.00821 s t(norm)=0.0600942, mflops=83.2027 (err=5.3e-17) 18. Mayer (Buneman): elapsed time t=1.26951 s, 1048576 iters, t-(init.)=1.17433 s t(norm)=0.139991, mflops=35.7165 (err=1.3e-16) 19. Mayer (simple): elapsed time t=1.24451 s, 1048576 iters, t-(init.)=1.14939 s t(norm)=0.137018, mflops=36.4916 20. Mayer (lookup): elapsed time t=1.33027 s, 1048576 iters, t-(init.)=1.2351 s t(norm)=0.147236, mflops=33.9592 (err=1.3e-16) 21. NAPACK (f2c): elapsed time t=1.42925 s, 262144 iters, t-(init.)=1.40538 s t(norm)=0.670138, mflops=7.46115 (err=5.3e-17) 22. Nielsen: elapsed time t=1.16287 s, 262144 iters, t-(init.)=1.13901 s t(norm)=0.543121, mflops=9.20605 (err=1.3e-16) 23. NR (C): elapsed time t=1.13513 s, 524288 iters, t-(init.)=1.08753 s t(norm)=0.259288, mflops=19.2836 (err=6.4e-17) 24. Ooura (C): elapsed time t=1.50211 s, 2097152 iters, t-(init.)=1.31175 s t(norm)=0.0781865, mflops=63.9496 (err=5.3e-17) 25. Skipping fft (QFT requires N >= 16). 26. Ransom: elapsed time t=1.97389 s, 262144 iters, t-(init.)=1.95008 s t(norm)=0.92987, mflops=5.37709 (err=2.4e-16) 27. Singleton (f2c): elapsed time t=1.7464 s, 1048576 iters, t-(init.)=1.65111 s t(norm)=0.196827, mflops=25.403 (err=5.3e-17) 28. Temperton (f2c): elapsed time t=1.02817 s, 262144 iters, t-(init.)=1.00434 s t(norm)=0.478905, mflops=10.4405 (err=5.3e-17) 29. Valkenburg: elapsed time t=1.16 s, 262144 iters, t-(init.)=1.13532 s t(norm)=0.541365, mflops=9.23592 (err=5.3e-17) Top mflops for N=4 = 172.467 Normalized results and averages for N=4: fft 0: mflops = 70.7838 (norm. = 0.41042), norm. avg. (of 2) = 0.528895 fft 1: mflops = 72.0402 (norm. = 0.417704), norm. avg. (of 2) = 0.517872 fft 2: mflops = 31.7694 (norm. = 0.184206), norm. avg. (of 2) = 0.275532 fft 3: mflops = 6.55162 (norm. = 0.0379877), norm. avg. (of 2) = 0.0320465 fft 4: mflops = 11.6136 (norm. = 0.0673381), norm. avg. (of 2) = 0.0797401 fft 5: mflops = 20.3067 (norm. = 0.117743), norm. avg. (of 2) = 0.111443 fft 6: mflops = 19.0643 (norm. = 0.110539), norm. avg. (of 2) = 0.101267 fft 7: mflops = 9.51516 (norm. = 0.0551709), norm. avg. (of 2) = 0.0709058 fft 8: mflops = 29.5366 (norm. = 0.17126), norm. avg. (of 1) = 0.17126 fft 9: mflops = 18.7906 (norm. = 0.108952), norm. avg. (of 2) = 0.122902 fft 10: mflops = 115.522 (norm. = 0.66982), norm. avg. (of 2) = 0.662335 fft 11: mflops = 116.671 (norm. = 0.676481), norm. avg. (of 2) = 0.669644 fft 12: mflops = 172.467 (norm. = 1), norm. avg. (of 2) = 1 fft 13: mflops = -1 (norm. = -0.00579821), norm. avg. (of 0) = -1 fft 14: mflops = 34.7833 (norm. = 0.201681), norm. avg. (of 2) = 0.225296 fft 15: mflops = 18.2477 (norm. = 0.105804), norm. avg. (of 2) = 0.135286 fft 16: mflops = 17.6482 (norm. = 0.102328), norm. avg. (of 2) = 0.130429 fft 17: mflops = 83.2027 (norm. = 0.482427), norm. avg. (of 2) = 0.516232 fft 18: mflops = 35.7165 (norm. = 0.207092), norm. avg. (of 1) = 0.207092 fft 19: mflops = 36.4916 (norm. = 0.211586), norm. avg. (of 1) = 0.211586 fft 20: mflops = 33.9592 (norm. = 0.196903), norm. avg. (of 1) = 0.196903 fft 21: mflops = 7.46115 (norm. = 0.0432613), norm. avg. (of 2) = 0.0526005 fft 22: mflops = 9.20605 (norm. = 0.0533786), norm. avg. (of 2) = 0.0500568 fft 23: mflops = 19.2836 (norm. = 0.11181), norm. avg. (of 2) = 0.153366 fft 24: mflops = 63.9496 (norm. = 0.370793), norm. avg. (of 2) = 0.491132 fft 25: mflops = -1 (norm. = -0.00579821), norm. avg. (of 0) = -1 fft 26: mflops = 5.37709 (norm. = 0.0311775), norm. avg. (of 1) = 0.0311775 fft 27: mflops = 25.403 (norm. = 0.147292), norm. avg. (of 2) = 0.140532 fft 28: mflops = 10.4405 (norm. = 0.0605361), norm. avg. (of 2) = 0.0610045 fft 29: mflops = 9.23592 (norm. = 0.0535518), norm. avg. (of 2) = 0.10696 Benchmarking for array size = 8 (power of 2): 0. Arndt DIF: elapsed time t=1.28032 s, 1048576 iters, t-(init.)=1.12877 s t(norm)=0.0448535, mflops=111.474 (err=1.1e-16) 1. Arndt DIT: elapsed time t=1.28018 s, 1048576 iters, t-(init.)=1.12828 s t(norm)=0.0448338, mflops=111.523 (err=1.1e-16) 2. Arndt Split-Radix: elapsed time t=1.60686 s, 524288 iters, t-(init.)=1.5311 s t(norm)=0.121681, mflops=41.091 (err=7.7e-17) 3. Arndt 4-step: elapsed time t=1.80393 s, 131072 iters, t-(init.)=1.78472 s t(norm)=0.567348, mflops=8.81294 (err=9.0e-17) 4. Beauregard: elapsed time t=1.95397 s, 262144 iters, t-(init.)=1.91606 s t(norm)=0.30455, mflops=16.4177 (err=1.5e-16) 5. Bergland: elapsed time t=1.81517 s, 524288 iters, t-(init.)=1.73935 s t(norm)=0.138231, mflops=36.1713 (err=1.6e-16) 6. CWP (min N): elapsed time t=1.42858 s, 524288 iters, t-(init.)=1.35283 s t(norm)=0.107513, mflops=46.5059 7. CWP (best N) (N=15): elapsed time t=1.16334 s, 262144 iters, t-(init.)=1.10043 s t(norm)=0.174909, mflops=28.5862 8. Edelblute: elapsed time t=1.99244 s, 524288 iters, t-(init.)=1.91656 s t(norm)=0.152314, mflops=32.8268 (err=8.3e-17) 9. FFTPACK (f2c): elapsed time t=1.04544 s, 262144 iters, t-(init.)=1.00668 s t(norm)=0.160007, mflops=31.2487 (err=1.5e-16) FFTW_MEASURE plan: (cost = 7.195587e-07) FFTW_NOTW 8 10. FFTW: elapsed time t=1.60798 s, 2097152 iters, t-(init.)=1.29779 s t(norm)=0.0257847, mflops=193.913 (err=1.4e-16) FFTW_ESTIMATE plan: (cost = 4.688000e+02) FFTW_NOTW 8 11. FFTW_ESTIMATE: elapsed time t=1.59364 s, 2097152 iters, t-(init.)=1.29021 s t(norm)=0.0256341, mflops=195.053 (err=1.4e-16) 12. Frigo-old: elapsed time t=1.27673 s, 2097152 iters, t-(init.)=0.973303 s t(norm)=0.0193378, mflops=258.561 (err=1.4e-16) 13. Green: elapsed time t=1.66409 s, 1048576 iters, t-(init.)=1.51244 s t(norm)=0.060099, mflops=83.1961 (err=1.4e-16) 14. GSL: elapsed time t=1.34986 s, 524288 iters, t-(init.)=1.27404 s t(norm)=0.101251, mflops=49.3821 (err=1.4e-16) 15. GSL DIT: elapsed time t=1.02961 s, 262144 iters, t-(init.)=0.990765 s t(norm)=0.157478, mflops=31.7505 (err=1.5e-16) 16. GSL DIF: elapsed time t=1.05696 s, 262144 iters, t-(init.)=1.01899 s t(norm)=0.161964, mflops=30.871 (err=1.6e-16) 17. Krukar: elapsed time t=1.24851 s, 1048576 iters, t-(init.)=1.09699 s t(norm)=0.0435906, mflops=114.704 (err=1.5e-16) 18. Mayer (Buneman): elapsed time t=1.12255 s, 524288 iters, t-(init.)=1.04668 s t(norm)=0.0831825, mflops=60.1088 (err=1.1e-16) 19. Mayer (simple): elapsed time t=1.12051 s, 524288 iters, t-(init.)=1.04471 s t(norm)=0.0830262, mflops=60.222 20. Mayer (lookup): elapsed time t=1.15951 s, 524288 iters, t-(init.)=1.08367 s t(norm)=0.0861227, mflops=58.0567 (err=1.1e-16) 21. NAPACK (f2c): elapsed time t=1.42417 s, 131072 iters, t-(init.)=1.40519 s t(norm)=0.446698, mflops=11.1932 (err=1.7e-16) 22. Nielsen: elapsed time t=1.58319 s, 262144 iters, t-(init.)=1.54528 s t(norm)=0.245615, mflops=20.357 (err=7.5e-16) 23. NR (C): elapsed time t=1.99013 s, 524288 iters, t-(init.)=1.91438 s t(norm)=0.152142, mflops=32.8641 (err=1.6e-16) 24. Ooura (C): elapsed time t=1.21639 s, 1048576 iters, t-(init.)=1.06442 s t(norm)=0.0422962, mflops=118.214 (err=1.5e-16) 25. Skipping fft (QFT requires N >= 16). 26. Ransom: elapsed time t=1.26027 s, 65536 iters, t-(init.)=1.25079 s t(norm)=0.795228, mflops=6.2875 (err=3.1e-16) 27. Singleton (f2c): elapsed time t=1.13403 s, 262144 iters, t-(init.)=1.09612 s t(norm)=0.174224, mflops=28.6987 (err=1.4e-16) 28. Temperton (f2c): elapsed time t=1.08385 s, 131072 iters, t-(init.)=1.0649 s t(norm)=0.338522, mflops=14.7701 (err=1.4e-16) 29. Valkenburg: elapsed time t=1.63856 s, 131072 iters, t-(init.)=1.6193 s t(norm)=0.514763, mflops=9.7132 (err=1.4e-16) Top mflops for N=8 = 258.561 Normalized results and averages for N=8: fft 0: mflops = 111.474 (norm. = 0.431132), norm. avg. (of 3) = 0.496307 fft 1: mflops = 111.523 (norm. = 0.431322), norm. avg. (of 3) = 0.489022 fft 2: mflops = 41.091 (norm. = 0.158922), norm. avg. (of 3) = 0.236662 fft 3: mflops = 8.81294 (norm. = 0.0340845), norm. avg. (of 3) = 0.0327258 fft 4: mflops = 16.4177 (norm. = 0.0634963), norm. avg. (of 3) = 0.0743255 fft 5: mflops = 36.1713 (norm. = 0.139895), norm. avg. (of 3) = 0.120927 fft 6: mflops = 46.5059 (norm. = 0.179864), norm. avg. (of 3) = 0.127466 fft 7: mflops = 28.5862 (norm. = 0.110559), norm. avg. (of 3) = 0.0841235 fft 8: mflops = 32.8268 (norm. = 0.12696), norm. avg. (of 2) = 0.14911 fft 9: mflops = 31.2487 (norm. = 0.120856), norm. avg. (of 3) = 0.12222 fft 10: mflops = 193.913 (norm. = 0.749971), norm. avg. (of 3) = 0.691547 fft 11: mflops = 195.053 (norm. = 0.754378), norm. avg. (of 3) = 0.697889 fft 12: mflops = 258.561 (norm. = 1), norm. avg. (of 3) = 1 fft 13: mflops = 83.1961 (norm. = 0.321766), norm. avg. (of 1) = 0.321766 fft 14: mflops = 49.3821 (norm. = 0.190988), norm. avg. (of 3) = 0.21386 fft 15: mflops = 31.7505 (norm. = 0.122797), norm. avg. (of 3) = 0.131123 fft 16: mflops = 30.871 (norm. = 0.119395), norm. avg. (of 3) = 0.126751 fft 17: mflops = 114.704 (norm. = 0.443623), norm. avg. (of 3) = 0.492029 fft 18: mflops = 60.1088 (norm. = 0.232474), norm. avg. (of 2) = 0.219783 fft 19: mflops = 60.222 (norm. = 0.232912), norm. avg. (of 2) = 0.222249 fft 20: mflops = 58.0567 (norm. = 0.224538), norm. avg. (of 2) = 0.21072 fft 21: mflops = 11.1932 (norm. = 0.0432905), norm. avg. (of 3) = 0.0494972 fft 22: mflops = 20.357 (norm. = 0.0787321), norm. avg. (of 3) = 0.0596152 fft 23: mflops = 32.8641 (norm. = 0.127104), norm. avg. (of 3) = 0.144612 fft 24: mflops = 118.214 (norm. = 0.457199), norm. avg. (of 3) = 0.479821 fft 25: mflops = -1 (norm. = -0.00386756), norm. avg. (of 0) = -1 fft 26: mflops = 6.2875 (norm. = 0.0243173), norm. avg. (of 2) = 0.0277474 fft 27: mflops = 28.6987 (norm. = 0.110994), norm. avg. (of 3) = 0.130686 fft 28: mflops = 14.7701 (norm. = 0.0571241), norm. avg. (of 3) = 0.0597111 fft 29: mflops = 9.7132 (norm. = 0.0375664), norm. avg. (of 3) = 0.0838289 Benchmarking for array size = 16 (power of 2): 0. Arndt DIF: elapsed time t=1.20694 s, 262144 iters, t-(init.)=1.14055 s t(norm)=0.0679819, mflops=73.549 (err=1.9e-16) 1. Arndt DIT: elapsed time t=1.19935 s, 262144 iters, t-(init.)=1.13327 s t(norm)=0.0675482, mflops=74.0212 (err=1.9e-16) 2. Arndt Split-Radix: elapsed time t=1.74688 s, 262144 iters, t-(init.)=1.68078 s t(norm)=0.100182, mflops=49.909 (err=1.5e-16) 3. Arndt 4-step: elapsed time t=1.24059 s, 65536 iters, t-(init.)=1.22403 s t(norm)=0.291831, mflops=17.1332 (err=2.0e-16) 4. Beauregard: elapsed time t=1.13941 s, 65536 iters, t-(init.)=1.12288 s t(norm)=0.267715, mflops=18.6766 (err=2.3e-16) 5. Bergland: elapsed time t=1.46042 s, 262144 iters, t-(init.)=1.3943 s t(norm)=0.0831069, mflops=60.1635 (err=2.6e-16) 6. CWP (min N): elapsed time t=1.07869 s, 262144 iters, t-(init.)=1.01258 s t(norm)=0.0603547, mflops=82.8437 7. CWP (best N) (N=28): elapsed time t=1.49977 s, 262144 iters, t-(init.)=1.39139 s t(norm)=0.0829335, mflops=60.2893 8. Edelblute: elapsed time t=1.15813 s, 131072 iters, t-(init.)=1.12502 s t(norm)=0.134113, mflops=37.2819 (err=1.6e-16) 9. FFTPACK (f2c): elapsed time t=1.8753 s, 262144 iters, t-(init.)=1.80821 s t(norm)=0.107777, mflops=46.3919 (err=2.1e-16) FFTW_MEASURE plan: (cost = 1.401611e-06) FFTW_NOTW 16 10. FFTW: elapsed time t=1.54777 s, 1048576 iters, t-(init.)=1.27996 s t(norm)=0.0190729, mflops=262.152 (err=2.2e-16) FFTW_ESTIMATE plan: (cost = 4.256000e+02) FFTW_NOTW 16 11. FFTW_ESTIMATE: elapsed time t=1.54096 s, 1048576 iters, t-(init.)=1.2764 s t(norm)=0.0190198, mflops=262.884 (err=2.2e-16) 12. Frigo-old: elapsed time t=1.47779 s, 1048576 iters, t-(init.)=1.21348 s t(norm)=0.0180822, mflops=276.514 (err=2.2e-16) 13. Green: elapsed time t=1.40706 s, 524288 iters, t-(init.)=1.27489 s t(norm)=0.0379948, mflops=131.597 (err=2.6e-16) 14. GSL: elapsed time t=1.19625 s, 262144 iters, t-(init.)=1.13015 s t(norm)=0.0673623, mflops=74.2255 (err=2.1e-16) 15. GSL DIT: elapsed time t=1.87594 s, 262144 iters, t-(init.)=1.80897 s t(norm)=0.107823, mflops=46.3724 (err=3.1e-16) 16. GSL DIF: elapsed time t=1.90053 s, 262144 iters, t-(init.)=1.83443 s t(norm)=0.109341, mflops=45.7286 (err=2.5e-16) 17. Krukar: elapsed time t=1.39477 s, 524288 iters, t-(init.)=1.2625 s t(norm)=0.0376253, mflops=132.889 (err=1.7e-16) 18. Mayer (Buneman): elapsed time t=1.47781 s, 262144 iters, t-(init.)=1.41171 s t(norm)=0.0841443, mflops=59.4217 (err=2.3e-16) 19. Mayer (simple): elapsed time t=1.26187 s, 262144 iters, t-(init.)=1.19577 s t(norm)=0.0712732, mflops=70.1526 20. Mayer (lookup): elapsed time t=1.25872 s, 262144 iters, t-(init.)=1.19262 s t(norm)=0.0710856, mflops=70.3378 (err=2.1e-16) 21. NAPACK (f2c): elapsed time t=1.34629 s, 65536 iters, t-(init.)=1.32975 s t(norm)=0.317036, mflops=15.7711 (err=2.7e-16) 22. Nielsen: elapsed time t=1.09164 s, 65536 iters, t-(init.)=1.07511 s t(norm)=0.256326, mflops=19.5064 (err=1.8e-16) 23. NR (C): elapsed time t=1.81996 s, 262144 iters, t-(init.)=1.75385 s t(norm)=0.104538, mflops=47.8296 (err=2.9e-16) 24. Ooura (C): elapsed time t=1.22409 s, 524288 iters, t-(init.)=1.09152 s t(norm)=0.0325298, mflops=153.705 (err=2.5e-16) 25. QFT: elapsed time t=1.98508 s, 524288 iters, t-(init.)=1.85105 s t(norm)=0.0551657, mflops=90.6361 (err=1.4e-16) 26. Ransom: elapsed time t=1.02833 s, 65536 iters, t-(init.)=1.01179 s t(norm)=0.241229, mflops=20.7272 (err=5.0e-16) 27. Singleton (f2c): elapsed time t=1.1793 s, 262144 iters, t-(init.)=1.11319 s t(norm)=0.0663512, mflops=75.3566 (err=2.0e-16) 28. Temperton (f2c): elapsed time t=1.00948 s, 65536 iters, t-(init.)=0.992901 s t(norm)=0.236726, mflops=21.1215 (err=2.1e-16) 29. Valkenburg: elapsed time t=1.04624 s, 32768 iters, t-(init.)=1.03788 s t(norm)=0.494901, mflops=10.103 (err=2.5e-16) Top mflops for N=16 = 276.514 Normalized results and averages for N=16: fft 0: mflops = 73.549 (norm. = 0.265986), norm. avg. (of 4) = 0.438727 fft 1: mflops = 74.0212 (norm. = 0.267694), norm. avg. (of 4) = 0.43369 fft 2: mflops = 49.909 (norm. = 0.180493), norm. avg. (of 4) = 0.22262 fft 3: mflops = 17.1332 (norm. = 0.0619614), norm. avg. (of 4) = 0.0400347 fft 4: mflops = 18.6766 (norm. = 0.0675429), norm. avg. (of 4) = 0.0726298 fft 5: mflops = 60.1635 (norm. = 0.217578), norm. avg. (of 4) = 0.14509 fft 6: mflops = 82.8437 (norm. = 0.2996), norm. avg. (of 4) = 0.1705 fft 7: mflops = 60.2893 (norm. = 0.218033), norm. avg. (of 4) = 0.117601 fft 8: mflops = 37.2819 (norm. = 0.134828), norm. avg. (of 3) = 0.144349 fft 9: mflops = 46.3919 (norm. = 0.167774), norm. avg. (of 4) = 0.133609 fft 10: mflops = 262.152 (norm. = 0.948059), norm. avg. (of 4) = 0.755675 fft 11: mflops = 262.884 (norm. = 0.950706), norm. avg. (of 4) = 0.761093 fft 12: mflops = 276.514 (norm. = 1), norm. avg. (of 4) = 1 fft 13: mflops = 131.597 (norm. = 0.475914), norm. avg. (of 2) = 0.39884 fft 14: mflops = 74.2255 (norm. = 0.268433), norm. avg. (of 4) = 0.227503 fft 15: mflops = 46.3724 (norm. = 0.167703), norm. avg. (of 4) = 0.140268 fft 16: mflops = 45.7286 (norm. = 0.165375), norm. avg. (of 4) = 0.136407 fft 17: mflops = 132.889 (norm. = 0.480587), norm. avg. (of 4) = 0.489169 fft 18: mflops = 59.4217 (norm. = 0.214896), norm. avg. (of 3) = 0.218154 fft 19: mflops = 70.1526 (norm. = 0.253703), norm. avg. (of 3) = 0.232734 fft 20: mflops = 70.3378 (norm. = 0.254373), norm. avg. (of 3) = 0.225271 fft 21: mflops = 15.7711 (norm. = 0.0570352), norm. avg. (of 4) = 0.0513817 fft 22: mflops = 19.5064 (norm. = 0.070544), norm. avg. (of 4) = 0.0623474 fft 23: mflops = 47.8296 (norm. = 0.172973), norm. avg. (of 4) = 0.151702 fft 24: mflops = 153.705 (norm. = 0.555867), norm. avg. (of 4) = 0.498833 fft 25: mflops = 90.6361 (norm. = 0.327781), norm. avg. (of 1) = 0.327781 fft 26: mflops = 20.7272 (norm. = 0.0749589), norm. avg. (of 3) = 0.0434846 fft 27: mflops = 75.3566 (norm. = 0.272523), norm. avg. (of 4) = 0.166146 fft 28: mflops = 21.1215 (norm. = 0.0763847), norm. avg. (of 4) = 0.0638795 fft 29: mflops = 10.103 (norm. = 0.0365371), norm. avg. (of 4) = 0.0720059 Benchmarking for array size = 32 (power of 2): 0. Arndt DIF: elapsed time t=1.24333 s, 131072 iters, t-(init.)=1.18207 s t(norm)=0.0563655, mflops=88.7067 (err=2.4e-16) 1. Arndt DIT: elapsed time t=1.23483 s, 131072 iters, t-(init.)=1.17362 s t(norm)=0.0559624, mflops=89.3457 (err=2.7e-16) 2. Arndt Split-Radix: elapsed time t=1.84513 s, 131072 iters, t-(init.)=1.78381 s t(norm)=0.0850586, mflops=58.783 (err=3.0e-16) 3. Arndt 4-step: elapsed time t=1.29092 s, 32768 iters, t-(init.)=1.27558 s t(norm)=0.243298, mflops=20.551 (err=2.4e-16) 4. Beauregard: elapsed time t=1.34232 s, 32768 iters, t-(init.)=1.32695 s t(norm)=0.253095, mflops=19.7554 (err=2.5e-16) 5. Bergland: elapsed time t=1.21614 s, 131072 iters, t-(init.)=1.15489 s t(norm)=0.0550696, mflops=90.7942 (err=2.6e-16) 6. CWP (min N) (N=33): elapsed time t=1.25305 s, 131072 iters, t-(init.)=1.19003 s t(norm)=0.0567452, mflops=88.1131 7. CWP (best N) (N=35): elapsed time t=1.99182 s, 262144 iters, t-(init.)=1.85877 s t(norm)=0.0443165, mflops=112.825 8. Edelblute: elapsed time t=1.26789 s, 65536 iters, t-(init.)=1.23736 s t(norm)=0.118004, mflops=42.3716 (err=2.9e-16) 9. FFTPACK (f2c): elapsed time t=1.39422 s, 65536 iters, t-(init.)=1.36333 s t(norm)=0.130018, mflops=38.4563 (err=2.3e-16) FFTW_MEASURE plan: (cost = 2.905121e-06) FFTW_NOTW 32 10. FFTW: elapsed time t=1.5874 s, 524288 iters, t-(init.)=1.34074 s t(norm)=0.0159828, mflops=312.836 (err=2.4e-16) FFTW_ESTIMATE plan: (cost = 3.200000e+01) FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.58425 s, 524288 iters, t-(init.)=1.33915 s t(norm)=0.0159639, mflops=313.206 (err=2.4e-16) 12. Frigo-old: elapsed time t=1.57212 s, 524288 iters, t-(init.)=1.3272 s t(norm)=0.0158215, mflops=316.026 (err=2.1e-16) 13. Green: elapsed time t=1.27824 s, 262144 iters, t-(init.)=1.15571 s t(norm)=0.0275542, mflops=181.461 (err=2.4e-16) 14. GSL: elapsed time t=1.57714 s, 131072 iters, t-(init.)=1.51587 s t(norm)=0.0722821, mflops=69.1734 (err=2.3e-16) 15. GSL DIT: elapsed time t=1.78821 s, 131072 iters, t-(init.)=1.72656 s t(norm)=0.082329, mflops=60.732 (err=3.1e-16) 16. GSL DIF: elapsed time t=1.76801 s, 131072 iters, t-(init.)=1.70659 s t(norm)=0.0813763, mflops=61.4429 (err=3.2e-16) 17. Krukar: elapsed time t=1.65654 s, 262144 iters, t-(init.)=1.53403 s t(norm)=0.0365741, mflops=136.709 (err=2.7e-16) 18. Mayer (Buneman): elapsed time t=1.57309 s, 131072 iters, t-(init.)=1.51179 s t(norm)=0.0720878, mflops=69.3599 (err=2.8e-16) 19. Mayer (simple): elapsed time t=1.27549 s, 131072 iters, t-(init.)=1.21427 s t(norm)=0.057901, mflops=86.3543 20. Mayer (lookup): elapsed time t=1.25001 s, 131072 iters, t-(init.)=1.18871 s t(norm)=0.0566822, mflops=88.2111 (err=2.6e-16) 21. NAPACK (f2c): elapsed time t=1.47061 s, 32768 iters, t-(init.)=1.45527 s t(norm)=0.27757, mflops=18.0135 (err=6.4e-16) 22. Nielsen: elapsed time t=1.90421 s, 65536 iters, t-(init.)=1.87357 s t(norm)=0.178678, mflops=27.9833 (err=1.1e-15) 23. NR (C): elapsed time t=1.7109 s, 131072 iters, t-(init.)=1.64963 s t(norm)=0.0786603, mflops=63.5644 (err=2.9e-16) 24. Ooura (C): elapsed time t=1.32046 s, 262144 iters, t-(init.)=1.19782 s t(norm)=0.0285583, mflops=175.081 (err=2.5e-16) 25. QFT: elapsed time t=1.32905 s, 131072 iters, t-(init.)=1.26732 s t(norm)=0.0604307, mflops=82.7394 (err=2.8e-16) 26. Ransom: elapsed time t=1.26916 s, 32768 iters, t-(init.)=1.25382 s t(norm)=0.239147, mflops=20.9077 (err=7.4e-16) 27. Singleton (f2c): elapsed time t=1.12253 s, 131072 iters, t-(init.)=1.06122 s t(norm)=0.050603, mflops=98.8084 (err=2.3e-16) 28. Temperton (f2c): elapsed time t=1.34394 s, 32768 iters, t-(init.)=1.32859 s t(norm)=0.253409, mflops=19.7309 (err=2.6e-16) 29. Valkenburg: elapsed time t=1.26715 s, 16384 iters, t-(init.)=1.25939 s t(norm)=0.480419, mflops=10.4076 (err=2.8e-16) Top mflops for N=32 = 316.026 Normalized results and averages for N=32: fft 0: mflops = 88.7067 (norm. = 0.280695), norm. avg. (of 5) = 0.40712 fft 1: mflops = 89.3457 (norm. = 0.282717), norm. avg. (of 5) = 0.403495 fft 2: mflops = 58.783 (norm. = 0.186007), norm. avg. (of 5) = 0.215297 fft 3: mflops = 20.551 (norm. = 0.0650294), norm. avg. (of 5) = 0.0450337 fft 4: mflops = 19.7554 (norm. = 0.062512), norm. avg. (of 5) = 0.0706063 fft 5: mflops = 90.7942 (norm. = 0.2873), norm. avg. (of 5) = 0.173532 fft 6: mflops = 88.1131 (norm. = 0.278816), norm. avg. (of 5) = 0.192163 fft 7: mflops = 112.825 (norm. = 0.357012), norm. avg. (of 5) = 0.165483 fft 8: mflops = 42.3716 (norm. = 0.134077), norm. avg. (of 4) = 0.141781 fft 9: mflops = 38.4563 (norm. = 0.121687), norm. avg. (of 5) = 0.131224 fft 10: mflops = 312.836 (norm. = 0.989908), norm. avg. (of 5) = 0.802521 fft 11: mflops = 313.206 (norm. = 0.991077), norm. avg. (of 5) = 0.80709 fft 12: mflops = 316.026 (norm. = 1), norm. avg. (of 5) = 1 fft 13: mflops = 181.461 (norm. = 0.574196), norm. avg. (of 3) = 0.457292 fft 14: mflops = 69.1734 (norm. = 0.218886), norm. avg. (of 5) = 0.22578 fft 15: mflops = 60.732 (norm. = 0.192174), norm. avg. (of 5) = 0.150649 fft 16: mflops = 61.4429 (norm. = 0.194424), norm. avg. (of 5) = 0.148011 fft 17: mflops = 136.709 (norm. = 0.432588), norm. avg. (of 5) = 0.477853 fft 18: mflops = 69.3599 (norm. = 0.219475), norm. avg. (of 4) = 0.218484 fft 19: mflops = 86.3543 (norm. = 0.273251), norm. avg. (of 4) = 0.242863 fft 20: mflops = 88.2111 (norm. = 0.279126), norm. avg. (of 4) = 0.238735 fft 21: mflops = 18.0135 (norm. = 0.057), norm. avg. (of 5) = 0.0525054 fft 22: mflops = 27.9833 (norm. = 0.0885476), norm. avg. (of 5) = 0.0675874 fft 23: mflops = 63.5644 (norm. = 0.201137), norm. avg. (of 5) = 0.161589 fft 24: mflops = 175.081 (norm. = 0.554008), norm. avg. (of 5) = 0.509868 fft 25: mflops = 82.7394 (norm. = 0.261812), norm. avg. (of 2) = 0.294797 fft 26: mflops = 20.9077 (norm. = 0.0661582), norm. avg. (of 4) = 0.049153 fft 27: mflops = 98.8084 (norm. = 0.312659), norm. avg. (of 5) = 0.195448 fft 28: mflops = 19.7309 (norm. = 0.0624347), norm. avg. (of 5) = 0.0635905 fft 29: mflops = 10.4076 (norm. = 0.0329327), norm. avg. (of 5) = 0.0641913 Benchmarking for array size = 64 (power of 2): 0. Arndt DIF: elapsed time t=1.43743 s, 65536 iters, t-(init.)=1.37865 s t(norm)=0.0547825, mflops=91.2701 (err=5.0e-16) 1. Arndt DIT: elapsed time t=1.43151 s, 65536 iters, t-(init.)=1.37264 s t(norm)=0.054544, mflops=91.6691 (err=4.9e-16) 2. Arndt Split-Radix: elapsed time t=1.93589 s, 65536 iters, t-(init.)=1.87709 s t(norm)=0.0745887, mflops=67.0343 (err=4.5e-16) 3. Arndt 4-step: elapsed time t=1.04855 s, 16384 iters, t-(init.)=1.0338 s t(norm)=0.164319, mflops=30.4287 (err=4.9e-16) 4. Beauregard: elapsed time t=1.58045 s, 16384 iters, t-(init.)=1.56571 s t(norm)=0.248863, mflops=20.0914 (err=4.5e-16) 5. Bergland: elapsed time t=1.19649 s, 65536 iters, t-(init.)=1.13761 s t(norm)=0.0452047, mflops=110.608 (err=5.5e-16) 6. CWP (min N) (N=65): elapsed time t=1.30254 s, 65536 iters, t-(init.)=1.24279 s t(norm)=0.049384, mflops=101.247 7. CWP (best N) (N=84): elapsed time t=1.02498 s, 65536 iters, t-(init.)=0.948476 s t(norm)=0.0376891, mflops=132.665 8. Edelblute: elapsed time t=1.34834 s, 32768 iters, t-(init.)=1.31896 s t(norm)=0.104821, mflops=47.7002 (err=4.6e-16) 9. FFTPACK (f2c): elapsed time t=1.40359 s, 32768 iters, t-(init.)=1.37404 s t(norm)=0.109199, mflops=45.7881 (err=4.4e-16) FFTW_MEASURE plan: (cost = 6.343994e-06) FFTW_NOTW 64 10. FFTW: elapsed time t=1.72289 s, 262144 iters, t-(init.)=1.48652 s t(norm)=0.0147672, mflops=338.588 (err=4.4e-16) FFTW_ESTIMATE plan: (cost = 7.680000e+02) FFTW_TWIDDLE 2 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.06027 s, 131072 iters, t-(init.)=0.942646 s t(norm)=0.0187287, mflops=266.97 (err=4.7e-16) 12. Frigo-old: elapsed time t=1.36945 s, 131072 iters, t-(init.)=1.25179 s t(norm)=0.0248709, mflops=201.039 (err=4.5e-16) 13. Green: elapsed time t=1.10833 s, 131072 iters, t-(init.)=0.990743 s t(norm)=0.0196843, mflops=254.01 (err=4.6e-16) 14. GSL: elapsed time t=1.56148 s, 65536 iters, t-(init.)=1.50257 s t(norm)=0.0597068, mflops=83.7425 (err=4.4e-16) 15. GSL DIT: elapsed time t=1.8306 s, 65536 iters, t-(init.)=1.7716 s t(norm)=0.070397, mflops=71.0258 (err=4.6e-16) 16. GSL DIF: elapsed time t=1.76959 s, 65536 iters, t-(init.)=1.71076 s t(norm)=0.0679795, mflops=73.5516 (err=4.9e-16) 17. Krukar: elapsed time t=1.95406 s, 131072 iters, t-(init.)=1.83643 s t(norm)=0.0364866, mflops=137.037 (err=5.2e-16) 18. Mayer (Buneman): elapsed time t=1.78357 s, 65536 iters, t-(init.)=1.72497 s t(norm)=0.0685443, mflops=72.9456 (err=4.8e-16) 19. Mayer (simple): elapsed time t=1.39031 s, 65536 iters, t-(init.)=1.33141 s t(norm)=0.0529056, mflops=94.508 20. Mayer (lookup): elapsed time t=1.34487 s, 65536 iters, t-(init.)=1.28597 s t(norm)=0.0510997, mflops=97.8479 (err=4.5e-16) 21. NAPACK (f2c): elapsed time t=1.51827 s, 16384 iters, t-(init.)=1.5035 s t(norm)=0.238975, mflops=20.9227 (err=1.1e-15) 22. Nielsen: elapsed time t=1.77825 s, 32768 iters, t-(init.)=1.7488 s t(norm)=0.138982, mflops=35.9758 (err=1.9e-15) 23. NR (C): elapsed time t=1.71293 s, 65536 iters, t-(init.)=1.65415 s t(norm)=0.0657301, mflops=76.0687 (err=4.4e-16) 24. Ooura (C): elapsed time t=1.39592 s, 131072 iters, t-(init.)=1.27817 s t(norm)=0.0253949, mflops=196.89 (err=5.4e-16) 25. QFT: elapsed time t=1.64949 s, 65536 iters, t-(init.)=1.5901 s t(norm)=0.0631847, mflops=79.133 (err=4.9e-16) 26. Ransom: elapsed time t=1.52607 s, 32768 iters, t-(init.)=1.49663 s t(norm)=0.118942, mflops=42.0373 (err=9.1e-16) 27. Singleton (f2c): elapsed time t=1.91736 s, 131072 iters, t-(init.)=1.79946 s t(norm)=0.035752, mflops=139.852 (err=6.5e-16) 28. Temperton (f2c): elapsed time t=1.23249 s, 16384 iters, t-(init.)=1.21772 s t(norm)=0.193551, mflops=25.833 (err=4.7e-16) 29. Valkenburg: elapsed time t=1.48632 s, 8192 iters, t-(init.)=1.47893 s t(norm)=0.470139, mflops=10.6352 (err=6.0e-16) Top mflops for N=64 = 338.588 Normalized results and averages for N=64: fft 0: mflops = 91.2701 (norm. = 0.269561), norm. avg. (of 6) = 0.384194 fft 1: mflops = 91.6691 (norm. = 0.270739), norm. avg. (of 6) = 0.381369 fft 2: mflops = 67.0343 (norm. = 0.197982), norm. avg. (of 6) = 0.212411 fft 3: mflops = 30.4287 (norm. = 0.0898694), norm. avg. (of 6) = 0.0525063 fft 4: mflops = 20.0914 (norm. = 0.0593387), norm. avg. (of 6) = 0.0687283 fft 5: mflops = 110.608 (norm. = 0.326674), norm. avg. (of 6) = 0.199055 fft 6: mflops = 101.247 (norm. = 0.299028), norm. avg. (of 6) = 0.209974 fft 7: mflops = 132.665 (norm. = 0.391817), norm. avg. (of 6) = 0.203205 fft 8: mflops = 47.7002 (norm. = 0.14088), norm. avg. (of 5) = 0.141601 fft 9: mflops = 45.7881 (norm. = 0.135233), norm. avg. (of 6) = 0.131892 fft 10: mflops = 338.588 (norm. = 1), norm. avg. (of 6) = 0.835435 fft 11: mflops = 266.97 (norm. = 0.788481), norm. avg. (of 6) = 0.803988 fft 12: mflops = 201.039 (norm. = 0.593756), norm. avg. (of 6) = 0.932293 fft 13: mflops = 254.01 (norm. = 0.750203), norm. avg. (of 4) = 0.53052 fft 14: mflops = 83.7425 (norm. = 0.247329), norm. avg. (of 6) = 0.229371 fft 15: mflops = 71.0258 (norm. = 0.209771), norm. avg. (of 6) = 0.160503 fft 16: mflops = 73.5516 (norm. = 0.21723), norm. avg. (of 6) = 0.159547 fft 17: mflops = 137.037 (norm. = 0.40473), norm. avg. (of 6) = 0.465665 fft 18: mflops = 72.9456 (norm. = 0.21544), norm. avg. (of 5) = 0.217876 fft 19: mflops = 94.508 (norm. = 0.279124), norm. avg. (of 5) = 0.250115 fft 20: mflops = 97.8479 (norm. = 0.288988), norm. avg. (of 5) = 0.248786 fft 21: mflops = 20.9227 (norm. = 0.0617939), norm. avg. (of 6) = 0.0540534 fft 22: mflops = 35.9758 (norm. = 0.106253), norm. avg. (of 6) = 0.0740316 fft 23: mflops = 76.0687 (norm. = 0.224664), norm. avg. (of 6) = 0.172102 fft 24: mflops = 196.89 (norm. = 0.581503), norm. avg. (of 6) = 0.521807 fft 25: mflops = 79.133 (norm. = 0.233715), norm. avg. (of 3) = 0.274436 fft 26: mflops = 42.0373 (norm. = 0.124155), norm. avg. (of 5) = 0.0641533 fft 27: mflops = 139.852 (norm. = 0.413046), norm. avg. (of 6) = 0.231715 fft 28: mflops = 25.833 (norm. = 0.0762962), norm. avg. (of 6) = 0.0657081 fft 29: mflops = 10.6352 (norm. = 0.0314103), norm. avg. (of 6) = 0.0587278 Benchmarking for array size = 128 (power of 2): 0. Arndt DIF: elapsed time t=1.50372 s, 32768 iters, t-(init.)=1.44604 s t(norm)=0.0492519, mflops=101.519 (err=4.0e-16) 1. Arndt DIT: elapsed time t=1.49745 s, 32768 iters, t-(init.)=1.43986 s t(norm)=0.0490412, mflops=101.955 (err=4.1e-16) 2. Arndt Split-Radix: elapsed time t=1.01192 s, 16384 iters, t-(init.)=0.98311 s t(norm)=0.0669691, mflops=74.6614 (err=4.4e-16) 3. Arndt 4-step: elapsed time t=1.21117 s, 8192 iters, t-(init.)=1.1968 s t(norm)=0.163051, mflops=30.6652 (err=4.0e-16) 4. Beauregard: elapsed time t=1.83818 s, 8192 iters, t-(init.)=1.82374 s t(norm)=0.248465, mflops=20.1235 (err=4.1e-16) 5. Bergland: elapsed time t=1.23401 s, 32768 iters, t-(init.)=1.1764 s t(norm)=0.0400678, mflops=124.788 (err=4.3e-16) 6. CWP (min N) (N=130): elapsed time t=1.28308 s, 32768 iters, t-(init.)=1.22453 s t(norm)=0.0417072, mflops=119.883 7. CWP (best N) (N=140): elapsed time t=1.70242 s, 65536 iters, t-(init.)=1.57647 s t(norm)=0.0268472, mflops=186.239 8. Edelblute: elapsed time t=1.40841 s, 16384 iters, t-(init.)=1.37946 s t(norm)=0.0939684, mflops=53.2094 (err=4.1e-16) 9. FFTPACK (f2c): elapsed time t=1.45274 s, 16384 iters, t-(init.)=1.42384 s t(norm)=0.0969912, mflops=51.5511 (err=4.1e-16) FFTW_MEASURE plan: (cost = 1.647314e-05) FFTW_TWIDDLE 4 FFTW_NOTW 32 10. FFTW: elapsed time t=1.10795 s, 65536 iters, t-(init.)=0.992577 s t(norm)=0.0169035, mflops=295.797 (err=4.2e-16) FFTW_ESTIMATE plan: (cost = 1.075200e+03) FFTW_TWIDDLE 4 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.10752 s, 65536 iters, t-(init.)=0.992058 s t(norm)=0.0168946, mflops=295.952 (err=4.2e-16) 12. Frigo-old: elapsed time t=1.42984 s, 65536 iters, t-(init.)=1.31429 s t(norm)=0.0223823, mflops=223.391 (err=4.4e-16) 13. Green: elapsed time t=1.16733 s, 65536 iters, t-(init.)=1.05211 s t(norm)=0.0179174, mflops=279.058 (err=4.4e-16) 14. GSL: elapsed time t=1.76307 s, 32768 iters, t-(init.)=1.70519 s t(norm)=0.0580784, mflops=86.0906 (err=4.2e-16) 15. GSL DIT: elapsed time t=1.92483 s, 32768 iters, t-(init.)=1.86695 s t(norm)=0.063588, mflops=78.6312 (err=4.3e-16) 16. GSL DIF: elapsed time t=1.82492 s, 32768 iters, t-(init.)=1.76732 s t(norm)=0.0601945, mflops=83.0641 (err=4.6e-16) 17. Krukar: elapsed time t=1.13731 s, 16384 iters, t-(init.)=1.10829 s t(norm)=0.0754961, mflops=66.2286 (err=4.6e-16) 18. Mayer (Buneman): elapsed time t=1.8929 s, 32768 iters, t-(init.)=1.83526 s t(norm)=0.0625087, mflops=79.9889 (err=4.0e-16) 19. Mayer (simple): elapsed time t=1.43957 s, 32768 iters, t-(init.)=1.38196 s t(norm)=0.0470694, mflops=106.226 20. Mayer (lookup): elapsed time t=1.39314 s, 32768 iters, t-(init.)=1.33547 s t(norm)=0.0454858, mflops=109.924 (err=4.3e-16) 21. NAPACK (f2c): elapsed time t=1.72643 s, 8192 iters, t-(init.)=1.71198 s t(norm)=0.233239, mflops=21.4372 (err=1.2e-15) 22. Nielsen: elapsed time t=1.13552 s, 8192 iters, t-(init.)=1.12108 s t(norm)=0.152735, mflops=32.7365 (err=1.3e-15) 23. NR (C): elapsed time t=1.75863 s, 32768 iters, t-(init.)=1.70087 s t(norm)=0.0579313, mflops=86.3092 (err=4.4e-16) 24. Ooura (C): elapsed time t=1.57783 s, 65536 iters, t-(init.)=1.46259 s t(norm)=0.0249077, mflops=200.741 (err=4.1e-16) 25. QFT: elapsed time t=1.96473 s, 32768 iters, t-(init.)=1.90695 s t(norm)=0.0649505, mflops=76.9817 (err=4.6e-16) 26. Ransom: elapsed time t=1.88367 s, 16384 iters, t-(init.)=1.85482 s t(norm)=0.12635, mflops=39.5727 (err=1.1e-15) 27. Singleton (f2c): elapsed time t=1.17789 s, 32768 iters, t-(init.)=1.12018 s t(norm)=0.0381533, mflops=131.05 (err=5.3e-16) 28. Temperton (f2c): elapsed time t=1.49835 s, 8192 iters, t-(init.)=1.48388 s t(norm)=0.202163, mflops=24.7325 (err=4.4e-16) 29. Valkenburg: elapsed time t=1.70761 s, 4096 iters, t-(init.)=1.70039 s t(norm)=0.46332, mflops=10.7917 (err=4.8e-16) Top mflops for N=128 = 295.952 Normalized results and averages for N=128: fft 0: mflops = 101.519 (norm. = 0.343025), norm. avg. (of 7) = 0.378313 fft 1: mflops = 101.955 (norm. = 0.344499), norm. avg. (of 7) = 0.376102 fft 2: mflops = 74.6614 (norm. = 0.252275), norm. avg. (of 7) = 0.218106 fft 3: mflops = 30.6652 (norm. = 0.103616), norm. avg. (of 7) = 0.0598076 fft 4: mflops = 20.1235 (norm. = 0.067996), norm. avg. (of 7) = 0.0686237 fft 5: mflops = 124.788 (norm. = 0.421651), norm. avg. (of 7) = 0.230855 fft 6: mflops = 119.883 (norm. = 0.405077), norm. avg. (of 7) = 0.237846 fft 7: mflops = 186.239 (norm. = 0.629289), norm. avg. (of 7) = 0.264074 fft 8: mflops = 53.2094 (norm. = 0.179791), norm. avg. (of 6) = 0.147966 fft 9: mflops = 51.5511 (norm. = 0.174187), norm. avg. (of 7) = 0.137935 fft 10: mflops = 295.797 (norm. = 0.999477), norm. avg. (of 7) = 0.858869 fft 11: mflops = 295.952 (norm. = 1), norm. avg. (of 7) = 0.83199 fft 12: mflops = 223.391 (norm. = 0.754823), norm. avg. (of 7) = 0.90694 fft 13: mflops = 279.058 (norm. = 0.942918), norm. avg. (of 5) = 0.612999 fft 14: mflops = 86.0906 (norm. = 0.290894), norm. avg. (of 7) = 0.23816 fft 15: mflops = 78.6312 (norm. = 0.265689), norm. avg. (of 7) = 0.175529 fft 16: mflops = 83.0641 (norm. = 0.280668), norm. avg. (of 7) = 0.17685 fft 17: mflops = 66.2286 (norm. = 0.223782), norm. avg. (of 7) = 0.431111 fft 18: mflops = 79.9889 (norm. = 0.270277), norm. avg. (of 6) = 0.226609 fft 19: mflops = 106.226 (norm. = 0.35893), norm. avg. (of 6) = 0.268251 fft 20: mflops = 109.924 (norm. = 0.371427), norm. avg. (of 6) = 0.269226 fft 21: mflops = 21.4372 (norm. = 0.0724349), norm. avg. (of 7) = 0.0566794 fft 22: mflops = 32.7365 (norm. = 0.110614), norm. avg. (of 7) = 0.0792577 fft 23: mflops = 86.3092 (norm. = 0.291633), norm. avg. (of 7) = 0.189178 fft 24: mflops = 200.741 (norm. = 0.67829), norm. avg. (of 7) = 0.544162 fft 25: mflops = 76.9817 (norm. = 0.260116), norm. avg. (of 4) = 0.270856 fft 26: mflops = 39.5727 (norm. = 0.133713), norm. avg. (of 6) = 0.0757467 fft 27: mflops = 131.05 (norm. = 0.44281), norm. avg. (of 7) = 0.261871 fft 28: mflops = 24.7325 (norm. = 0.0835693), norm. avg. (of 7) = 0.0682597 fft 29: mflops = 10.7917 (norm. = 0.0364644), norm. avg. (of 7) = 0.0555473 Benchmarking for array size = 256 (power of 2): 0. Arndt DIF: elapsed time t=1.61575 s, 16384 iters, t-(init.)=1.55877 s t(norm)=0.0464548, mflops=107.631 (err=6.7e-16) 1. Arndt DIT: elapsed time t=1.61406 s, 16384 iters, t-(init.)=1.55698 s t(norm)=0.0464017, mflops=107.755 (err=7.1e-16) 2. Arndt Split-Radix: elapsed time t=1.05743 s, 8192 iters, t-(init.)=1.02889 s t(norm)=0.0613267, mflops=81.5305 (err=7.4e-16) 3. Arndt 4-step: elapsed time t=1.20178 s, 4096 iters, t-(init.)=1.18747 s t(norm)=0.141557, mflops=35.3214 (err=7.2e-16) 4. Beauregard: elapsed time t=1.04788 s, 2048 iters, t-(init.)=1.04075 s t(norm)=0.248134, mflops=20.1504 (err=7.8e-16) 5. Bergland: elapsed time t=1.23668 s, 16384 iters, t-(init.)=1.17947 s t(norm)=0.035151, mflops=142.243 (err=8.3e-16) 6. CWP (min N) (N=260): elapsed time t=1.24071 s, 16384 iters, t-(init.)=1.18285 s t(norm)=0.0352517, mflops=141.837 7. CWP (best N) (N=280): elapsed time t=1.86264 s, 32768 iters, t-(init.)=1.73779 s t(norm)=0.0258951, mflops=193.087 8. Edelblute: elapsed time t=1.46587 s, 8192 iters, t-(init.)=1.43737 s t(norm)=0.0856739, mflops=58.3608 (err=7.0e-16) 9. FFTPACK (f2c): elapsed time t=1.52898 s, 8192 iters, t-(init.)=1.50031 s t(norm)=0.0894257, mflops=55.9123 (err=7.8e-16) FFTW_MEASURE plan: (cost = 3.501318e-05) FFTW_TWIDDLE 4 FFTW_NOTW 64 10. FFTW: elapsed time t=1.17673 s, 32768 iters, t-(init.)=1.06234 s t(norm)=0.0158301, mflops=315.854 (err=8.0e-16) FFTW_ESTIMATE plan: (cost = 9.216000e+02) FFTW_TWIDDLE 8 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.19545 s, 32768 iters, t-(init.)=1.08114 s t(norm)=0.0161102, mflops=310.362 (err=8.1e-16) 12. Frigo-old: elapsed time t=1.45791 s, 32768 iters, t-(init.)=1.34387 s t(norm)=0.0200252, mflops=249.685 (err=8.0e-16) 13. Green: elapsed time t=1.23826 s, 32768 iters, t-(init.)=1.12428 s t(norm)=0.0167531, mflops=298.452 (err=7.6e-16) 14. GSL: elapsed time t=1.80087 s, 16384 iters, t-(init.)=1.7438 s t(norm)=0.0519693, mflops=96.2106 (err=7.8e-16) 15. GSL DIT: elapsed time t=1.03622 s, 8192 iters, t-(init.)=1.00766 s t(norm)=0.0600613, mflops=83.2482 (err=7.7e-16) 16. GSL DIF: elapsed time t=1.93477 s, 16384 iters, t-(init.)=1.87769 s t(norm)=0.0559596, mflops=89.3501 (err=8.3e-16) 17. Krukar: elapsed time t=1.27842 s, 8192 iters, t-(init.)=1.24984 s t(norm)=0.0744965, mflops=67.1172 (err=7.7e-16) 18. Mayer (Buneman): elapsed time t=1.02161 s, 8192 iters, t-(init.)=0.9931 s t(norm)=0.0591934, mflops=84.4689 (err=7.0e-16) 19. Mayer (simple): elapsed time t=1.55288 s, 16384 iters, t-(init.)=1.49588 s t(norm)=0.0445806, mflops=112.156 20. Mayer (lookup): elapsed time t=1.49581 s, 16384 iters, t-(init.)=1.43882 s t(norm)=0.0428802, mflops=116.604 (err=7.1e-16) 21. NAPACK (f2c): elapsed time t=1.82959 s, 4096 iters, t-(init.)=1.81529 s t(norm)=0.216399, mflops=23.1054 (err=3.6e-15) 22. Nielsen: elapsed time t=1.1416 s, 4096 iters, t-(init.)=1.12731 s t(norm)=0.134386, mflops=37.2063 (err=3.4e-15) 23. NR (C): elapsed time t=1.85366 s, 16384 iters, t-(init.)=1.79662 s t(norm)=0.0535436, mflops=93.3818 (err=8.6e-16) 24. Ooura (C): elapsed time t=1.67517 s, 32768 iters, t-(init.)=1.56115 s t(norm)=0.0232629, mflops=214.934 (err=7.9e-16) 25. QFT: elapsed time t=1.14291 s, 8192 iters, t-(init.)=1.11437 s t(norm)=0.0664218, mflops=75.2765 (err=9.5e-16) 26. Ransom: elapsed time t=1.42897 s, 8192 iters, t-(init.)=1.40043 s t(norm)=0.083472, mflops=59.9004 (err=1.7e-15) 27. Singleton (f2c): elapsed time t=1.00797 s, 16384 iters, t-(init.)=0.950985 s t(norm)=0.0283416, mflops=176.419 (err=1.3e-15) 28. Temperton (f2c): elapsed time t=1.5507 s, 4096 iters, t-(init.)=1.53639 s t(norm)=0.183152, mflops=27.2997 (err=7.5e-16) 29. Valkenburg: elapsed time t=1.92713 s, 2048 iters, t-(init.)=1.91999 s t(norm)=0.457762, mflops=10.9227 (err=7.4e-16) Top mflops for N=256 = 315.854 Normalized results and averages for N=256: fft 0: mflops = 107.631 (norm. = 0.340763), norm. avg. (of 8) = 0.373619 fft 1: mflops = 107.755 (norm. = 0.341154), norm. avg. (of 8) = 0.371733 fft 2: mflops = 81.5305 (norm. = 0.258127), norm. avg. (of 8) = 0.223109 fft 3: mflops = 35.3214 (norm. = 0.111828), norm. avg. (of 8) = 0.0663102 fft 4: mflops = 20.1504 (norm. = 0.0637965), norm. avg. (of 8) = 0.0680203 fft 5: mflops = 142.243 (norm. = 0.450346), norm. avg. (of 8) = 0.258291 fft 6: mflops = 141.837 (norm. = 0.449059), norm. avg. (of 8) = 0.264247 fft 7: mflops = 193.087 (norm. = 0.611317), norm. avg. (of 8) = 0.30748 fft 8: mflops = 58.3608 (norm. = 0.184771), norm. avg. (of 7) = 0.153224 fft 9: mflops = 55.9123 (norm. = 0.17702), norm. avg. (of 8) = 0.14282 fft 10: mflops = 315.854 (norm. = 1), norm. avg. (of 8) = 0.876511 fft 11: mflops = 310.362 (norm. = 0.982614), norm. avg. (of 8) = 0.850818 fft 12: mflops = 249.685 (norm. = 0.790507), norm. avg. (of 8) = 0.892386 fft 13: mflops = 298.452 (norm. = 0.944906), norm. avg. (of 6) = 0.668317 fft 14: mflops = 96.2106 (norm. = 0.304605), norm. avg. (of 8) = 0.246466 fft 15: mflops = 83.2482 (norm. = 0.263566), norm. avg. (of 8) = 0.186534 fft 16: mflops = 89.3501 (norm. = 0.282884), norm. avg. (of 8) = 0.190104 fft 17: mflops = 67.1172 (norm. = 0.212495), norm. avg. (of 8) = 0.403784 fft 18: mflops = 84.4689 (norm. = 0.26743), norm. avg. (of 7) = 0.232441 fft 19: mflops = 112.156 (norm. = 0.35509), norm. avg. (of 7) = 0.280657 fft 20: mflops = 116.604 (norm. = 0.36917), norm. avg. (of 7) = 0.283504 fft 21: mflops = 23.1054 (norm. = 0.0731523), norm. avg. (of 8) = 0.0587385 fft 22: mflops = 37.2063 (norm. = 0.117796), norm. avg. (of 8) = 0.084075 fft 23: mflops = 93.3818 (norm. = 0.295649), norm. avg. (of 8) = 0.202487 fft 24: mflops = 214.934 (norm. = 0.680487), norm. avg. (of 8) = 0.561202 fft 25: mflops = 75.2765 (norm. = 0.238327), norm. avg. (of 5) = 0.26435 fft 26: mflops = 59.9004 (norm. = 0.189646), norm. avg. (of 7) = 0.092018 fft 27: mflops = 176.419 (norm. = 0.558547), norm. avg. (of 8) = 0.298956 fft 28: mflops = 27.2997 (norm. = 0.0864314), norm. avg. (of 8) = 0.0705312 fft 29: mflops = 10.9227 (norm. = 0.0345815), norm. avg. (of 8) = 0.0529266 Benchmarking for array size = 512 (power of 2): 0. Arndt DIF: elapsed time t=1.72444 s, 8192 iters, t-(init.)=1.66762 s t(norm)=0.0441769, mflops=113.181 (err=6.7e-16) 1. Arndt DIT: elapsed time t=1.72368 s, 8192 iters, t-(init.)=1.66699 s t(norm)=0.0441601, mflops=113.224 (err=6.2e-16) 2. Arndt Split-Radix: elapsed time t=1.2036 s, 4096 iters, t-(init.)=1.17523 s t(norm)=0.0622658, mflops=80.3009 (err=6.5e-16) 3. Arndt 4-step: elapsed time t=1.27384 s, 2048 iters, t-(init.)=1.25961 s t(norm)=0.133474, mflops=37.4606 (err=6.3e-16) 4. Beauregard: elapsed time t=1.17545 s, 1024 iters, t-(init.)=1.1683 s t(norm)=0.247594, mflops=20.1943 (err=6.8e-16) 5. Bergland: elapsed time t=1.27861 s, 8192 iters, t-(init.)=1.22184 s t(norm)=0.0323677, mflops=154.475 (err=7.2e-16) 6. CWP (min N) (N=520): elapsed time t=1.35086 s, 8192 iters, t-(init.)=1.29329 s t(norm)=0.0342605, mflops=145.94 7. CWP (best N) (N=560): elapsed time t=1.09476 s, 8192 iters, t-(init.)=1.03279 s t(norm)=0.0273595, mflops=182.752 8. Edelblute: elapsed time t=1.62434 s, 4096 iters, t-(init.)=1.59586 s t(norm)=0.0845515, mflops=59.1356 (err=6.2e-16) 9. FFTPACK (f2c): elapsed time t=1.03272 s, 2048 iters, t-(init.)=1.01848 s t(norm)=0.107922, mflops=46.3298 (err=6.4e-16) FFTW_MEASURE plan: (cost = 7.930078e-05) FFTW_TWIDDLE 8 FFTW_NOTW 64 10. FFTW: elapsed time t=1.30149 s, 16384 iters, t-(init.)=1.18806 s t(norm)=0.0157364, mflops=317.734 (err=6.4e-16) FFTW_ESTIMATE plan: (cost = 1.843200e+03) FFTW_TWIDDLE 16 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.39047 s, 16384 iters, t-(init.)=1.27701 s t(norm)=0.0169146, mflops=295.603 (err=6.5e-16) 12. Frigo-old: elapsed time t=1.68231 s, 16384 iters, t-(init.)=1.56888 s t(norm)=0.0207805, mflops=240.61 (err=6.3e-16) 13. Green: elapsed time t=1.28137 s, 16384 iters, t-(init.)=1.16788 s t(norm)=0.0154691, mflops=323.226 (err=6.2e-16) 14. GSL: elapsed time t=1.1323 s, 4096 iters, t-(init.)=1.10389 s t(norm)=0.0584862, mflops=85.4903 (err=6.4e-16) 15. GSL DIT: elapsed time t=1.15344 s, 4096 iters, t-(init.)=1.12504 s t(norm)=0.0596067, mflops=83.8832 (err=9.0e-16) 16. GSL DIF: elapsed time t=1.0398 s, 4096 iters, t-(init.)=1.01144 s t(norm)=0.0535881, mflops=93.3043 (err=7.8e-16) 17. Krukar: elapsed time t=1.36186 s, 4096 iters, t-(init.)=1.33342 s t(norm)=0.0706473, mflops=70.7741 (err=6.9e-16) 18. Mayer (Buneman): elapsed time t=1.0652 s, 4096 iters, t-(init.)=1.03687 s t(norm)=0.0549351, mflops=91.0164 (err=6.5e-16) 19. Mayer (simple): elapsed time t=1.61135 s, 8192 iters, t-(init.)=1.55465 s t(norm)=0.0411842, mflops=121.406 20. Mayer (lookup): elapsed time t=1.5585 s, 8192 iters, t-(init.)=1.50169 s t(norm)=0.0397812, mflops=125.688 (err=6.5e-16) 21. NAPACK (f2c): elapsed time t=1.0361 s, 1024 iters, t-(init.)=1.02897 s t(norm)=0.218068, mflops=22.9286 (err=6.7e-15) 22. Nielsen: elapsed time t=1.15015 s, 2048 iters, t-(init.)=1.13596 s t(norm)=0.120371, mflops=41.5382 (err=3.2e-15) 23. NR (C): elapsed time t=1.02203 s, 4096 iters, t-(init.)=0.993653 s t(norm)=0.0526456, mflops=94.9746 (err=7.1e-16) 24. Ooura (C): elapsed time t=1.88024 s, 16384 iters, t-(init.)=1.7668 s t(norm)=0.0234021, mflops=213.656 (err=6.9e-16) 25. QFT: elapsed time t=1.37044 s, 4096 iters, t-(init.)=1.34198 s t(norm)=0.0711005, mflops=70.323 (err=9.5e-16) 26. Ransom: elapsed time t=1.73437 s, 4096 iters, t-(init.)=1.70601 s t(norm)=0.0903878, mflops=55.3172 (err=1.5e-15) 27. Singleton (f2c): elapsed time t=1.11825 s, 8192 iters, t-(init.)=1.06156 s t(norm)=0.0281217, mflops=177.798 (err=8.4e-16) 28. Temperton (f2c): elapsed time t=1.95023 s, 2048 iters, t-(init.)=1.93607 s t(norm)=0.205153, mflops=24.3721 (err=6.4e-16) 29. Valkenburg: elapsed time t=1.07772 s, 512 iters, t-(init.)=1.07411 s t(norm)=0.455266, mflops=10.9826 (err=7.4e-16) Top mflops for N=512 = 323.226 Normalized results and averages for N=512: fft 0: mflops = 113.181 (norm. = 0.350162), norm. avg. (of 9) = 0.371013 fft 1: mflops = 113.224 (norm. = 0.350295), norm. avg. (of 9) = 0.369351 fft 2: mflops = 80.3009 (norm. = 0.248436), norm. avg. (of 9) = 0.225923 fft 3: mflops = 37.4606 (norm. = 0.115896), norm. avg. (of 9) = 0.0718198 fft 4: mflops = 20.1943 (norm. = 0.0624775), norm. avg. (of 9) = 0.0674044 fft 5: mflops = 154.475 (norm. = 0.477917), norm. avg. (of 9) = 0.282694 fft 6: mflops = 145.94 (norm. = 0.451513), norm. avg. (of 9) = 0.285055 fft 7: mflops = 182.752 (norm. = 0.5654), norm. avg. (of 9) = 0.336138 fft 8: mflops = 59.1356 (norm. = 0.182954), norm. avg. (of 8) = 0.15694 fft 9: mflops = 46.3298 (norm. = 0.143336), norm. avg. (of 9) = 0.142877 fft 10: mflops = 317.734 (norm. = 0.983011), norm. avg. (of 9) = 0.888344 fft 11: mflops = 295.603 (norm. = 0.914542), norm. avg. (of 9) = 0.857898 fft 12: mflops = 240.61 (norm. = 0.744403), norm. avg. (of 9) = 0.875943 fft 13: mflops = 323.226 (norm. = 1), norm. avg. (of 7) = 0.7157 fft 14: mflops = 85.4903 (norm. = 0.264491), norm. avg. (of 9) = 0.248468 fft 15: mflops = 83.8832 (norm. = 0.259519), norm. avg. (of 9) = 0.194643 fft 16: mflops = 93.3043 (norm. = 0.288666), norm. avg. (of 9) = 0.201056 fft 17: mflops = 70.7741 (norm. = 0.218962), norm. avg. (of 9) = 0.383248 fft 18: mflops = 91.0164 (norm. = 0.281588), norm. avg. (of 8) = 0.238584 fft 19: mflops = 121.406 (norm. = 0.375607), norm. avg. (of 8) = 0.292525 fft 20: mflops = 125.688 (norm. = 0.388854), norm. avg. (of 8) = 0.296672 fft 21: mflops = 22.9286 (norm. = 0.0709369), norm. avg. (of 9) = 0.0600939 fft 22: mflops = 41.5382 (norm. = 0.128512), norm. avg. (of 9) = 0.0890124 fft 23: mflops = 94.9746 (norm. = 0.293834), norm. avg. (of 9) = 0.212636 fft 24: mflops = 213.656 (norm. = 0.661011), norm. avg. (of 9) = 0.572292 fft 25: mflops = 70.323 (norm. = 0.217566), norm. avg. (of 6) = 0.256553 fft 26: mflops = 55.3172 (norm. = 0.171141), norm. avg. (of 8) = 0.101908 fft 27: mflops = 177.798 (norm. = 0.550075), norm. avg. (of 9) = 0.326858 fft 28: mflops = 24.3721 (norm. = 0.0754026), norm. avg. (of 9) = 0.0710725 fft 29: mflops = 10.9826 (norm. = 0.0339781), norm. avg. (of 9) = 0.0508212 Benchmarking for array size = 1024 (power of 2): 0. Arndt DIF: elapsed time t=1.81602 s, 4096 iters, t-(init.)=1.75934 s t(norm)=0.0419458, mflops=119.201 (err=1.0e-15) 1. Arndt DIT: elapsed time t=1.82163 s, 4096 iters, t-(init.)=1.76506 s t(norm)=0.0420822, mflops=118.815 (err=1.0e-15) 2. Arndt Split-Radix: elapsed time t=1.25908 s, 2048 iters, t-(init.)=1.23059 s t(norm)=0.0586792, mflops=85.209 (err=1.0e-15) 3. Arndt 4-step: elapsed time t=1.21199 s, 1024 iters, t-(init.)=1.1978 s t(norm)=0.114231, mflops=43.7709 (err=1.0e-15) 4. Beauregard: elapsed time t=1.30831 s, 512 iters, t-(init.)=1.30124 s t(norm)=0.248192, mflops=20.1457 (err=1.1e-15) 5. Bergland: elapsed time t=1.38249 s, 4096 iters, t-(init.)=1.32585 s t(norm)=0.0316107, mflops=158.174 (err=1.1e-15) 6. CWP (min N) (N=1040): elapsed time t=1.50719 s, 4096 iters, t-(init.)=1.44953 s t(norm)=0.0345595, mflops=144.678 7. CWP (best N) (N=1040): elapsed time t=1.50729 s, 4096 iters, t-(init.)=1.44972 s t(norm)=0.034564, mflops=144.659 8. Edelblute: elapsed time t=1.68392 s, 2048 iters, t-(init.)=1.65562 s t(norm)=0.078946, mflops=63.3344 (err=1.0e-15) 9. FFTPACK (f2c): elapsed time t=1.91127 s, 1024 iters, t-(init.)=1.89698 s t(norm)=0.18091, mflops=27.6381 (err=1.1e-15) FFTW_MEASURE plan: (cost = 4.015781e-04) FFTW_TWIDDLE 16 FFTW_NOTW 64 10. FFTW: elapsed time t=1.5994 s, 4096 iters, t-(init.)=1.54256 s t(norm)=0.0367775, mflops=135.953 (err=1.1e-15) FFTW_ESTIMATE plan: (cost = 1.126400e+04) FFTW_TWIDDLE 8 FFTW_TWIDDLE 4 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.61519 s, 4096 iters, t-(init.)=1.55845 s t(norm)=0.0371562, mflops=134.567 (err=1.1e-15) 12. Frigo-old: elapsed time t=1.75717 s, 2048 iters, t-(init.)=1.72875 s t(norm)=0.0824334, mflops=60.655 (err=1.1e-15) 13. Green: elapsed time t=1.37874 s, 8192 iters, t-(init.)=1.26538 s t(norm)=0.0150846, mflops=331.465 (err=1.1e-15) 14. GSL: elapsed time t=1.23908 s, 1024 iters, t-(init.)=1.22484 s t(norm)=0.11681, mflops=42.8045 (err=1.1e-15) 15. GSL DIT: elapsed time t=1.29019 s, 2048 iters, t-(init.)=1.26184 s t(norm)=0.0601692, mflops=83.099 (err=1.3e-15) 16. GSL DIF: elapsed time t=1.12765 s, 2048 iters, t-(init.)=1.09932 s t(norm)=0.0524197, mflops=95.3841 (err=1.4e-15) 17. Krukar: elapsed time t=1.90191 s, 2048 iters, t-(init.)=1.87349 s t(norm)=0.0893349, mflops=55.9692 (err=1.1e-15) 18. Mayer (Buneman): elapsed time t=1.12684 s, 2048 iters, t-(init.)=1.09856 s t(norm)=0.0523835, mflops=95.4499 (err=1.0e-15) 19. Mayer (simple): elapsed time t=1.7248 s, 4096 iters, t-(init.)=1.66813 s t(norm)=0.0397713, mflops=125.719 20. Mayer (lookup): elapsed time t=1.66841 s, 4096 iters, t-(init.)=1.61165 s t(norm)=0.0384248, mflops=130.124 (err=1.0e-15) 21. NAPACK (f2c): elapsed time t=1.30821 s, 512 iters, t-(init.)=1.30102 s t(norm)=0.248149, mflops=20.1491 (err=1.6e-14) 22. Nielsen: elapsed time t=1.57366 s, 1024 iters, t-(init.)=1.55909 s t(norm)=0.148687, mflops=33.6278 (err=7.2e-15) 23. NR (C): elapsed time t=1.13681 s, 2048 iters, t-(init.)=1.10842 s t(norm)=0.0528537, mflops=94.6007 (err=1.2e-15) 24. Ooura (C): elapsed time t=1.9951 s, 8192 iters, t-(init.)=1.88197 s t(norm)=0.0224349, mflops=222.868 (err=1.1e-15) 25. QFT: elapsed time t=1.05458 s, 512 iters, t-(init.)=1.04714 s t(norm)=0.199727, mflops=25.0342 (err=1.4e-15) 26. Ransom: elapsed time t=1.45267 s, 2048 iters, t-(init.)=1.42427 s t(norm)=0.0679146, mflops=73.6218 (err=2.1e-15) 27. Singleton (f2c): elapsed time t=1.14153 s, 4096 iters, t-(init.)=1.08495 s t(norm)=0.0258673, mflops=193.294 (err=1.6e-15) 28. Temperton (f2c): elapsed time t=1.91545 s, 1024 iters, t-(init.)=1.90129 s t(norm)=0.181321, mflops=27.5754 (err=1.1e-15) 29. Valkenburg: elapsed time t=1.41823 s, 256 iters, t-(init.)=1.4147 s t(norm)=0.539665, mflops=9.265 (err=1.1e-15) Top mflops for N=1024 = 331.465 Normalized results and averages for N=1024: fft 0: mflops = 119.201 (norm. = 0.35962), norm. avg. (of 10) = 0.369873 fft 1: mflops = 118.815 (norm. = 0.358454), norm. avg. (of 10) = 0.368262 fft 2: mflops = 85.209 (norm. = 0.257068), norm. avg. (of 10) = 0.229037 fft 3: mflops = 43.7709 (norm. = 0.132053), norm. avg. (of 10) = 0.0778431 fft 4: mflops = 20.1457 (norm. = 0.0607777), norm. avg. (of 10) = 0.0667418 fft 5: mflops = 158.174 (norm. = 0.477197), norm. avg. (of 10) = 0.302144 fft 6: mflops = 144.678 (norm. = 0.436481), norm. avg. (of 10) = 0.300197 fft 7: mflops = 144.659 (norm. = 0.436424), norm. avg. (of 10) = 0.346166 fft 8: mflops = 63.3344 (norm. = 0.191074), norm. avg. (of 9) = 0.160733 fft 9: mflops = 27.6381 (norm. = 0.0833816), norm. avg. (of 10) = 0.136928 fft 10: mflops = 135.953 (norm. = 0.410157), norm. avg. (of 10) = 0.840525 fft 11: mflops = 134.567 (norm. = 0.405976), norm. avg. (of 10) = 0.812706 fft 12: mflops = 60.655 (norm. = 0.182991), norm. avg. (of 10) = 0.806648 fft 13: mflops = 331.465 (norm. = 1), norm. avg. (of 8) = 0.751238 fft 14: mflops = 42.8045 (norm. = 0.129137), norm. avg. (of 10) = 0.236535 fft 15: mflops = 83.099 (norm. = 0.250702), norm. avg. (of 10) = 0.200249 fft 16: mflops = 95.3841 (norm. = 0.287765), norm. avg. (of 10) = 0.209727 fft 17: mflops = 55.9692 (norm. = 0.168854), norm. avg. (of 10) = 0.361808 fft 18: mflops = 95.4499 (norm. = 0.287964), norm. avg. (of 9) = 0.244071 fft 19: mflops = 125.719 (norm. = 0.379282), norm. avg. (of 9) = 0.302165 fft 20: mflops = 130.124 (norm. = 0.392573), norm. avg. (of 9) = 0.307328 fft 21: mflops = 20.1491 (norm. = 0.0607882), norm. avg. (of 10) = 0.0601633 fft 22: mflops = 33.6278 (norm. = 0.101452), norm. avg. (of 10) = 0.0902563 fft 23: mflops = 94.6007 (norm. = 0.285402), norm. avg. (of 10) = 0.219913 fft 24: mflops = 222.868 (norm. = 0.672371), norm. avg. (of 10) = 0.5823 fft 25: mflops = 25.0342 (norm. = 0.075526), norm. avg. (of 7) = 0.230692 fft 26: mflops = 73.6218 (norm. = 0.222111), norm. avg. (of 9) = 0.115264 fft 27: mflops = 193.294 (norm. = 0.583152), norm. avg. (of 10) = 0.352487 fft 28: mflops = 27.5754 (norm. = 0.0831926), norm. avg. (of 10) = 0.0722845 fft 29: mflops = 9.265 (norm. = 0.0279517), norm. avg. (of 10) = 0.0485342 Benchmarking for array size = 2048 (power of 2): 0. Arndt DIF: elapsed time t=1.05783 s, 1024 iters, t-(init.)=1.02769 s t(norm)=0.0445493, mflops=112.235 (err=1.4e-15) 1. Arndt DIT: elapsed time t=1.08902 s, 1024 iters, t-(init.)=1.0588 s t(norm)=0.0458977, mflops=108.938 (err=1.4e-15) 2. Arndt Split-Radix: elapsed time t=1.56031 s, 1024 iters, t-(init.)=1.52978 s t(norm)=0.0663144, mflops=75.3985 (err=1.4e-15) 3. Arndt 4-step: elapsed time t=1.53224 s, 512 iters, t-(init.)=1.51689 s t(norm)=0.131511, mflops=38.0197 (err=1.4e-15) 4. Beauregard: elapsed time t=1.50049 s, 256 iters, t-(init.)=1.49311 s t(norm)=0.258898, mflops=19.3126 (err=1.5e-15) 5. Bergland: elapsed time t=1.71333 s, 2048 iters, t-(init.)=1.65276 s t(norm)=0.0358226, mflops=139.577 (err=1.5e-15) 6. CWP (min N) (N=2145): elapsed time t=1.20577 s, 1024 iters, t-(init.)=1.12815 s t(norm)=0.048904, mflops=102.241 7. CWP (best N) (N=2184): elapsed time t=1.09243 s, 1024 iters, t-(init.)=0.996309 s t(norm)=0.0431888, mflops=115.771 8. Edelblute: elapsed time t=1.99185 s, 1024 iters, t-(init.)=1.96133 s t(norm)=0.0850212, mflops=58.8088 (err=1.4e-15) 9. FFTPACK (f2c): elapsed time t=1.84189 s, 256 iters, t-(init.)=1.83379 s t(norm)=0.317971, mflops=15.7247 (err=1.5e-15) FFTW_MEASURE plan: (cost = 1.111844e-03) FFTW_TWIDDLE 32 FFTW_NOTW 64 10. FFTW: elapsed time t=1.3544 s, 512 iters, t-(init.)=1.33869 s t(norm)=0.116062, mflops=43.0806 (err=1.5e-15) FFTW_ESTIMATE plan: (cost = 1.269760e+04) FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.36131 s, 512 iters, t-(init.)=1.34562 s t(norm)=0.116662, mflops=42.8589 (err=1.5e-15) 12. Frigo-old: elapsed time t=1.09922 s, 256 iters, t-(init.)=1.09128 s t(norm)=0.189223, mflops=26.4238 (err=1.5e-15) 13. Green: elapsed time t=1.68733 s, 2048 iters, t-(init.)=1.62667 s t(norm)=0.0352571, mflops=141.815 (err=1.5e-15) 14. GSL: elapsed time t=1.4773 s, 256 iters, t-(init.)=1.46944 s t(norm)=0.254794, mflops=19.6237 (err=1.5e-15) 15. GSL DIT: elapsed time t=1.59107 s, 1024 iters, t-(init.)=1.56071 s t(norm)=0.067655, mflops=73.9043 (err=2.1e-15) 16. GSL DIF: elapsed time t=1.37796 s, 1024 iters, t-(init.)=1.34768 s t(norm)=0.0584203, mflops=85.5867 (err=2.2e-15) 17. Krukar: elapsed time t=1.24415 s, 512 iters, t-(init.)=1.22883 s t(norm)=0.106536, mflops=46.9324 (err=1.5e-15) 18. Mayer (Buneman): elapsed time t=1.24664 s, 1024 iters, t-(init.)=1.21623 s t(norm)=0.0527222, mflops=94.8366 (err=1.4e-15) 19. Mayer (simple): elapsed time t=1.91962 s, 2048 iters, t-(init.)=1.85891 s t(norm)=0.0402907, mflops=124.098 20. Mayer (lookup): elapsed time t=1.25257 s, 1024 iters, t-(init.)=1.22209 s t(norm)=0.0529762, mflops=94.3821 (err=1.4e-15) 21. NAPACK (f2c): elapsed time t=1.33021 s, 64 iters, t-(init.)=1.32818 s t(norm)=0.921203, mflops=5.42769 (err=1.5e-14) 22. Nielsen: elapsed time t=1.30163 s, 256 iters, t-(init.)=1.29312 s t(norm)=0.224222, mflops=22.2994 (err=1.2e-14) 23. NR (C): elapsed time t=1.42834 s, 1024 iters, t-(init.)=1.39809 s t(norm)=0.0606054, mflops=82.5008 (err=1.6e-15) 24. Ooura (C): elapsed time t=1.10506 s, 1024 iters, t-(init.)=1.07477 s t(norm)=0.0465899, mflops=107.319 (err=1.4e-15) 25. QFT: elapsed time t=1.24577 s, 256 iters, t-(init.)=1.2376 s t(norm)=0.214594, mflops=23.2998 (err=1.9e-15) 26. Ransom: elapsed time t=1.0598 s, 512 iters, t-(init.)=1.04453 s t(norm)=0.090558, mflops=55.2133 (err=2.6e-15) 27. Singleton (f2c): elapsed time t=1.68156 s, 2048 iters, t-(init.)=1.62103 s t(norm)=0.0351349, mflops=142.309 (err=2.0e-15) 28. Temperton (f2c): elapsed time t=1.4404 s, 256 iters, t-(init.)=1.43288 s t(norm)=0.248454, mflops=20.1244 (err=1.5e-15) 29. Valkenburg: elapsed time t=1.16633 s, 64 iters, t-(init.)=1.16433 s t(norm)=0.807558, mflops=6.1915 (err=1.5e-15) Top mflops for N=2048 = 142.309 Normalized results and averages for N=2048: fft 0: mflops = 112.235 (norm. = 0.788674), norm. avg. (of 11) = 0.407946 fft 1: mflops = 108.938 (norm. = 0.765504), norm. avg. (of 11) = 0.404375 fft 2: mflops = 75.3985 (norm. = 0.529823), norm. avg. (of 11) = 0.256382 fft 3: mflops = 38.0197 (norm. = 0.267163), norm. avg. (of 11) = 0.095054 fft 4: mflops = 19.3126 (norm. = 0.135709), norm. avg. (of 11) = 0.0730115 fft 5: mflops = 139.577 (norm. = 0.980801), norm. avg. (of 11) = 0.36384 fft 6: mflops = 102.241 (norm. = 0.718445), norm. avg. (of 11) = 0.33822 fft 7: mflops = 115.771 (norm. = 0.813517), norm. avg. (of 11) = 0.388653 fft 8: mflops = 58.8088 (norm. = 0.413248), norm. avg. (of 10) = 0.185984 fft 9: mflops = 15.7247 (norm. = 0.110497), norm. avg. (of 11) = 0.134525 fft 10: mflops = 43.0806 (norm. = 0.302726), norm. avg. (of 11) = 0.791634 fft 11: mflops = 42.8589 (norm. = 0.301169), norm. avg. (of 11) = 0.766203 fft 12: mflops = 26.4238 (norm. = 0.18568), norm. avg. (of 11) = 0.750196 fft 13: mflops = 141.815 (norm. = 0.996532), norm. avg. (of 9) = 0.778493 fft 14: mflops = 19.6237 (norm. = 0.137895), norm. avg. (of 11) = 0.227568 fft 15: mflops = 73.9043 (norm. = 0.519324), norm. avg. (of 11) = 0.229256 fft 16: mflops = 85.5867 (norm. = 0.601416), norm. avg. (of 11) = 0.245335 fft 17: mflops = 46.9324 (norm. = 0.329792), norm. avg. (of 11) = 0.358898 fft 18: mflops = 94.8366 (norm. = 0.666414), norm. avg. (of 10) = 0.286305 fft 19: mflops = 124.098 (norm. = 0.872034), norm. avg. (of 10) = 0.359152 fft 20: mflops = 94.3821 (norm. = 0.66322), norm. avg. (of 10) = 0.342917 fft 21: mflops = 5.42769 (norm. = 0.0381402), norm. avg. (of 11) = 0.0581612 fft 22: mflops = 22.2994 (norm. = 0.156697), norm. avg. (of 11) = 0.0962964 fft 23: mflops = 82.5008 (norm. = 0.579731), norm. avg. (of 11) = 0.252624 fft 24: mflops = 107.319 (norm. = 0.75413), norm. avg. (of 11) = 0.597921 fft 25: mflops = 23.2998 (norm. = 0.163727), norm. avg. (of 8) = 0.222321 fft 26: mflops = 55.2133 (norm. = 0.387982), norm. avg. (of 10) = 0.142536 fft 27: mflops = 142.309 (norm. = 1), norm. avg. (of 11) = 0.411352 fft 28: mflops = 20.1244 (norm. = 0.141414), norm. avg. (of 11) = 0.078569 fft 29: mflops = 6.1915 (norm. = 0.0435075), norm. avg. (of 11) = 0.0480773 Benchmarking for array size = 4096 (power of 2): 0. Arndt DIF: elapsed time t=1.4379 s, 128 iters, t-(init.)=1.38207 s t(norm)=0.219674, mflops=22.761 (err=2.5e-15) 1. Arndt DIT: elapsed time t=1.51944 s, 128 iters, t-(init.)=1.46404 s t(norm)=0.232702, mflops=21.4867 (err=2.5e-15) 2. Arndt Split-Radix: elapsed time t=1.80081 s, 128 iters, t-(init.)=1.74515 s t(norm)=0.277383, mflops=18.0256 (err=2.5e-15) 3. Arndt 4-step: elapsed time t=1.14121 s, 128 iters, t-(init.)=1.08546 s t(norm)=0.172529, mflops=28.9807 (err=2.5e-15) 4. Beauregard: elapsed time t=1.994 s, 128 iters, t-(init.)=1.93826 s t(norm)=0.308078, mflops=16.2296 (err=2.6e-15) 5. Bergland: elapsed time t=1.31886 s, 256 iters, t-(init.)=1.20813 s t(norm)=0.0960139, mflops=52.0758 (err=2.5e-15) 6. CWP (min N) (N=4290): elapsed time t=1.96875 s, 512 iters, t-(init.)=1.73638 s t(norm)=0.0689974, mflops=72.4665 7. CWP (best N) (N=4368): elapsed time t=1.68496 s, 512 iters, t-(init.)=1.44848 s t(norm)=0.0575574, mflops=86.8698 8. Edelblute: elapsed time t=1.89028 s, 128 iters, t-(init.)=1.83489 s t(norm)=0.291647, mflops=17.144 (err=2.5e-15) 9. FFTPACK (f2c): elapsed time t=1.90091 s, 128 iters, t-(init.)=1.84502 s t(norm)=0.293259, mflops=17.0498 (err=2.6e-15) FFTW_MEASURE plan: (cost = 2.563375e-03) FFTW_TWIDDLE 4 FFTW_TWIDDLE 16 FFTW_NOTW 64 10. FFTW: elapsed time t=1.5073 s, 256 iters, t-(init.)=1.39542 s t(norm)=0.110898, mflops=45.0866 (err=2.6e-15) FFTW_ESTIMATE plan: (cost = 2.539520e+04) FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.59947 s, 256 iters, t-(init.)=1.48778 s t(norm)=0.118238, mflops=42.2877 (err=2.6e-15) 12. Frigo-old: elapsed time t=1.20145 s, 128 iters, t-(init.)=1.14609 s t(norm)=0.182166, mflops=27.4475 (err=2.6e-15) 13. Green: elapsed time t=1.26866 s, 256 iters, t-(init.)=1.15741 s t(norm)=0.0919829, mflops=54.3579 (err=2.6e-15) 14. GSL: elapsed time t=1.44447 s, 128 iters, t-(init.)=1.38902 s t(norm)=0.220779, mflops=22.6471 (err=2.6e-15) 15. GSL DIT: elapsed time t=1.4498 s, 128 iters, t-(init.)=1.39423 s t(norm)=0.221608, mflops=22.5624 (err=3.0e-15) 16. GSL DIF: elapsed time t=1.43601 s, 128 iters, t-(init.)=1.38084 s t(norm)=0.219479, mflops=22.7812 (err=3.1e-15) 17. Krukar: elapsed time t=1.58406 s, 256 iters, t-(init.)=1.47296 s t(norm)=0.11706, mflops=42.713 (err=2.6e-15) 18. Mayer (Buneman): elapsed time t=1.72403 s, 512 iters, t-(init.)=1.50237 s t(norm)=0.0596989, mflops=83.7536 (err=2.5e-15) 19. Mayer (simple): elapsed time t=1.38075 s, 512 iters, t-(init.)=1.15907 s t(norm)=0.0460573, mflops=108.561 20. Mayer (lookup): elapsed time t=1.21896 s, 256 iters, t-(init.)=1.10869 s t(norm)=0.0881104, mflops=56.747 (err=2.5e-15) 21. NAPACK (f2c): elapsed time t=1.39443 s, 32 iters, t-(init.)=1.37976 s t(norm)=0.877229, mflops=5.69977 (err=4.7e-14) 22. Nielsen: elapsed time t=1.84809 s, 128 iters, t-(init.)=1.79133 s t(norm)=0.284725, mflops=17.5608 (err=2.2e-14) 23. NR (C): elapsed time t=1.42647 s, 128 iters, t-(init.)=1.37085 s t(norm)=0.217891, mflops=22.9473 (err=2.6e-15) 24. Ooura (C): elapsed time t=1.17554 s, 256 iters, t-(init.)=1.06455 s t(norm)=0.084603, mflops=59.0996 (err=2.5e-15) 25. QFT: elapsed time t=1.50038 s, 128 iters, t-(init.)=1.44357 s t(norm)=0.229449, mflops=21.7914 (err=3.1e-15) 26. Ransom: elapsed time t=1.27631 s, 256 iters, t-(init.)=1.16555 s t(norm)=0.0926296, mflops=53.9784 (err=3.1e-15) 27. Singleton (f2c): elapsed time t=1.50264 s, 256 iters, t-(init.)=1.39174 s t(norm)=0.110605, mflops=45.2057 (err=3.8e-15) 28. Temperton (f2c): elapsed time t=1.70124 s, 128 iters, t-(init.)=1.64566 s t(norm)=0.26157, mflops=19.1153 (err=2.6e-15) 29. Valkenburg: elapsed time t=1.39236 s, 32 iters, t-(init.)=1.37841 s t(norm)=0.87637, mflops=5.70535 (err=2.5e-15) Top mflops for N=4096 = 108.561 Normalized results and averages for N=4096: fft 0: mflops = 22.761 (norm. = 0.209662), norm. avg. (of 12) = 0.391422 fft 1: mflops = 21.4867 (norm. = 0.197924), norm. avg. (of 12) = 0.38717 fft 2: mflops = 18.0256 (norm. = 0.166042), norm. avg. (of 12) = 0.248853 fft 3: mflops = 28.9807 (norm. = 0.266954), norm. avg. (of 12) = 0.109379 fft 4: mflops = 16.2296 (norm. = 0.149498), norm. avg. (of 12) = 0.0793855 fft 5: mflops = 52.0758 (norm. = 0.479694), norm. avg. (of 12) = 0.373495 fft 6: mflops = 72.4665 (norm. = 0.667522), norm. avg. (of 12) = 0.365662 fft 7: mflops = 86.8698 (norm. = 0.800197), norm. avg. (of 12) = 0.422948 fft 8: mflops = 17.144 (norm. = 0.157921), norm. avg. (of 11) = 0.183433 fft 9: mflops = 17.0498 (norm. = 0.157053), norm. avg. (of 12) = 0.136402 fft 10: mflops = 45.0866 (norm. = 0.415313), norm. avg. (of 12) = 0.760274 fft 11: mflops = 42.2877 (norm. = 0.389531), norm. avg. (of 12) = 0.734813 fft 12: mflops = 27.4475 (norm. = 0.252832), norm. avg. (of 12) = 0.708749 fft 13: mflops = 54.3579 (norm. = 0.500715), norm. avg. (of 10) = 0.750715 fft 14: mflops = 22.6471 (norm. = 0.208612), norm. avg. (of 12) = 0.225988 fft 15: mflops = 22.5624 (norm. = 0.207833), norm. avg. (of 12) = 0.227471 fft 16: mflops = 22.7812 (norm. = 0.209848), norm. avg. (of 12) = 0.242378 fft 17: mflops = 42.713 (norm. = 0.393449), norm. avg. (of 12) = 0.361777 fft 18: mflops = 83.7536 (norm. = 0.771492), norm. avg. (of 11) = 0.330413 fft 19: mflops = 108.561 (norm. = 1), norm. avg. (of 11) = 0.417411 fft 20: mflops = 56.747 (norm. = 0.522722), norm. avg. (of 11) = 0.359263 fft 21: mflops = 5.69977 (norm. = 0.0525031), norm. avg. (of 12) = 0.0576897 fft 22: mflops = 17.5608 (norm. = 0.161761), norm. avg. (of 12) = 0.101752 fft 23: mflops = 22.9473 (norm. = 0.211378), norm. avg. (of 12) = 0.249186 fft 24: mflops = 59.0996 (norm. = 0.544393), norm. avg. (of 12) = 0.59346 fft 25: mflops = 21.7914 (norm. = 0.20073), norm. avg. (of 9) = 0.219922 fft 26: mflops = 53.9784 (norm. = 0.49722), norm. avg. (of 11) = 0.17478 fft 27: mflops = 45.2057 (norm. = 0.41641), norm. avg. (of 12) = 0.411774 fft 28: mflops = 19.1153 (norm. = 0.17608), norm. avg. (of 12) = 0.0866949 fft 29: mflops = 5.70535 (norm. = 0.0525546), norm. avg. (of 12) = 0.0484504 Benchmarking for array size = 8192 (power of 2): 0. Arndt DIF: elapsed time t=1.45907 s, 64 iters, t-(init.)=1.40309 s t(norm)=0.205861, mflops=24.2882 (err=3.0e-15) 1. Arndt DIT: elapsed time t=1.54525 s, 64 iters, t-(init.)=1.48952 s t(norm)=0.218542, mflops=22.8789 (err=3.0e-15) 2. Arndt Split-Radix: elapsed time t=1.03777 s, 32 iters, t-(init.)=1.01004 s t(norm)=0.296383, mflops=16.8701 (err=3.0e-15) 3. Arndt 4-step: elapsed time t=1.4618 s, 64 iters, t-(init.)=1.40634 s t(norm)=0.206337, mflops=24.2322 (err=2.9e-15) 4. Beauregard: elapsed time t=1.08554 s, 32 iters, t-(init.)=1.05746 s t(norm)=0.310298, mflops=16.1135 (err=2.9e-15) 5. Bergland: elapsed time t=1.5606 s, 128 iters, t-(init.)=1.44927 s t(norm)=0.106318, mflops=47.0286 (err=2.9e-15) 6. CWP (min N) (N=8580): elapsed time t=1.04581 s, 128 iters, t-(init.)=0.929636 s t(norm)=0.0681977, mflops=73.3163 7. CWP (best N) (N=9240): elapsed time t=1.95084 s, 256 iters, t-(init.)=1.70044 s t(norm)=0.0623718, mflops=80.1645 8. Edelblute: elapsed time t=1.07825 s, 32 iters, t-(init.)=1.05036 s t(norm)=0.308215, mflops=16.2224 (err=3.0e-15) 9. FFTPACK (f2c): elapsed time t=1.20721 s, 32 iters, t-(init.)=1.17934 s t(norm)=0.346062, mflops=14.4483 (err=2.9e-15) FFTW_MEASURE plan: (cost = 6.030625e-03) FFTW_TWIDDLE 4 FFTW_TWIDDLE 16 FFTW_TWIDDLE 4 FFTW_NOTW 32 10. FFTW: elapsed time t=1.61597 s, 128 iters, t-(init.)=1.50264 s t(norm)=0.110233, mflops=45.3585 (err=2.9e-15) FFTW_ESTIMATE plan: (cost = 5.079040e+04) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.61698 s, 128 iters, t-(init.)=1.50411 s t(norm)=0.11034, mflops=45.3143 (err=2.9e-15) 12. Frigo-old: elapsed time t=1.22837 s, 64 iters, t-(init.)=1.17256 s t(norm)=0.172037, mflops=29.0635 (err=2.9e-15) 13. Green: elapsed time t=1.39134 s, 128 iters, t-(init.)=1.27991 s t(norm)=0.093894, mflops=53.2515 (err=2.9e-15) 14. GSL: elapsed time t=1.98796 s, 64 iters, t-(init.)=1.93212 s t(norm)=0.283479, mflops=17.638 (err=2.9e-15) 15. GSL DIT: elapsed time t=1.58504 s, 64 iters, t-(init.)=1.52954 s t(norm)=0.224412, mflops=22.2804 (err=3.6e-15) 16. GSL DIF: elapsed time t=1.56841 s, 64 iters, t-(init.)=1.51271 s t(norm)=0.221943, mflops=22.5283 (err=3.6e-15) 17. Skipping fft (Krukar can't handle N > 4096). 18. Mayer (Buneman): elapsed time t=1.38305 s, 64 iters, t-(init.)=1.32776 s t(norm)=0.194808, mflops=25.6663 (err=2.9e-15) 19. Mayer (simple): elapsed time t=1.31214 s, 64 iters, t-(init.)=1.25667 s t(norm)=0.184377, mflops=27.1183 20. Mayer (lookup): elapsed time t=1.5482 s, 64 iters, t-(init.)=1.49257 s t(norm)=0.218989, mflops=22.8322 (err=3.0e-15) 21. NAPACK (f2c): elapsed time t=1.55661 s, 16 iters, t-(init.)=1.54254 s t(norm)=0.90528, mflops=5.52315 (err=4.3e-14) 22. Nielsen: elapsed time t=1.07139 s, 32 iters, t-(init.)=1.04159 s t(norm)=0.305641, mflops=16.359 (err=1.1e-14) 23. NR (C): elapsed time t=1.55942 s, 64 iters, t-(init.)=1.50388 s t(norm)=0.220648, mflops=22.6605 (err=3.0e-15) 24. Ooura (C): elapsed time t=1.30803 s, 128 iters, t-(init.)=1.19663 s t(norm)=0.087784, mflops=56.958 (err=2.9e-15) 25. QFT: elapsed time t=1.79928 s, 64 iters, t-(init.)=1.74142 s t(norm)=0.2555, mflops=19.5695 (err=4.0e-15) 26. Ransom: elapsed time t=1.64102 s, 128 iters, t-(init.)=1.52948 s t(norm)=0.112202, mflops=44.5624 (err=4.1e-15) 27. Singleton (f2c): elapsed time t=1.76322 s, 128 iters, t-(init.)=1.6523 s t(norm)=0.121212, mflops=41.25 (err=4.4e-15) 28. Temperton (f2c): elapsed time t=1.96627 s, 64 iters, t-(init.)=1.91092 s t(norm)=0.280369, mflops=17.8337 (err=2.9e-15) 29. Valkenburg: elapsed time t=1.5859 s, 16 iters, t-(init.)=1.57193 s t(norm)=0.922527, mflops=5.4199 (err=2.9e-15) Top mflops for N=8192 = 80.1645 Normalized results and averages for N=8192: fft 0: mflops = 24.2882 (norm. = 0.30298), norm. avg. (of 13) = 0.384619 fft 1: mflops = 22.8789 (norm. = 0.2854), norm. avg. (of 13) = 0.379342 fft 2: mflops = 16.8701 (norm. = 0.210443), norm. avg. (of 13) = 0.245899 fft 3: mflops = 24.2322 (norm. = 0.30228), norm. avg. (of 13) = 0.124218 fft 4: mflops = 16.1135 (norm. = 0.201006), norm. avg. (of 13) = 0.0887409 fft 5: mflops = 47.0286 (norm. = 0.586652), norm. avg. (of 13) = 0.389892 fft 6: mflops = 73.3163 (norm. = 0.914573), norm. avg. (of 13) = 0.407886 fft 7: mflops = 80.1645 (norm. = 1), norm. avg. (of 13) = 0.467337 fft 8: mflops = 16.2224 (norm. = 0.202364), norm. avg. (of 12) = 0.185011 fft 9: mflops = 14.4483 (norm. = 0.180233), norm. avg. (of 13) = 0.139774 fft 10: mflops = 45.3585 (norm. = 0.565818), norm. avg. (of 13) = 0.745316 fft 11: mflops = 45.3143 (norm. = 0.565266), norm. avg. (of 13) = 0.721771 fft 12: mflops = 29.0635 (norm. = 0.362548), norm. avg. (of 13) = 0.682118 fft 13: mflops = 53.2515 (norm. = 0.664278), norm. avg. (of 11) = 0.742857 fft 14: mflops = 17.638 (norm. = 0.220022), norm. avg. (of 13) = 0.225529 fft 15: mflops = 22.2804 (norm. = 0.277934), norm. avg. (of 13) = 0.231352 fft 16: mflops = 22.5283 (norm. = 0.281026), norm. avg. (of 13) = 0.24535 fft 17: mflops = -1 (norm. = -0.0124744), norm. avg. (of 12) = 0.361777 fft 18: mflops = 25.6663 (norm. = 0.32017), norm. avg. (of 12) = 0.329559 fft 19: mflops = 27.1183 (norm. = 0.338283), norm. avg. (of 12) = 0.410817 fft 20: mflops = 22.8322 (norm. = 0.284817), norm. avg. (of 12) = 0.353059 fft 21: mflops = 5.52315 (norm. = 0.0688978), norm. avg. (of 13) = 0.0585518 fft 22: mflops = 16.359 (norm. = 0.204068), norm. avg. (of 13) = 0.109622 fft 23: mflops = 22.6605 (norm. = 0.282675), norm. avg. (of 13) = 0.251762 fft 24: mflops = 56.958 (norm. = 0.710514), norm. avg. (of 13) = 0.602464 fft 25: mflops = 19.5695 (norm. = 0.244116), norm. avg. (of 10) = 0.222342 fft 26: mflops = 44.5624 (norm. = 0.555887), norm. avg. (of 12) = 0.206539 fft 27: mflops = 41.25 (norm. = 0.514567), norm. avg. (of 13) = 0.419681 fft 28: mflops = 17.8337 (norm. = 0.222463), norm. avg. (of 13) = 0.0971386 fft 29: mflops = 5.4199 (norm. = 0.0676097), norm. avg. (of 13) = 0.0499242 Benchmarking for array size = 16384 (power of 2): 0. Arndt DIF: elapsed time t=1.73714 s, 32 iters, t-(init.)=1.68163 s t(norm)=0.229105, mflops=21.8241 (err=5.6e-15) 1. Arndt DIT: elapsed time t=1.83619 s, 32 iters, t-(init.)=1.7808 s t(norm)=0.242615, mflops=20.6088 (err=5.6e-15) 2. Arndt Split-Radix: elapsed time t=1.12235 s, 16 iters, t-(init.)=1.09417 s t(norm)=0.298137, mflops=16.7708 (err=5.6e-15) 3. Arndt 4-step: elapsed time t=1.24272 s, 32 iters, t-(init.)=1.18703 s t(norm)=0.16172, mflops=30.9176 (err=5.6e-15) 4. Beauregard: elapsed time t=1.16676 s, 16 iters, t-(init.)=1.13884 s t(norm)=0.31031, mflops=16.1129 (err=5.7e-15) 5. Bergland: elapsed time t=1.56566 s, 64 iters, t-(init.)=1.45447 s t(norm)=0.0990782, mflops=50.4652 (err=5.7e-15) 6. CWP (min N) (N=17160): elapsed time t=1.04158 s, 64 iters, t-(init.)=0.924754 s t(norm)=0.0629939, mflops=79.3728 7. CWP (best N) (N=17160): elapsed time t=1.04237 s, 64 iters, t-(init.)=0.92549 s t(norm)=0.063044, mflops=79.3097 8. Edelblute: elapsed time t=1.1628 s, 16 iters, t-(init.)=1.13504 s t(norm)=0.309273, mflops=16.1669 (err=5.6e-15) 9. FFTPACK (f2c): elapsed time t=1.2515 s, 16 iters, t-(init.)=1.2237 s t(norm)=0.333433, mflops=14.9955 (err=5.7e-15) FFTW_MEASURE plan: (cost = 1.456450e-02) FFTW_TWIDDLE 64 FFTW_TWIDDLE 4 FFTW_NOTW 64 10. FFTW: elapsed time t=1.80683 s, 64 iters, t-(init.)=1.69144 s t(norm)=0.11522, mflops=43.3952 (err=5.7e-15) FFTW_ESTIMATE plan: (cost = 1.441792e+05) FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.70901 s, 64 iters, t-(init.)=1.5933 s t(norm)=0.108535, mflops=46.0682 (err=5.7e-15) 12. Frigo-old: elapsed time t=1.40622 s, 32 iters, t-(init.)=1.35062 s t(norm)=0.184008, mflops=27.1727 (err=5.7e-15) 13. Green: elapsed time t=1.52941 s, 64 iters, t-(init.)=1.41802 s t(norm)=0.0965949, mflops=51.7625 (err=5.7e-15) 14. GSL: elapsed time t=1.96114 s, 32 iters, t-(init.)=1.90558 s t(norm)=0.259614, mflops=19.2594 (err=5.7e-15) 15. GSL DIT: elapsed time t=1.69964 s, 32 iters, t-(init.)=1.6436 s t(norm)=0.223923, mflops=22.3291 (err=6.3e-15) 16. GSL DIF: elapsed time t=1.68398 s, 32 iters, t-(init.)=1.62842 s t(norm)=0.221855, mflops=22.5372 (err=6.4e-15) 17. Skipping fft (Krukar can't handle N > 4096). 18. Mayer (Buneman): elapsed time t=1.52977 s, 32 iters, t-(init.)=1.47387 s t(norm)=0.200799, mflops=24.9005 (err=5.6e-15) 19. Mayer (simple): elapsed time t=1.46238 s, 32 iters, t-(init.)=1.40699 s t(norm)=0.191688, mflops=26.0841 20. Mayer (lookup): elapsed time t=1.6859 s, 32 iters, t-(init.)=1.63034 s t(norm)=0.222117, mflops=22.5107 (err=5.6e-15) 21. NAPACK (f2c): elapsed time t=1.62402 s, 8 iters, t-(init.)=1.60568 s t(norm)=0.875028, mflops=5.7141 (err=2.3e-13) 22. Nielsen: elapsed time t=1.07973 s, 16 iters, t-(init.)=1.04752 s t(norm)=0.285425, mflops=17.5177 (err=1.3e-13) 23. NR (C): elapsed time t=1.68063 s, 32 iters, t-(init.)=1.62514 s t(norm)=0.221408, mflops=22.5828 (err=5.6e-15) 24. Ooura (C): elapsed time t=1.32549 s, 64 iters, t-(init.)=1.21435 s t(norm)=0.082721, mflops=60.4442 (err=5.7e-15) 25. QFT: elapsed time t=1.0639 s, 16 iters, t-(init.)=1.03148 s t(norm)=0.281057, mflops=17.79 (err=7.0e-15) 26. Ransom: elapsed time t=1.3011 s, 64 iters, t-(init.)=1.19015 s t(norm)=0.0810727, mflops=61.6731 (err=6.0e-15) 27. Singleton (f2c): elapsed time t=1.74913 s, 64 iters, t-(init.)=1.63786 s t(norm)=0.11157, mflops=44.8147 (err=8.5e-15) 28. Temperton (f2c): elapsed time t=1.97089 s, 32 iters, t-(init.)=1.91526 s t(norm)=0.260934, mflops=19.1619 (err=5.7e-15) 29. Valkenburg: elapsed time t=1.78215 s, 8 iters, t-(init.)=1.76809 s t(norm)=0.963534, mflops=5.18923 (err=5.7e-15) Top mflops for N=16384 = 79.3728 Normalized results and averages for N=16384: fft 0: mflops = 21.8241 (norm. = 0.274957), norm. avg. (of 14) = 0.376786 fft 1: mflops = 20.6088 (norm. = 0.259646), norm. avg. (of 14) = 0.370792 fft 2: mflops = 16.7708 (norm. = 0.211292), norm. avg. (of 14) = 0.243427 fft 3: mflops = 30.9176 (norm. = 0.389523), norm. avg. (of 14) = 0.143168 fft 4: mflops = 16.1129 (norm. = 0.203003), norm. avg. (of 14) = 0.0969025 fft 5: mflops = 50.4652 (norm. = 0.6358), norm. avg. (of 14) = 0.407456 fft 6: mflops = 79.3728 (norm. = 1), norm. avg. (of 14) = 0.450179 fft 7: mflops = 79.3097 (norm. = 0.999205), norm. avg. (of 14) = 0.505327 fft 8: mflops = 16.1669 (norm. = 0.203684), norm. avg. (of 13) = 0.186447 fft 9: mflops = 14.9955 (norm. = 0.188925), norm. avg. (of 14) = 0.143285 fft 10: mflops = 43.3952 (norm. = 0.546727), norm. avg. (of 14) = 0.731131 fft 11: mflops = 46.0682 (norm. = 0.580403), norm. avg. (of 14) = 0.711674 fft 12: mflops = 27.1727 (norm. = 0.342343), norm. avg. (of 14) = 0.657849 fft 13: mflops = 51.7625 (norm. = 0.652145), norm. avg. (of 12) = 0.735298 fft 14: mflops = 19.2594 (norm. = 0.242644), norm. avg. (of 14) = 0.226752 fft 15: mflops = 22.3291 (norm. = 0.28132), norm. avg. (of 14) = 0.234922 fft 16: mflops = 22.5372 (norm. = 0.283941), norm. avg. (of 14) = 0.248107 fft 17: mflops = -1 (norm. = -0.0125988), norm. avg. (of 12) = 0.361777 fft 18: mflops = 24.9005 (norm. = 0.313716), norm. avg. (of 13) = 0.328341 fft 19: mflops = 26.0841 (norm. = 0.328628), norm. avg. (of 13) = 0.404495 fft 20: mflops = 22.5107 (norm. = 0.283607), norm. avg. (of 13) = 0.347717 fft 21: mflops = 5.7141 (norm. = 0.0719907), norm. avg. (of 14) = 0.0595118 fft 22: mflops = 17.5177 (norm. = 0.220702), norm. avg. (of 14) = 0.117557 fft 23: mflops = 22.5828 (norm. = 0.284515), norm. avg. (of 14) = 0.254102 fft 24: mflops = 60.4442 (norm. = 0.761522), norm. avg. (of 14) = 0.613826 fft 25: mflops = 17.79 (norm. = 0.224132), norm. avg. (of 11) = 0.222504 fft 26: mflops = 61.6731 (norm. = 0.777005), norm. avg. (of 13) = 0.250421 fft 27: mflops = 44.8147 (norm. = 0.56461), norm. avg. (of 14) = 0.430033 fft 28: mflops = 19.1619 (norm. = 0.241417), norm. avg. (of 14) = 0.107444 fft 29: mflops = 5.18923 (norm. = 0.065378), norm. avg. (of 14) = 0.051028 Benchmarking for array size = 32768 (power of 2): 0. Arndt DIF: elapsed time t=1.75045 s, 16 iters, t-(init.)=1.69425 s t(norm)=0.215435, mflops=23.2089 (err=5.2e-15) 1. Arndt DIT: elapsed time t=1.84393 s, 16 iters, t-(init.)=1.7875 s t(norm)=0.227293, mflops=21.9981 (err=5.2e-15) 2. Arndt Split-Radix: elapsed time t=1.21118 s, 8 iters, t-(init.)=1.18296 s t(norm)=0.300842, mflops=16.62 (err=5.2e-15) 3. Arndt 4-step: elapsed time t=1.61205 s, 16 iters, t-(init.)=1.5559 s t(norm)=0.197843, mflops=25.2725 (err=5.2e-15) 4. Beauregard: elapsed time t=1.26164 s, 8 iters, t-(init.)=1.2333 s t(norm)=0.313646, mflops=15.9416 (err=5.2e-15) 5. Bergland: elapsed time t=1.69804 s, 32 iters, t-(init.)=1.58504 s t(norm)=0.100774, mflops=49.6159 (err=5.2e-15) 6. CWP (min N) (N=34320): elapsed time t=1.12753 s, 32 iters, t-(init.)=1.01003 s t(norm)=0.0642158, mflops=77.8625 7. CWP (best N) (N=34320): elapsed time t=1.1264 s, 32 iters, t-(init.)=1.00841 s t(norm)=0.0641127, mflops=77.9876 8. Edelblute: elapsed time t=1.25185 s, 8 iters, t-(init.)=1.22392 s t(norm)=0.31126, mflops=16.0638 (err=5.2e-15) 9. FFTPACK (f2c): elapsed time t=1.29778 s, 8 iters, t-(init.)=1.26493 s t(norm)=0.321688, mflops=15.543 (err=5.2e-15) FFTW_MEASURE plan: (cost = 4.509850e-02) FFTW_TWIDDLE 32 FFTW_TWIDDLE 16 FFTW_NOTW 64 10. FFTW: elapsed time t=1.81421 s, 32 iters, t-(init.)=1.69277 s t(norm)=0.107624, mflops=46.4582 (err=5.2e-15) FFTW_ESTIMATE plan: (cost = 2.883584e+05) FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.83979 s, 32 iters, t-(init.)=1.71801 s t(norm)=0.109228, mflops=45.7758 (err=5.2e-15) 12. Frigo-old: elapsed time t=1.88815 s, 16 iters, t-(init.)=1.83144 s t(norm)=0.23288, mflops=21.4703 (err=5.2e-15) 13. Green: elapsed time t=1.65442 s, 32 iters, t-(init.)=1.54065 s t(norm)=0.0979522, mflops=51.0453 (err=5.2e-15) 14. GSL: elapsed time t=1.93952 s, 16 iters, t-(init.)=1.88348 s t(norm)=0.239497, mflops=20.8771 (err=5.2e-15) 15. GSL DIT: elapsed time t=1.83081 s, 16 iters, t-(init.)=1.77461 s t(norm)=0.225654, mflops=22.1578 (err=5.9e-15) 16. GSL DIF: elapsed time t=1.81426 s, 16 iters, t-(init.)=1.75791 s t(norm)=0.22353, mflops=22.3684 (err=6.0e-15) 17. Skipping fft (Krukar can't handle N > 4096). 18. Mayer (Buneman): elapsed time t=1.64714 s, 16 iters, t-(init.)=1.59079 s t(norm)=0.20228, mflops=24.7183 (err=5.2e-15) 19. Mayer (simple): elapsed time t=1.58207 s, 16 iters, t-(init.)=1.52555 s t(norm)=0.193983, mflops=25.7754 20. Mayer (lookup): elapsed time t=1.80698 s, 16 iters, t-(init.)=1.75008 s t(norm)=0.222534, mflops=22.4685 (err=5.2e-15) 21. NAPACK (f2c): elapsed time t=1.79736 s, 4 iters, t-(init.)=1.78306 s t(norm)=0.90691, mflops=5.51323 (err=5.6e-13) 22. Nielsen: elapsed time t=1.13682 s, 8 iters, t-(init.)=1.10004 s t(norm)=0.279756, mflops=17.8727 (err=2.3e-13) 23. NR (C): elapsed time t=1.81812 s, 16 iters, t-(init.)=1.76128 s t(norm)=0.223958, mflops=22.3256 (err=5.3e-15) 24. Ooura (C): elapsed time t=1.46234 s, 32 iters, t-(init.)=1.34991 s t(norm)=0.0858248, mflops=58.2582 (err=5.2e-15) 25. QFT: elapsed time t=1.28349 s, 8 iters, t-(init.)=1.24624 s t(norm)=0.316934, mflops=15.7761 (err=7.5e-15) 26. Ransom: elapsed time t=1.69697 s, 32 iters, t-(init.)=1.58478 s t(norm)=0.100757, mflops=49.6241 (err=6.4e-15) 27. Singleton (f2c): elapsed time t=1.11715 s, 16 iters, t-(init.)=1.06076 s t(norm)=0.134883, mflops=37.0693 (err=7.2e-15) 28. Temperton (f2c): elapsed time t=1.11491 s, 8 iters, t-(init.)=1.08654 s t(norm)=0.276321, mflops=18.0949 (err=5.2e-15) 29. Valkenburg: elapsed time t=1.01509 s, 2 iters, t-(init.)=1.00778 s t(norm)=1.02517, mflops=4.87724 (err=5.2e-15) Top mflops for N=32768 = 77.9876 Normalized results and averages for N=32768: fft 0: mflops = 23.2089 (norm. = 0.297597), norm. avg. (of 15) = 0.371507 fft 1: mflops = 21.9981 (norm. = 0.282071), norm. avg. (of 15) = 0.364877 fft 2: mflops = 16.62 (norm. = 0.213111), norm. avg. (of 15) = 0.241406 fft 3: mflops = 25.2725 (norm. = 0.324058), norm. avg. (of 15) = 0.155227 fft 4: mflops = 15.9416 (norm. = 0.204411), norm. avg. (of 15) = 0.10407 fft 5: mflops = 49.6159 (norm. = 0.636202), norm. avg. (of 15) = 0.422706 fft 6: mflops = 77.8625 (norm. = 0.998395), norm. avg. (of 15) = 0.486727 fft 7: mflops = 77.9876 (norm. = 1), norm. avg. (of 15) = 0.538305 fft 8: mflops = 16.0638 (norm. = 0.205978), norm. avg. (of 14) = 0.187842 fft 9: mflops = 15.543 (norm. = 0.199301), norm. avg. (of 15) = 0.147019 fft 10: mflops = 46.4582 (norm. = 0.595712), norm. avg. (of 15) = 0.722103 fft 11: mflops = 45.7758 (norm. = 0.586962), norm. avg. (of 15) = 0.703359 fft 12: mflops = 21.4703 (norm. = 0.275304), norm. avg. (of 15) = 0.632346 fft 13: mflops = 51.0453 (norm. = 0.654531), norm. avg. (of 13) = 0.729085 fft 14: mflops = 20.8771 (norm. = 0.267697), norm. avg. (of 15) = 0.229482 fft 15: mflops = 22.1578 (norm. = 0.28412), norm. avg. (of 15) = 0.238201 fft 16: mflops = 22.3684 (norm. = 0.28682), norm. avg. (of 15) = 0.250688 fft 17: mflops = -1 (norm. = -0.0128225), norm. avg. (of 12) = 0.361777 fft 18: mflops = 24.7183 (norm. = 0.316951), norm. avg. (of 14) = 0.327527 fft 19: mflops = 25.7754 (norm. = 0.330507), norm. avg. (of 14) = 0.39921 fft 20: mflops = 22.4685 (norm. = 0.288103), norm. avg. (of 14) = 0.343459 fft 21: mflops = 5.51323 (norm. = 0.0706936), norm. avg. (of 15) = 0.0602572 fft 22: mflops = 17.8727 (norm. = 0.229174), norm. avg. (of 15) = 0.124998 fft 23: mflops = 22.3256 (norm. = 0.286271), norm. avg. (of 15) = 0.256247 fft 24: mflops = 58.2582 (norm. = 0.747018), norm. avg. (of 15) = 0.622705 fft 25: mflops = 15.7761 (norm. = 0.20229), norm. avg. (of 12) = 0.22082 fft 26: mflops = 49.6241 (norm. = 0.636308), norm. avg. (of 14) = 0.277984 fft 27: mflops = 37.0693 (norm. = 0.475322), norm. avg. (of 15) = 0.433052 fft 28: mflops = 18.0949 (norm. = 0.232023), norm. avg. (of 15) = 0.115749 fft 29: mflops = 4.87724 (norm. = 0.0625386), norm. avg. (of 15) = 0.0517954 Benchmarking for array size = 65536 (power of 2): 0. Arndt DIF: elapsed time t=1.06979 s, 4 iters, t-(init.)=1.03876 s t(norm)=0.24766, mflops=20.189 (err=1.6e-14) 1. Arndt DIT: elapsed time t=1.12443 s, 4 iters, t-(init.)=1.09386 s t(norm)=0.260798, mflops=19.1719 (err=1.6e-14) 2. Arndt Split-Radix: elapsed time t=1.34987 s, 4 iters, t-(init.)=1.31934 s t(norm)=0.314554, mflops=15.8955 (err=1.6e-14) 3. Arndt 4-step: elapsed time t=1.33886 s, 8 iters, t-(init.)=1.2781 s t(norm)=0.152361, mflops=32.8168 (err=1.6e-14) 4. Beauregard: elapsed time t=1.38481 s, 4 iters, t-(init.)=1.354 s t(norm)=0.322819, mflops=15.4886 (err=1.6e-14) 5. Bergland: elapsed time t=1.00975 s, 8 iters, t-(init.)=0.949157 s t(norm)=0.113148, mflops=44.1898 (err=1.6e-14) 6. CWP (min N) (N=72072): elapsed time t=1.54526 s, 16 iters, t-(init.)=1.354 s t(norm)=0.080705, mflops=61.954 7. CWP (best N) (N=72072): elapsed time t=1.54532 s, 16 iters, t-(init.)=1.35345 s t(norm)=0.080672, mflops=61.9794 8. Edelblute: elapsed time t=1.39512 s, 4 iters, t-(init.)=1.3643 s t(norm)=0.325274, mflops=15.3716 (err=1.6e-14) 9. FFTPACK (f2c): elapsed time t=1.64961 s, 4 iters, t-(init.)=1.61009 s t(norm)=0.383876, mflops=13.025 (err=1.6e-14) FFTW_MEASURE plan: (cost = 1.297020e-01) FFTW_TWIDDLE 64 FFTW_TWIDDLE 8 FFTW_TWIDDLE 4 FFTW_NOTW 32 10. FFTW: elapsed time t=1.02014 s, 8 iters, t-(init.)=0.941946 s t(norm)=0.112289, mflops=44.5281 (err=1.6e-14) FFTW_ESTIMATE plan: (cost = 5.767168e+05) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.21929 s, 8 iters, t-(init.)=1.14154 s t(norm)=0.136082, mflops=36.7426 (err=1.6e-14) 12. Frigo-old: elapsed time t=1.14763 s, 4 iters, t-(init.)=1.10781 s t(norm)=0.264122, mflops=18.9307 (err=1.6e-14) 13. Green: elapsed time t=1.85762 s, 16 iters, t-(init.)=1.73469 s t(norm)=0.103395, mflops=48.3581 (err=1.6e-14) 14. GSL: elapsed time t=1.06968 s, 4 iters, t-(init.)=1.0389 s t(norm)=0.247692, mflops=20.1864 (err=1.6e-14) 15. GSL DIT: elapsed time t=1.04087 s, 4 iters, t-(init.)=1.01004 s t(norm)=0.240813, mflops=20.763 (err=1.7e-14) 16. GSL DIF: elapsed time t=1.02678 s, 4 iters, t-(init.)=0.996133 s t(norm)=0.237497, mflops=21.0529 (err=1.8e-14) 17. Skipping fft (Krukar can't handle N > 4096). 18. Mayer (Buneman): elapsed time t=1.8037 s, 8 iters, t-(init.)=1.74269 s t(norm)=0.207744, mflops=24.068 (err=1.6e-14) 19. Mayer (simple): elapsed time t=1.73365 s, 8 iters, t-(init.)=1.67228 s t(norm)=0.199351, mflops=25.0813 20. Mayer (lookup): elapsed time t=1.98988 s, 8 iters, t-(init.)=1.92943 s t(norm)=0.230007, mflops=21.7385 (err=1.6e-14) 21. NAPACK (f2c): elapsed time t=1.87975 s, 2 iters, t-(init.)=1.84704 s t(norm)=0.880739, mflops=5.67705 (err=8.7e-13) 22. Nielsen: elapsed time t=1.36116 s, 4 iters, t-(init.)=1.31345 s t(norm)=0.313151, mflops=15.9667 (err=2.6e-13) 23. NR (C): elapsed time t=1.01287 s, 4 iters, t-(init.)=0.982357 s t(norm)=0.234212, mflops=21.3482 (err=1.6e-14) 24. Ooura (C): elapsed time t=1.6578 s, 16 iters, t-(init.)=1.53627 s t(norm)=0.0915688, mflops=54.6037 (err=1.6e-14) 25. QFT: elapsed time t=1.91377 s, 4 iters, t-(init.)=1.86521 s t(norm)=0.4447, mflops=11.2435 (err=1.9e-14) 26. Ransom: elapsed time t=1.09427 s, 8 iters, t-(init.)=1.03216 s t(norm)=0.123043, mflops=40.6363 (err=1.7e-14) 27. Singleton (f2c): elapsed time t=1.10584 s, 8 iters, t-(init.)=1.04449 s t(norm)=0.124513, mflops=40.1563 (err=2.4e-14) 28. Temperton (f2c): elapsed time t=1.19156 s, 4 iters, t-(init.)=1.1607 s t(norm)=0.276733, mflops=18.068 (err=1.6e-14) 29. Valkenburg: elapsed time t=1.1687 s, 1 iters, t-(init.)=1.1607 s t(norm)=1.10693, mflops=4.51699 (err=1.6e-14) Top mflops for N=65536 = 61.9794 Normalized results and averages for N=65536: fft 0: mflops = 20.189 (norm. = 0.325737), norm. avg. (of 16) = 0.368646 fft 1: mflops = 19.1719 (norm. = 0.309328), norm. avg. (of 16) = 0.361406 fft 2: mflops = 15.8955 (norm. = 0.256465), norm. avg. (of 16) = 0.242347 fft 3: mflops = 32.8168 (norm. = 0.52948), norm. avg. (of 16) = 0.178618 fft 4: mflops = 15.4886 (norm. = 0.249899), norm. avg. (of 16) = 0.113184 fft 5: mflops = 44.1898 (norm. = 0.712976), norm. avg. (of 16) = 0.440848 fft 6: mflops = 61.954 (norm. = 0.999592), norm. avg. (of 16) = 0.518781 fft 7: mflops = 61.9794 (norm. = 1), norm. avg. (of 16) = 0.567161 fft 8: mflops = 15.3716 (norm. = 0.248012), norm. avg. (of 15) = 0.191853 fft 9: mflops = 13.025 (norm. = 0.210151), norm. avg. (of 16) = 0.150965 fft 10: mflops = 44.5281 (norm. = 0.718434), norm. avg. (of 16) = 0.721874 fft 11: mflops = 36.7426 (norm. = 0.59282), norm. avg. (of 16) = 0.696451 fft 12: mflops = 18.9307 (norm. = 0.305435), norm. avg. (of 16) = 0.611914 fft 13: mflops = 48.3581 (norm. = 0.780229), norm. avg. (of 14) = 0.732738 fft 14: mflops = 20.1864 (norm. = 0.325695), norm. avg. (of 16) = 0.235495 fft 15: mflops = 20.763 (norm. = 0.334999), norm. avg. (of 16) = 0.244251 fft 16: mflops = 21.0529 (norm. = 0.339677), norm. avg. (of 16) = 0.25625 fft 17: mflops = -1 (norm. = -0.0161344), norm. avg. (of 12) = 0.361777 fft 18: mflops = 24.068 (norm. = 0.388324), norm. avg. (of 15) = 0.33158 fft 19: mflops = 25.0813 (norm. = 0.404672), norm. avg. (of 15) = 0.399574 fft 20: mflops = 21.7385 (norm. = 0.350738), norm. avg. (of 15) = 0.343944 fft 21: mflops = 5.67705 (norm. = 0.0915958), norm. avg. (of 16) = 0.0622159 fft 22: mflops = 15.9667 (norm. = 0.257614), norm. avg. (of 16) = 0.133286 fft 23: mflops = 21.3482 (norm. = 0.34444), norm. avg. (of 16) = 0.261759 fft 24: mflops = 54.6037 (norm. = 0.880999), norm. avg. (of 16) = 0.638849 fft 25: mflops = 11.2435 (norm. = 0.181408), norm. avg. (of 13) = 0.217788 fft 26: mflops = 40.6363 (norm. = 0.655643), norm. avg. (of 15) = 0.303161 fft 27: mflops = 40.1563 (norm. = 0.647898), norm. avg. (of 16) = 0.44648 fft 28: mflops = 18.068 (norm. = 0.291516), norm. avg. (of 16) = 0.126735 fft 29: mflops = 4.51699 (norm. = 0.072879), norm. avg. (of 16) = 0.0531131 Benchmarking for array size = 131072 (power of 2): 0. Arndt DIF: elapsed time t=1.74871 s, 1 iters, t-(init.)=1.69836 s t(norm)=0.762203, mflops=6.55994 (err=3.9e-14) 1. Arndt DIT: elapsed time t=1.85416 s, 1 iters, t-(init.)=1.80399 s t(norm)=0.809611, mflops=6.1758 (err=3.9e-14) 2. Arndt Split-Radix: elapsed time t=2.48572 s, 1 iters, t-(init.)=2.43518 s t(norm)=1.09288, mflops=4.57506 (err=3.9e-14) 3. Arndt 4-step: elapsed time t=1.61913 s, 2 iters, t-(init.)=1.51878 s t(norm)=0.340804, mflops=14.6712 (err=3.9e-14) 4. Beauregard: elapsed time t=1.13442 s, 1 iters, t-(init.)=1.08395 s t(norm)=0.486465, mflops=10.2782 (err=3.8e-14) 5. Bergland: elapsed time t=1.41377 s, 2 iters, t-(init.)=1.31331 s t(norm)=0.294699, mflops=16.9665 (err=3.9e-14) 6. CWP (min N) (N=144144): elapsed time t=1.35361 s, 4 iters, t-(init.)=1.13519 s t(norm)=0.127364, mflops=39.2574 7. CWP (best N) (N=144144): elapsed time t=1.35398 s, 4 iters, t-(init.)=1.13539 s t(norm)=0.127387, mflops=39.2504 8. Edelblute: elapsed time t=2.48599 s, 1 iters, t-(init.)=2.43567 s t(norm)=1.0931, mflops=4.57416 (err=3.9e-14) 9. FFTPACK (f2c): elapsed time t=1.26739 s, 1 iters, t-(init.)=1.21714 s t(norm)=0.546237, mflops=9.15353 (err=3.8e-14) FFTW_MEASURE plan: (cost = 3.643840e-01) FFTW_TWIDDLE 2 FFTW_TWIDDLE 64 FFTW_TWIDDLE 16 FFTW_NOTW 64 10. FFTW: elapsed time t=1.48244 s, 4 iters, t-(init.)=1.28127 s t(norm)=0.143754, mflops=34.7816 (err=3.8e-14) FFTW_ESTIMATE plan: (cost = 1.153434e+06) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.81078 s, 4 iters, t-(init.)=1.60971 s t(norm)=0.180605, mflops=27.6848 (err=3.8e-14) 12. Frigo-old: elapsed time t=1.52271 s, 2 iters, t-(init.)=1.42319 s t(norm)=0.319355, mflops=15.6566 (err=3.8e-14) 13. Green: elapsed time t=1.35287 s, 2 iters, t-(init.)=1.2609 s t(norm)=0.282939, mflops=17.6717 (err=3.8e-14) 14. GSL: elapsed time t=1.85324 s, 2 iters, t-(init.)=1.75277 s t(norm)=0.393311, mflops=12.7126 (err=3.8e-14) 15. GSL DIT: elapsed time t=1.68185 s, 1 iters, t-(init.)=1.63154 s t(norm)=0.732215, mflops=6.8286 (err=4.0e-14) 16. GSL DIF: elapsed time t=1.62967 s, 1 iters, t-(init.)=1.57909 s t(norm)=0.708679, mflops=7.05538 (err=4.2e-14) 17. Skipping fft (Krukar can't handle N > 4096). 18. Mayer (Buneman): elapsed time t=1.22129 s, 2 iters, t-(init.)=1.12078 s t(norm)=0.251496, mflops=19.881 (err=3.9e-14) 19. Mayer (simple): elapsed time t=1.1836 s, 2 iters, t-(init.)=1.08296 s t(norm)=0.243011, mflops=20.5752 20. Mayer (lookup): elapsed time t=1.33137 s, 2 iters, t-(init.)=1.23085 s t(norm)=0.276196, mflops=18.1031 (err=3.9e-14) 21. NAPACK (f2c): elapsed time t=2.1083 s, 1 iters, t-(init.)=2.05808 s t(norm)=0.923639, mflops=5.41337 (err=2.0e-12) 22. Nielsen: elapsed time t=1.3963 s, 1 iters, t-(init.)=1.346 s t(norm)=0.60407, mflops=8.27718 (err=9.2e-13) 23. NR (C): elapsed time t=1.67906 s, 1 iters, t-(init.)=1.62892 s t(norm)=0.731038, mflops=6.83959 (err=3.9e-14) 24. Ooura (C): elapsed time t=1.14508 s, 2 iters, t-(init.)=1.04451 s t(norm)=0.234382, mflops=21.3327 (err=3.9e-14) 25. QFT: elapsed time t=1.1364 s, 1 iters, t-(init.)=1.08585 s t(norm)=0.487317, mflops=10.2603 (err=4.1e-14) 26. Ransom: elapsed time t=1.2009 s, 2 iters, t-(init.)=1.10011 s t(norm)=0.246858, mflops=20.2546 (err=3.9e-14) 27. Singleton (f2c): elapsed time t=1.77931 s, 2 iters, t-(init.)=1.67856 s t(norm)=0.376658, mflops=13.2746 (err=5.7e-14) 28. Temperton (f2c): elapsed time t=1.07842 s, 1 iters, t-(init.)=1.02856 s t(norm)=0.461608, mflops=10.8317 (err=3.8e-14) 29. Valkenburg: elapsed time t=3.29816 s, 1 iters, t-(init.)=3.24806 s t(norm)=1.45769, mflops=3.43008 (err=3.9e-14) Top mflops for N=131072 = 39.2574 Normalized results and averages for N=131072: fft 0: mflops = 6.55994 (norm. = 0.167101), norm. avg. (of 17) = 0.356791 fft 1: mflops = 6.1758 (norm. = 0.157316), norm. avg. (of 17) = 0.3494 fft 2: mflops = 4.57506 (norm. = 0.11654), norm. avg. (of 17) = 0.234946 fft 3: mflops = 14.6712 (norm. = 0.373717), norm. avg. (of 17) = 0.190095 fft 4: mflops = 10.2782 (norm. = 0.261816), norm. avg. (of 17) = 0.121927 fft 5: mflops = 16.9665 (norm. = 0.432185), norm. avg. (of 17) = 0.440338 fft 6: mflops = 39.2574 (norm. = 1), norm. avg. (of 17) = 0.547088 fft 7: mflops = 39.2504 (norm. = 0.999822), norm. avg. (of 17) = 0.592612 fft 8: mflops = 4.57416 (norm. = 0.116517), norm. avg. (of 16) = 0.187145 fft 9: mflops = 9.15353 (norm. = 0.233167), norm. avg. (of 17) = 0.1558 fft 10: mflops = 34.7816 (norm. = 0.885987), norm. avg. (of 17) = 0.731528 fft 11: mflops = 27.6848 (norm. = 0.705211), norm. avg. (of 17) = 0.696966 fft 12: mflops = 15.6566 (norm. = 0.398818), norm. avg. (of 17) = 0.599379 fft 13: mflops = 17.6717 (norm. = 0.450148), norm. avg. (of 15) = 0.713899 fft 14: mflops = 12.7126 (norm. = 0.323826), norm. avg. (of 17) = 0.240691 fft 15: mflops = 6.8286 (norm. = 0.173944), norm. avg. (of 17) = 0.240116 fft 16: mflops = 7.05538 (norm. = 0.179721), norm. avg. (of 17) = 0.251748 fft 17: mflops = -1 (norm. = -0.0254729), norm. avg. (of 12) = 0.361777 fft 18: mflops = 19.881 (norm. = 0.506426), norm. avg. (of 16) = 0.342508 fft 19: mflops = 20.5752 (norm. = 0.52411), norm. avg. (of 16) = 0.407357 fft 20: mflops = 18.1031 (norm. = 0.461138), norm. avg. (of 16) = 0.351269 fft 21: mflops = 5.41337 (norm. = 0.137894), norm. avg. (of 17) = 0.0666675 fft 22: mflops = 8.27718 (norm. = 0.210844), norm. avg. (of 17) = 0.137848 fft 23: mflops = 6.83959 (norm. = 0.174224), norm. avg. (of 17) = 0.25661 fft 24: mflops = 21.3327 (norm. = 0.543405), norm. avg. (of 17) = 0.633234 fft 25: mflops = 10.2603 (norm. = 0.261359), norm. avg. (of 14) = 0.2209 fft 26: mflops = 20.2546 (norm. = 0.515943), norm. avg. (of 16) = 0.31646 fft 27: mflops = 13.2746 (norm. = 0.338144), norm. avg. (of 17) = 0.440107 fft 28: mflops = 10.8317 (norm. = 0.275915), norm. avg. (of 17) = 0.13551 fft 29: mflops = 3.43008 (norm. = 0.087374), norm. avg. (of 17) = 0.0551285 Benchmarking for array size = 262144 (power of 2): 0. Arndt DIF: elapsed time t=4.64717 s, 1 iters, t-(init.)=4.54664 s t(norm)=0.963559, mflops=5.18909 (err=4.3e-14) 1. Arndt DIT: elapsed time t=4.67082 s, 1 iters, t-(init.)=4.57034 s t(norm)=0.968582, mflops=5.16219 (err=4.3e-14) 2. Arndt Split-Radix: elapsed time t=5.64649 s, 1 iters, t-(init.)=5.54592 s t(norm)=1.17533, mflops=4.25411 (err=4.3e-14) 3. Arndt 4-step: elapsed time t=1.54051 s, 1 iters, t-(init.)=1.43989 s t(norm)=0.305153, mflops=16.3852 (err=4.3e-14) 4. Beauregard: elapsed time t=2.40468 s, 1 iters, t-(init.)=2.30396 s t(norm)=0.488274, mflops=10.2402 (err=4.3e-14) 5. Bergland: elapsed time t=1.55686 s, 1 iters, t-(init.)=1.45646 s t(norm)=0.308664, mflops=16.1989 (err=4.3e-14) 6. CWP (min N) (N=360360): elapsed time t=1.08737 s, 1 iters, t-(init.)=0.957292 s t(norm)=0.202877, mflops=24.6455 7. CWP (best N) (N=360360): elapsed time t=1.08803 s, 1 iters, t-(init.)=0.957704 s t(norm)=0.202964, mflops=24.6349 8. Edelblute: elapsed time t=5.6679 s, 1 iters, t-(init.)=5.56732 s t(norm)=1.17987, mflops=4.23776 (err=4.3e-14) 9. FFTPACK (f2c): elapsed time t=2.70648 s, 1 iters, t-(init.)=2.60575 s t(norm)=0.55223, mflops=9.0542 (err=4.3e-14) FFTW_MEASURE plan: (cost = 7.722530e-01) FFTW_TWIDDLE 4 FFTW_TWIDDLE 64 FFTW_TWIDDLE 8 FFTW_TWIDDLE 4 FFTW_NOTW 32 10. FFTW: elapsed time t=1.52733 s, 2 iters, t-(init.)=1.32637 s t(norm)=0.140547, mflops=35.5753 (err=4.3e-14) FFTW_ESTIMATE plan: (cost = 2.988442e+06) FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=1.98232 s, 2 iters, t-(init.)=1.78151 s t(norm)=0.188775, mflops=26.4865 (err=4.3e-14) 12. Frigo-old: elapsed time t=1.78503 s, 1 iters, t-(init.)=1.68492 s t(norm)=0.357082, mflops=14.0024 (err=4.3e-14) 13. Green: elapsed time t=1.51774 s, 1 iters, t-(init.)=1.41732 s t(norm)=0.300369, mflops=16.6462 (err=4.3e-14) 14. GSL: elapsed time t=1.92236 s, 1 iters, t-(init.)=1.82191 s t(norm)=0.386112, mflops=12.9496 (err=4.3e-14) 15. GSL DIT: elapsed time t=3.67167 s, 1 iters, t-(init.)=3.5709 s t(norm)=0.756772, mflops=6.60701 (err=4.5e-14) 16. GSL DIF: elapsed time t=3.74183 s, 1 iters, t-(init.)=3.6415 s t(norm)=0.771734, mflops=6.47892 (err=4.7e-14) 17. Skipping fft (Krukar can't handle N > 4096). 18. Mayer (Buneman): elapsed time t=3.28065 s, 1 iters, t-(init.)=3.18012 s t(norm)=0.673956, mflops=7.41888 (err=4.3e-14) 19. Mayer (simple): elapsed time t=3.25106 s, 1 iters, t-(init.)=3.15058 s t(norm)=0.667694, mflops=7.48846 20. Mayer (lookup): elapsed time t=3.39122 s, 1 iters, t-(init.)=3.29066 s t(norm)=0.697383, mflops=7.16967 (err=4.3e-14) 21. NAPACK (f2c): elapsed time t=4.28708 s, 1 iters, t-(init.)=4.18675 s t(norm)=0.887288, mflops=5.63515 (err=3.7e-12) 22. Nielsen: elapsed time t=3.52158 s, 1 iters, t-(init.)=3.42107 s t(norm)=0.72502, mflops=6.89636 (err=2.1e-12) 23. NR (C): elapsed time t=3.76475 s, 1 iters, t-(init.)=3.66423 s t(norm)=0.776551, mflops=6.43873 (err=4.3e-14) 24. Ooura (C): elapsed time t=1.15703 s, 1 iters, t-(init.)=1.05647 s t(norm)=0.223896, mflops=22.3318 (err=4.3e-14) 25. QFT: elapsed time t=2.9645 s, 1 iters, t-(init.)=2.86349 s t(norm)=0.606852, mflops=8.23924 (err=4.7e-14) 26. Ransom: elapsed time t=1.83193 s, 2 iters, t-(init.)=1.6307 s t(norm)=0.172796, mflops=28.9359 (err=4.3e-14) 27. Singleton (f2c): elapsed time t=1.91447 s, 1 iters, t-(init.)=1.81394 s t(norm)=0.384425, mflops=13.0064 (err=5.9e-14) 28. Temperton (f2c): elapsed time t=2.38743 s, 1 iters, t-(init.)=2.2867 s t(norm)=0.484616, mflops=10.3175 (err=4.3e-14) 29. Valkenburg: elapsed time t=7.24801 s, 1 iters, t-(init.)=7.14738 s t(norm)=1.51473, mflops=3.30092 (err=4.3e-14) Top mflops for N=262144 = 35.5753 Normalized results and averages for N=262144: fft 0: mflops = 5.18909 (norm. = 0.145862), norm. avg. (of 18) = 0.345072 fft 1: mflops = 5.16219 (norm. = 0.145106), norm. avg. (of 18) = 0.338051 fft 2: mflops = 4.25411 (norm. = 0.11958), norm. avg. (of 18) = 0.228537 fft 3: mflops = 16.3852 (norm. = 0.460579), norm. avg. (of 18) = 0.205121 fft 4: mflops = 10.2402 (norm. = 0.287845), norm. avg. (of 18) = 0.131145 fft 5: mflops = 16.1989 (norm. = 0.45534), norm. avg. (of 18) = 0.441172 fft 6: mflops = 24.6455 (norm. = 0.692771), norm. avg. (of 18) = 0.555182 fft 7: mflops = 24.6349 (norm. = 0.692473), norm. avg. (of 18) = 0.59816 fft 8: mflops = 4.23776 (norm. = 0.119121), norm. avg. (of 17) = 0.183144 fft 9: mflops = 9.0542 (norm. = 0.254508), norm. avg. (of 18) = 0.161284 fft 10: mflops = 35.5753 (norm. = 1), norm. avg. (of 18) = 0.746443 fft 11: mflops = 26.4865 (norm. = 0.744521), norm. avg. (of 18) = 0.699608 fft 12: mflops = 14.0024 (norm. = 0.393599), norm. avg. (of 18) = 0.587947 fft 13: mflops = 16.6462 (norm. = 0.467915), norm. avg. (of 16) = 0.698525 fft 14: mflops = 12.9496 (norm. = 0.364006), norm. avg. (of 18) = 0.247542 fft 15: mflops = 6.60701 (norm. = 0.185719), norm. avg. (of 18) = 0.237094 fft 16: mflops = 6.47892 (norm. = 0.182119), norm. avg. (of 18) = 0.24788 fft 17: mflops = -1 (norm. = -0.0281094), norm. avg. (of 12) = 0.361777 fft 18: mflops = 7.41888 (norm. = 0.20854), norm. avg. (of 17) = 0.334628 fft 19: mflops = 7.48846 (norm. = 0.210496), norm. avg. (of 17) = 0.395777 fft 20: mflops = 7.16967 (norm. = 0.201535), norm. avg. (of 17) = 0.342461 fft 21: mflops = 5.63515 (norm. = 0.158401), norm. avg. (of 18) = 0.0717638 fft 22: mflops = 6.89636 (norm. = 0.193853), norm. avg. (of 18) = 0.14096 fft 23: mflops = 6.43873 (norm. = 0.180989), norm. avg. (of 18) = 0.252408 fft 24: mflops = 22.3318 (norm. = 0.627735), norm. avg. (of 18) = 0.632929 fft 25: mflops = 8.23924 (norm. = 0.2316), norm. avg. (of 15) = 0.221614 fft 26: mflops = 28.9359 (norm. = 0.813372), norm. avg. (of 17) = 0.34569 fft 27: mflops = 13.0064 (norm. = 0.365603), norm. avg. (of 18) = 0.435968 fft 28: mflops = 10.3175 (norm. = 0.290017), norm. avg. (of 18) = 0.144094 fft 29: mflops = 3.30092 (norm. = 0.092787), norm. avg. (of 18) = 0.0572206 Benchmarking for array size = 524288 (power of 2): 0. Arndt DIF: elapsed time t=9.37552 s, 1 iters, t-(init.)=9.17476 s t(norm)=0.921024, mflops=5.42874 (err=1.1e-13) 1. Arndt DIT: elapsed time t=9.45206 s, 1 iters, t-(init.)=9.25097 s t(norm)=0.928676, mflops=5.38401 (err=1.1e-13) 2. Arndt Split-Radix: elapsed time t=12.1472 s, 1 iters, t-(init.)=11.9463 s t(norm)=1.19925, mflops=4.16927 (err=1.1e-13) 3. Arndt 4-step: elapsed time t=5.03809 s, 1 iters, t-(init.)=4.83705 s t(norm)=0.485576, mflops=10.2971 (err=1.1e-13) 4. Beauregard: elapsed time t=5.05532 s, 1 iters, t-(init.)=4.85395 s t(norm)=0.487272, mflops=10.2612 (err=1.1e-13) 5. Bergland: elapsed time t=3.6044 s, 1 iters, t-(init.)=3.40332 s t(norm)=0.341648, mflops=14.6349 (err=1.1e-13) 6. CWP (min N) (N=720720): elapsed time t=2.27585 s, 1 iters, t-(init.)=1.99924 s t(norm)=0.200697, mflops=24.9132 7. CWP (best N) (N=720720): elapsed time t=2.27592 s, 1 iters, t-(init.)=1.99916 s t(norm)=0.200689, mflops=24.9142 8. Edelblute: elapsed time t=12.2114 s, 1 iters, t-(init.)=12.01 s t(norm)=1.20565, mflops=4.14715 (err=1.1e-13) 9. FFTPACK (f2c): elapsed time t=5.51826 s, 1 iters, t-(init.)=5.31713 s t(norm)=0.533769, mflops=9.36735 (err=1.1e-13) FFTW_MEASURE plan: (cost = 1.772352e+00) FFTW_TWIDDLE 4 FFTW_TWIDDLE 2 FFTW_TWIDDLE 64 FFTW_TWIDDLE 8 FFTW_TWIDDLE 4 FFTW_NOTW 32 10. FFTW: elapsed time t=1.79787 s, 1 iters, t-(init.)=1.59676 s t(norm)=0.160294, mflops=31.1927 (err=1.1e-13) FFTW_ESTIMATE plan: (cost = 5.976883e+06) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=2.08039 s, 1 iters, t-(init.)=1.87932 s t(norm)=0.188658, mflops=26.5029 (err=1.1e-13) 12. Frigo-old: elapsed time t=4.19225 s, 1 iters, t-(init.)=3.99123 s t(norm)=0.400667, mflops=12.4792 (err=1.1e-13) 13. Green: elapsed time t=3.24214 s, 1 iters, t-(init.)=3.04131 s t(norm)=0.305308, mflops=16.3769 (err=1.1e-13) 14. GSL: elapsed time t=3.84737 s, 1 iters, t-(init.)=3.64586 s t(norm)=0.365996, mflops=13.6613 (err=1.1e-13) 15. GSL DIT: elapsed time t=7.77342 s, 1 iters, t-(init.)=7.57226 s t(norm)=0.760155, mflops=6.5776 (err=1.1e-13) 16. GSL DIF: elapsed time t=7.93432 s, 1 iters, t-(init.)=7.73321 s t(norm)=0.776312, mflops=6.44071 (err=1.1e-13) 17. Skipping fft (Krukar can't handle N > 4096). 18. Mayer (Buneman): elapsed time t=7.46092 s, 1 iters, t-(init.)=7.25965 s t(norm)=0.728773, mflops=6.86085 (err=1.1e-13) 19. Mayer (simple): elapsed time t=7.38116 s, 1 iters, t-(init.)=7.18002 s t(norm)=0.720779, mflops=6.93694 20. Mayer (lookup): elapsed time t=7.76125 s, 1 iters, t-(init.)=7.5601 s t(norm)=0.758934, mflops=6.58819 (err=1.1e-13) 21. NAPACK (f2c): elapsed time t=9.47708 s, 1 iters, t-(init.)=9.27603 s t(norm)=0.931191, mflops=5.36947 (err=7.9e-12) 22. Nielsen: elapsed time t=7.82734 s, 1 iters, t-(init.)=7.6263 s t(norm)=0.76558, mflops=6.531 (err=4.4e-12) 23. NR (C): elapsed time t=7.97833 s, 1 iters, t-(init.)=7.7771 s t(norm)=0.780718, mflops=6.40436 (err=1.1e-13) 24. Ooura (C): elapsed time t=2.51167 s, 1 iters, t-(init.)=2.31042 s t(norm)=0.231936, mflops=21.5577 (err=1.1e-13) 25. QFT: elapsed time t=8.90384 s, 1 iters, t-(init.)=8.7027 s t(norm)=0.873636, mflops=5.7232 (err=1.2e-13) 26. Ransom: elapsed time t=2.43938 s, 1 iters, t-(init.)=2.23843 s t(norm)=0.224709, mflops=22.251 (err=1.1e-13) 27. Singleton (f2c): elapsed time t=4.74204 s, 1 iters, t-(init.)=4.54103 s t(norm)=0.45586, mflops=10.9683 (err=1.6e-13) 28. Temperton (f2c): elapsed time t=5.36436 s, 1 iters, t-(init.)=5.1631 s t(norm)=0.518307, mflops=9.64679 (err=1.1e-13) 29. Valkenburg: elapsed time t=15.7341 s, 1 iters, t-(init.)=15.5331 s t(norm)=1.55931, mflops=3.20654 (err=1.1e-13) Top mflops for N=524288 = 31.1927 Normalized results and averages for N=524288: fft 0: mflops = 5.42874 (norm. = 0.174039), norm. avg. (of 19) = 0.336071 fft 1: mflops = 5.38401 (norm. = 0.172605), norm. avg. (of 19) = 0.329343 fft 2: mflops = 4.16927 (norm. = 0.133662), norm. avg. (of 19) = 0.223544 fft 3: mflops = 10.2971 (norm. = 0.330111), norm. avg. (of 19) = 0.2117 fft 4: mflops = 10.2612 (norm. = 0.328962), norm. avg. (of 19) = 0.141556 fft 5: mflops = 14.6349 (norm. = 0.469179), norm. avg. (of 19) = 0.442646 fft 6: mflops = 24.9132 (norm. = 0.798687), norm. avg. (of 19) = 0.567998 fft 7: mflops = 24.9142 (norm. = 0.798719), norm. avg. (of 19) = 0.608716 fft 8: mflops = 4.14715 (norm. = 0.132953), norm. avg. (of 18) = 0.180355 fft 9: mflops = 9.36735 (norm. = 0.300306), norm. avg. (of 19) = 0.168601 fft 10: mflops = 31.1927 (norm. = 1), norm. avg. (of 19) = 0.759788 fft 11: mflops = 26.5029 (norm. = 0.849652), norm. avg. (of 19) = 0.707505 fft 12: mflops = 12.4792 (norm. = 0.400068), norm. avg. (of 19) = 0.578058 fft 13: mflops = 16.3769 (norm. = 0.525024), norm. avg. (of 17) = 0.688319 fft 14: mflops = 13.6613 (norm. = 0.437966), norm. avg. (of 19) = 0.257564 fft 15: mflops = 6.5776 (norm. = 0.21087), norm. avg. (of 19) = 0.235713 fft 16: mflops = 6.44071 (norm. = 0.206481), norm. avg. (of 19) = 0.245701 fft 17: mflops = -1 (norm. = -0.0320588), norm. avg. (of 12) = 0.361777 fft 18: mflops = 6.86085 (norm. = 0.219951), norm. avg. (of 18) = 0.328257 fft 19: mflops = 6.93694 (norm. = 0.22239), norm. avg. (of 18) = 0.386145 fft 20: mflops = 6.58819 (norm. = 0.211209), norm. avg. (of 18) = 0.335169 fft 21: mflops = 5.36947 (norm. = 0.172139), norm. avg. (of 19) = 0.0770467 fft 22: mflops = 6.531 (norm. = 0.209376), norm. avg. (of 19) = 0.144561 fft 23: mflops = 6.40436 (norm. = 0.205316), norm. avg. (of 19) = 0.24993 fft 24: mflops = 21.5577 (norm. = 0.691113), norm. avg. (of 19) = 0.635991 fft 25: mflops = 5.7232 (norm. = 0.183479), norm. avg. (of 16) = 0.21923 fft 26: mflops = 22.251 (norm. = 0.71334), norm. avg. (of 18) = 0.366115 fft 27: mflops = 10.9683 (norm. = 0.35163), norm. avg. (of 19) = 0.431529 fft 28: mflops = 9.64679 (norm. = 0.309265), norm. avg. (of 19) = 0.152787 fft 29: mflops = 3.20654 (norm. = 0.102798), norm. avg. (of 19) = 0.0596194 Benchmarking for array size = 1048576 (power of 2): 0. Arndt DIF: elapsed time t=21.4385 s, 1 iters, t-(init.)=21.0362 s t(norm)=1.00309, mflops=4.98462 (err=1.8e-13) 1. Arndt DIT: elapsed time t=21.5527 s, 1 iters, t-(init.)=21.1504 s t(norm)=1.00853, mflops=4.95771 (err=1.8e-13) 2. Arndt Split-Radix: elapsed time t=25.862 s, 1 iters, t-(init.)=25.46 s t(norm)=1.21403, mflops=4.11852 (err=1.8e-13) 3. Arndt 4-step: elapsed time t=6.76295 s, 1 iters, t-(init.)=6.3611 s t(norm)=0.303321, mflops=16.4842 (err=1.8e-13) 4. Beauregard: elapsed time t=10.6047 s, 1 iters, t-(init.)=10.2022 s t(norm)=0.486478, mflops=10.278 (err=1.8e-13) 5. Bergland: elapsed time t=7.68216 s, 1 iters, t-(init.)=7.27976 s t(norm)=0.347126, mflops=14.404 (err=1.8e-13) 6. Skipping fft (this transform size is too big for CWP). 7. Skipping fft (this transform size is too big for CWP). 8. Edelblute: elapsed time t=26.006 s, 1 iters, t-(init.)=25.6037 s t(norm)=1.22088, mflops=4.0954 (err=1.8e-13) 9. FFTPACK (f2c): elapsed time t=11.7176 s, 1 iters, t-(init.)=11.3158 s t(norm)=0.539577, mflops=9.26652 (err=1.8e-13) FFTW_MEASURE plan: (cost = 4.156831e+00) FFTW_TWIDDLE 4 FFTW_TWIDDLE 4 FFTW_TWIDDLE 64 FFTW_TWIDDLE 16 FFTW_NOTW 64 10. FFTW: elapsed time t=3.71361 s, 1 iters, t-(init.)=3.31137 s t(norm)=0.157899, mflops=31.6659 Found big error at 1733: (0.992925,0.498767)*9.53674e-07 = (9.46927e-07,4.75661e-07) != (9.52e-07,4.85e-07) Found big error at 10365: (0.143577,0.466041)*9.53674e-07 = (1.36926e-07,4.44452e-07) != (1.35e-07,4.34e-07) Found big error at 14273: (0.498681,0.242214)*9.53674e-07 = (4.75579e-07,2.30993e-07) != (4.72e-07,2.41e-07) Found big error at 17145: (0.768474,0.643108)*9.53674e-07 = (7.32874e-07,6.13315e-07) != (7.39e-07,6.22e-07) Found big error at 18925: (0.351104,0.148473)*9.53674e-07 = (3.34839e-07,1.41594e-07) != (3.34e-07,1.31e-07) Found big error at 27913: (0.374355,0.710726)*9.53674e-07 = (3.57013e-07,6.77801e-07) != (3.6e-07,6.88e-07) Found big error at 51221: (0.774089,0.703883)*9.53674e-07 = (7.38229e-07,6.71275e-07) != (7.3e-07,6.78e-07) Found big error at 53173: (0.1119,1.19123)*9.53674e-07 = (1.06716e-07,1.13604e-06) != (1.03e-07,1.146e-06) Found big error at 58737: (1.22236,0.0406511)*9.53674e-07 = (1.16574e-06,3.87679e-08) != (1.171e-06,4.8e-08) Found big error at 62213: (0.17316,1.04563)*9.53674e-07 = (1.65138e-07,9.97188e-07) != (1.55e-07,9.94e-07) Found big error at 65393: (0.66551,0.602633)*9.53674e-07 = (6.3468e-07,5.74716e-07) != (6.32e-07,5.85e-07) (err=1.4e-04) FFTW_ESTIMATE plan: (cost = 1.195377e+07) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_NOTW 32 11. FFTW_ESTIMATE: elapsed time t=4.77678 s, 1 iters, t-(init.)=4.37457 s t(norm)=0.208596, mflops=23.9698 (err=1.8e-13) 12. Frigo-old: elapsed time t=9.00228 s, 1 iters, t-(init.)=8.60008 s t(norm)=0.410084, mflops=12.1926 (err=1.8e-13) 13. Skipping fft (Green can't handle this size.). 14. GSL: elapsed time t=7.76314 s, 1 iters, t-(init.)=7.36069 s t(norm)=0.350985, mflops=14.2456 (err=1.8e-13) 15. GSL DIT: elapsed time t=16.3908 s, 1 iters, t-(init.)=15.9884 s t(norm)=0.762387, mflops=6.55835 (err=1.9e-13) 16. GSL DIF: elapsed time t=16.752 s, 1 iters, t-(init.)=16.3497 s t(norm)=0.779613, mflops=6.41344 (err=1.9e-13) 17. Skipping fft (Krukar can't handle N > 4096). 18. Skipping fft (Mayer (Buneman) can't handle N > 2^19). 19. Mayer (simple): elapsed time t=15.7891 s, 1 iters, t-(init.)=15.3867 s t(norm)=0.733696, mflops=6.81481 20. Mayer (lookup): elapsed time t=16.5601 s, 1 iters, t-(init.)=16.1576 s t(norm)=0.770457, mflops=6.48966 (err=1.8e-13) 21. NAPACK (f2c): elapsed time t=19.3964 s, 1 iters, t-(init.)=18.9942 s t(norm)=0.905713, mflops=5.52051 (err=1.5e-11) 22. Nielsen: elapsed time t=15.8677 s, 1 iters, t-(init.)=15.4652 s t(norm)=0.737437, mflops=6.78024 (err=8.1e-12) 23. NR (C): elapsed time t=16.8334 s, 1 iters, t-(init.)=16.4314 s t(norm)=0.78351, mflops=6.38154 (err=1.8e-13) 24. Ooura (C): elapsed time t=5.04565 s, 1 iters, t-(init.)=4.6432 s t(norm)=0.221405, mflops=22.5831 (err=1.8e-13) 25. QFT: elapsed time t=20.7593 s, 1 iters, t-(init.)=20.3566 s t(norm)=0.970679, mflops=5.15103 (err=1.9e-13) 26. Ransom: elapsed time t=3.71463 s, 1 iters, t-(init.)=3.31206 s t(norm)=0.157931, mflops=31.6593 (err=1.8e-13) 27. Singleton (f2c): elapsed time t=8.62825 s, 1 iters, t-(init.)=8.226 s t(norm)=0.392246, mflops=12.7471 (err=2.6e-13) 28. Temperton (f2c): elapsed time t=10.8988 s, 1 iters, t-(init.)=10.4964 s t(norm)=0.500506, mflops=9.9899 (err=1.8e-13) 29. Valkenburg: elapsed time t=34.1992 s, 1 iters, t-(init.)=33.7969 s t(norm)=1.61156, mflops=3.10258 (err=1.8e-13) Top mflops for N=1048576 = 31.6659 Normalized results and averages for N=1048576: fft 0: mflops = 4.98462 (norm. = 0.157413), norm. avg. (of 20) = 0.327138 fft 1: mflops = 4.95771 (norm. = 0.156563), norm. avg. (of 20) = 0.320704 fft 2: mflops = 4.11852 (norm. = 0.130062), norm. avg. (of 20) = 0.21887 fft 3: mflops = 16.4842 (norm. = 0.520566), norm. avg. (of 20) = 0.227143 fft 4: mflops = 10.278 (norm. = 0.324575), norm. avg. (of 20) = 0.150707 fft 5: mflops = 14.404 (norm. = 0.454874), norm. avg. (of 20) = 0.443257 fft 6: mflops = -1 (norm. = -0.0315797), norm. avg. (of 19) = 0.567998 fft 7: mflops = -1 (norm. = -0.0315797), norm. avg. (of 19) = 0.608716 fft 8: mflops = 4.0954 (norm. = 0.129332), norm. avg. (of 19) = 0.17767 fft 9: mflops = 9.26652 (norm. = 0.292634), norm. avg. (of 20) = 0.174803 fft 10: mflops = 31.6659 (norm. = 1), norm. avg. (of 20) = 0.771798 fft 11: mflops = 23.9698 (norm. = 0.75696), norm. avg. (of 20) = 0.709978 fft 12: mflops = 12.1926 (norm. = 0.38504), norm. avg. (of 20) = 0.568407 fft 13: mflops = -1 (norm. = -0.0315797), norm. avg. (of 17) = 0.688319 fft 14: mflops = 14.2456 (norm. = 0.449873), norm. avg. (of 20) = 0.26718 fft 15: mflops = 6.55835 (norm. = 0.207111), norm. avg. (of 20) = 0.234283 fft 16: mflops = 6.41344 (norm. = 0.202535), norm. avg. (of 20) = 0.243542 fft 17: mflops = -1 (norm. = -0.0315797), norm. avg. (of 12) = 0.361777 fft 18: mflops = -1 (norm. = -0.0315797), norm. avg. (of 18) = 0.328257 fft 19: mflops = 6.81481 (norm. = 0.21521), norm. avg. (of 19) = 0.377148 fft 20: mflops = 6.48966 (norm. = 0.204942), norm. avg. (of 19) = 0.328315 fft 21: mflops = 5.52051 (norm. = 0.174336), norm. avg. (of 20) = 0.0819112 fft 22: mflops = 6.78024 (norm. = 0.214118), norm. avg. (of 20) = 0.148038 fft 23: mflops = 6.38154 (norm. = 0.201527), norm. avg. (of 20) = 0.24751 fft 24: mflops = 22.5831 (norm. = 0.713167), norm. avg. (of 20) = 0.63985 fft 25: mflops = 5.15103 (norm. = 0.162668), norm. avg. (of 17) = 0.215903 fft 26: mflops = 31.6593 (norm. = 0.999793), norm. avg. (of 19) = 0.399467 fft 27: mflops = 12.7471 (norm. = 0.40255), norm. avg. (of 20) = 0.43008 fft 28: mflops = 9.9899 (norm. = 0.315478), norm. avg. (of 20) = 0.160922 fft 29: mflops = 3.10258 (norm. = 0.0979787), norm. avg. (of 20) = 0.0615374 ------------------------------------------------------ @@@@ bench.1d.np2.log Benchmarking for sizes: 6 (0.000686646 MB) 9 (0.000915527 MB) 12 (0.00114441 MB) 15 (0.00137329 MB) 18 (0.00180054 MB) 24 (0.0022583 MB) 36 (0.0032959 MB) 80 (0.00738525 MB) 108 (0.00994873 MB) 210 (0.0192261 MB) 504 (0.0461426 MB) 1000 (0.0916748 MB) 1960 (0.179749 MB) 4725 (0.437393 MB) 10368 (0.960205 MB) 27000 (2.48291 MB) 75600 (6.98975 MB) 165375 (15.3664 MB) 362880 (38.6829 MB) Maximum array size = 720720 Benchmarking FFTs: 0. CWP (min N) 1. CWP (best N) 2. FFTPACK (f2c) 3. FFTW 4. FFTW_ESTIMATE 5. Frigo-old 6. GSL 7. NAPACK (f2c) 8. Nielsen 9. Singleton (f2c) 10. Temperton (f2c) 11. Valkenburg Computing normalized averages (12 transforms). Benchmarking for array size = 6: 0. CWP (min N): elapsed time t=1.54267 s, 524288 iters, t-(init.)=1.48101 s t(norm)=0.182131, mflops=27.4528 1. CWP (best N) (N=15): elapsed time t=1.17385 s, 262144 iters, t-(init.)=1.1113 s t(norm)=0.273329, mflops=18.293 2. FFTPACK (f2c): elapsed time t=1.80687 s, 524288 iters, t-(init.)=1.74347 s t(norm)=0.214408, mflops=23.3201 (err=1.7e-16) FFTW_MEASURE plan: (cost = 6.080704e-07) FFTW_NOTW 6 3. FFTW: elapsed time t=1.36092 s, 2097152 iters, t-(init.)=1.10722 s t(norm)=0.0340406, mflops=146.884 (err=1.3e-16) FFTW_ESTIMATE plan: (cost = 4.116000e+02) FFTW_NOTW 6 4. FFTW_ESTIMATE: elapsed time t=1.34685 s, 2097152 iters, t-(init.)=1.10017 s t(norm)=0.0338238, mflops=147.825 (err=1.3e-16) 5. Frigo-old: elapsed time t=1.60309 s, 524288 iters, t-(init.)=1.54136 s t(norm)=0.189553, mflops=26.3779 (err=3.2e-16) 6. GSL: elapsed time t=1.10429 s, 524288 iters, t-(init.)=1.04255 s t(norm)=0.12821, mflops=38.9986 (err=1.3e-16) 7. NAPACK (f2c): elapsed time t=1.03044 s, 131072 iters, t-(init.)=1.01498 s t(norm)=0.499279, mflops=10.0144 (err=2.3e-16) 8. Nielsen: elapsed time t=1.3322 s, 131072 iters, t-(init.)=1.31671 s t(norm)=0.647703, mflops=7.71959 (err=2.7e-16) 9. Singleton (f2c): elapsed time t=1.02502 s, 262144 iters, t-(init.)=0.994128 s t(norm)=0.24451, mflops=20.449 (err=1.3e-16) 10. Temperton (f2c): elapsed time t=1.01137 s, 131072 iters, t-(init.)=0.995891 s t(norm)=0.489888, mflops=10.2064 (err=1.2e-16) 11. Valkenburg: elapsed time t=1.05143 s, 131072 iters, t-(init.)=1.03555 s t(norm)=0.509396, mflops=9.81555 (err=2.1e-16) Top mflops for N=6 = 147.825 Normalized results and averages for N=6: fft 0: mflops = 27.4528 (norm. = 0.185712), norm. avg. (of 1) = 0.185712 fft 1: mflops = 18.293 (norm. = 0.123748), norm. avg. (of 1) = 0.123748 fft 2: mflops = 23.3201 (norm. = 0.157755), norm. avg. (of 1) = 0.157755 fft 3: mflops = 146.884 (norm. = 0.993632), norm. avg. (of 1) = 0.993632 fft 4: mflops = 147.825 (norm. = 1), norm. avg. (of 1) = 1 fft 5: mflops = 26.3779 (norm. = 0.17844), norm. avg. (of 1) = 0.17844 fft 6: mflops = 38.9986 (norm. = 0.263816), norm. avg. (of 1) = 0.263816 fft 7: mflops = 10.0144 (norm. = 0.0677453), norm. avg. (of 1) = 0.0677453 fft 8: mflops = 7.71959 (norm. = 0.0522212), norm. avg. (of 1) = 0.0522212 fft 9: mflops = 20.449 (norm. = 0.138333), norm. avg. (of 1) = 0.138333 fft 10: mflops = 10.2064 (norm. = 0.069044), norm. avg. (of 1) = 0.069044 fft 11: mflops = 9.81555 (norm. = 0.0663999), norm. avg. (of 1) = 0.0663999 Benchmarking for array size = 9: 0. CWP (min N): elapsed time t=1.65408 s, 524288 iters, t-(init.)=1.57125 s t(norm)=0.105047, mflops=47.5977 1. CWP (best N) (N=15): elapsed time t=1.17402 s, 262144 iters, t-(init.)=1.11152 s t(norm)=0.148623, mflops=33.6421 2. FFTPACK (f2c): elapsed time t=1.3144 s, 262144 iters, t-(init.)=1.27207 s t(norm)=0.17009, mflops=29.3962 (err=2.8e-16) FFTW_MEASURE plan: (cost = 9.853668e-07) FFTW_NOTW 9 3. FFTW: elapsed time t=1.08594 s, 1048576 iters, t-(init.)=0.916516 s t(norm)=0.0306372, mflops=163.2 (err=1.4e-16) FFTW_ESTIMATE plan: (cost = 4.851000e+02) FFTW_NOTW 9 4. FFTW_ESTIMATE: elapsed time t=1.0789 s, 1048576 iters, t-(init.)=0.913292 s t(norm)=0.0305294, mflops=163.777 (err=1.4e-16) 5. Frigo-old: elapsed time t=1.67536 s, 262144 iters, t-(init.)=1.63388 s t(norm)=0.218469, mflops=22.8865 (err=3.1e-16) 6. GSL: elapsed time t=1.05957 s, 262144 iters, t-(init.)=1.01803 s t(norm)=0.136122, mflops=36.7318 (err=1.4e-16) 7. NAPACK (f2c): elapsed time t=1.51198 s, 131072 iters, t-(init.)=1.49112 s t(norm)=0.398761, mflops=12.5388 (err=5.8e-16) 8. Nielsen: elapsed time t=1.59626 s, 131072 iters, t-(init.)=1.57557 s t(norm)=0.421345, mflops=11.8668 (err=4.5e-16) 9. Singleton (f2c): elapsed time t=1.01852 s, 262144 iters, t-(init.)=0.977051 s t(norm)=0.130643, mflops=38.2723 (err=1.7e-16) 10. Temperton (f2c): elapsed time t=1.28734 s, 131072 iters, t-(init.)=1.26656 s t(norm)=0.338708, mflops=14.762 (err=1.7e-16) 11. Valkenburg: elapsed time t=1.89141 s, 131072 iters, t-(init.)=1.87029 s t(norm)=0.500158, mflops=9.99683 (err=2.6e-16) Top mflops for N=9 = 163.777 Normalized results and averages for N=9: fft 0: mflops = 47.5977 (norm. = 0.290626), norm. avg. (of 2) = 0.238169 fft 1: mflops = 33.6421 (norm. = 0.205415), norm. avg. (of 2) = 0.164581 fft 2: mflops = 29.3962 (norm. = 0.179489), norm. avg. (of 2) = 0.168622 fft 3: mflops = 163.2 (norm. = 0.996482), norm. avg. (of 2) = 0.995057 fft 4: mflops = 163.777 (norm. = 1), norm. avg. (of 2) = 1 fft 5: mflops = 22.8865 (norm. = 0.139742), norm. avg. (of 2) = 0.159091 fft 6: mflops = 36.7318 (norm. = 0.22428), norm. avg. (of 2) = 0.244048 fft 7: mflops = 12.5388 (norm. = 0.0765607), norm. avg. (of 2) = 0.072153 fft 8: mflops = 11.8668 (norm. = 0.0724571), norm. avg. (of 2) = 0.0623391 fft 9: mflops = 38.2723 (norm. = 0.233686), norm. avg. (of 2) = 0.186009 fft 10: mflops = 14.762 (norm. = 0.0901348), norm. avg. (of 2) = 0.0795894 fft 11: mflops = 9.99683 (norm. = 0.0610395), norm. avg. (of 2) = 0.0637197 Benchmarking for array size = 12: 0. CWP (min N): elapsed time t=1.95133 s, 524288 iters, t-(init.)=1.84736 s t(norm)=0.0819061, mflops=61.0455 1. CWP (best N) (N=15): elapsed time t=1.17405 s, 262144 iters, t-(init.)=1.11146 s t(norm)=0.0985574, mflops=50.7319 2. FFTPACK (f2c): elapsed time t=1.56331 s, 262144 iters, t-(init.)=1.51036 s t(norm)=0.133929, mflops=37.3331 (err=1.9e-16) FFTW_MEASURE plan: (cost = 1.042450e-06) FFTW_NOTW 12 3. FFTW: elapsed time t=1.15656 s, 1048576 iters, t-(init.)=0.945 s t(norm)=0.0209491, mflops=238.673 (err=1.3e-16) FFTW_ESTIMATE plan: (cost = 4.920000e+02) FFTW_NOTW 12 4. FFTW_ESTIMATE: elapsed time t=1.14956 s, 1048576 iters, t-(init.)=0.941485 s t(norm)=0.0208712, mflops=239.564 (err=1.3e-16) 5. Frigo-old: elapsed time t=1.46662 s, 262144 iters, t-(init.)=1.41464 s t(norm)=0.125441, mflops=39.8592 (err=2.3e-16) 6. GSL: elapsed time t=1.0318 s, 262144 iters, t-(init.)=0.979822 s t(norm)=0.0868843, mflops=57.5478 (err=1.5e-16) 7. NAPACK (f2c): elapsed time t=1.08881 s, 65536 iters, t-(init.)=1.07577 s t(norm)=0.381568, mflops=13.1038 (err=4.2e-16) 8. Nielsen: elapsed time t=1.85205 s, 131072 iters, t-(init.)=1.826 s t(norm)=0.323836, mflops=15.4399 (err=4.8e-16) 9. Singleton (f2c): elapsed time t=1.42526 s, 262144 iters, t-(init.)=1.37326 s t(norm)=0.121772, mflops=41.0605 (err=1.9e-16) 10. Temperton (f2c): elapsed time t=1.48826 s, 131072 iters, t-(init.)=1.4623 s t(norm)=0.259335, mflops=19.2801 (err=1.2e-16) 11. Valkenburg: elapsed time t=1.40376 s, 65536 iters, t-(init.)=1.3905 s t(norm)=0.493202, mflops=10.1378 (err=1.9e-16) Top mflops for N=12 = 239.564 Normalized results and averages for N=12: fft 0: mflops = 61.0455 (norm. = 0.254819), norm. avg. (of 3) = 0.243719 fft 1: mflops = 50.7319 (norm. = 0.211767), norm. avg. (of 3) = 0.18031 fft 2: mflops = 37.3331 (norm. = 0.155837), norm. avg. (of 3) = 0.16436 fft 3: mflops = 238.673 (norm. = 0.99628), norm. avg. (of 3) = 0.995465 fft 4: mflops = 239.564 (norm. = 1), norm. avg. (of 3) = 1 fft 5: mflops = 39.8592 (norm. = 0.166382), norm. avg. (of 3) = 0.161522 fft 6: mflops = 57.5478 (norm. = 0.240218), norm. avg. (of 3) = 0.242771 fft 7: mflops = 13.1038 (norm. = 0.0546985), norm. avg. (of 3) = 0.0663348 fft 8: mflops = 15.4399 (norm. = 0.06445), norm. avg. (of 3) = 0.0630427 fft 9: mflops = 41.0605 (norm. = 0.171396), norm. avg. (of 3) = 0.181138 fft 10: mflops = 19.2801 (norm. = 0.0804798), norm. avg. (of 3) = 0.0798862 fft 11: mflops = 10.1378 (norm. = 0.0423178), norm. avg. (of 3) = 0.0565857 Benchmarking for array size = 15: 0. CWP (min N): elapsed time t=1.17231 s, 262144 iters, t-(init.)=1.10972 s t(norm)=0.0722358, mflops=69.2177 1. CWP (best N): elapsed time t=1.17385 s, 262144 iters, t-(init.)=1.1113 s t(norm)=0.0723382, mflops=69.1198 2. FFTPACK (f2c): elapsed time t=1.00977 s, 131072 iters, t-(init.)=0.978006 s t(norm)=0.127324, mflops=39.27 (err=3.6e-16) FFTW_MEASURE plan: (cost = 1.779877e-06) FFTW_NOTW 15 3. FFTW: elapsed time t=1.94758 s, 1048576 iters, t-(init.)=1.69387 s t(norm)=0.027565, mflops=181.389 (err=1.7e-16) FFTW_ESTIMATE plan: (cost = 4.485000e+02) FFTW_NOTW 15 4. FFTW_ESTIMATE: elapsed time t=1.94052 s, 1048576 iters, t-(init.)=1.69017 s t(norm)=0.0275047, mflops=181.787 (err=1.7e-16) 5. Frigo-old: elapsed time t=1.41565 s, 131072 iters, t-(init.)=1.38433 s t(norm)=0.180222, mflops=27.7436 (err=2.7e-16) 6. GSL: elapsed time t=1.97632 s, 262144 iters, t-(init.)=1.91374 s t(norm)=0.124572, mflops=40.1375 (err=1.9e-16) 7. NAPACK (f2c): elapsed time t=1.06172 s, 32768 iters, t-(init.)=1.05384 s t(norm)=0.548783, mflops=9.11107 (err=9.4e-16) 8. Nielsen: elapsed time t=1.08052 s, 65536 iters, t-(init.)=1.06479 s t(norm)=0.277243, mflops=18.0347 (err=4.5e-15) 9. Singleton (f2c): elapsed time t=1.73825 s, 262144 iters, t-(init.)=1.6757 s t(norm)=0.109077, mflops=45.839 (err=2.0e-16) 10. Temperton (f2c): elapsed time t=1.93994 s, 131072 iters, t-(init.)=1.90864 s t(norm)=0.248481, mflops=20.1223 (err=2.5e-16) 11. Valkenburg: elapsed time t=1.05997 s, 32768 iters, t-(init.)=1.05202 s t(norm)=0.547839, mflops=9.12677 (err=2.5e-16) Top mflops for N=15 = 181.787 Normalized results and averages for N=15: fft 0: mflops = 69.2177 (norm. = 0.380763), norm. avg. (of 4) = 0.27798 fft 1: mflops = 69.1198 (norm. = 0.380224), norm. avg. (of 4) = 0.230288 fft 2: mflops = 39.27 (norm. = 0.216022), norm. avg. (of 4) = 0.177276 fft 3: mflops = 181.389 (norm. = 0.997812), norm. avg. (of 4) = 0.996052 fft 4: mflops = 181.787 (norm. = 1), norm. avg. (of 4) = 1 fft 5: mflops = 27.7436 (norm. = 0.152616), norm. avg. (of 4) = 0.159295 fft 6: mflops = 40.1375 (norm. = 0.220794), norm. avg. (of 4) = 0.237277 fft 7: mflops = 9.11107 (norm. = 0.0501195), norm. avg. (of 4) = 0.062281 fft 8: mflops = 18.0347 (norm. = 0.0992078), norm. avg. (of 4) = 0.072084 fft 9: mflops = 45.839 (norm. = 0.252158), norm. avg. (of 4) = 0.198893 fft 10: mflops = 20.1223 (norm. = 0.110692), norm. avg. (of 4) = 0.0875875 fft 11: mflops = 9.12677 (norm. = 0.0502058), norm. avg. (of 4) = 0.0549907 Benchmarking for array size = 18: 0. CWP (min N): elapsed time t=1.39853 s, 262144 iters, t-(init.)=1.32543 s t(norm)=0.0673624, mflops=74.2254 1. CWP (best N) (N=28): elapsed time t=1.50079 s, 262144 iters, t-(init.)=1.39238 s t(norm)=0.0707649, mflops=70.6565 2. FFTPACK (f2c): elapsed time t=1.78586 s, 131072 iters, t-(init.)=1.7487 s t(norm)=0.177748, mflops=28.1297 (err=2.6e-16) FFTW_MEASURE plan: (cost = 2.681824e-06) FFTW_TWIDDLE 3 FFTW_NOTW 6 3. FFTW: elapsed time t=1.4491 s, 524288 iters, t-(init.)=1.30111 s t(norm)=0.0330631, mflops=151.226 (err=1.9e-16) FFTW_ESTIMATE plan: (cost = 1.168200e+03) FFTW_TWIDDLE 2 FFTW_NOTW 9 4. FFTW_ESTIMATE: elapsed time t=1.46157 s, 524288 iters, t-(init.)=1.31532 s t(norm)=0.0334242, mflops=149.592 (err=2.3e-16) 5. Frigo-old: elapsed time t=1.9008 s, 131072 iters, t-(init.)=1.8642 s t(norm)=0.189488, mflops=26.3869 (err=3.8e-16) 6. GSL: elapsed time t=1.616 s, 262144 iters, t-(init.)=1.54263 s t(norm)=0.0784012, mflops=63.7746 (err=2.4e-16) 7. NAPACK (f2c): elapsed time t=1.65759 s, 65536 iters, t-(init.)=1.6393 s t(norm)=0.333256, mflops=15.0035 (err=6.0e-16) 8. Nielsen: elapsed time t=1.81643 s, 65536 iters, t-(init.)=1.79811 s t(norm)=0.36554, mflops=13.6784 (err=7.7e-16) 9. Singleton (f2c): elapsed time t=1.79021 s, 262144 iters, t-(init.)=1.71706 s t(norm)=0.0872657, mflops=57.2963 (err=1.7e-16) 10. Temperton (f2c): elapsed time t=1.46869 s, 65536 iters, t-(init.)=1.45036 s t(norm)=0.294845, mflops=16.958 (err=2.8e-16) 11. Valkenburg: elapsed time t=1.20353 s, 32768 iters, t-(init.)=1.19426 s t(norm)=0.485564, mflops=10.2973 (err=2.8e-16) Top mflops for N=18 = 151.226 Normalized results and averages for N=18: fft 0: mflops = 74.2254 (norm. = 0.490824), norm. avg. (of 5) = 0.320549 fft 1: mflops = 70.6565 (norm. = 0.467224), norm. avg. (of 5) = 0.277675 fft 2: mflops = 28.1297 (norm. = 0.18601), norm. avg. (of 5) = 0.179023 fft 3: mflops = 151.226 (norm. = 1), norm. avg. (of 5) = 0.996841 fft 4: mflops = 149.592 (norm. = 0.989194), norm. avg. (of 5) = 0.997839 fft 5: mflops = 26.3869 (norm. = 0.174486), norm. avg. (of 5) = 0.162333 fft 6: mflops = 63.7746 (norm. = 0.421716), norm. avg. (of 5) = 0.274165 fft 7: mflops = 15.0035 (norm. = 0.0992123), norm. avg. (of 5) = 0.0696672 fft 8: mflops = 13.6784 (norm. = 0.0904499), norm. avg. (of 5) = 0.0757572 fft 9: mflops = 57.2963 (norm. = 0.378878), norm. avg. (of 5) = 0.23489 fft 10: mflops = 16.958 (norm. = 0.112137), norm. avg. (of 5) = 0.0924974 fft 11: mflops = 10.2973 (norm. = 0.068092), norm. avg. (of 5) = 0.057611 Benchmarking for array size = 24: 0. CWP (min N): elapsed time t=1.50435 s, 262144 iters, t-(init.)=1.41005 s t(norm)=0.0488817, mflops=102.288 1. CWP (best N) (N=28): elapsed time t=1.50071 s, 262144 iters, t-(init.)=1.3923 s t(norm)=0.0482666, mflops=103.591 2. FFTPACK (f2c): elapsed time t=1.11445 s, 65536 iters, t-(init.)=1.09058 s t(norm)=0.151227, mflops=33.0629 (err=2.4e-16) FFTW_MEASURE plan: (cost = 3.004578e-06) FFTW_TWIDDLE 2 FFTW_NOTW 12 3. FFTW: elapsed time t=1.62569 s, 524288 iters, t-(init.)=1.43542 s t(norm)=0.0248808, mflops=200.959 (err=2.0e-16) FFTW_ESTIMATE plan: (cost = 1.248000e+03) FFTW_TWIDDLE 2 FFTW_NOTW 12 4. FFTW_ESTIMATE: elapsed time t=1.62244 s, 524288 iters, t-(init.)=1.43393 s t(norm)=0.0248549, mflops=201.167 (err=2.0e-16) 5. Frigo-old: elapsed time t=1.43601 s, 131072 iters, t-(init.)=1.38892 s t(norm)=0.0962989, mflops=51.9216 (err=2.7e-16) 6. GSL: elapsed time t=1.8017 s, 262144 iters, t-(init.)=1.70725 s t(norm)=0.0591848, mflops=84.4812 (err=2.2e-16) 7. NAPACK (f2c): elapsed time t=1.08125 s, 32768 iters, t-(init.)=1.0694 s t(norm)=0.29658, mflops=16.8588 (err=8.2e-16) 8. Nielsen: elapsed time t=1.57155 s, 65536 iters, t-(init.)=1.54796 s t(norm)=0.214651, mflops=23.2937 (err=1.4e-15) 9. Singleton (f2c): elapsed time t=1.39098 s, 131072 iters, t-(init.)=1.34377 s t(norm)=0.0931684, mflops=53.6663 (err=2.2e-16) 10. Temperton (f2c): elapsed time t=1.66269 s, 65536 iters, t-(init.)=1.63909 s t(norm)=0.227287, mflops=21.9986 (err=2.7e-16) 11. Valkenburg: elapsed time t=1.74284 s, 32768 iters, t-(init.)=1.73078 s t(norm)=0.480005, mflops=10.4166 (err=2.9e-16) Top mflops for N=24 = 201.167 Normalized results and averages for N=24: fft 0: mflops = 102.288 (norm. = 0.508471), norm. avg. (of 6) = 0.351869 fft 1: mflops = 103.591 (norm. = 0.51495), norm. avg. (of 6) = 0.317221 fft 2: mflops = 33.0629 (norm. = 0.164355), norm. avg. (of 6) = 0.176578 fft 3: mflops = 200.959 (norm. = 0.998961), norm. avg. (of 6) = 0.997195 fft 4: mflops = 201.167 (norm. = 1), norm. avg. (of 6) = 0.998199 fft 5: mflops = 51.9216 (norm. = 0.258102), norm. avg. (of 6) = 0.178295 fft 6: mflops = 84.4812 (norm. = 0.419955), norm. avg. (of 6) = 0.298463 fft 7: mflops = 16.8588 (norm. = 0.083805), norm. avg. (of 6) = 0.0720235 fft 8: mflops = 23.2937 (norm. = 0.115792), norm. avg. (of 6) = 0.0824297 fft 9: mflops = 53.6663 (norm. = 0.266774), norm. avg. (of 6) = 0.240204 fft 10: mflops = 21.9986 (norm. = 0.109355), norm. avg. (of 6) = 0.095307 fft 11: mflops = 10.4166 (norm. = 0.0517805), norm. avg. (of 6) = 0.0566393 Benchmarking for array size = 36: 0. CWP (min N): elapsed time t=1.06507 s, 131072 iters, t-(init.)=0.996785 s t(norm)=0.0408606, mflops=122.367 1. CWP (best N): elapsed time t=1.06797 s, 131072 iters, t-(init.)=0.99967 s t(norm)=0.0409789, mflops=122.014 2. FFTPACK (f2c): elapsed time t=1.73536 s, 65536 iters, t-(init.)=1.70088 s t(norm)=0.139446, mflops=35.8561 (err=3.7e-16) FFTW_MEASURE plan: (cost = 4.716858e-06) FFTW_TWIDDLE 3 FFTW_NOTW 12 3. FFTW: elapsed time t=1.27243 s, 262144 iters, t-(init.)=1.13501 s t(norm)=0.0232634, mflops=214.93 (err=3.5e-16) FFTW_ESTIMATE plan: (cost = 1.803600e+03) FFTW_TWIDDLE 3 FFTW_NOTW 12 4. FFTW_ESTIMATE: elapsed time t=1.27047 s, 262144 iters, t-(init.)=1.13378 s t(norm)=0.0232382, mflops=215.163 (err=3.5e-16) 5. Frigo-old: elapsed time t=1.87489 s, 65536 iters, t-(init.)=1.84061 s t(norm)=0.150902, mflops=33.134 (err=4.8e-16) 6. GSL: elapsed time t=1.33116 s, 131072 iters, t-(init.)=1.26276 s t(norm)=0.0517637, mflops=96.5927 (err=2.8e-16) 7. NAPACK (f2c): elapsed time t=1.68764 s, 32768 iters, t-(init.)=1.67053 s t(norm)=0.273916, mflops=18.2538 (err=1.0e-15) 8. Nielsen: elapsed time t=1.49345 s, 32768 iters, t-(init.)=1.4763 s t(norm)=0.242069, mflops=20.6553 (err=9.7e-16) 9. Singleton (f2c): elapsed time t=1.45762 s, 131072 iters, t-(init.)=1.38932 s t(norm)=0.0569518, mflops=87.7936 (err=2.7e-16) 10. Temperton (f2c): elapsed time t=1.23119 s, 32768 iters, t-(init.)=1.21409 s t(norm)=0.199075, mflops=25.1162 (err=3.9e-16) 11. Valkenburg: elapsed time t=1.45521 s, 16384 iters, t-(init.)=1.44667 s t(norm)=0.474421, mflops=10.5392 (err=4.0e-16) Top mflops for N=36 = 215.163 Normalized results and averages for N=36: fft 0: mflops = 122.367 (norm. = 0.568718), norm. avg. (of 7) = 0.382847 fft 1: mflops = 122.014 (norm. = 0.567077), norm. avg. (of 7) = 0.352915 fft 2: mflops = 35.8561 (norm. = 0.166646), norm. avg. (of 7) = 0.175159 fft 3: mflops = 214.93 (norm. = 0.998916), norm. avg. (of 7) = 0.997441 fft 4: mflops = 215.163 (norm. = 1), norm. avg. (of 7) = 0.998456 fft 5: mflops = 33.134 (norm. = 0.153995), norm. avg. (of 7) = 0.174823 fft 6: mflops = 96.5927 (norm. = 0.448928), norm. avg. (of 7) = 0.319958 fft 7: mflops = 18.2538 (norm. = 0.0848369), norm. avg. (of 7) = 0.073854 fft 8: mflops = 20.6553 (norm. = 0.0959982), norm. avg. (of 7) = 0.0843681 fft 9: mflops = 87.7936 (norm. = 0.408033), norm. avg. (of 7) = 0.26418 fft 10: mflops = 25.1162 (norm. = 0.116731), norm. avg. (of 7) = 0.0983675 fft 11: mflops = 10.5392 (norm. = 0.0489822), norm. avg. (of 7) = 0.0555454 Benchmarking for array size = 80: 0. CWP (min N): elapsed time t=1.15775 s, 65536 iters, t-(init.)=1.08475 s t(norm)=0.0327273, mflops=152.778 1. CWP (best N) (N=84): elapsed time t=1.02548 s, 65536 iters, t-(init.)=0.948977 s t(norm)=0.028631, mflops=174.636 2. FFTPACK (f2c): elapsed time t=1.8421 s, 32768 iters, t-(init.)=1.80554 s t(norm)=0.108948, mflops=45.8935 (err=7.7e-16) FFTW_MEASURE plan: (cost = 1.124988e-05) FFTW_TWIDDLE 5 FFTW_NOTW 16 3. FFTW: elapsed time t=1.50625 s, 131072 iters, t-(init.)=1.35996 s t(norm)=0.0205152, mflops=243.722 (err=7.3e-16) FFTW_ESTIMATE plan: (cost = 2.600000e+03) FFTW_TWIDDLE 5 FFTW_NOTW 16 4. FFTW_ESTIMATE: elapsed time t=1.50536 s, 131072 iters, t-(init.)=1.35925 s t(norm)=0.0205045, mflops=243.849 (err=7.3e-16) 5. Frigo-old: elapsed time t=1.28148 s, 32768 iters, t-(init.)=1.24501 s t(norm)=0.0751251, mflops=66.5556 (err=7.1e-16) 6. GSL: elapsed time t=1.33283 s, 32768 iters, t-(init.)=1.2964 s t(norm)=0.0782257, mflops=63.9176 (err=6.9e-16) 7. NAPACK (f2c): elapsed time t=1.64568 s, 8192 iters, t-(init.)=1.63648 s t(norm)=0.394986, mflops=12.6587 (err=1.1e-15) 8. Nielsen: elapsed time t=1.09666 s, 16384 iters, t-(init.)=1.07833 s t(norm)=0.130135, mflops=38.4218 (err=5.4e-15) 9. Singleton (f2c): elapsed time t=1.29992 s, 65536 iters, t-(init.)=1.22702 s t(norm)=0.0370195, mflops=135.064 (err=1.3e-15) 10. Temperton (f2c): elapsed time t=1.42784 s, 16384 iters, t-(init.)=1.4096 s t(norm)=0.170112, mflops=29.3923 (err=7.0e-16) 11. Valkenburg: elapsed time t=1.03447 s, 4096 iters, t-(init.)=1.02992 s t(norm)=0.49717, mflops=10.0569 (err=8.4e-16) Top mflops for N=80 = 243.849 Normalized results and averages for N=80: fft 0: mflops = 152.778 (norm. = 0.626525), norm. avg. (of 8) = 0.413307 fft 1: mflops = 174.636 (norm. = 0.716163), norm. avg. (of 8) = 0.398321 fft 2: mflops = 45.8935 (norm. = 0.188204), norm. avg. (of 8) = 0.17679 fft 3: mflops = 243.722 (norm. = 0.999477), norm. avg. (of 8) = 0.997695 fft 4: mflops = 243.849 (norm. = 1), norm. avg. (of 8) = 0.998649 fft 5: mflops = 66.5556 (norm. = 0.272937), norm. avg. (of 8) = 0.187088 fft 6: mflops = 63.9176 (norm. = 0.262119), norm. avg. (of 8) = 0.312728 fft 7: mflops = 12.6587 (norm. = 0.0519119), norm. avg. (of 8) = 0.0711112 fft 8: mflops = 38.4218 (norm. = 0.157564), norm. avg. (of 8) = 0.0935175 fft 9: mflops = 135.064 (norm. = 0.553882), norm. avg. (of 8) = 0.300392 fft 10: mflops = 29.3923 (norm. = 0.120535), norm. avg. (of 8) = 0.101138 fft 11: mflops = 10.0569 (norm. = 0.0412424), norm. avg. (of 8) = 0.0537575 Benchmarking for array size = 108: 0. CWP (min N) (N=110): elapsed time t=1.94017 s, 65536 iters, t-(init.)=1.8407 s t(norm)=0.0385001, mflops=129.87 1. CWP (best N) (N=112): elapsed time t=1.52259 s, 65536 iters, t-(init.)=1.42148 s t(norm)=0.0297317, mflops=168.171 2. FFTPACK (f2c): elapsed time t=1.40011 s, 16384 iters, t-(init.)=1.37565 s t(norm)=0.115092, mflops=43.4436 (err=4.7e-16) FFTW_MEASURE plan: (cost = 1.666431e-05) FFTW_TWIDDLE 9 FFTW_NOTW 12 3. FFTW: elapsed time t=1.11203 s, 65536 iters, t-(init.)=1.01402 s t(norm)=0.0212092, mflops=235.747 (err=3.7e-16) FFTW_ESTIMATE plan: (cost = 4.633200e+03) FFTW_TWIDDLE 9 FFTW_NOTW 12 4. FFTW_ESTIMATE: elapsed time t=1.11103 s, 65536 iters, t-(init.)=1.01341 s t(norm)=0.0211964, mflops=235.89 (err=3.7e-16) 5. Frigo-old: elapsed time t=1.98484 s, 16384 iters, t-(init.)=1.96042 s t(norm)=0.164016, mflops=30.4848 (err=5.5e-16) 6. GSL: elapsed time t=1.36784 s, 32768 iters, t-(init.)=1.31905 s t(norm)=0.0551783, mflops=90.6154 (err=4.7e-16) 7. NAPACK (f2c): elapsed time t=1.50862 s, 8192 iters, t-(init.)=1.49639 s t(norm)=0.250388, mflops=19.969 (err=2.7e-15) 8. Nielsen: elapsed time t=1.28203 s, 8192 iters, t-(init.)=1.26981 s t(norm)=0.212474, mflops=23.5322 (err=1.1e-15) 9. Singleton (f2c): elapsed time t=1.27557 s, 32768 iters, t-(init.)=1.22662 s t(norm)=0.0513117, mflops=97.4436 (err=5.1e-16) 10. Temperton (f2c): elapsed time t=1.12401 s, 8192 iters, t-(init.)=1.11175 s t(norm)=0.186027, mflops=26.8778 (err=3.8e-16) 11. Valkenburg: elapsed time t=1.40162 s, 4096 iters, t-(init.)=1.39551 s t(norm)=0.467015, mflops=10.7063 (err=5.2e-16) Top mflops for N=108 = 235.89 Normalized results and averages for N=108: fft 0: mflops = 129.87 (norm. = 0.550554), norm. avg. (of 9) = 0.428557 fft 1: mflops = 168.171 (norm. = 0.712921), norm. avg. (of 9) = 0.433276 fft 2: mflops = 43.4436 (norm. = 0.184169), norm. avg. (of 9) = 0.17761 fft 3: mflops = 235.747 (norm. = 0.999394), norm. avg. (of 9) = 0.997884 fft 4: mflops = 235.89 (norm. = 1), norm. avg. (of 9) = 0.998799 fft 5: mflops = 30.4848 (norm. = 0.129233), norm. avg. (of 9) = 0.180659 fft 6: mflops = 90.6154 (norm. = 0.384143), norm. avg. (of 9) = 0.320663 fft 7: mflops = 19.969 (norm. = 0.084654), norm. avg. (of 9) = 0.072616 fft 8: mflops = 23.5322 (norm. = 0.0997596), norm. avg. (of 9) = 0.0942111 fft 9: mflops = 97.4436 (norm. = 0.41309), norm. avg. (of 9) = 0.312914 fft 10: mflops = 26.8778 (norm. = 0.113942), norm. avg. (of 9) = 0.102561 fft 11: mflops = 10.7063 (norm. = 0.0453869), norm. avg. (of 9) = 0.0528274 Benchmarking for array size = 210: 0. CWP (min N): elapsed time t=1.52516 s, 32768 iters, t-(init.)=1.43139 s t(norm)=0.0269647, mflops=185.428 1. CWP (best N): elapsed time t=1.5254 s, 32768 iters, t-(init.)=1.43165 s t(norm)=0.0269695, mflops=185.394 2. FFTPACK (f2c): elapsed time t=1.05465 s, 4096 iters, t-(init.)=1.04284 s t(norm)=0.157162, mflops=31.8144 (err=5.7e-16) FFTW_MEASURE plan: (cost = 4.332031e-05) FFTW_TWIDDLE 3 FFTW_TWIDDLE 7 FFTW_NOTW 10 3. FFTW: elapsed time t=1.45277 s, 32768 iters, t-(init.)=1.35888 s t(norm)=0.0255988, mflops=195.322 (err=4.6e-16) FFTW_ESTIMATE plan: (cost = 9.324000e+03) FFTW_TWIDDLE 7 FFTW_TWIDDLE 2 FFTW_NOTW 15 4. FFTW_ESTIMATE: elapsed time t=1.51935 s, 32768 iters, t-(init.)=1.42563 s t(norm)=0.0268562, mflops=186.177 (err=4.6e-16) 5. Frigo-old: elapsed time t=1.03239 s, 4096 iters, t-(init.)=1.0206 s t(norm)=0.153809, mflops=32.5078 (err=5.8e-16) 6. GSL: elapsed time t=1.04906 s, 8192 iters, t-(init.)=1.02559 s t(norm)=0.0772805, mflops=64.6994 (err=5.3e-16) 7. NAPACK (f2c): elapsed time t=1.61255 s, 2048 iters, t-(init.)=1.60672 s t(norm)=0.484282, mflops=10.3246 (err=1.4e-14) 8. Nielsen: elapsed time t=1.05731 s, 4096 iters, t-(init.)=1.04553 s t(norm)=0.157566, mflops=31.7327 (err=7.6e-15) 9. Singleton (f2c): elapsed time t=1.71139 s, 16384 iters, t-(init.)=1.6643 s t(norm)=0.0627046, mflops=79.739 (err=6.7e-16) 10. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 11. Valkenburg: elapsed time t=1.92386 s, 2048 iters, t-(init.)=1.91795 s t(norm)=0.578089, mflops=8.64919 (err=6.5e-16) Top mflops for N=210 = 195.322 Normalized results and averages for N=210: fft 0: mflops = 185.428 (norm. = 0.949344), norm. avg. (of 10) = 0.480636 fft 1: mflops = 185.394 (norm. = 0.949174), norm. avg. (of 10) = 0.484866 fft 2: mflops = 31.8144 (norm. = 0.162882), norm. avg. (of 10) = 0.176137 fft 3: mflops = 195.322 (norm. = 1), norm. avg. (of 10) = 0.998096 fft 4: mflops = 186.177 (norm. = 0.953179), norm. avg. (of 10) = 0.994237 fft 5: mflops = 32.5078 (norm. = 0.166432), norm. avg. (of 10) = 0.179237 fft 6: mflops = 64.6994 (norm. = 0.331245), norm. avg. (of 10) = 0.321721 fft 7: mflops = 10.3246 (norm. = 0.0528593), norm. avg. (of 10) = 0.0706403 fft 8: mflops = 31.7327 (norm. = 0.162464), norm. avg. (of 10) = 0.101036 fft 9: mflops = 79.739 (norm. = 0.408244), norm. avg. (of 10) = 0.322447 fft 10: mflops = -1 (norm. = -0.00511976), norm. avg. (of 9) = 0.102561 fft 11: mflops = 8.64919 (norm. = 0.0442818), norm. avg. (of 10) = 0.0519729 Benchmarking for array size = 504: 0. CWP (min N): elapsed time t=1.96048 s, 16384 iters, t-(init.)=1.8489 s t(norm)=0.0249413, mflops=200.471 1. CWP (best N): elapsed time t=1.96071 s, 16384 iters, t-(init.)=1.84885 s t(norm)=0.0249406, mflops=200.476 2. FFTPACK (f2c): elapsed time t=1.4999 s, 2048 iters, t-(init.)=1.48587 s t(norm)=0.160353, mflops=31.1813 (err=9.8e-16) FFTW_MEASURE plan: (cost = 1.018320e-04) FFTW_TWIDDLE 7 FFTW_TWIDDLE 6 FFTW_NOTW 12 3. FFTW: elapsed time t=1.68424 s, 16384 iters, t-(init.)=1.57253 s t(norm)=0.0212131, mflops=235.703 (err=9.2e-16) FFTW_ESTIMATE plan: (cost = 2.147040e+04) FFTW_TWIDDLE 6 FFTW_TWIDDLE 6 FFTW_NOTW 14 4. FFTW_ESTIMATE: elapsed time t=1.91881 s, 16384 iters, t-(init.)=1.8071 s t(norm)=0.0243774, mflops=205.108 (err=8.8e-16) 5. Frigo-old: elapsed time t=1.22829 s, 2048 iters, t-(init.)=1.21427 s t(norm)=0.131042, mflops=38.1557 (err=1.0e-15) 6. GSL: elapsed time t=1.09167 s, 4096 iters, t-(init.)=1.06369 s t(norm)=0.0573959, mflops=87.1142 (err=8.9e-16) 7. NAPACK (f2c): elapsed time t=1.71845 s, 1024 iters, t-(init.)=1.71128 s t(norm)=0.369356, mflops=13.5371 (err=4.2e-14) 8. Nielsen: elapsed time t=1.54587 s, 2048 iters, t-(init.)=1.53186 s t(norm)=0.165315, mflops=30.2452 (err=5.8e-15) 9. Singleton (f2c): elapsed time t=1.00326 s, 4096 iters, t-(init.)=0.975363 s t(norm)=0.0526297, mflops=95.0034 (err=1.3e-15) 10. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 11. Valkenburg: elapsed time t=1.22349 s, 512 iters, t-(init.)=1.22001 s t(norm)=0.526645, mflops=9.49406 (err=1.0e-15) Top mflops for N=504 = 235.703 Normalized results and averages for N=504: fft 0: mflops = 200.471 (norm. = 0.850523), norm. avg. (of 11) = 0.514262 fft 1: mflops = 200.476 (norm. = 0.850546), norm. avg. (of 11) = 0.51811 fft 2: mflops = 31.1813 (norm. = 0.13229), norm. avg. (of 11) = 0.172151 fft 3: mflops = 235.703 (norm. = 1), norm. avg. (of 11) = 0.998269 fft 4: mflops = 205.108 (norm. = 0.870196), norm. avg. (of 11) = 0.982961 fft 5: mflops = 38.1557 (norm. = 0.16188), norm. avg. (of 11) = 0.177659 fft 6: mflops = 87.1142 (norm. = 0.369593), norm. avg. (of 11) = 0.326073 fft 7: mflops = 13.5371 (norm. = 0.0574327), norm. avg. (of 11) = 0.0694396 fft 8: mflops = 30.2452 (norm. = 0.128319), norm. avg. (of 11) = 0.103517 fft 9: mflops = 95.0034 (norm. = 0.403063), norm. avg. (of 11) = 0.329776 fft 10: mflops = -1 (norm. = -0.00424262), norm. avg. (of 9) = 0.102561 fft 11: mflops = 9.49406 (norm. = 0.0402797), norm. avg. (of 11) = 0.0509099 Benchmarking for array size = 1000: 0. CWP (min N) (N=1001): elapsed time t=1.62558 s, 4096 iters, t-(init.)=1.56981 s t(norm)=0.0384571, mflops=130.015 1. CWP (best N) (N=1008): elapsed time t=1.12917 s, 4096 iters, t-(init.)=1.0734 s t(norm)=0.026296, mflops=190.143 2. FFTPACK (f2c): elapsed time t=1.83595 s, 1024 iters, t-(init.)=1.82195 s t(norm)=0.178536, mflops=28.0056 (err=3.1e-15) FFTW_MEASURE plan: (cost = 4.316719e-04) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_NOTW 10 3. FFTW: elapsed time t=1.7601 s, 4096 iters, t-(init.)=1.70463 s t(norm)=0.0417597, mflops=119.733 (err=3.1e-15) FFTW_ESTIMATE plan: (cost = 5.220000e+04) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_NOTW 10 4. FFTW_ESTIMATE: elapsed time t=1.76017 s, 4096 iters, t-(init.)=1.70472 s t(norm)=0.0417621, mflops=119.726 (err=3.1e-15) 5. Frigo-old: elapsed time t=1.05839 s, 512 iters, t-(init.)=1.05125 s t(norm)=0.206027, mflops=24.2687 (err=3.1e-15) 6. GSL: elapsed time t=1.48176 s, 1024 iters, t-(init.)=1.46781 s t(norm)=0.143833, mflops=34.7626 (err=3.1e-15) 7. NAPACK (f2c): elapsed time t=1.17502 s, 256 iters, t-(init.)=1.17142 s t(norm)=0.459158, mflops=10.8895 (err=1.8e-14) 8. Nielsen: elapsed time t=1.23363 s, 1024 iters, t-(init.)=1.21959 s t(norm)=0.11951, mflops=41.8375 (err=1.5e-14) 9. Singleton (f2c): elapsed time t=1.57343 s, 4096 iters, t-(init.)=1.51816 s t(norm)=0.0371917, mflops=134.439 (err=4.7e-15) 10. Temperton (f2c): elapsed time t=1.87446 s, 1024 iters, t-(init.)=1.86064 s t(norm)=0.182327, mflops=27.4233 (err=3.0e-15) 11. Valkenburg: elapsed time t=1.53089 s, 256 iters, t-(init.)=1.52735 s t(norm)=0.598671, mflops=8.35184 (err=3.0e-15) Top mflops for N=1000 = 190.143 Normalized results and averages for N=1000: fft 0: mflops = 130.015 (norm. = 0.683774), norm. avg. (of 12) = 0.528388 fft 1: mflops = 190.143 (norm. = 1), norm. avg. (of 12) = 0.558267 fft 2: mflops = 28.0056 (norm. = 0.147287), norm. avg. (of 12) = 0.170079 fft 3: mflops = 119.733 (norm. = 0.629697), norm. avg. (of 12) = 0.967554 fft 4: mflops = 119.726 (norm. = 0.629661), norm. avg. (of 12) = 0.953519 fft 5: mflops = 24.2687 (norm. = 0.127634), norm. avg. (of 12) = 0.17349 fft 6: mflops = 34.7626 (norm. = 0.182823), norm. avg. (of 12) = 0.314136 fft 7: mflops = 10.8895 (norm. = 0.05727), norm. avg. (of 12) = 0.0684255 fft 8: mflops = 41.8375 (norm. = 0.220032), norm. avg. (of 12) = 0.113226 fft 9: mflops = 134.439 (norm. = 0.70704), norm. avg. (of 12) = 0.361215 fft 10: mflops = 27.4233 (norm. = 0.144224), norm. avg. (of 10) = 0.106727 fft 11: mflops = 8.35184 (norm. = 0.0439239), norm. avg. (of 12) = 0.0503277 Benchmarking for array size = 1960: 0. CWP (min N) (N=1980): elapsed time t=1.48357 s, 2048 iters, t-(init.)=1.42572 s t(norm)=0.0324761, mflops=153.959 1. CWP (best N) (N=1980): elapsed time t=1.4833 s, 2048 iters, t-(init.)=1.42551 s t(norm)=0.0324714, mflops=153.982 2. FFTPACK (f2c): elapsed time t=1.47679 s, 128 iters, t-(init.)=1.47279 s t(norm)=0.536773, mflops=9.31492 (err=1.5e-15) FFTW_MEASURE plan: (cost = 1.083391e-03) FFTW_TWIDDLE 7 FFTW_TWIDDLE 5 FFTW_TWIDDLE 7 FFTW_NOTW 8 3. FFTW: elapsed time t=1.1838 s, 512 iters, t-(init.)=1.16891 s t(norm)=0.106505, mflops=46.9461 (err=1.5e-15) FFTW_ESTIMATE plan: (cost = 9.662800e+04) FFTW_TWIDDLE 10 FFTW_TWIDDLE 7 FFTW_TWIDDLE 2 FFTW_NOTW 14 4. FFTW_ESTIMATE: elapsed time t=1.16672 s, 512 iters, t-(init.)=1.15182 s t(norm)=0.104948, mflops=47.6425 (err=1.5e-15) 5. Frigo-old: elapsed time t=1.49084 s, 256 iters, t-(init.)=1.48335 s t(norm)=0.270312, mflops=18.4971 (err=1.5e-15) 6. GSL: elapsed time t=1.71634 s, 256 iters, t-(init.)=1.70854 s t(norm)=0.311347, mflops=16.0592 (err=1.6e-15) 7. NAPACK (f2c): elapsed time t=1.13332 s, 64 iters, t-(init.)=1.13101 s t(norm)=0.824418, mflops=6.06489 (err=1.3e-13) 8. Nielsen: elapsed time t=1.26612 s, 256 iters, t-(init.)=1.25842 s t(norm)=0.229323, mflops=21.8033 (err=1.7e-14) 9. Singleton (f2c): elapsed time t=1.1856 s, 1024 iters, t-(init.)=1.15699 s t(norm)=0.0527098, mflops=94.859 (err=2.3e-15) 10. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 11. Valkenburg: elapsed time t=1.12674 s, 64 iters, t-(init.)=1.1243 s t(norm)=0.819527, mflops=6.10108 (err=1.4e-15) Top mflops for N=1960 = 153.982 Normalized results and averages for N=1960: fft 0: mflops = 153.959 (norm. = 0.999856), norm. avg. (of 13) = 0.564654 fft 1: mflops = 153.982 (norm. = 1), norm. avg. (of 13) = 0.592247 fft 2: mflops = 9.31492 (norm. = 0.0604937), norm. avg. (of 13) = 0.161649 fft 3: mflops = 46.9461 (norm. = 0.304881), norm. avg. (of 13) = 0.91658 fft 4: mflops = 47.6425 (norm. = 0.309403), norm. avg. (of 13) = 0.903972 fft 5: mflops = 18.4971 (norm. = 0.120126), norm. avg. (of 13) = 0.169385 fft 6: mflops = 16.0592 (norm. = 0.104293), norm. avg. (of 13) = 0.297994 fft 7: mflops = 6.06489 (norm. = 0.0393871), norm. avg. (of 13) = 0.0661918 fft 8: mflops = 21.8033 (norm. = 0.141597), norm. avg. (of 13) = 0.115409 fft 9: mflops = 94.859 (norm. = 0.616041), norm. avg. (of 13) = 0.380817 fft 10: mflops = -1 (norm. = -0.00649428), norm. avg. (of 10) = 0.106727 fft 11: mflops = 6.10108 (norm. = 0.0396221), norm. avg. (of 13) = 0.0495042 Benchmarking for array size = 4725: 0. CWP (min N) (N=5005): elapsed time t=1.00063 s, 256 iters, t-(init.)=0.865084 s t(norm)=0.0585922, mflops=85.3356 1. CWP (best N) (N=5040): elapsed time t=1.82459 s, 512 iters, t-(init.)=1.55191 s t(norm)=0.0525554, mflops=95.1378 2. FFTPACK (f2c): elapsed time t=1.70389 s, 64 iters, t-(init.)=1.67092 s t(norm)=0.452686, mflops=11.0452 (err=2.4e-15) FFTW_MEASURE plan: (cost = 3.066125e-03) FFTW_TWIDDLE 9 FFTW_TWIDDLE 5 FFTW_TWIDDLE 7 FFTW_NOTW 15 3. FFTW: elapsed time t=1.58532 s, 256 iters, t-(init.)=1.45638 s t(norm)=0.0986409, mflops=50.6889 (err=2.4e-15) FFTW_ESTIMATE plan: (cost = 1.946700e+05) FFTW_TWIDDLE 5 FFTW_TWIDDLE 7 FFTW_TWIDDLE 9 FFTW_NOTW 15 4. FFTW_ESTIMATE: elapsed time t=1.79775 s, 256 iters, t-(init.)=1.66895 s t(norm)=0.113038, mflops=44.2329 (err=2.3e-15) 5. Frigo-old: elapsed time t=1.47854 s, 64 iters, t-(init.)=1.44565 s t(norm)=0.391657, mflops=12.7663 (err=2.3e-15) 6. GSL: elapsed time t=1.02765 s, 64 iters, t-(init.)=0.994611 s t(norm)=0.26946, mflops=18.5556 (err=2.4e-15) 7. NAPACK (f2c): elapsed time t=1.46175 s, 32 iters, t-(init.)=1.44487 s t(norm)=0.782891, mflops=6.38658 (err=3.5e-13) 8. Nielsen: elapsed time t=1.10057 s, 64 iters, t-(init.)=1.06729 s t(norm)=0.28915, mflops=17.2921 (err=4.4e-14) 9. Singleton (f2c): elapsed time t=1.90971 s, 256 iters, t-(init.)=1.78187 s t(norm)=0.120686, mflops=41.4297 (err=3.3e-15) 10. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 11. Valkenburg: elapsed time t=1.46652 s, 32 iters, t-(init.)=1.44923 s t(norm)=0.785251, mflops=6.36739 (err=2.3e-15) Top mflops for N=4725 = 95.1378 Normalized results and averages for N=4725: fft 0: mflops = 85.3356 (norm. = 0.896969), norm. avg. (of 14) = 0.588391 fft 1: mflops = 95.1378 (norm. = 1), norm. avg. (of 14) = 0.621372 fft 2: mflops = 11.0452 (norm. = 0.116097), norm. avg. (of 14) = 0.158396 fft 3: mflops = 50.6889 (norm. = 0.532795), norm. avg. (of 14) = 0.889166 fft 4: mflops = 44.2329 (norm. = 0.464935), norm. avg. (of 14) = 0.872612 fft 5: mflops = 12.7663 (norm. = 0.134187), norm. avg. (of 14) = 0.166871 fft 6: mflops = 18.5556 (norm. = 0.195039), norm. avg. (of 14) = 0.29064 fft 7: mflops = 6.38658 (norm. = 0.0671299), norm. avg. (of 14) = 0.0662588 fft 8: mflops = 17.2921 (norm. = 0.181758), norm. avg. (of 14) = 0.120148 fft 9: mflops = 41.4297 (norm. = 0.43547), norm. avg. (of 14) = 0.384721 fft 10: mflops = -1 (norm. = -0.0105111), norm. avg. (of 10) = 0.106727 fft 11: mflops = 6.36739 (norm. = 0.0669281), norm. avg. (of 14) = 0.0507488 Benchmarking for array size = 10368: 0. CWP (min N) (N=10920): elapsed time t=1.2119 s, 128 iters, t-(init.)=1.064 s t(norm)=0.0601015, mflops=83.1926 1. CWP (best N) (N=11088): elapsed time t=1.15953 s, 128 iters, t-(init.)=1.00948 s t(norm)=0.0570221, mflops=87.6852 2. FFTPACK (f2c): elapsed time t=1.43939 s, 32 iters, t-(init.)=1.40272 s t(norm)=0.316939, mflops=15.7759 (err=4.7e-15) FFTW_MEASURE plan: (cost = 7.121125e-03) FFTW_TWIDDLE 32 FFTW_TWIDDLE 3 FFTW_TWIDDLE 9 FFTW_NOTW 12 3. FFTW: elapsed time t=1.09509 s, 64 iters, t-(init.)=1.0223 s t(norm)=0.115492, mflops=43.293 (err=4.7e-15) FFTW_ESTIMATE plan: (cost = 1.254528e+05) FFTW_TWIDDLE 9 FFTW_TWIDDLE 6 FFTW_TWIDDLE 6 FFTW_NOTW 32 4. FFTW_ESTIMATE: elapsed time t=1.96202 s, 128 iters, t-(init.)=1.81878 s t(norm)=0.102736, mflops=48.6682 (err=4.7e-15) 5. Frigo-old: elapsed time t=1.28127 s, 32 iters, t-(init.)=1.24387 s t(norm)=0.281048, mflops=17.7906 (err=4.8e-15) 6. GSL: elapsed time t=1.05576 s, 32 iters, t-(init.)=1.01801 s t(norm)=0.230015, mflops=21.7377 (err=4.7e-15) 7. NAPACK (f2c): elapsed time t=1.76313 s, 16 iters, t-(init.)=1.7429 s t(norm)=0.787604, mflops=6.34837 (err=7.8e-14) 8. Nielsen: elapsed time t=1.36805 s, 32 iters, t-(init.)=1.33054 s t(norm)=0.300629, mflops=16.6318 (err=1.1e-14) 9. Singleton (f2c): elapsed time t=1.37131 s, 64 iters, t-(init.)=1.30118 s t(norm)=0.146998, mflops=34.0141 (err=6.7e-15) 10. Temperton (f2c): elapsed time t=1.13936 s, 32 iters, t-(init.)=1.10434 s t(norm)=0.24952, mflops=20.0384 (err=4.7e-15) 11. Valkenburg: elapsed time t=1.94359 s, 16 iters, t-(init.)=1.92338 s t(norm)=0.86916, mflops=5.75268 (err=4.7e-15) Top mflops for N=10368 = 87.6852 Normalized results and averages for N=10368: fft 0: mflops = 83.1926 (norm. = 0.948764), norm. avg. (of 15) = 0.612416 fft 1: mflops = 87.6852 (norm. = 1), norm. avg. (of 15) = 0.646614 fft 2: mflops = 15.7759 (norm. = 0.179915), norm. avg. (of 15) = 0.15983 fft 3: mflops = 43.293 (norm. = 0.493732), norm. avg. (of 15) = 0.862804 fft 4: mflops = 48.6682 (norm. = 0.555033), norm. avg. (of 15) = 0.85144 fft 5: mflops = 17.7906 (norm. = 0.202891), norm. avg. (of 15) = 0.169272 fft 6: mflops = 21.7377 (norm. = 0.247906), norm. avg. (of 15) = 0.287791 fft 7: mflops = 6.34837 (norm. = 0.0723995), norm. avg. (of 15) = 0.0666682 fft 8: mflops = 16.6318 (norm. = 0.189676), norm. avg. (of 15) = 0.124783 fft 9: mflops = 34.0141 (norm. = 0.387911), norm. avg. (of 15) = 0.384933 fft 10: mflops = 20.0384 (norm. = 0.228527), norm. avg. (of 11) = 0.1178 fft 11: mflops = 5.75268 (norm. = 0.065606), norm. avg. (of 15) = 0.0517392 Benchmarking for array size = 27000: 0. CWP (min N) (N=27720): elapsed time t=1.59461 s, 64 iters, t-(init.)=1.40555 s t(norm)=0.0552555, mflops=90.4887 1. CWP (best N) (N=27720): elapsed time t=1.59493 s, 64 iters, t-(init.)=1.40653 s t(norm)=0.0552941, mflops=90.4255 2. FFTPACK (f2c): elapsed time t=1.9881 s, 16 iters, t-(init.)=1.93823 s t(norm)=0.304786, mflops=16.405 (err=7.3e-15) FFTW_MEASURE plan: (cost = 3.286900e-02) FFTW_TWIDDLE 8 FFTW_TWIDDLE 9 FFTW_TWIDDLE 5 FFTW_TWIDDLE 5 FFTW_NOTW 15 3. FFTW: elapsed time t=1.43042 s, 32 iters, t-(init.)=1.33099 s t(norm)=0.104649, mflops=47.779 (err=7.3e-15) FFTW_ESTIMATE plan: (cost = 1.231200e+06) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_TWIDDLE 2 FFTW_TWIDDLE 9 FFTW_NOTW 15 4. FFTW_ESTIMATE: elapsed time t=1.39044 s, 32 iters, t-(init.)=1.29069 s t(norm)=0.10148, mflops=49.2707 (err=7.3e-15) 5. Frigo-old: elapsed time t=1.27426 s, 8 iters, t-(init.)=1.24462 s t(norm)=0.391431, mflops=12.7737 (err=7.3e-15) 6. GSL: elapsed time t=1.54605 s, 16 iters, t-(init.)=1.49293 s t(norm)=0.234763, mflops=21.2981 (err=7.3e-15) 7. NAPACK (f2c): elapsed time t=1.30687 s, 4 iters, t-(init.)=1.28818 s t(norm)=0.81026, mflops=6.17086 (err=1.0e-12) 8. Nielsen: elapsed time t=1.73804 s, 16 iters, t-(init.)=1.68461 s t(norm)=0.264904, mflops=18.8748 (err=2.0e-13) 9. Singleton (f2c): elapsed time t=1.93099 s, 32 iters, t-(init.)=1.8388 s t(norm)=0.144575, mflops=34.5841 (err=1.1e-14) 10. Temperton (f2c): elapsed time t=1.63073 s, 16 iters, t-(init.)=1.58433 s t(norm)=0.249135, mflops=20.0694 (err=7.3e-15) 11. Valkenburg: elapsed time t=1.52557 s, 4 iters, t-(init.)=1.50676 s t(norm)=0.947745, mflops=5.27568 (err=7.3e-15) Top mflops for N=27000 = 90.4887 Normalized results and averages for N=27000: fft 0: mflops = 90.4887 (norm. = 1), norm. avg. (of 16) = 0.63664 fft 1: mflops = 90.4255 (norm. = 0.999303), norm. avg. (of 16) = 0.668657 fft 2: mflops = 16.405 (norm. = 0.181293), norm. avg. (of 16) = 0.161172 fft 3: mflops = 47.779 (norm. = 0.528011), norm. avg. (of 16) = 0.841879 fft 4: mflops = 49.2707 (norm. = 0.544496), norm. avg. (of 16) = 0.832256 fft 5: mflops = 12.7737 (norm. = 0.141163), norm. avg. (of 16) = 0.167515 fft 6: mflops = 21.2981 (norm. = 0.235367), norm. avg. (of 16) = 0.284515 fft 7: mflops = 6.17086 (norm. = 0.0681948), norm. avg. (of 16) = 0.0667636 fft 8: mflops = 18.8748 (norm. = 0.208587), norm. avg. (of 16) = 0.130021 fft 9: mflops = 34.5841 (norm. = 0.382193), norm. avg. (of 16) = 0.384762 fft 10: mflops = 20.0694 (norm. = 0.22179), norm. avg. (of 12) = 0.126466 fft 11: mflops = 5.27568 (norm. = 0.0583021), norm. avg. (of 16) = 0.0521494 Benchmarking for array size = 75600: 0. CWP (min N) (N=80080): elapsed time t=1.97326 s, 16 iters, t-(init.)=1.69711 s t(norm)=0.0865745, mflops=57.7537 1. CWP (best N) (N=80080): elapsed time t=1.97363 s, 16 iters, t-(init.)=1.69728 s t(norm)=0.0865829, mflops=57.7481 2. FFTPACK (f2c): elapsed time t=1.20925 s, 2 iters, t-(init.)=1.16657 s t(norm)=0.476082, mflops=10.5024 (err=9.4e-15) FFTW_MEASURE plan: (cost = 1.610780e-01) FFTW_TWIDDLE 16 FFTW_TWIDDLE 9 FFTW_TWIDDLE 5 FFTW_TWIDDLE 7 FFTW_NOTW 15 3. FFTW: elapsed time t=1.27796 s, 8 iters, t-(init.)=1.14921 s t(norm)=0.117249, mflops=42.6443 (err=9.4e-15) FFTW_ESTIMATE plan: (cost = 2.971080e+06) FFTW_TWIDDLE 10 FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 8 FFTW_NOTW 15 4. FFTW_ESTIMATE: elapsed time t=1.33175 s, 8 iters, t-(init.)=1.20298 s t(norm)=0.122735, mflops=40.7383 (err=9.4e-15) 5. Frigo-old: elapsed time t=1.11714 s, 2 iters, t-(init.)=1.07457 s t(norm)=0.438537, mflops=11.4015 (err=9.4e-15) 6. GSL: elapsed time t=1.58177 s, 4 iters, t-(init.)=1.51009 s t(norm)=0.308137, mflops=16.2266 (err=9.4e-15) 7. NAPACK (f2c): elapsed time t=1.05903 s, 1 iters, t-(init.)=1.02994 s t(norm)=0.84064, mflops=5.94785 (err=5.1e-12) 8. Nielsen: elapsed time t=1.6995 s, 4 iters, t-(init.)=1.62744 s t(norm)=0.332081, mflops=15.0566 (err=4.7e-13) 9. Singleton (f2c): elapsed time t=1.13616 s, 4 iters, t-(init.)=1.08125 s t(norm)=0.220631, mflops=22.6622 (err=1.3e-14) 10. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 11. Valkenburg: elapsed time t=1.3767 s, 1 iters, t-(init.)=1.34799 s t(norm)=1.10024, mflops=4.54447 (err=9.5e-15) Top mflops for N=75600 = 57.7537 Normalized results and averages for N=75600: fft 0: mflops = 57.7537 (norm. = 1), norm. avg. (of 17) = 0.658014 fft 1: mflops = 57.7481 (norm. = 0.999903), norm. avg. (of 17) = 0.688142 fft 2: mflops = 10.5024 (norm. = 0.181848), norm. avg. (of 17) = 0.162388 fft 3: mflops = 42.6443 (norm. = 0.738382), norm. avg. (of 17) = 0.835791 fft 4: mflops = 40.7383 (norm. = 0.70538), norm. avg. (of 17) = 0.824793 fft 5: mflops = 11.4015 (norm. = 0.197417), norm. avg. (of 17) = 0.169274 fft 6: mflops = 16.2266 (norm. = 0.280961), norm. avg. (of 17) = 0.284306 fft 7: mflops = 5.94785 (norm. = 0.102986), norm. avg. (of 17) = 0.0688943 fft 8: mflops = 15.0566 (norm. = 0.260703), norm. avg. (of 17) = 0.137708 fft 9: mflops = 22.6622 (norm. = 0.392394), norm. avg. (of 17) = 0.385211 fft 10: mflops = -1 (norm. = -0.0173149), norm. avg. (of 12) = 0.126466 fft 11: mflops = 4.54447 (norm. = 0.078687), norm. avg. (of 17) = 0.0537105 Benchmarking for array size = 165375: 0. CWP (min N) (N=180180): elapsed time t=1.01385 s, 2 iters, t-(init.)=0.880889 s t(norm)=0.153634, mflops=32.5448 1. CWP (best N) (N=180180): elapsed time t=1.01347 s, 2 iters, t-(init.)=0.880721 s t(norm)=0.153605, mflops=32.551 2. FFTPACK (f2c): elapsed time t=2.32329 s, 1 iters, t-(init.)=2.25998 s t(norm)=0.788318, mflops=6.34262 (err=3.7e-14) FFTW_MEASURE plan: (cost = 4.480230e-01) FFTW_TWIDDLE 9 FFTW_TWIDDLE 5 FFTW_TWIDDLE 7 FFTW_TWIDDLE 7 FFTW_TWIDDLE 5 FFTW_NOTW 15 3. FFTW: elapsed time t=1.83419 s, 4 iters, t-(init.)=1.58053 s t(norm)=0.137828, mflops=36.277 (err=3.7e-14) FFTW_ESTIMATE plan: (cost = 8.367975e+06) FFTW_TWIDDLE 7 FFTW_TWIDDLE 5 FFTW_TWIDDLE 5 FFTW_TWIDDLE 7 FFTW_TWIDDLE 9 FFTW_NOTW 15 4. FFTW_ESTIMATE: elapsed time t=1.90579 s, 4 iters, t-(init.)=1.65216 s t(norm)=0.144075, mflops=34.7041 (err=3.7e-14) 5. Frigo-old: elapsed time t=1.92967 s, 1 iters, t-(init.)=1.86647 s t(norm)=0.651055, mflops=7.67984 (err=3.7e-14) 6. GSL: elapsed time t=1.8497 s, 2 iters, t-(init.)=1.723 s t(norm)=0.300506, mflops=16.6386 (err=3.7e-14) 7. NAPACK (f2c): elapsed time t=2.92377 s, 1 iters, t-(init.)=2.86047 s t(norm)=0.997778, mflops=5.01113 (err=1.6e-11) 8. Nielsen: elapsed time t=1.36593 s, 1 iters, t-(init.)=1.3027 s t(norm)=0.454404, mflops=11.0034 (err=1.6e-12) 9. Singleton (f2c): elapsed time t=1.14869 s, 1 iters, t-(init.)=1.09895 s t(norm)=0.383333, mflops=13.0435 (err=5.6e-14) 10. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 11. Valkenburg: elapsed time t=3.15569 s, 1 iters, t-(init.)=3.09246 s t(norm)=1.0787, mflops=4.6352 (err=3.6e-14) Top mflops for N=165375 = 36.277 Normalized results and averages for N=165375: fft 0: mflops = 32.5448 (norm. = 0.897121), norm. avg. (of 18) = 0.671298 fft 1: mflops = 32.551 (norm. = 0.897292), norm. avg. (of 18) = 0.699761 fft 2: mflops = 6.34262 (norm. = 0.174839), norm. avg. (of 18) = 0.16308 fft 3: mflops = 36.277 (norm. = 1), norm. avg. (of 18) = 0.844914 fft 4: mflops = 34.7041 (norm. = 0.956642), norm. avg. (of 18) = 0.832118 fft 5: mflops = 7.67984 (norm. = 0.2117), norm. avg. (of 18) = 0.171631 fft 6: mflops = 16.6386 (norm. = 0.458655), norm. avg. (of 18) = 0.293992 fft 7: mflops = 5.01113 (norm. = 0.138135), norm. avg. (of 18) = 0.0727411 fft 8: mflops = 11.0034 (norm. = 0.303317), norm. avg. (of 18) = 0.146908 fft 9: mflops = 13.0435 (norm. = 0.359553), norm. avg. (of 18) = 0.383786 fft 10: mflops = -1 (norm. = -0.0275657), norm. avg. (of 12) = 0.126466 fft 11: mflops = 4.6352 (norm. = 0.127773), norm. avg. (of 18) = 0.057825 Benchmarking for array size = 362880: 0. CWP (min N) (N=720720): elapsed time t=2.27546 s, 1 iters, t-(init.)=1.99883 s t(norm)=0.29824, mflops=16.765 1. CWP (best N) (N=720720): elapsed time t=2.2758 s, 1 iters, t-(init.)=1.99918 s t(norm)=0.298293, mflops=16.7621 2. FFTPACK (f2c): elapsed time t=4.03588 s, 1 iters, t-(init.)=3.89657 s t(norm)=0.581398, mflops=8.59997 (err=7.1e-14) FFTW_MEASURE plan: (cost = 9.044400e-01) FFTW_TWIDDLE 64 FFTW_TWIDDLE 10 FFTW_TWIDDLE 7 FFTW_TWIDDLE 9 FFTW_NOTW 9 3. FFTW: elapsed time t=1.84251 s, 2 iters, t-(init.)=1.56415 s t(norm)=0.116691, mflops=42.8482 (err=7.1e-14) FFTW_ESTIMATE plan: (cost = 7.511616e+06) FFTW_TWIDDLE 10 FFTW_TWIDDLE 9 FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 2 FFTW_NOTW 32 4. FFTW_ESTIMATE: elapsed time t=1.06811 s, 1 iters, t-(init.)=0.928895 s t(norm)=0.138598, mflops=36.0755 (err=7.1e-14) 5. Frigo-old: elapsed time t=3.65395 s, 1 iters, t-(init.)=3.51483 s t(norm)=0.524438, mflops=9.53401 (err=7.1e-14) 6. GSL: elapsed time t=2.02348 s, 1 iters, t-(init.)=1.88441 s t(norm)=0.281167, mflops=17.783 (err=7.1e-14) 7. NAPACK (f2c): elapsed time t=6.49021 s, 1 iters, t-(init.)=6.35111 s t(norm)=0.947632, mflops=5.27631 (err=3.4e-11) 8. Nielsen: elapsed time t=3.99117 s, 1 iters, t-(init.)=3.85169 s t(norm)=0.574702, mflops=8.70017 (err=3.5e-12) 9. Singleton (f2c): elapsed time t=3.4112 s, 1 iters, t-(init.)=3.28899 s t(norm)=0.490741, mflops=10.1887 (err=1.0e-13) 10. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 11. Valkenburg: elapsed time t=9.08378 s, 1 iters, t-(init.)=8.94461 s t(norm)=1.3346, mflops=3.74644 (err=7.1e-14) Top mflops for N=362880 = 42.8482 Normalized results and averages for N=362880: fft 0: mflops = 16.765 (norm. = 0.391266), norm. avg. (of 19) = 0.656559 fft 1: mflops = 16.7621 (norm. = 0.391197), norm. avg. (of 19) = 0.683521 fft 2: mflops = 8.59997 (norm. = 0.200708), norm. avg. (of 19) = 0.16506 fft 3: mflops = 42.8482 (norm. = 1), norm. avg. (of 19) = 0.853077 fft 4: mflops = 36.0755 (norm. = 0.841939), norm. avg. (of 19) = 0.832635 fft 5: mflops = 9.53401 (norm. = 0.222507), norm. avg. (of 19) = 0.174309 fft 6: mflops = 17.783 (norm. = 0.415024), norm. avg. (of 19) = 0.300362 fft 7: mflops = 5.27631 (norm. = 0.12314), norm. avg. (of 19) = 0.0753936 fft 8: mflops = 8.70017 (norm. = 0.203046), norm. avg. (of 19) = 0.149863 fft 9: mflops = 10.1887 (norm. = 0.237785), norm. avg. (of 19) = 0.376101 fft 10: mflops = -1 (norm. = -0.0233382), norm. avg. (of 12) = 0.126466 fft 11: mflops = 3.74644 (norm. = 0.0874351), norm. avg. (of 19) = 0.0593834 ------------------------------------------------------ @@@@ bench.3d.p2.log Benchmarking for sizes: 4x4x4 (0.00128174 MB) 8x8x8 (0.00830078 MB) 16x16x16 (0.0633545 MB) 32x32x32 (0.501587 MB) 64x64x64 (4.00305 MB) 256x64x32 (8.01184 MB) 16x1024x64 (16.047 MB) 128x128x128 (32.006 MB) 512x128x64 (64.0236 MB) Maximum array size N = 4194304 Benchmarking FFTs: 0. FFTW 1. HARM (f2c) 2. NR (C) 3. PDA (f2c) 4. Singleton (f2c) 5. Temperton (f2c) Computing normalized averages (6 transforms). Benchmarking for array size = 4x4x4 (power of 2): 0. FFTW: elapsed time t=1.20295 s, 131072 iters, t-(init.)=1.08523 s t(norm)=0.0215615, mflops=231.895 (err=1.9e-16) 1. Skipping fft (all dimensions must be > 4 for HARM). 2. NR (C): elapsed time t=1.62216 s, 65536 iters, t-(init.)=1.56323 s t(norm)=0.0621172, mflops=80.493 (err=2.3e-16) 3. PDA (f2c): elapsed time t=1.32547 s, 8192 iters, t-(init.)=1.31798 s t(norm)=0.418976, mflops=11.9339 (err=2.8e-16) 4. Singleton (f2c): elapsed time t=1.76875 s, 131072 iters, t-(init.)=1.65104 s t(norm)=0.0328033, mflops=152.424 (err=1.9e-16) 5. Temperton (f2c): elapsed time t=1.82915 s, 32768 iters, t-(init.)=1.7997 s t(norm)=0.143027, mflops=34.9584 (err=1.9e-16) Top mflops for N=64 = 231.895 Normalized results and averages for N=64: fft 0: mflops = 231.895 (norm. = 1), norm. avg. (of 1) = 1 fft 1: mflops = -1 (norm. = -0.0043123), norm. avg. (of 0) = -1 fft 2: mflops = 80.493 (norm. = 0.34711), norm. avg. (of 1) = 0.34711 fft 3: mflops = 11.9339 (norm. = 0.0514625), norm. avg. (of 1) = 0.0514625 fft 4: mflops = 152.424 (norm. = 0.657298), norm. avg. (of 1) = 0.657298 fft 5: mflops = 34.9584 (norm. = 0.150751), norm. avg. (of 1) = 0.150751 Benchmarking for array size = 8x8x8 (power of 2): 0. FFTW: elapsed time t=1.47921 s, 16384 iters, t-(init.)=1.36582 s t(norm)=0.018091, mflops=276.381 (err=3.8e-16) 1. HARM (f2c): elapsed time t=1.21052 s, 2048 iters, t-(init.)=1.19626 s t(norm)=0.12676, mflops=39.4445 (err=3.6e-16) 2. NR (C): elapsed time t=1.7804 s, 8192 iters, t-(init.)=1.72367 s t(norm)=0.0456618, mflops=109.501 (err=2.9e-16) 3. PDA (f2c): elapsed time t=1.11004 s, 1024 iters, t-(init.)=1.10284 s t(norm)=0.233722, mflops=21.3929 (err=3.1e-16) 4. Singleton (f2c): elapsed time t=1.46191 s, 8192 iters, t-(init.)=1.40513 s t(norm)=0.0372233, mflops=134.324 (err=3.1e-16) 5. Temperton (f2c): elapsed time t=1.27305 s, 2048 iters, t-(init.)=1.2588 s t(norm)=0.133387, mflops=37.4849 (err=3.7e-16) Top mflops for N=512 = 276.381 Normalized results and averages for N=512: fft 0: mflops = 276.381 (norm. = 1), norm. avg. (of 2) = 1 fft 1: mflops = 39.4445 (norm. = 0.142718), norm. avg. (of 1) = 0.142718 fft 2: mflops = 109.501 (norm. = 0.396195), norm. avg. (of 2) = 0.371653 fft 3: mflops = 21.3929 (norm. = 0.0774038), norm. avg. (of 2) = 0.0644331 fft 4: mflops = 134.324 (norm. = 0.486011), norm. avg. (of 2) = 0.571655 fft 5: mflops = 37.4849 (norm. = 0.135628), norm. avg. (of 2) = 0.14319 Benchmarking for array size = 16x16x16 (power of 2): 0. FFTW: elapsed time t=1.58156 s, 512 iters, t-(init.)=1.35976 s t(norm)=0.0540319, mflops=92.5379 (err=4.1e-16) 1. HARM (f2c): elapsed time t=1.96523 s, 256 iters, t-(init.)=1.85426 s t(norm)=0.147363, mflops=33.9298 (err=4.0e-16) 2. NR (C): elapsed time t=1.54763 s, 128 iters, t-(init.)=1.49221 s t(norm)=0.23718, mflops=21.081 (err=4.7e-16) 3. PDA (f2c): elapsed time t=1.41383 s, 128 iters, t-(init.)=1.3583 s t(norm)=0.215895, mflops=23.1594 (err=3.8e-16) 4. Singleton (f2c): elapsed time t=1.79099 s, 256 iters, t-(init.)=1.67991 s t(norm)=0.133507, mflops=37.4511 (err=4.7e-16) 5. Temperton (f2c): elapsed time t=1.96155 s, 256 iters, t-(init.)=1.85054 s t(norm)=0.147068, mflops=33.9979 (err=4.1e-16) Top mflops for N=4096 = 92.5379 Normalized results and averages for N=4096: fft 0: mflops = 92.5379 (norm. = 1), norm. avg. (of 3) = 1 fft 1: mflops = 33.9298 (norm. = 0.366658), norm. avg. (of 2) = 0.254688 fft 2: mflops = 21.081 (norm. = 0.22781), norm. avg. (of 3) = 0.323705 fft 3: mflops = 23.1594 (norm. = 0.250269), norm. avg. (of 3) = 0.126378 fft 4: mflops = 37.4511 (norm. = 0.404711), norm. avg. (of 3) = 0.516007 fft 5: mflops = 33.9979 (norm. = 0.367395), norm. avg. (of 3) = 0.217925 Benchmarking for array size = 32x32x32 (power of 2): 0. FFTW: elapsed time t=1.64759 s, 64 iters, t-(init.)=1.42373 s t(norm)=0.0452593, mflops=110.475 (err=4.8e-16) 1. HARM (f2c): elapsed time t=1.2141 s, 16 iters, t-(init.)=1.15752 s t(norm)=0.147187, mflops=33.9704 (err=4.8e-16) 2. NR (C): elapsed time t=1.93342 s, 16 iters, t-(init.)=1.87698 s t(norm)=0.23867, mflops=20.9494 (err=6.0e-16) 3. PDA (f2c): elapsed time t=1.77189 s, 16 iters, t-(init.)=1.71566 s t(norm)=0.218158, mflops=22.9192 (err=4.5e-16) 4. Singleton (f2c): elapsed time t=1.24763 s, 16 iters, t-(init.)=1.19104 s t(norm)=0.151448, mflops=33.0146 (err=4.9e-16) 5. Temperton (f2c): elapsed time t=1.40377 s, 16 iters, t-(init.)=1.34697 s t(norm)=0.171276, mflops=29.1926 (err=5.1e-16) Top mflops for N=32768 = 110.475 Normalized results and averages for N=32768: fft 0: mflops = 110.475 (norm. = 1), norm. avg. (of 4) = 1 fft 1: mflops = 33.9704 (norm. = 0.307495), norm. avg. (of 3) = 0.272291 fft 2: mflops = 20.9494 (norm. = 0.189631), norm. avg. (of 4) = 0.290187 fft 3: mflops = 22.9192 (norm. = 0.207461), norm. avg. (of 4) = 0.146649 fft 4: mflops = 33.0146 (norm. = 0.298843), norm. avg. (of 4) = 0.461716 fft 5: mflops = 29.1926 (norm. = 0.264247), norm. avg. (of 4) = 0.229505 Benchmarking for array size = 64x64x64 (power of 2): 0. FFTW: elapsed time t=1.4494 s, 2 iters, t-(init.)=1.24848 s t(norm)=0.132293, mflops=37.7948 (err=1.0e-15) 1. HARM (f2c): elapsed time t=1.27654 s, 1 iters, t-(init.)=1.1759 s t(norm)=0.249205, mflops=20.0638 (err=1.0e-15) 2. NR (C): elapsed time t=4.03301 s, 1 iters, t-(init.)=3.93233 s t(norm)=0.83337, mflops=5.99974 (err=1.0e-15) 3. PDA (f2c): elapsed time t=2.15248 s, 1 iters, t-(init.)=2.05206 s t(norm)=0.434889, mflops=11.4972 (err=1.0e-15) 4. Singleton (f2c): elapsed time t=2.15278 s, 1 iters, t-(init.)=2.05215 s t(norm)=0.434907, mflops=11.4967 (err=1.4e-15) 5. Temperton (f2c): elapsed time t=1.33861 s, 1 iters, t-(init.)=1.23801 s t(norm)=0.262367, mflops=19.0572 (err=9.9e-16) Top mflops for N=262144 = 37.7948 Normalized results and averages for N=262144: fft 0: mflops = 37.7948 (norm. = 1), norm. avg. (of 5) = 1 fft 1: mflops = 20.0638 (norm. = 0.530862), norm. avg. (of 4) = 0.336933 fft 2: mflops = 5.99974 (norm. = 0.158745), norm. avg. (of 5) = 0.263898 fft 3: mflops = 11.4972 (norm. = 0.304201), norm. avg. (of 5) = 0.178159 fft 4: mflops = 11.4967 (norm. = 0.304188), norm. avg. (of 5) = 0.430211 fft 5: mflops = 19.0572 (norm. = 0.50423), norm. avg. (of 5) = 0.28445 Benchmarking for array size = 256x64x32 (power of 2): 0. FFTW: elapsed time t=1.48964 s, 1 iters, t-(init.)=1.28861 s t(norm)=0.129359, mflops=38.6521 (err=9.2e-16) 1. HARM (f2c): elapsed time t=2.79 s, 1 iters, t-(init.)=2.58874 s t(norm)=0.259876, mflops=19.24 (err=9.4e-16) 2. NR (C): elapsed time t=8.47126 s, 1 iters, t-(init.)=8.26998 s t(norm)=0.830197, mflops=6.02267 (err=9.6e-16) 3. PDA (f2c): elapsed time t=3.79129 s, 1 iters, t-(init.)=3.59024 s t(norm)=0.360412, mflops=13.873 (err=8.8e-16) 4. Singleton (f2c): elapsed time t=4.74386 s, 1 iters, t-(init.)=4.54262 s t(norm)=0.456019, mflops=10.9645 (err=1.3e-15) 5. Temperton (f2c): elapsed time t=3.27994 s, 1 iters, t-(init.)=3.07857 s t(norm)=0.309047, mflops=16.1788 (err=9.2e-16) Top mflops for N=524288 = 38.6521 Normalized results and averages for N=524288: fft 0: mflops = 38.6521 (norm. = 1), norm. avg. (of 6) = 1 fft 1: mflops = 19.24 (norm. = 0.497774), norm. avg. (of 5) = 0.369101 fft 2: mflops = 6.02267 (norm. = 0.155817), norm. avg. (of 6) = 0.245885 fft 3: mflops = 13.873 (norm. = 0.35892), norm. avg. (of 6) = 0.208286 fft 4: mflops = 10.9645 (norm. = 0.283671), norm. avg. (of 6) = 0.405787 fft 5: mflops = 16.1788 (norm. = 0.418574), norm. avg. (of 6) = 0.306804 Benchmarking for array size = 16x1024x64 (power of 2): 0. FFTW: elapsed time t=3.18217 s, 1 iters, t-(init.)=2.78006 s t(norm)=0.132563, mflops=37.7178 (err=1.2e-15) 1. HARM (f2c): elapsed time t=5.71683 s, 1 iters, t-(init.)=5.31435 s t(norm)=0.253408, mflops=19.731 (err=1.2e-15) 2. NR (C): elapsed time t=17.9175 s, 1 iters, t-(init.)=17.5153 s t(norm)=0.835195, mflops=5.98662 (err=1.3e-15) 3. PDA (f2c): elapsed time t=9.27486 s, 1 iters, t-(init.)=8.87275 s t(norm)=0.423085, mflops=11.8179 (err=1.2e-15) 4. Singleton (f2c): elapsed time t=9.6234 s, 1 iters, t-(init.)=9.2211 s t(norm)=0.439697, mflops=11.3715 (err=1.7e-15) 5. Skipping fft (Temperton can't handle dimensions > 256). Top mflops for N=1048576 = 37.7178 Normalized results and averages for N=1048576: fft 0: mflops = 37.7178 (norm. = 1), norm. avg. (of 7) = 1 fft 1: mflops = 19.731 (norm. = 0.523123), norm. avg. (of 6) = 0.394772 fft 2: mflops = 5.98662 (norm. = 0.158722), norm. avg. (of 7) = 0.233433 fft 3: mflops = 11.8179 (norm. = 0.313326), norm. avg. (of 7) = 0.223292 fft 4: mflops = 11.3715 (norm. = 0.301489), norm. avg. (of 7) = 0.390887 fft 5: mflops = -1 (norm. = -0.0265127), norm. avg. (of 6) = 0.306804 Benchmarking for array size = 128x128x128 (power of 2): 0. FFTW: elapsed time t=6.20177 s, 1 iters, t-(init.)=5.39717 s t(norm)=0.122551, mflops=40.7994 (err=8.1e-16) 1. HARM (f2c): elapsed time t=12.7883 s, 1 iters, t-(init.)=11.9837 s t(norm)=0.272108, mflops=18.375 (err=8.0e-16) 2. NR (C): elapsed time t=37.5269 s, 1 iters, t-(init.)=36.7225 s t(norm)=0.833842, mflops=5.99634 (err=8.7e-16) 3. PDA (f2c): elapsed time t=17.4164 s, 1 iters, t-(init.)=16.6116 s t(norm)=0.377191, mflops=13.2559 (err=7.8e-16) 4. Singleton (f2c): elapsed time t=27.8769 s, 1 iters, t-(init.)=27.0724 s t(norm)=0.614721, mflops=8.13377 (err=1.1e-15) 5. Temperton (f2c): elapsed time t=22.1898 s, 1 iters, t-(init.)=21.3852 s t(norm)=0.485584, mflops=10.2969 (err=8.1e-16) Top mflops for N=2097152 = 40.7994 Normalized results and averages for N=2097152: fft 0: mflops = 40.7994 (norm. = 1), norm. avg. (of 8) = 1 fft 1: mflops = 18.375 (norm. = 0.450375), norm. avg. (of 7) = 0.402715 fft 2: mflops = 5.99634 (norm. = 0.146971), norm. avg. (of 8) = 0.222625 fft 3: mflops = 13.2559 (norm. = 0.324904), norm. avg. (of 8) = 0.235993 fft 4: mflops = 8.13377 (norm. = 0.19936), norm. avg. (of 8) = 0.366947 fft 5: mflops = 10.2969 (norm. = 0.252378), norm. avg. (of 7) = 0.299029 Benchmarking for array size = 512x128x64 (power of 2): 0. FFTW: elapsed time t=12.2494 s, 1 iters, t-(init.)=10.6398 s t(norm)=0.115306, mflops=43.3628 (err=1.0e-15) 1. HARM (f2c): elapsed time t=26.1704 s, 1 iters, t-(init.)=24.5614 s t(norm)=0.266177, mflops=18.7845 (err=1.0e-15) 2. NR (C): elapsed time t=78.088 s, 1 iters, t-(init.)=76.4792 s t(norm)=0.828821, mflops=6.03266 (err=1.0e-15) 3. PDA (f2c): elapsed time t=34.9063 s, 1 iters, t-(init.)=33.2971 s t(norm)=0.360848, mflops=13.8563 (err=1.0e-15) 4. Singleton (f2c): elapsed time t=48.4575 s, 1 iters, t-(init.)=46.8479 s t(norm)=0.507701, mflops=9.84832 (err=1.4e-15) 5. Skipping fft (Temperton can't handle dimensions > 256). Top mflops for N=4194304 = 43.3628 Normalized results and averages for N=4194304: fft 0: mflops = 43.3628 (norm. = 1), norm. avg. (of 9) = 1 fft 1: mflops = 18.7845 (norm. = 0.433193), norm. avg. (of 8) = 0.406525 fft 2: mflops = 6.03266 (norm. = 0.139121), norm. avg. (of 9) = 0.213347 fft 3: mflops = 13.8563 (norm. = 0.319542), norm. avg. (of 9) = 0.245276 fft 4: mflops = 9.84832 (norm. = 0.227114), norm. avg. (of 9) = 0.35141 fft 5: mflops = -1 (norm. = -0.0230612), norm. avg. (of 7) = 0.299029 ------------------------------------------------------ @@@@ bench.3d.np2.log Benchmarking for sizes: 5x5x5 (0.0022583 MB) 6x6x6 (0.00369263 MB) 7x7x7 (0.00567627 MB) 9x9x9 (0.0116577 MB) 10x10x10 (0.0158386 MB) 11x11x11 (0.0209351 MB) 12x12x12 (0.0270386 MB) 13x13x13 (0.0342407 MB) 14x14x14 (0.0426331 MB) 15x15x15 (0.0523071 MB) 24x25x28 (0.257751 MB) 48x48x48 (1.68982 MB) 49x49x49 (1.79755 MB) 60x60x60 (3.29877 MB) 72x60x56 (3.69482 MB) 75x75x75 (6.44086 MB) 80x80x80 (7.81628 MB) 84x84x84 (9.04791 MB) 96x96x96 (13.5045 MB) 105x105x105 (17.6689 MB) 112x112x112 (21.4427 MB) 120x120x120 (26.3728 MB) 144x144x144 (45.5692 MB) 180x180x180 (88.9976 MB) Maximum array size N = 5832000 Benchmarking FFTs: 0. FFTW 1. PDA (f2c) 2. Singleton (f2c) 3. Temperton (f2c) Computing normalized averages (4 transforms). Benchmarking for array size = 5x5x5: 0. FFTW: elapsed time t=1.84977 s, 65536 iters, t-(init.)=1.73724 s t(norm)=0.0304439, mflops=164.237 (err=2.4e-16) 1. PDA (f2c): elapsed time t=1.24003 s, 4096 iters, t-(init.)=1.23292 s t(norm)=0.345697, mflops=14.4635 (err=2.1e-16) 2. Singleton (f2c): elapsed time t=1.46861 s, 65536 iters, t-(init.)=1.35604 s t(norm)=0.0237636, mflops=210.406 (err=3.1e-16) 3. Temperton (f2c): elapsed time t=1.05609 s, 8192 iters, t-(init.)=1.04203 s t(norm)=0.146087, mflops=34.2262 (err=2.4e-16) Top mflops for N=125 = 210.406 Normalized results and averages for N=125: fft 0: mflops = 164.237 (norm. = 0.780572), norm. avg. (of 1) = 0.780572 fft 1: mflops = 14.4635 (norm. = 0.0687413), norm. avg. (of 1) = 0.0687413 fft 2: mflops = 210.406 (norm. = 1), norm. avg. (of 1) = 1 fft 3: mflops = 34.2262 (norm. = 0.162668), norm. avg. (of 1) = 0.162668 Benchmarking for array size = 6x6x6: 0. FFTW: elapsed time t=1.34075 s, 32768 iters, t-(init.)=1.24432 s t(norm)=0.0226701, mflops=220.555 (err=3.0e-16) 1. PDA (f2c): elapsed time t=1.07173 s, 2048 iters, t-(init.)=1.06574 s t(norm)=0.310666, mflops=16.0945 (err=3.7e-16) 2. Singleton (f2c): elapsed time t=1.25263 s, 16384 iters, t-(init.)=1.20441 s t(norm)=0.0438858, mflops=113.932 (err=3.1e-16) 3. Temperton (f2c): elapsed time t=1.13034 s, 4096 iters, t-(init.)=1.11823 s t(norm)=0.162983, mflops=30.678 (err=3.2e-16) Top mflops for N=216 = 220.555 Normalized results and averages for N=216: fft 0: mflops = 220.555 (norm. = 1), norm. avg. (of 2) = 0.890286 fft 1: mflops = 16.0945 (norm. = 0.0729726), norm. avg. (of 2) = 0.0708569 fft 2: mflops = 113.932 (norm. = 0.51657), norm. avg. (of 2) = 0.758285 fft 3: mflops = 30.678 (norm. = 0.139095), norm. avg. (of 2) = 0.150881 Benchmarking for array size = 7x7x7: 0. FFTW: elapsed time t=1.58631 s, 16384 iters, t-(init.)=1.51017 s t(norm)=0.0319075, mflops=156.703 (err=4.0e-16) 1. PDA (f2c): elapsed time t=1.69584 s, 1024 iters, t-(init.)=1.69088 s t(norm)=0.571609, mflops=8.74723 (err=4.0e-16) 2. Singleton (f2c): elapsed time t=1.21663 s, 8192 iters, t-(init.)=1.17852 s t(norm)=0.0498004, mflops=100.401 (err=4.9e-16) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=343 = 156.703 Normalized results and averages for N=343: fft 0: mflops = 156.703 (norm. = 1), norm. avg. (of 3) = 0.926857 fft 1: mflops = 8.74723 (norm. = 0.0558205), norm. avg. (of 3) = 0.0658448 fft 2: mflops = 100.401 (norm. = 0.640708), norm. avg. (of 3) = 0.719093 fft 3: mflops = -1 (norm. = -0.00638151), norm. avg. (of 2) = 0.150881 Benchmarking for array size = 9x9x9: 0. FFTW: elapsed time t=1.46231 s, 8192 iters, t-(init.)=1.38134 s t(norm)=0.0243228, mflops=205.568 (err=5.4e-16) 1. PDA (f2c): elapsed time t=1.71207 s, 1024 iters, t-(init.)=1.7019 s t(norm)=0.239738, mflops=20.8561 (err=5.2e-16) 2. Singleton (f2c): elapsed time t=1.01134 s, 4096 iters, t-(init.)=0.970995 s t(norm)=0.0341947, mflops=146.221 (err=4.9e-16) 3. Temperton (f2c): elapsed time t=1.90767 s, 2048 iters, t-(init.)=1.88748 s t(norm)=0.132939, mflops=37.6111 (err=5.8e-16) Top mflops for N=729 = 205.568 Normalized results and averages for N=729: fft 0: mflops = 205.568 (norm. = 1), norm. avg. (of 4) = 0.945143 fft 1: mflops = 20.8561 (norm. = 0.101456), norm. avg. (of 4) = 0.0747476 fft 2: mflops = 146.221 (norm. = 0.711303), norm. avg. (of 4) = 0.717145 fft 3: mflops = 37.6111 (norm. = 0.182962), norm. avg. (of 3) = 0.161575 Benchmarking for array size = 10x10x10: 0. FFTW: elapsed time t=1.91423 s, 8192 iters, t-(init.)=1.80375 s t(norm)=0.022094, mflops=226.306 (err=3.8e-16) 1. PDA (f2c): elapsed time t=1.08545 s, 512 iters, t-(init.)=1.07832 s t(norm)=0.211332, mflops=23.6594 (err=4.2e-16) 2. Singleton (f2c): elapsed time t=1.47443 s, 4096 iters, t-(init.)=1.41896 s t(norm)=0.0347616, mflops=143.837 (err=4.4e-16) 3. Temperton (f2c): elapsed time t=1.48558 s, 1024 iters, t-(init.)=1.47171 s t(norm)=0.144215, mflops=34.6705 (err=3.6e-16) Top mflops for N=1000 = 226.306 Normalized results and averages for N=1000: fft 0: mflops = 226.306 (norm. = 1), norm. avg. (of 5) = 0.956114 fft 1: mflops = 23.6594 (norm. = 0.104546), norm. avg. (of 5) = 0.0807073 fft 2: mflops = 143.837 (norm. = 0.635586), norm. avg. (of 5) = 0.700834 fft 3: mflops = 34.6705 (norm. = 0.153202), norm. avg. (of 4) = 0.159481 Benchmarking for array size = 11x11x11: 0. FFTW: elapsed time t=1.19977 s, 2048 iters, t-(init.)=1.16284 s t(norm)=0.0411043, mflops=121.642 (err=4.0e-16) 1. PDA (f2c): elapsed time t=1.0167 s, 128 iters, t-(init.)=1.0144 s t(norm)=0.573713, mflops=8.71517 (err=4.8e-16) 2. Singleton (f2c): elapsed time t=1.53143 s, 2048 iters, t-(init.)=1.49451 s t(norm)=0.0528279, mflops=94.6469 (err=6.4e-16) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=1331 = 121.642 Normalized results and averages for N=1331: fft 0: mflops = 121.642 (norm. = 1), norm. avg. (of 6) = 0.963429 fft 1: mflops = 8.71517 (norm. = 0.0716461), norm. avg. (of 6) = 0.0791971 fft 2: mflops = 94.6469 (norm. = 0.778079), norm. avg. (of 6) = 0.713708 fft 3: mflops = -1 (norm. = -0.00822086), norm. avg. (of 4) = 0.159481 Benchmarking for array size = 12x12x12: 0. FFTW: elapsed time t=1.42322 s, 4096 iters, t-(init.)=1.32628 s t(norm)=0.0174231, mflops=286.976 (err=3.8e-16) 1. PDA (f2c): elapsed time t=1.88947 s, 512 iters, t-(init.)=1.87733 s t(norm)=0.197297, mflops=25.3425 (err=3.8e-16) 2. Singleton (f2c): elapsed time t=1.39612 s, 2048 iters, t-(init.)=1.34793 s t(norm)=0.035415, mflops=141.183 (err=4.0e-16) 3. Temperton (f2c): elapsed time t=1.03984 s, 512 iters, t-(init.)=1.02768 s t(norm)=0.108004, mflops=46.2948 (err=3.8e-16) Top mflops for N=1728 = 286.976 Normalized results and averages for N=1728: fft 0: mflops = 286.976 (norm. = 1), norm. avg. (of 7) = 0.968653 fft 1: mflops = 25.3425 (norm. = 0.0883087), norm. avg. (of 7) = 0.0804988 fft 2: mflops = 141.183 (norm. = 0.491968), norm. avg. (of 7) = 0.682031 fft 3: mflops = 46.2948 (norm. = 0.161319), norm. avg. (of 5) = 0.159849 Benchmarking for array size = 13x13x13: 0. FFTW: elapsed time t=1.73507 s, 1024 iters, t-(init.)=1.63205 s t(norm)=0.0653474, mflops=76.5142 (err=4.1e-16) 1. PDA (f2c): elapsed time t=1.91527 s, 128 iters, t-(init.)=1.90182 s t(norm)=0.609192, mflops=8.20759 (err=7.2e-16) 2. Singleton (f2c): elapsed time t=1.71903 s, 1024 iters, t-(init.)=1.61651 s t(norm)=0.0647253, mflops=77.2495 (err=4.3e-16) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=2197 = 77.2495 Normalized results and averages for N=2197: fft 0: mflops = 76.5142 (norm. = 0.990481), norm. avg. (of 8) = 0.971382 fft 1: mflops = 8.20759 (norm. = 0.106248), norm. avg. (of 8) = 0.0837174 fft 2: mflops = 77.2495 (norm. = 1), norm. avg. (of 8) = 0.721777 fft 3: mflops = -1 (norm. = -0.0129451), norm. avg. (of 5) = 0.159849 Benchmarking for array size = 14x14x14: 0. FFTW: elapsed time t=1.70244 s, 1024 iters, t-(init.)=1.4061 s t(norm)=0.0438114, mflops=114.126 (err=3.9e-16) 1. PDA (f2c): elapsed time t=1.49238 s, 128 iters, t-(init.)=1.45542 s t(norm)=0.362785, mflops=13.7823 (err=3.8e-16) 2. Singleton (f2c): elapsed time t=1.48029 s, 512 iters, t-(init.)=1.33199 s t(norm)=0.0830044, mflops=60.2377 (err=4.6e-16) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=2744 = 114.126 Normalized results and averages for N=2744: fft 0: mflops = 114.126 (norm. = 1), norm. avg. (of 9) = 0.974561 fft 1: mflops = 13.7823 (norm. = 0.120764), norm. avg. (of 9) = 0.0878337 fft 2: mflops = 60.2377 (norm. = 0.52782), norm. avg. (of 9) = 0.700226 fft 3: mflops = -1 (norm. = -0.00876228), norm. avg. (of 5) = 0.159849 Benchmarking for array size = 15x15x15: 0. FFTW: elapsed time t=1.0214 s, 512 iters, t-(init.)=0.838942 s t(norm)=0.0414224, mflops=120.708 (err=4.6e-16) 1. PDA (f2c): elapsed time t=1.02226 s, 128 iters, t-(init.)=0.976663 s t(norm)=0.192889, mflops=25.9216 (err=4.5e-16) 2. Singleton (f2c): elapsed time t=1.84906 s, 512 iters, t-(init.)=1.66668 s t(norm)=0.0822919, mflops=60.7593 (err=4.8e-16) 3. Temperton (f2c): elapsed time t=1.51242 s, 256 iters, t-(init.)=1.42106 s t(norm)=0.140328, mflops=35.6307 (err=4.6e-16) Top mflops for N=3375 = 120.708 Normalized results and averages for N=3375: fft 0: mflops = 120.708 (norm. = 1), norm. avg. (of 10) = 0.977105 fft 1: mflops = 25.9216 (norm. = 0.214747), norm. avg. (of 10) = 0.100525 fft 2: mflops = 60.7593 (norm. = 0.50336), norm. avg. (of 10) = 0.680539 fft 3: mflops = 35.6307 (norm. = 0.295182), norm. avg. (of 6) = 0.182405 Benchmarking for array size = 24x25x28: 0. FFTW: elapsed time t=1.71412 s, 128 iters, t-(init.)=1.48664 s t(norm)=0.0492537, mflops=101.515 (err=4.7e-16) 1. PDA (f2c): elapsed time t=1.6682 s, 32 iters, t-(init.)=1.61109 s t(norm)=0.213507, mflops=23.4185 (err=4.4e-16) 2. Singleton (f2c): elapsed time t=1.12974 s, 32 iters, t-(init.)=1.07233 s t(norm)=0.142109, mflops=35.1844 (err=5.6e-16) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=16800 = 101.515 Normalized results and averages for N=16800: fft 0: mflops = 101.515 (norm. = 1), norm. avg. (of 11) = 0.979187 fft 1: mflops = 23.4185 (norm. = 0.23069), norm. avg. (of 11) = 0.112358 fft 2: mflops = 35.1844 (norm. = 0.346593), norm. avg. (of 11) = 0.650181 fft 3: mflops = -1 (norm. = -0.00985075), norm. avg. (of 6) = 0.182405 Benchmarking for array size = 48x48x48: 0. FFTW: elapsed time t=1.51373 s, 8 iters, t-(init.)=1.22101 s t(norm)=0.0823689, mflops=60.7025 (err=7.1e-16) 1. PDA (f2c): elapsed time t=1.79472 s, 4 iters, t-(init.)=1.65603 s t(norm)=0.223431, mflops=22.3783 (err=7.1e-16) 2. Singleton (f2c): elapsed time t=1.21631 s, 2 iters, t-(init.)=1.14549 s t(norm)=0.309097, mflops=16.1762 (err=8.2e-16) 3. Temperton (f2c): elapsed time t=1.59726 s, 4 iters, t-(init.)=1.45354 s t(norm)=0.19611, mflops=25.4958 (err=7.6e-16) Top mflops for N=110592 = 60.7025 Normalized results and averages for N=110592: fft 0: mflops = 60.7025 (norm. = 1), norm. avg. (of 12) = 0.980921 fft 1: mflops = 22.3783 (norm. = 0.368655), norm. avg. (of 12) = 0.133716 fft 2: mflops = 16.1762 (norm. = 0.266483), norm. avg. (of 12) = 0.618206 fft 3: mflops = 25.4958 (norm. = 0.420013), norm. avg. (of 7) = 0.216349 Benchmarking for array size = 49x49x49: 0. FFTW: elapsed time t=1.54675 s, 8 iters, t-(init.)=1.21627 s t(norm)=0.0767191, mflops=65.1728 (err=8.7e-16) 1. PDA (f2c): elapsed time t=1.54927 s, 2 iters, t-(init.)=1.47372 s t(norm)=0.371834, mflops=13.4469 (err=8.8e-16) 2. Singleton (f2c): elapsed time t=1.09756 s, 2 iters, t-(init.)=1.01627 s t(norm)=0.256416, mflops=19.4996 (err=1.1e-15) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=117649 = 65.1728 Normalized results and averages for N=117649: fft 0: mflops = 65.1728 (norm. = 1), norm. avg. (of 13) = 0.982389 fft 1: mflops = 13.4469 (norm. = 0.206326), norm. avg. (of 13) = 0.139302 fft 2: mflops = 19.4996 (norm. = 0.299198), norm. avg. (of 13) = 0.593667 fft 3: mflops = -1 (norm. = -0.0153438), norm. avg. (of 7) = 0.216349 Benchmarking for array size = 60x60x60: 0. FFTW: elapsed time t=1.57316 s, 4 iters, t-(init.)=1.24529 s t(norm)=0.0813346, mflops=61.4745 (err=4.9e-16) 1. PDA (f2c): elapsed time t=1.75839 s, 2 iters, t-(init.)=1.59852 s t(norm)=0.208811, mflops=23.9451 (err=5.0e-16) 2. Singleton (f2c): elapsed time t=1.95123 s, 1 iters, t-(init.)=1.8774 s t(norm)=0.490482, mflops=10.1941 (err=6.0e-16) 3. Temperton (f2c): elapsed time t=1.88478 s, 2 iters, t-(init.)=1.72231 s t(norm)=0.224982, mflops=22.224 (err=4.7e-16) Top mflops for N=216000 = 61.4745 Normalized results and averages for N=216000: fft 0: mflops = 61.4745 (norm. = 1), norm. avg. (of 14) = 0.983647 fft 1: mflops = 23.9451 (norm. = 0.389512), norm. avg. (of 14) = 0.157174 fft 2: mflops = 10.1941 (norm. = 0.165826), norm. avg. (of 14) = 0.563107 fft 3: mflops = 22.224 (norm. = 0.361516), norm. avg. (of 8) = 0.234494 Benchmarking for array size = 72x60x56: 0. FFTW: elapsed time t=1.80307 s, 4 iters, t-(init.)=1.43185 s t(norm)=0.0827365, mflops=60.4328 (err=5.7e-16) 1. PDA (f2c): elapsed time t=1.19281 s, 1 iters, t-(init.)=1.10022 s t(norm)=0.254295, mflops=19.6622 (err=6.1e-16) 2. Singleton (f2c): elapsed time t=2.22295 s, 1 iters, t-(init.)=2.13049 s t(norm)=0.492425, mflops=10.1538 (err=7.0e-16) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=241920 = 60.4328 Normalized results and averages for N=241920: fft 0: mflops = 60.4328 (norm. = 1), norm. avg. (of 15) = 0.984737 fft 1: mflops = 19.6622 (norm. = 0.325356), norm. avg. (of 15) = 0.168386 fft 2: mflops = 10.1538 (norm. = 0.168019), norm. avg. (of 15) = 0.536767 fft 3: mflops = -1 (norm. = -0.0165473), norm. avg. (of 8) = 0.234494 Benchmarking for array size = 75x75x75: 0. FFTW: elapsed time t=1.57882 s, 2 iters, t-(init.)=1.25947 s t(norm)=0.0798819, mflops=62.5924 (err=9.0e-16) 1. PDA (f2c): elapsed time t=1.77452 s, 1 iters, t-(init.)=1.61611 s t(norm)=0.205003, mflops=24.3899 (err=9.5e-16) 2. Singleton (f2c): elapsed time t=3.41143 s, 1 iters, t-(init.)=3.2494 s t(norm)=0.412186, mflops=12.1305 (err=1.3e-15) 3. Temperton (f2c): elapsed time t=1.79345 s, 1 iters, t-(init.)=1.63568 s t(norm)=0.207485, mflops=24.0981 (err=1.1e-15) Top mflops for N=421875 = 62.5924 Normalized results and averages for N=421875: fft 0: mflops = 62.5924 (norm. = 1), norm. avg. (of 16) = 0.985691 fft 1: mflops = 24.3899 (norm. = 0.389663), norm. avg. (of 16) = 0.182216 fft 2: mflops = 12.1305 (norm. = 0.193801), norm. avg. (of 16) = 0.515332 fft 3: mflops = 24.0981 (norm. = 0.385001), norm. avg. (of 9) = 0.251217 Benchmarking for array size = 80x80x80: 0. FFTW: elapsed time t=1.00576 s, 1 iters, t-(init.)=0.812727 s t(norm)=0.0836958, mflops=59.7401 (err=1.5e-15) 1. PDA (f2c): elapsed time t=2.26881 s, 1 iters, t-(init.)=2.07841 s t(norm)=0.214038, mflops=23.3603 (err=1.5e-15) 2. Singleton (f2c): elapsed time t=4.19287 s, 1 iters, t-(init.)=3.99649 s t(norm)=0.411565, mflops=12.1487 (err=2.3e-15) 3. Temperton (f2c): elapsed time t=2.44756 s, 1 iters, t-(init.)=2.2545 s t(norm)=0.232172, mflops=21.5358 (err=1.5e-15) Top mflops for N=512000 = 59.7401 Normalized results and averages for N=512000: fft 0: mflops = 59.7401 (norm. = 1), norm. avg. (of 17) = 0.986533 fft 1: mflops = 23.3603 (norm. = 0.391032), norm. avg. (of 17) = 0.194499 fft 2: mflops = 12.1487 (norm. = 0.20336), norm. avg. (of 17) = 0.496981 fft 3: mflops = 21.5358 (norm. = 0.360491), norm. avg. (of 10) = 0.262145 Benchmarking for array size = 84x84x84: 0. FFTW: elapsed time t=1.1111 s, 1 iters, t-(init.)=0.883707 s t(norm)=0.0777483, mflops=64.3101 (err=7.6e-16) 1. PDA (f2c): elapsed time t=3.307 s, 1 iters, t-(init.)=3.08012 s t(norm)=0.270988, mflops=18.451 (err=6.9e-16) 2. Singleton (f2c): elapsed time t=6.06473 s, 1 iters, t-(init.)=5.84643 s t(norm)=0.514368, mflops=9.72068 (err=8.6e-16) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=592704 = 64.3101 Normalized results and averages for N=592704: fft 0: mflops = 64.3101 (norm. = 1), norm. avg. (of 18) = 0.987281 fft 1: mflops = 18.451 (norm. = 0.286907), norm. avg. (of 18) = 0.199633 fft 2: mflops = 9.72068 (norm. = 0.151153), norm. avg. (of 18) = 0.477768 fft 3: mflops = -1 (norm. = -0.0155497), norm. avg. (of 10) = 0.262145 Benchmarking for array size = 96x96x96: 0. FFTW: elapsed time t=2.22144 s, 1 iters, t-(init.)=1.88232 s t(norm)=0.107697, mflops=46.4265 (err=8.0e-16) 1. PDA (f2c): elapsed time t=5.19328 s, 1 iters, t-(init.)=4.85365 s t(norm)=0.277702, mflops=18.0049 (err=7.7e-16) 2. Singleton (f2c): elapsed time t=10.3612 s, 1 iters, t-(init.)=10.024 s t(norm)=0.573523, mflops=8.71805 (err=8.2e-16) 3. Temperton (f2c): elapsed time t=5.02604 s, 1 iters, t-(init.)=4.68799 s t(norm)=0.268225, mflops=18.6411 (err=8.9e-16) Top mflops for N=884736 = 46.4265 Normalized results and averages for N=884736: fft 0: mflops = 46.4265 (norm. = 1), norm. avg. (of 19) = 0.98795 fft 1: mflops = 18.0049 (norm. = 0.387815), norm. avg. (of 19) = 0.209537 fft 2: mflops = 8.71805 (norm. = 0.187782), norm. avg. (of 19) = 0.462506 fft 3: mflops = 18.6411 (norm. = 0.401519), norm. avg. (of 11) = 0.274815 Benchmarking for array size = 105x105x105: 0. FFTW: elapsed time t=2.20108 s, 1 iters, t-(init.)=1.7581 s t(norm)=0.0753976, mflops=66.3151 (err=7.9e-16) 1. PDA (f2c): elapsed time t=6.50414 s, 1 iters, t-(init.)=6.06192 s t(norm)=0.25997, mflops=19.233 (err=8.1e-16) 2. Singleton (f2c): elapsed time t=9.87474 s, 1 iters, t-(init.)=9.43345 s t(norm)=0.404561, mflops=12.3591 (err=9.7e-16) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=1157625 = 66.3151 Normalized results and averages for N=1157625: fft 0: mflops = 66.3151 (norm. = 1), norm. avg. (of 20) = 0.988553 fft 1: mflops = 19.233 (norm. = 0.290024), norm. avg. (of 20) = 0.213562 fft 2: mflops = 12.3591 (norm. = 0.186369), norm. avg. (of 20) = 0.448699 fft 3: mflops = -1 (norm. = -0.0150795), norm. avg. (of 11) = 0.274815 Benchmarking for array size = 112x112x112: 0. FFTW: elapsed time t=2.83413 s, 1 iters, t-(init.)=2.2956 s t(norm)=0.0800096, mflops=62.4925 (err=7.2e-16) 1. PDA (f2c): elapsed time t=8.247 s, 1 iters, t-(init.)=7.70851 s t(norm)=0.268669, mflops=18.6103 (err=7.0e-16) 2. Singleton (f2c): elapsed time t=12.2899 s, 1 iters, t-(init.)=11.751 s t(norm)=0.409564, mflops=12.2081 (err=6.8e-16) 3. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=1404928 = 62.4925 Normalized results and averages for N=1404928: fft 0: mflops = 62.4925 (norm. = 1), norm. avg. (of 21) = 0.989098 fft 1: mflops = 18.6103 (norm. = 0.2978), norm. avg. (of 21) = 0.217573 fft 2: mflops = 12.2081 (norm. = 0.195353), norm. avg. (of 21) = 0.436635 fft 3: mflops = -1 (norm. = -0.0160019), norm. avg. (of 11) = 0.274815 Benchmarking for array size = 120x120x120: 0. FFTW: elapsed time t=3.55794 s, 1 iters, t-(init.)=2.8949 s t(norm)=0.080851, mflops=61.8422 (err=5.9e-16) 1. PDA (f2c): elapsed time t=7.93557 s, 1 iters, t-(init.)=7.27278 s t(norm)=0.20312, mflops=24.616 (err=6.2e-16) 2. Singleton (f2c): elapsed time t=24.536 s, 1 iters, t-(init.)=23.8738 s t(norm)=0.666766, mflops=7.49889 (err=7.4e-16) 3. Temperton (f2c): elapsed time t=10.2026 s, 1 iters, t-(init.)=9.53969 s t(norm)=0.266432, mflops=18.7665 (err=5.7e-16) Top mflops for N=1728000 = 61.8422 Normalized results and averages for N=1728000: fft 0: mflops = 61.8422 (norm. = 1), norm. avg. (of 22) = 0.989593 fft 1: mflops = 24.616 (norm. = 0.398045), norm. avg. (of 22) = 0.225776 fft 2: mflops = 7.49889 (norm. = 0.121259), norm. avg. (of 22) = 0.422299 fft 3: mflops = 18.7665 (norm. = 0.303458), norm. avg. (of 12) = 0.277202 Benchmarking for array size = 144x144x144: 0. FFTW: elapsed time t=6.28855 s, 1 iters, t-(init.)=5.14294 s t(norm)=0.0800734, mflops=62.4427 (err=1.2e-15) 1. PDA (f2c): elapsed time t=14.7598 s, 1 iters, t-(init.)=13.6142 s t(norm)=0.211968, mflops=23.5885 (err=1.1e-15) 2. Singleton (f2c): elapsed time t=36.1841 s, 1 iters, t-(init.)=35.0386 s t(norm)=0.545536, mflops=9.16531 (err=1.5e-15) 3. Temperton (f2c): elapsed time t=16.9166 s, 1 iters, t-(init.)=15.7712 s t(norm)=0.245551, mflops=20.3623 (err=1.2e-15) Top mflops for N=2985984 = 62.4427 Normalized results and averages for N=2985984: fft 0: mflops = 62.4427 (norm. = 1), norm. avg. (of 23) = 0.990046 fft 1: mflops = 23.5885 (norm. = 0.377762), norm. avg. (of 23) = 0.232384 fft 2: mflops = 9.16531 (norm. = 0.146779), norm. avg. (of 23) = 0.41032 fft 3: mflops = 20.3623 (norm. = 0.326096), norm. avg. (of 13) = 0.280963 Benchmarking for array size = 180x180x180: 0. FFTW: elapsed time t=12.5463 s, 1 iters, t-(init.)=10.3087 s t(norm)=0.0786459, mflops=63.5761 Found big error at 3212824: (0.375958,0.483562)*1.71468e-07 = (6.44647e-08,8.29152e-08) != (6.3e-08,8.2e-08) Found big error at 4438815: (0.489888,27.966)*1.71468e-07 = (8.4e-08,4.79526e-06) != (8.4e-08,4.72e-06) Found big error at 4438880: (28.641,2.65203)*1.71468e-07 = (4.911e-06,4.54737e-07) != (4.911e-06,5.3e-07) Found big error at 4438901: (17.3152,5.68773)*1.71468e-07 = (2.969e-06,9.75263e-07) != (2.969e-06,9e-07) Found big error at 4438904: (26.0282,12.5139)*1.71468e-07 = (4.463e-06,2.14574e-06) != (4.463e-06,2.221e-06) Found big error at 4439002: (33.0348,21.0948)*1.71468e-07 = (5.6644e-06,3.61709e-06) != (5.602e-06,3.575e-06) Found big error at 4439012: (2.13099,57.9123)*1.71468e-07 = (3.65396e-07,9.93009e-06) != (3.03e-07,9.888e-06) Found big error at 4439041: (43.0262,13.658)*1.71468e-07 = (7.3776e-06,2.34191e-06) != (7.44e-06,2.384e-06) Found big error at 4439046: (14.8098,23.6026)*1.71468e-07 = (2.5394e-06,4.04709e-06) != (2.477e-06,4.005e-06) Found big error at 4439090: (7.88134,31.2775)*1.71468e-07 = (1.3514e-06,5.36309e-06) != (1.289e-06,5.321e-06) Found big error at 4439112: (27.3369,31.7149)*1.71468e-07 = (4.6874e-06,5.43809e-06) != (4.625e-06,5.396e-06) (err=2.5e-05) 1. PDA (f2c): elapsed time t=30.0541 s, 1 iters, t-(init.)=27.8172 s t(norm)=0.21222, mflops=23.5605 Found big error at 43650: (0.0385033,0.00678272)*0.000414087 = (1.59437e-05,2.80863e-06) != (1.5939e-05,2.473e-06) Found big error at 58875: (0.000517776,0.000152142)*0.000414087 = (2.14404e-07,6.3e-08) != (2.11e-07,6.3e-08) Found big error at 59850: (0.014057,0.0671133)*0.000414087 = (5.82083e-06,2.77907e-05) != (5.825e-06,2.8286e-05) Found big error at 76050: (0.0739279,0.070626)*0.000414087 = (3.06126e-05,2.92453e-05) != (3.0609e-05,2.86e-05) Found big error at 92250: (0.0354228,-0.00106427)*0.000414087 = (1.46681e-05,-4.407e-07) != (1.4671e-05,3.42e-07) Found big error at 108450: (0.0193756,0.00903893)*0.000414087 = (8.02317e-06,3.7429e-06) != (8.021e-06,2.838e-06) Found big error at 124650: (0.00478305,0.0522584)*0.000414087 = (1.9806e-06,2.16395e-05) != (1.982e-06,2.2649e-05) Found big error at 140850: (0.0169375,0.00804283)*0.000414087 = (7.01361e-06,3.33043e-06) != (7.013e-06,2.236e-06) Found big error at 157050: (0.077832,0.0756362)*0.000414087 = (3.22292e-05,3.13199e-05) != (3.2229e-05,3.2478e-05) Found big error at 173250: (0.0776456,0.0533588)*0.000414087 = (3.2152e-05,2.20952e-05) != (3.2153e-05,2.0896e-05) Found big error at 189450: (0.0418048,0.0338433)*0.000414087 = (1.73108e-05,1.40141e-05) != (1.7309e-05,1.5231e-05) (err=1.2e-05) 2. Singleton (f2c): elapsed time t=83.2909 s, 1 iters, t-(init.)=81.0533 s t(norm)=0.618361, mflops=8.08589 Found big error at 76050: (188.708,162.676)*1.71468e-07 = (3.23574e-05,2.78936e-05) != (3.0609e-05,2.86e-05) Found big error at 173250: (189.426,132.696)*1.71468e-07 = (3.24804e-05,2.2753e-05) != (3.2153e-05,2.0896e-05) Found big error at 270450: (107.72,19.7133)*1.71468e-07 = (1.84705e-05,3.38019e-06) != (2.0355e-05,3.446e-06) Found big error at 367650: (90.887,63.0634)*1.71468e-07 = (1.55842e-05,1.08133e-05) != (1.5128e-05,1.2643e-05) Found big error at 464850: (192.204,111.651)*1.71468e-07 = (3.29568e-05,1.91446e-05) != (3.1262e-05,1.8318e-05) Found big error at 562050: (68.4331,137.25)*1.71468e-07 = (1.17341e-05,2.35339e-05) != (1.2895e-05,2.2048e-05) Found big error at 659250: (168.556,133.783)*1.71468e-07 = (2.89019e-05,2.29395e-05) != (3.0114e-05,2.4384e-05) Found big error at 756450: (107.775,139.786)*1.71468e-07 = (1.848e-05,2.39687e-05) != (1.6815e-05,2.4854e-05) Found big error at 853650: (3.48615,138.659)*1.71468e-07 = (5.97762e-07,2.37756e-05) != (7.8e-08,2.1963e-05) Found big error at 950850: (-0.0996133,77.6912)*1.71468e-07 = (-1.70805e-08,1.33215e-05) != (1.864e-06,1.319e-05) Found big error at 1048050: (37.9397,151.006)*1.71468e-07 = (6.50544e-06,2.58927e-05) != (6.243e-06,2.776e-05) (err=4.8e-06) 3. Temperton (f2c): elapsed time t=33.1753 s, 1 iters, t-(init.)=30.9379 s t(norm)=0.236027, mflops=21.184 Found big error at 76050: (188.981,162.565)*1.71468e-07 = (3.24042e-05,2.78747e-05) != (3.0609e-05,2.86e-05) Found big error at 173250: (189.477,132.986)*1.71468e-07 = (3.24892e-05,2.28028e-05) != (3.2153e-05,2.0896e-05) Found big error at 270450: (107.425,19.703)*1.71468e-07 = (1.842e-05,3.37843e-06) != (2.0355e-05,3.446e-06) Found big error at 367650: (90.9582,62.7776)*1.71468e-07 = (1.55964e-05,1.07643e-05) != (1.5128e-05,1.2643e-05) Found big error at 464850: (192.469,111.781)*1.71468e-07 = (3.30022e-05,1.91668e-05) != (3.1262e-05,1.8318e-05) Found big error at 562050: (68.2517,137.482)*1.71468e-07 = (1.1703e-05,2.35737e-05) != (1.2895e-05,2.2048e-05) Found big error at 659250: (168.367,133.557)*1.71468e-07 = (2.88694e-05,2.29008e-05) != (3.0114e-05,2.4384e-05) Found big error at 756450: (108.035,139.647)*1.71468e-07 = (1.85245e-05,2.3945e-05) != (1.6815e-05,2.4854e-05) Found big error at 853650: (3.56735,138.943)*1.71468e-07 = (6.11685e-07,2.38242e-05) != (7.8e-08,2.1963e-05) Found big error at 950850: (-0.393476,77.7118)*1.71468e-07 = (-6.74685e-08,1.33251e-05) != (1.864e-06,1.319e-05) Found big error at 1048050: (37.9807,150.714)*1.71468e-07 = (6.51246e-06,2.58427e-05) != (6.243e-06,2.776e-05) (err=2.6e-05) Top mflops for N=5832000 = 63.5761 Normalized results and averages for N=5832000: fft 0: mflops = 63.5761 (norm. = 1), norm. avg. (of 24) = 0.990461 fft 1: mflops = 23.5605 (norm. = 0.370587), norm. avg. (of 24) = 0.238143 fft 2: mflops = 8.08589 (norm. = 0.127184), norm. avg. (of 24) = 0.398523 fft 3: mflops = 21.184 (norm. = 0.333207), norm. avg. (of 14) = 0.284695 ------------------------------------------------------ @@@@ bench.1d.p2.dat N, Arndt DIF, Arndt DIT, Arndt Split-Radix, Arndt 4-step, Beauregard, Bergland, CWP (min N), CWP (best N), Edelblute, FFTPACK (f2c), FFTW, FFTW_ESTIMATE, Frigo-old, Green, GSL, GSL DIT, GSL DIF, Krukar, Mayer (Buneman), Mayer (simple), Mayer (lookup), NAPACK (f2c), Nielsen, NR (C), Ooura (C), QFT, Ransom, Singleton (f2c), Temperton (f2c), Valkenburg 2, 34.9955, 33.41, 19.8316, 1.4112, 4.98101, 5.68379, 4.97309, 4.68362, , 7.39798, 35.3998, 35.83, 54.058, , 13.4556, 8.90698, 8.56982, 29.7339, , , , 3.34833, 2.5264, 10.5371, 33.0549, , , 7.23149, 3.32311, 8.66919 4, 70.7838, 72.0402, 31.7694, 6.55162, 11.6136, 20.3067, 19.0643, 9.51516, 29.5366, 18.7906, 115.522, 116.671, 172.467, , 34.7833, 18.2477, 17.6482, 83.2027, 35.7165, 36.4916, 33.9592, 7.46115, 9.20605, 19.2836, 63.9496, , 5.37709, 25.403, 10.4405, 9.23592 8, 111.474, 111.523, 41.091, 8.81294, 16.4177, 36.1713, 46.5059, 28.5862, 32.8268, 31.2487, 193.913, 195.053, 258.561, 83.1961, 49.3821, 31.7505, 30.871, 114.704, 60.1088, 60.222, 58.0567, 11.1932, 20.357, 32.8641, 118.214, , 6.2875, 28.6987, 14.7701, 9.7132 16, 73.549, 74.0212, 49.909, 17.1332, 18.6766, 60.1635, 82.8437, 60.2893, 37.2819, 46.3919, 262.152, 262.884, 276.514, 131.597, 74.2255, 46.3724, 45.7286, 132.889, 59.4217, 70.1526, 70.3378, 15.7711, 19.5064, 47.8296, 153.705, 90.6361, 20.7272, 75.3566, 21.1215, 10.103 32, 88.7067, 89.3457, 58.783, 20.551, 19.7554, 90.7942, 88.1131, 112.825, 42.3716, 38.4563, 312.836, 313.206, 316.026, 181.461, 69.1734, 60.732, 61.4429, 136.709, 69.3599, 86.3543, 88.2111, 18.0135, 27.9833, 63.5644, 175.081, 82.7394, 20.9077, 98.8084, 19.7309, 10.4076 64, 91.2701, 91.6691, 67.0343, 30.4287, 20.0914, 110.608, 101.247, 132.665, 47.7002, 45.7881, 338.588, 266.97, 201.039, 254.01, 83.7425, 71.0258, 73.5516, 137.037, 72.9456, 94.508, 97.8479, 20.9227, 35.9758, 76.0687, 196.89, 79.133, 42.0373, 139.852, 25.833, 10.6352 128, 101.519, 101.955, 74.6614, 30.6652, 20.1235, 124.788, 119.883, 186.239, 53.2094, 51.5511, 295.797, 295.952, 223.391, 279.058, 86.0906, 78.6312, 83.0641, 66.2286, 79.9889, 106.226, 109.924, 21.4372, 32.7365, 86.3092, 200.741, 76.9817, 39.5727, 131.05, 24.7325, 10.7917 256, 107.631, 107.755, 81.5305, 35.3214, 20.1504, 142.243, 141.837, 193.087, 58.3608, 55.9123, 315.854, 310.362, 249.685, 298.452, 96.2106, 83.2482, 89.3501, 67.1172, 84.4689, 112.156, 116.604, 23.1054, 37.2063, 93.3818, 214.934, 75.2765, 59.9004, 176.419, 27.2997, 10.9227 512, 113.181, 113.224, 80.3009, 37.4606, 20.1943, 154.475, 145.94, 182.752, 59.1356, 46.3298, 317.734, 295.603, 240.61, 323.226, 85.4903, 83.8832, 93.3043, 70.7741, 91.0164, 121.406, 125.688, 22.9286, 41.5382, 94.9746, 213.656, 70.323, 55.3172, 177.798, 24.3721, 10.9826 1024, 119.201, 118.815, 85.209, 43.7709, 20.1457, 158.174, 144.678, 144.659, 63.3344, 27.6381, 135.953, 134.567, 60.655, 331.465, 42.8045, 83.099, 95.3841, 55.9692, 95.4499, 125.719, 130.124, 20.1491, 33.6278, 94.6007, 222.868, 25.0342, 73.6218, 193.294, 27.5754, 9.265 2048, 112.235, 108.938, 75.3985, 38.0197, 19.3126, 139.577, 102.241, 115.771, 58.8088, 15.7247, 43.0806, 42.8589, 26.4238, 141.815, 19.6237, 73.9043, 85.5867, 46.9324, 94.8366, 124.098, 94.3821, 5.42769, 22.2994, 82.5008, 107.319, 23.2998, 55.2133, 142.309, 20.1244, 6.1915 4096, 22.761, 21.4867, 18.0256, 28.9807, 16.2296, 52.0758, 72.4665, 86.8698, 17.144, 17.0498, 45.0866, 42.2877, 27.4475, 54.3579, 22.6471, 22.5624, 22.7812, 42.713, 83.7536, 108.561, 56.747, 5.69977, 17.5608, 22.9473, 59.0996, 21.7914, 53.9784, 45.2057, 19.1153, 5.70535 8192, 24.2882, 22.8789, 16.8701, 24.2322, 16.1135, 47.0286, 73.3163, 80.1645, 16.2224, 14.4483, 45.3585, 45.3143, 29.0635, 53.2515, 17.638, 22.2804, 22.5283, , 25.6663, 27.1183, 22.8322, 5.52315, 16.359, 22.6605, 56.958, 19.5695, 44.5624, 41.25, 17.8337, 5.4199 16384, 21.8241, 20.6088, 16.7708, 30.9176, 16.1129, 50.4652, 79.3728, 79.3097, 16.1669, 14.9955, 43.3952, 46.0682, 27.1727, 51.7625, 19.2594, 22.3291, 22.5372, , 24.9005, 26.0841, 22.5107, 5.7141, 17.5177, 22.5828, 60.4442, 17.79, 61.6731, 44.8147, 19.1619, 5.18923 32768, 23.2089, 21.9981, 16.62, 25.2725, 15.9416, 49.6159, 77.8625, 77.9876, 16.0638, 15.543, 46.4582, 45.7758, 21.4703, 51.0453, 20.8771, 22.1578, 22.3684, , 24.7183, 25.7754, 22.4685, 5.51323, 17.8727, 22.3256, 58.2582, 15.7761, 49.6241, 37.0693, 18.0949, 4.87724 65536, 20.189, 19.1719, 15.8955, 32.8168, 15.4886, 44.1898, 61.954, 61.9794, 15.3716, 13.025, 44.5281, 36.7426, 18.9307, 48.3581, 20.1864, 20.763, 21.0529, , 24.068, 25.0813, 21.7385, 5.67705, 15.9667, 21.3482, 54.6037, 11.2435, 40.6363, 40.1563, 18.068, 4.51699 131072, 6.55994, 6.1758, 4.57506, 14.6712, 10.2782, 16.9665, 39.2574, 39.2504, 4.57416, 9.15353, 34.7816, 27.6848, 15.6566, 17.6717, 12.7126, 6.8286, 7.05538, , 19.881, 20.5752, 18.1031, 5.41337, 8.27718, 6.83959, 21.3327, 10.2603, 20.2546, 13.2746, 10.8317, 3.43008 262144, 5.18909, 5.16219, 4.25411, 16.3852, 10.2402, 16.1989, 24.6455, 24.6349, 4.23776, 9.0542, 35.5753, 26.4865, 14.0024, 16.6462, 12.9496, 6.60701, 6.47892, , 7.41888, 7.48846, 7.16967, 5.63515, 6.89636, 6.43873, 22.3318, 8.23924, 28.9359, 13.0064, 10.3175, 3.30092 524288, 5.42874, 5.38401, 4.16927, 10.2971, 10.2612, 14.6349, 24.9132, 24.9142, 4.14715, 9.36735, 31.1927, 26.5029, 12.4792, 16.3769, 13.6613, 6.5776, 6.44071, , 6.86085, 6.93694, 6.58819, 5.36947, 6.531, 6.40436, 21.5577, 5.7232, 22.251, 10.9683, 9.64679, 3.20654 1048576, 4.98462, 4.95771, 4.11852, 16.4842, 10.278, 14.404, , , 4.0954, 9.26652, 31.6659, 23.9698, 12.1926, , 14.2456, 6.55835, 6.41344, , , 6.81481, 6.48966, 5.52051, 6.78024, 6.38154, 22.5831, 5.15103, 31.6593, 12.7471, 9.9899, 3.10258 Norm. Avg., 0.327138, 0.320704, 0.21887, 0.227143, 0.150707, 0.443257, 0.567998, 0.608716, 0.17767, 0.174803, 0.771798, 0.709978, 0.568407, 0.688319, 0.26718, 0.234283, 0.243542, 0.361777, 0.328257, 0.377148, 0.328315, 0.0819112, 0.148038, 0.24751, 0.63985, 0.215903, 0.399467, 0.43008, 0.160922, 0.0615374 ------------------------------------------------------ @@@@ bench.1d.np2.dat N, CWP (min N), CWP (best N), FFTPACK (f2c), FFTW, FFTW_ESTIMATE, Frigo-old, GSL, NAPACK (f2c), Nielsen, Singleton (f2c), Temperton (f2c), Valkenburg 6, 27.4528, 18.293, 23.3201, 146.884, 147.825, 26.3779, 38.9986, 10.0144, 7.71959, 20.449, 10.2064, 9.81555 9, 47.5977, 33.6421, 29.3962, 163.2, 163.777, 22.8865, 36.7318, 12.5388, 11.8668, 38.2723, 14.762, 9.99683 12, 61.0455, 50.7319, 37.3331, 238.673, 239.564, 39.8592, 57.5478, 13.1038, 15.4399, 41.0605, 19.2801, 10.1378 15, 69.2177, 69.1198, 39.27, 181.389, 181.787, 27.7436, 40.1375, 9.11107, 18.0347, 45.839, 20.1223, 9.12677 18, 74.2254, 70.6565, 28.1297, 151.226, 149.592, 26.3869, 63.7746, 15.0035, 13.6784, 57.2963, 16.958, 10.2973 24, 102.288, 103.591, 33.0629, 200.959, 201.167, 51.9216, 84.4812, 16.8588, 23.2937, 53.6663, 21.9986, 10.4166 36, 122.367, 122.014, 35.8561, 214.93, 215.163, 33.134, 96.5927, 18.2538, 20.6553, 87.7936, 25.1162, 10.5392 80, 152.778, 174.636, 45.8935, 243.722, 243.849, 66.5556, 63.9176, 12.6587, 38.4218, 135.064, 29.3923, 10.0569 108, 129.87, 168.171, 43.4436, 235.747, 235.89, 30.4848, 90.6154, 19.969, 23.5322, 97.4436, 26.8778, 10.7063 210, 185.428, 185.394, 31.8144, 195.322, 186.177, 32.5078, 64.6994, 10.3246, 31.7327, 79.739, , 8.64919 504, 200.471, 200.476, 31.1813, 235.703, 205.108, 38.1557, 87.1142, 13.5371, 30.2452, 95.0034, , 9.49406 1000, 130.015, 190.143, 28.0056, 119.733, 119.726, 24.2687, 34.7626, 10.8895, 41.8375, 134.439, 27.4233, 8.35184 1960, 153.959, 153.982, 9.31492, 46.9461, 47.6425, 18.4971, 16.0592, 6.06489, 21.8033, 94.859, , 6.10108 4725, 85.3356, 95.1378, 11.0452, 50.6889, 44.2329, 12.7663, 18.5556, 6.38658, 17.2921, 41.4297, , 6.36739 10368, 83.1926, 87.6852, 15.7759, 43.293, 48.6682, 17.7906, 21.7377, 6.34837, 16.6318, 34.0141, 20.0384, 5.75268 27000, 90.4887, 90.4255, 16.405, 47.779, 49.2707, 12.7737, 21.2981, 6.17086, 18.8748, 34.5841, 20.0694, 5.27568 75600, 57.7537, 57.7481, 10.5024, 42.6443, 40.7383, 11.4015, 16.2266, 5.94785, 15.0566, 22.6622, , 4.54447 165375, 32.5448, 32.551, 6.34262, 36.277, 34.7041, 7.67984, 16.6386, 5.01113, 11.0034, 13.0435, , 4.6352 362880, 16.765, 16.7621, 8.59997, 42.8482, 36.0755, 9.53401, 17.783, 5.27631, 8.70017, 10.1887, , 3.74644 Norm. Avg., 0.656559, 0.683521, 0.16506, 0.853077, 0.832635, 0.174309, 0.300362, 0.0753936, 0.149863, 0.376101, 0.126466, 0.0593834 ------------------------------------------------------ @@@@ bench.3d.p2.dat Array Dimensions, FFTW, HARM (f2c), NR (C), PDA (f2c), Singleton (f2c), Temperton (f2c) 4x4x4, 231.895, , 80.493, 11.9339, 152.424, 34.9584 8x8x8, 276.381, 39.4445, 109.501, 21.3929, 134.324, 37.4849 16x16x16, 92.5379, 33.9298, 21.081, 23.1594, 37.4511, 33.9979 32x32x32, 110.475, 33.9704, 20.9494, 22.9192, 33.0146, 29.1926 64x64x64, 37.7948, 20.0638, 5.99974, 11.4972, 11.4967, 19.0572 256x64x32, 38.6521, 19.24, 6.02267, 13.873, 10.9645, 16.1788 16x1024x64, 37.7178, 19.731, 5.98662, 11.8179, 11.3715, 128x128x128, 40.7994, 18.375, 5.99634, 13.2559, 8.13377, 10.2969 512x128x64, 43.3628, 18.7845, 6.03266, 13.8563, 9.84832, Norm. Avg., 1, 0.406525, 0.213347, 0.245276, 0.35141, 0.299029 ------------------------------------------------------ @@@@ bench.3d.np2.dat Array Dimensions, FFTW, PDA (f2c), Singleton (f2c), Temperton (f2c) 5x5x5, 164.237, 14.4635, 210.406, 34.2262 6x6x6, 220.555, 16.0945, 113.932, 30.678 7x7x7, 156.703, 8.74723, 100.401, 9x9x9, 205.568, 20.8561, 146.221, 37.6111 10x10x10, 226.306, 23.6594, 143.837, 34.6705 11x11x11, 121.642, 8.71517, 94.6469, 12x12x12, 286.976, 25.3425, 141.183, 46.2948 13x13x13, 76.5142, 8.20759, 77.2495, 14x14x14, 114.126, 13.7823, 60.2377, 15x15x15, 120.708, 25.9216, 60.7593, 35.6307 24x25x28, 101.515, 23.4185, 35.1844, 48x48x48, 60.7025, 22.3783, 16.1762, 25.4958 49x49x49, 65.1728, 13.4469, 19.4996, 60x60x60, 61.4745, 23.9451, 10.1941, 22.224 72x60x56, 60.4328, 19.6622, 10.1538, 75x75x75, 62.5924, 24.3899, 12.1305, 24.0981 80x80x80, 59.7401, 23.3603, 12.1487, 21.5358 84x84x84, 64.3101, 18.451, 9.72068, 96x96x96, 46.4265, 18.0049, 8.71805, 18.6411 105x105x105, 66.3151, 19.233, 12.3591, 112x112x112, 62.4925, 18.6103, 12.2081, 120x120x120, 61.8422, 24.616, 7.49889, 18.7665 144x144x144, 62.4427, 23.5885, 9.16531, 20.3623 180x180x180, 63.5761, 23.5605, 8.08589, 21.184 Norm. Avg., 0.990461, 0.238143, 0.398523, 0.284695 @@@@ end