To: benchfft@theory.lcs.mit.edu Subject: SUBMIT ------------------- @@SUBMIT@@ @ name = Steven G. Johnson @ email = stevenj@alum.mit.edu @ organization = MIT @ computer manufacturer = @ computer model = @ CPU manufacturer = Cyrix @ CPU model = 6x86 @ CPU speed = 166 @ RAM = 64 @ L2 cache size = 512 kB @ operating system = Linux 2.0.18 @ C compiler = gcc 2.7.2.1 @ C compiler flags = -pedantic -ansi -O6 -fomit-frame-pointer -Wall @ Fortran compiler = g77 0.5.21 @ Fortran compiler flags = -O6 -fomit-frame-pointer @ remarks = @ FFTW version = FFTW V1.2 @ floating-point precision = double @ floating-point size = 8 bytes ------------------------------------------------------ @@@@ bench.1d.p2.log Benchmarking for sizes: 2 (0.000228882 MB) 4 (0.000534058 MB) 8 (0.000839233 MB) 16 (0.00164795 MB) 32 (0.00297546 MB) 64 (0.00616455 MB) 128 (0.0119019 MB) 256 (0.0238037 MB) 512 (0.0476074 MB) 1024 (0.0939941 MB) 2048 (0.189575 MB) 4096 (0.37915 MB) 8192 (0.765991 MB) 16384 (1.51184 MB) 32768 (3.02368 MB) 65536 (6.09973 MB) 131072 (12.1995 MB) 262144 (25.4987 MB) Maximum array size = 360360 Benchmarking FFTs: 0. Arndt DIF 1. Arndt DIT 2. Arndt Split-Radix 3. Arndt 4-step 4. Bailey 5. Beauregard 6. Bergland 7. Brenner 8. Burrus 9. CWP (min N) 10. CWP (best N) 11. Edelblute 12. FFTPACK 13. FFTPACK (f2c) 14. FFTW 15. FFTW_ESTIMATE 16. Frigo-old 17. Green 18. GSL 19. GSL DIT 20. GSL DIF 21. Krukar 22. Mayer (Buneman) 23. Mayer (simple) 24. Mayer (lookup) 25. Monro 26. NAPACK (f2c) 27. Nielsen 28. NR (C) 29. NR (F) 30. Ooura (C) 31. Ooura (F) 32. QFT 33. Ransom 34. SCIPORT 35. Singleton 36. Singleton (f2c) 37. Sorensen 38. Sorensen DIT 39. Temperton 40. Temperton (f2c) 41. Valkenburg Computing normalized averages (42 transforms). Benchmarking for array size = 2 (power of 2): 0. Arndt DIF: elapsed time t=1.5 s, 1048576 iters, t-(init.)=1.18 s t(norm)=0.562668, mflops=8.88624 (err=5.6e-17) 1. Arndt DIT: elapsed time t=1.52 s, 1048576 iters, t-(init.)=1.24 s t(norm)=0.591278, mflops=8.45626 (err=5.6e-17) 2. Arndt Split-Radix: elapsed time t=1.96 s, 1048576 iters, t-(init.)=1.65 s t(norm)=0.786781, mflops=6.35501 (err=5.6e-17) 3. Arndt 4-step: elapsed time t=1.09 s, 65536 iters, t-(init.)=1.07 s t(norm)=8.16345, mflops=0.612486 (err=5.6e-17) 4. Bailey: elapsed time t=1.26 s, 262144 iters, t-(init.)=1.18 s t(norm)=2.25067, mflops=2.22156 (err=5.6e-17) 5. Beauregard: elapsed time t=1.8 s, 262144 iters, t-(init.)=1.73 s t(norm)=3.29971, mflops=1.51528 (err=8.4e-17) 6. Bergland: elapsed time t=1.11 s, 262144 iters, t-(init.)=1.03 s t(norm)=1.96457, mflops=2.54509 (err=8.4e-17) 7. Brenner: elapsed time t=1.14 s, 262144 iters, t-(init.)=1.06 s t(norm)=2.02179, mflops=2.47306 (err=8.4e-17) 8. Burrus: elapsed time t=1.62 s, 1048576 iters, t-(init.)=1.31 s t(norm)=0.624657, mflops=8.0044 (err=5.6e-17) 9. CWP (min N): elapsed time t=1.23 s, 262144 iters, t-(init.)=1.14 s t(norm)=2.17438, mflops=2.29951 10. CWP (best N) (N=3): elapsed time t=1.64 s, 262144 iters, t-(init.)=1.56 s t(norm)=2.97546, mflops=1.68041 11. Skipping fft (Edelblute can't handle N <= 2). 12. FFTPACK: elapsed time t=1.41 s, 524288 iters, t-(init.)=1.26 s t(norm)=1.20163, mflops=4.16102 (err=8.4e-17) 13. FFTPACK (f2c): elapsed time t=1.72 s, 524288 iters, t-(init.)=1.56 s t(norm)=1.48773, mflops=3.36082 (err=8.4e-17) FFTW_MEASURE plan: (cost = 1.182556e-06) FFTW_NOTW 2 14. FFTW: elapsed time t=1.39 s, 1048576 iters, t-(init.)=1.1 s t(norm)=0.524521, mflops=9.53251 (err=8.4e-17) FFTW_ESTIMATE plan: (cost = 1.820000e+02) FFTW_NOTW 2 15. FFTW_ESTIMATE: elapsed time t=1.39 s, 1048576 iters, t-(init.)=1.08 s t(norm)=0.514984, mflops=9.70904 (err=8.4e-17) 16. Frigo-old: elapsed time t=1.29 s, 1048576 iters, t-(init.)=0.96 s t(norm)=0.457764, mflops=10.9227 (err=8.4e-17) 17. Skipping fft (Green can't handle this size.). 18. GSL: elapsed time t=1.09 s, 262144 iters, t-(init.)=1.02 s t(norm)=1.9455, mflops=2.57004 (err=8.4e-17) 19. GSL DIT: elapsed time t=1.74 s, 262144 iters, t-(init.)=1.66 s t(norm)=3.1662, mflops=1.57918 (err=8.4e-17) 20. GSL DIF: elapsed time t=1.02 s, 131072 iters, t-(init.)=0.98 s t(norm)=3.7384, mflops=1.33747 (err=8.4e-17) 21. Krukar: elapsed time t=1.98 s, 1048576 iters, t-(init.)=1.69 s t(norm)=0.805855, mflops=6.20459 (err=8.4e-17) 22. Skipping fft (Mayer can't handle N <= 2). 23. Skipping fft (Mayer can't handle N <= 2). 24. Skipping fft (Mayer can't handle N <= 2). 25. Skipping fft (Monro can't handle N <= 2). 26. NAPACK (f2c): elapsed time t=1.63 s, 131072 iters, t-(init.)=1.59 s t(norm)=6.06537, mflops=0.824352 (err=8.3e-17) 27. Nielsen: elapsed time t=1.03 s, 65536 iters, t-(init.)=1 s t(norm)=7.62939, mflops=0.65536 (err=5.6e-17) 28. NR (C): elapsed time t=1.78 s, 262144 iters, t-(init.)=1.7 s t(norm)=3.24249, mflops=1.54202 (err=8.4e-17) 29. NR (F): elapsed time t=1.9 s, 262144 iters, t-(init.)=1.82 s t(norm)=3.47137, mflops=1.44035 (err=8.4e-17) 30. Ooura (C): elapsed time t=1.51 s, 1048576 iters, t-(init.)=1.21 s t(norm)=0.576973, mflops=8.66592 (err=8.4e-17) 31. Ooura (F): elapsed time t=1.63 s, 1048576 iters, t-(init.)=1.31 s t(norm)=0.624657, mflops=8.0044 (err=8.4e-17) 32. Skipping fft (QFT requires N >= 16). 33. Skipping fft (Ransom doesn't work for N=2). 34. Skipping fft (SCIPORT can't handle N < 4). 35. Singleton: elapsed time t=1.9 s, 262144 iters, t-(init.)=1.83 s t(norm)=3.49045, mflops=1.43248 (err=8.4e-17) 36. Singleton (f2c): elapsed time t=1.76 s, 262144 iters, t-(init.)=1.69 s t(norm)=3.22342, mflops=1.55115 (err=8.4e-17) 37. Sorensen: elapsed time t=1.64 s, 1048576 iters, t-(init.)=1.34 s t(norm)=0.638962, mflops=7.82519 (err=5.6e-17) 38. Sorensen DIT: elapsed time t=1.75 s, 1048576 iters, t-(init.)=1.43 s t(norm)=0.681877, mflops=7.3327 (err=5.6e-17) 39. Temperton: elapsed time t=1.41 s, 262144 iters, t-(init.)=1.34 s t(norm)=2.55585, mflops=1.9563 (err=8.4e-17) 40. Temperton (f2c): elapsed time t=1.63 s, 262144 iters, t-(init.)=1.55 s t(norm)=2.95639, mflops=1.69125 (err=8.4e-17) 41. Valkenburg: elapsed time t=1.15 s, 262144 iters, t-(init.)=1.07 s t(norm)=2.04086, mflops=2.44994 (err=8.3e-17) Top mflops for N=2 = 10.9227 Normalized results and averages for N=2: fft 0: mflops = 8.88624 (norm. = 0.813559), norm. avg. (of 1) = 0.813559 fft 1: mflops = 8.45626 (norm. = 0.774194), norm. avg. (of 1) = 0.774194 fft 2: mflops = 6.35501 (norm. = 0.581818), norm. avg. (of 1) = 0.581818 fft 3: mflops = 0.612486 (norm. = 0.0560748), norm. avg. (of 1) = 0.0560748 fft 4: mflops = 2.22156 (norm. = 0.20339), norm. avg. (of 1) = 0.20339 fft 5: mflops = 1.51528 (norm. = 0.138728), norm. avg. (of 1) = 0.138728 fft 6: mflops = 2.54509 (norm. = 0.23301), norm. avg. (of 1) = 0.23301 fft 7: mflops = 2.47306 (norm. = 0.226415), norm. avg. (of 1) = 0.226415 fft 8: mflops = 8.0044 (norm. = 0.732824), norm. avg. (of 1) = 0.732824 fft 9: mflops = 2.29951 (norm. = 0.210526), norm. avg. (of 1) = 0.210526 fft 10: mflops = 1.68041 (norm. = 0.153846), norm. avg. (of 1) = 0.153846 fft 11: mflops = -1 (norm. = -0.0915527), norm. avg. (of 0) = -1 fft 12: mflops = 4.16102 (norm. = 0.380952), norm. avg. (of 1) = 0.380952 fft 13: mflops = 3.36082 (norm. = 0.307692), norm. avg. (of 1) = 0.307692 fft 14: mflops = 9.53251 (norm. = 0.872727), norm. avg. (of 1) = 0.872727 fft 15: mflops = 9.70904 (norm. = 0.888889), norm. avg. (of 1) = 0.888889 fft 16: mflops = 10.9227 (norm. = 1), norm. avg. (of 1) = 1 fft 17: mflops = -1 (norm. = -0.0915527), norm. avg. (of 0) = -1 fft 18: mflops = 2.57004 (norm. = 0.235294), norm. avg. (of 1) = 0.235294 fft 19: mflops = 1.57918 (norm. = 0.144578), norm. avg. (of 1) = 0.144578 fft 20: mflops = 1.33747 (norm. = 0.122449), norm. avg. (of 1) = 0.122449 fft 21: mflops = 6.20459 (norm. = 0.568047), norm. avg. (of 1) = 0.568047 fft 22: mflops = -1 (norm. = -0.0915527), norm. avg. (of 0) = -1 fft 23: mflops = -1 (norm. = -0.0915527), norm. avg. (of 0) = -1 fft 24: mflops = -1 (norm. = -0.0915527), norm. avg. (of 0) = -1 fft 25: mflops = -1 (norm. = -0.0915527), norm. avg. (of 0) = -1 fft 26: mflops = 0.824352 (norm. = 0.0754717), norm. avg. (of 1) = 0.0754717 fft 27: mflops = 0.65536 (norm. = 0.06), norm. avg. (of 1) = 0.06 fft 28: mflops = 1.54202 (norm. = 0.141176), norm. avg. (of 1) = 0.141176 fft 29: mflops = 1.44035 (norm. = 0.131868), norm. avg. (of 1) = 0.131868 fft 30: mflops = 8.66592 (norm. = 0.793388), norm. avg. (of 1) = 0.793388 fft 31: mflops = 8.0044 (norm. = 0.732824), norm. avg. (of 1) = 0.732824 fft 32: mflops = -1 (norm. = -0.0915527), norm. avg. (of 0) = -1 fft 33: mflops = -1 (norm. = -0.0915527), norm. avg. (of 0) = -1 fft 34: mflops = -1 (norm. = -0.0915527), norm. avg. (of 0) = -1 fft 35: mflops = 1.43248 (norm. = 0.131148), norm. avg. (of 1) = 0.131148 fft 36: mflops = 1.55115 (norm. = 0.142012), norm. avg. (of 1) = 0.142012 fft 37: mflops = 7.82519 (norm. = 0.716418), norm. avg. (of 1) = 0.716418 fft 38: mflops = 7.3327 (norm. = 0.671329), norm. avg. (of 1) = 0.671329 fft 39: mflops = 1.9563 (norm. = 0.179104), norm. avg. (of 1) = 0.179104 fft 40: mflops = 1.69125 (norm. = 0.154839), norm. avg. (of 1) = 0.154839 fft 41: mflops = 2.44994 (norm. = 0.224299), norm. avg. (of 1) = 0.224299 Benchmarking for array size = 4 (power of 2): 0. Arndt DIF: elapsed time t=1.89 s, 524288 iters, t-(init.)=1.7 s t(norm)=0.405312, mflops=12.3362 (err=9.6e-17) 1. Arndt DIT: elapsed time t=1.98 s, 524288 iters, t-(init.)=1.8 s t(norm)=0.429153, mflops=11.6508 (err=9.6e-17) 2. Arndt Split-Radix: elapsed time t=1 s, 131072 iters, t-(init.)=0.95 s t(norm)=0.905991, mflops=5.51882 (err=1.5e-16) 3. Arndt 4-step: elapsed time t=1.39 s, 65536 iters, t-(init.)=1.37 s t(norm)=2.61307, mflops=1.91346 (err=1.5e-16) 4. Bailey: elapsed time t=1.71 s, 131072 iters, t-(init.)=1.66 s t(norm)=1.5831, mflops=3.15836 (err=1.5e-16) 5. Beauregard: elapsed time t=1.32 s, 65536 iters, t-(init.)=1.3 s t(norm)=2.47955, mflops=2.01649 (err=1.4e-16) 6. Bergland: elapsed time t=1.72 s, 262144 iters, t-(init.)=1.62 s t(norm)=0.772476, mflops=6.47269 (err=7.6e-17) 7. Brenner: elapsed time t=1.14 s, 131072 iters, t-(init.)=1.09 s t(norm)=1.03951, mflops=4.80998 (err=1.4e-16) 8. Burrus: elapsed time t=1.02 s, 131072 iters, t-(init.)=0.97 s t(norm)=0.925064, mflops=5.40503 (err=1.5e-16) 9. CWP (min N): elapsed time t=1.76 s, 262144 iters, t-(init.)=1.66 s t(norm)=0.79155, mflops=6.31672 10. CWP (best N) (N=15): elapsed time t=1.09 s, 32768 iters, t-(init.)=1.07 s t(norm)=4.08173, mflops=1.22497 11. Edelblute: elapsed time t=1.97 s, 262144 iters, t-(init.)=1.87 s t(norm)=0.891685, mflops=5.60736 (err=1.5e-16) 12. FFTPACK: elapsed time t=1.31 s, 262144 iters, t-(init.)=1.22 s t(norm)=0.581741, mflops=8.59489 (err=1.2e-16) 13. FFTPACK (f2c): elapsed time t=1.5 s, 262144 iters, t-(init.)=1.41 s t(norm)=0.67234, mflops=7.43671 (err=1.2e-16) FFTW_MEASURE plan: (cost = 2.975464e-06) FFTW_NOTW 4 14. FFTW: elapsed time t=1.59 s, 524288 iters, t-(init.)=1.4 s t(norm)=0.333786, mflops=14.9797 (err=1.4e-16) FFTW_ESTIMATE plan: (cost = 3.176000e+02) FFTW_NOTW 4 15. FFTW_ESTIMATE: elapsed time t=1.58 s, 524288 iters, t-(init.)=1.39 s t(norm)=0.331402, mflops=15.0874 (err=1.4e-16) 16. Frigo-old: elapsed time t=1.48 s, 524288 iters, t-(init.)=1.29 s t(norm)=0.30756, mflops=16.257 (err=1.4e-16) 17. Skipping fft (Green can't handle this size.). 18. GSL: elapsed time t=1.05 s, 131072 iters, t-(init.)=1 s t(norm)=0.953674, mflops=5.24288 (err=1.4e-16) 19. GSL DIT: elapsed time t=1.94 s, 131072 iters, t-(init.)=1.9 s t(norm)=1.81198, mflops=2.75941 (err=1.4e-16) 20. GSL DIF: elapsed time t=1.16 s, 65536 iters, t-(init.)=1.13 s t(norm)=2.1553, mflops=2.31986 (err=1.8e-16) 21. Krukar: elapsed time t=1.92 s, 524288 iters, t-(init.)=1.72 s t(norm)=0.41008, mflops=12.1927 (err=1.4e-16) 22. Mayer (Buneman): elapsed time t=1.69 s, 262144 iters, t-(init.)=1.59 s t(norm)=0.758171, mflops=6.59482 (err=8.1e-17) 23. Mayer (simple): elapsed time t=1.73 s, 262144 iters, t-(init.)=1.64 s t(norm)=0.782013, mflops=6.39376 24. Mayer (lookup): elapsed time t=1.72 s, 262144 iters, t-(init.)=1.62 s t(norm)=0.772476, mflops=6.47269 (err=8.1e-17) 25. Monro: elapsed time t=1.21 s, 65536 iters, t-(init.)=1.19 s t(norm)=2.26974, mflops=2.20289 (err=8.1e-17) 26. NAPACK (f2c): elapsed time t=1.73 s, 65536 iters, t-(init.)=1.7 s t(norm)=3.24249, mflops=1.54202 (err=1.8e-16) 27. Nielsen: elapsed time t=1.11 s, 65536 iters, t-(init.)=1.08 s t(norm)=2.05994, mflops=2.42726 (err=1.5e-16) 28. NR (C): elapsed time t=1 s, 65536 iters, t-(init.)=0.97 s t(norm)=1.85013, mflops=2.70252 (err=1.4e-16) 29. NR (F): elapsed time t=1.07 s, 65536 iters, t-(init.)=1.05 s t(norm)=2.00272, mflops=2.49661 (err=1.4e-16) 30. Ooura (C): elapsed time t=1.06 s, 262144 iters, t-(init.)=0.96 s t(norm)=0.457764, mflops=10.9227 (err=9.8e-17) 31. Ooura (F): elapsed time t=1.12 s, 262144 iters, t-(init.)=1.02 s t(norm)=0.486374, mflops=10.2802 (err=9.8e-17) 32. Skipping fft (QFT requires N >= 16). 33. Ransom: elapsed time t=1.93 s, 65536 iters, t-(init.)=1.9 s t(norm)=3.62396, mflops=1.37971 (err=2.1e-16) 34. SCIPORT: elapsed time t=1.45 s, 131072 iters, t-(init.)=1.4 s t(norm)=1.33514, mflops=3.74491 (err=8.0e-09) 35. Singleton: elapsed time t=1.41 s, 131072 iters, t-(init.)=1.36 s t(norm)=1.297, mflops=3.85506 (err=1.5e-16) 36. Singleton (f2c): elapsed time t=1.34 s, 131072 iters, t-(init.)=1.29 s t(norm)=1.23024, mflops=4.06425 (err=1.1e-16) 37. Sorensen: elapsed time t=1.38 s, 262144 iters, t-(init.)=1.28 s t(norm)=0.610352, mflops=8.192 (err=1.5e-16) 38. Sorensen DIT: elapsed time t=1.94 s, 262144 iters, t-(init.)=1.84 s t(norm)=0.87738, mflops=5.69878 (err=8.1e-17) 39. Temperton: elapsed time t=1.06 s, 131072 iters, t-(init.)=1.01 s t(norm)=0.963211, mflops=5.19097 (err=1.7e-16) 40. Temperton (f2c): elapsed time t=1.2 s, 131072 iters, t-(init.)=1.15 s t(norm)=1.09673, mflops=4.55903 (err=1.7e-16) 41. Valkenburg: elapsed time t=1.1 s, 65536 iters, t-(init.)=1.08 s t(norm)=2.05994, mflops=2.42726 (err=1.8e-16) Top mflops for N=4 = 16.257 Normalized results and averages for N=4: fft 0: mflops = 12.3362 (norm. = 0.758824), norm. avg. (of 2) = 0.786191 fft 1: mflops = 11.6508 (norm. = 0.716667), norm. avg. (of 2) = 0.74543 fft 2: mflops = 5.51882 (norm. = 0.339474), norm. avg. (of 2) = 0.460646 fft 3: mflops = 1.91346 (norm. = 0.117701), norm. avg. (of 2) = 0.0868877 fft 4: mflops = 3.15836 (norm. = 0.194277), norm. avg. (of 2) = 0.198833 fft 5: mflops = 2.01649 (norm. = 0.124038), norm. avg. (of 2) = 0.131383 fft 6: mflops = 6.47269 (norm. = 0.398148), norm. avg. (of 2) = 0.315579 fft 7: mflops = 4.80998 (norm. = 0.295872), norm. avg. (of 2) = 0.261143 fft 8: mflops = 5.40503 (norm. = 0.332474), norm. avg. (of 2) = 0.532649 fft 9: mflops = 6.31672 (norm. = 0.388554), norm. avg. (of 2) = 0.29954 fft 10: mflops = 1.22497 (norm. = 0.0753505), norm. avg. (of 2) = 0.114598 fft 11: mflops = 5.60736 (norm. = 0.34492), norm. avg. (of 1) = 0.34492 fft 12: mflops = 8.59489 (norm. = 0.528689), norm. avg. (of 2) = 0.45482 fft 13: mflops = 7.43671 (norm. = 0.457447), norm. avg. (of 2) = 0.38257 fft 14: mflops = 14.9797 (norm. = 0.921429), norm. avg. (of 2) = 0.897078 fft 15: mflops = 15.0874 (norm. = 0.928058), norm. avg. (of 2) = 0.908473 fft 16: mflops = 16.257 (norm. = 1), norm. avg. (of 2) = 1 fft 17: mflops = -1 (norm. = -0.061512), norm. avg. (of 0) = -1 fft 18: mflops = 5.24288 (norm. = 0.3225), norm. avg. (of 2) = 0.278897 fft 19: mflops = 2.75941 (norm. = 0.169737), norm. avg. (of 2) = 0.157158 fft 20: mflops = 2.31986 (norm. = 0.142699), norm. avg. (of 2) = 0.132574 fft 21: mflops = 12.1927 (norm. = 0.75), norm. avg. (of 2) = 0.659024 fft 22: mflops = 6.59482 (norm. = 0.40566), norm. avg. (of 1) = 0.40566 fft 23: mflops = 6.39376 (norm. = 0.393293), norm. avg. (of 1) = 0.393293 fft 24: mflops = 6.47269 (norm. = 0.398148), norm. avg. (of 1) = 0.398148 fft 25: mflops = 2.20289 (norm. = 0.135504), norm. avg. (of 1) = 0.135504 fft 26: mflops = 1.54202 (norm. = 0.0948529), norm. avg. (of 2) = 0.0851623 fft 27: mflops = 2.42726 (norm. = 0.149306), norm. avg. (of 2) = 0.104653 fft 28: mflops = 2.70252 (norm. = 0.166237), norm. avg. (of 2) = 0.153707 fft 29: mflops = 2.49661 (norm. = 0.153571), norm. avg. (of 2) = 0.14272 fft 30: mflops = 10.9227 (norm. = 0.671875), norm. avg. (of 2) = 0.732632 fft 31: mflops = 10.2802 (norm. = 0.632353), norm. avg. (of 2) = 0.682589 fft 32: mflops = -1 (norm. = -0.061512), norm. avg. (of 0) = -1 fft 33: mflops = 1.37971 (norm. = 0.0848684), norm. avg. (of 1) = 0.0848684 fft 34: mflops = 3.74491 (norm. = 0.230357), norm. avg. (of 1) = 0.230357 fft 35: mflops = 3.85506 (norm. = 0.237132), norm. avg. (of 2) = 0.18414 fft 36: mflops = 4.06425 (norm. = 0.25), norm. avg. (of 2) = 0.196006 fft 37: mflops = 8.192 (norm. = 0.503906), norm. avg. (of 2) = 0.610162 fft 38: mflops = 5.69878 (norm. = 0.350543), norm. avg. (of 2) = 0.510936 fft 39: mflops = 5.19097 (norm. = 0.319307), norm. avg. (of 2) = 0.249206 fft 40: mflops = 4.55903 (norm. = 0.280435), norm. avg. (of 2) = 0.217637 fft 41: mflops = 2.42726 (norm. = 0.149306), norm. avg. (of 2) = 0.186802 Benchmarking for array size = 8 (power of 2): 0. Arndt DIF: elapsed time t=1.3 s, 131072 iters, t-(init.)=1.23 s t(norm)=0.391006, mflops=12.7875 (err=1.6e-16) 1. Arndt DIT: elapsed time t=1.29 s, 131072 iters, t-(init.)=1.22 s t(norm)=0.387828, mflops=12.8923 (err=1.6e-16) 2. Arndt Split-Radix: elapsed time t=1.38 s, 65536 iters, t-(init.)=1.35 s t(norm)=0.858307, mflops=5.82542 (err=2.0e-16) 3. Arndt 4-step: elapsed time t=1.76 s, 32768 iters, t-(init.)=1.74 s t(norm)=2.21252, mflops=2.25986 (err=2.1e-16) 4. Bailey: elapsed time t=1.79 s, 65536 iters, t-(init.)=1.75 s t(norm)=1.11262, mflops=4.4939 (err=1.6e-16) 5. Beauregard: elapsed time t=1.57 s, 32768 iters, t-(init.)=1.55 s t(norm)=1.97093, mflops=2.53688 (err=1.3e-16) 6. Bergland: elapsed time t=1.18 s, 65536 iters, t-(init.)=1.15 s t(norm)=0.73115, mflops=6.83854 (err=2.0e-16) 7. Brenner: elapsed time t=1.52 s, 65536 iters, t-(init.)=1.49 s t(norm)=0.947316, mflops=5.27807 (err=1.3e-16) 8. Burrus: elapsed time t=1.58 s, 65536 iters, t-(init.)=1.54 s t(norm)=0.979106, mflops=5.1067 (err=1.9e-16) 9. CWP (min N): elapsed time t=1.71 s, 131072 iters, t-(init.)=1.64 s t(norm)=0.521342, mflops=9.59063 10. CWP (best N) (N=15): elapsed time t=1.07 s, 32768 iters, t-(init.)=1.04 s t(norm)=1.32243, mflops=3.78092 11. Edelblute: elapsed time t=1.56 s, 65536 iters, t-(init.)=1.52 s t(norm)=0.96639, mflops=5.17389 (err=1.9e-16) 12. FFTPACK: elapsed time t=1.71 s, 131072 iters, t-(init.)=1.64 s t(norm)=0.521342, mflops=9.59063 (err=1.8e-16) 13. FFTPACK (f2c): elapsed time t=1.83 s, 131072 iters, t-(init.)=1.76 s t(norm)=0.559489, mflops=8.93673 (err=1.8e-16) FFTW_MEASURE plan: (cost = 8.239746e-06) FFTW_NOTW 8 14. FFTW: elapsed time t=1.09 s, 131072 iters, t-(init.)=1.02 s t(norm)=0.324249, mflops=15.4202 (err=1.0e-16) FFTW_ESTIMATE plan: (cost = 4.688000e+02) FFTW_NOTW 8 15. FFTW_ESTIMATE: elapsed time t=1.09 s, 131072 iters, t-(init.)=1.02 s t(norm)=0.324249, mflops=15.4202 (err=1.0e-16) 16. Frigo-old: elapsed time t=1.11 s, 131072 iters, t-(init.)=1.04 s t(norm)=0.330607, mflops=15.1237 (err=1.0e-16) 17. Green: elapsed time t=1.81 s, 131072 iters, t-(init.)=1.74 s t(norm)=0.553131, mflops=9.03945 (err=1.2e-16) 18. GSL: elapsed time t=1.28 s, 65536 iters, t-(init.)=1.24 s t(norm)=0.788371, mflops=6.34219 (err=1.2e-16) 19. GSL DIT: elapsed time t=1.89 s, 65536 iters, t-(init.)=1.85 s t(norm)=1.1762, mflops=4.25098 (err=1.2e-16) 20. GSL DIF: elapsed time t=1.15 s, 32768 iters, t-(init.)=1.14 s t(norm)=1.44958, mflops=3.44926 (err=1.6e-16) 21. Krukar: elapsed time t=1.5 s, 131072 iters, t-(init.)=1.43 s t(norm)=0.454585, mflops=10.999 (err=1.2e-16) 22. Mayer (Buneman): elapsed time t=1.97 s, 131072 iters, t-(init.)=1.89 s t(norm)=0.600815, mflops=8.32203 (err=1.5e-16) 23. Mayer (simple): elapsed time t=2 s, 131072 iters, t-(init.)=1.92 s t(norm)=0.610352, mflops=8.192 24. Mayer (lookup): elapsed time t=1.98 s, 131072 iters, t-(init.)=1.91 s t(norm)=0.607173, mflops=8.23489 (err=1.5e-16) 25. Monro: elapsed time t=1.96 s, 65536 iters, t-(init.)=1.92 s t(norm)=1.2207, mflops=4.096 (err=1.3e-08) 26. NAPACK (f2c): elapsed time t=1.82 s, 32768 iters, t-(init.)=1.8 s t(norm)=2.28882, mflops=2.18453 (err=2.8e-16) 27. Nielsen: elapsed time t=1.76 s, 65536 iters, t-(init.)=1.73 s t(norm)=1.0999, mflops=4.54585 (err=1.3e-15) 28. NR (C): elapsed time t=1.94 s, 65536 iters, t-(init.)=1.9 s t(norm)=1.20799, mflops=4.13912 (err=1.2e-16) 29. NR (F): elapsed time t=1.05 s, 32768 iters, t-(init.)=1.03 s t(norm)=1.30971, mflops=3.81763 (err=1.5e-16) 30. Ooura (C): elapsed time t=1.38 s, 131072 iters, t-(init.)=1.31 s t(norm)=0.416438, mflops=12.0066 (err=1.6e-16) 31. Ooura (F): elapsed time t=1.45 s, 131072 iters, t-(init.)=1.37 s t(norm)=0.435511, mflops=11.4808 (err=1.6e-16) 32. Skipping fft (QFT requires N >= 16). 33. Ransom: elapsed time t=1.07 s, 16384 iters, t-(init.)=1.06 s t(norm)=2.69572, mflops=1.85479 (err=7.5e-16) 34. SCIPORT: elapsed time t=1.23 s, 32768 iters, t-(init.)=1.22 s t(norm)=1.55131, mflops=3.22308 (err=4.5e-08) 35. Singleton: elapsed time t=1.07 s, 32768 iters, t-(init.)=1.05 s t(norm)=1.33514, mflops=3.74491 (err=1.2e-16) 36. Singleton (f2c): elapsed time t=1.88 s, 65536 iters, t-(init.)=1.85 s t(norm)=1.1762, mflops=4.25098 (err=1.2e-16) 37. Sorensen: elapsed time t=1.73 s, 131072 iters, t-(init.)=1.66 s t(norm)=0.5277, mflops=9.47508 (err=2.4e-16) 38. Sorensen DIT: elapsed time t=1.53 s, 65536 iters, t-(init.)=1.49 s t(norm)=0.947316, mflops=5.27807 (err=1.8e-16) 39. Temperton: elapsed time t=1.21 s, 65536 iters, t-(init.)=1.17 s t(norm)=0.743866, mflops=6.72164 (err=7.5e-09) 40. Temperton (f2c): elapsed time t=1.33 s, 65536 iters, t-(init.)=1.3 s t(norm)=0.826518, mflops=6.04948 (err=1.4e-16) 41. Valkenburg: elapsed time t=1.6 s, 32768 iters, t-(init.)=1.59 s t(norm)=2.02179, mflops=2.47306 (err=1.3e-16) Top mflops for N=8 = 15.4202 Normalized results and averages for N=8: fft 0: mflops = 12.7875 (norm. = 0.829268), norm. avg. (of 3) = 0.80055 fft 1: mflops = 12.8923 (norm. = 0.836066), norm. avg. (of 3) = 0.775642 fft 2: mflops = 5.82542 (norm. = 0.377778), norm. avg. (of 3) = 0.433023 fft 3: mflops = 2.25986 (norm. = 0.146552), norm. avg. (of 3) = 0.106776 fft 4: mflops = 4.4939 (norm. = 0.291429), norm. avg. (of 3) = 0.229699 fft 5: mflops = 2.53688 (norm. = 0.164516), norm. avg. (of 3) = 0.142428 fft 6: mflops = 6.83854 (norm. = 0.443478), norm. avg. (of 3) = 0.358212 fft 7: mflops = 5.27807 (norm. = 0.342282), norm. avg. (of 3) = 0.28819 fft 8: mflops = 5.1067 (norm. = 0.331169), norm. avg. (of 3) = 0.465489 fft 9: mflops = 9.59063 (norm. = 0.621951), norm. avg. (of 3) = 0.407011 fft 10: mflops = 3.78092 (norm. = 0.245192), norm. avg. (of 3) = 0.15813 fft 11: mflops = 5.17389 (norm. = 0.335526), norm. avg. (of 2) = 0.340223 fft 12: mflops = 9.59063 (norm. = 0.621951), norm. avg. (of 3) = 0.510531 fft 13: mflops = 8.93673 (norm. = 0.579545), norm. avg. (of 3) = 0.448228 fft 14: mflops = 15.4202 (norm. = 1), norm. avg. (of 3) = 0.931385 fft 15: mflops = 15.4202 (norm. = 1), norm. avg. (of 3) = 0.938982 fft 16: mflops = 15.1237 (norm. = 0.980769), norm. avg. (of 3) = 0.99359 fft 17: mflops = 9.03945 (norm. = 0.586207), norm. avg. (of 1) = 0.586207 fft 18: mflops = 6.34219 (norm. = 0.41129), norm. avg. (of 3) = 0.323028 fft 19: mflops = 4.25098 (norm. = 0.275676), norm. avg. (of 3) = 0.196664 fft 20: mflops = 3.44926 (norm. = 0.223684), norm. avg. (of 3) = 0.162944 fft 21: mflops = 10.999 (norm. = 0.713287), norm. avg. (of 3) = 0.677111 fft 22: mflops = 8.32203 (norm. = 0.539683), norm. avg. (of 2) = 0.472671 fft 23: mflops = 8.192 (norm. = 0.53125), norm. avg. (of 2) = 0.462271 fft 24: mflops = 8.23489 (norm. = 0.534031), norm. avg. (of 2) = 0.46609 fft 25: mflops = 4.096 (norm. = 0.265625), norm. avg. (of 2) = 0.200565 fft 26: mflops = 2.18453 (norm. = 0.141667), norm. avg. (of 3) = 0.103997 fft 27: mflops = 4.54585 (norm. = 0.294798), norm. avg. (of 3) = 0.168034 fft 28: mflops = 4.13912 (norm. = 0.268421), norm. avg. (of 3) = 0.191945 fft 29: mflops = 3.81763 (norm. = 0.247573), norm. avg. (of 3) = 0.177671 fft 30: mflops = 12.0066 (norm. = 0.778626), norm. avg. (of 3) = 0.747963 fft 31: mflops = 11.4808 (norm. = 0.744526), norm. avg. (of 3) = 0.703234 fft 32: mflops = -1 (norm. = -0.0648499), norm. avg. (of 0) = -1 fft 33: mflops = 1.85479 (norm. = 0.120283), norm. avg. (of 2) = 0.102576 fft 34: mflops = 3.22308 (norm. = 0.209016), norm. avg. (of 2) = 0.219687 fft 35: mflops = 3.74491 (norm. = 0.242857), norm. avg. (of 3) = 0.203712 fft 36: mflops = 4.25098 (norm. = 0.275676), norm. avg. (of 3) = 0.222563 fft 37: mflops = 9.47508 (norm. = 0.614458), norm. avg. (of 3) = 0.611594 fft 38: mflops = 5.27807 (norm. = 0.342282), norm. avg. (of 3) = 0.454718 fft 39: mflops = 6.72164 (norm. = 0.435897), norm. avg. (of 3) = 0.311436 fft 40: mflops = 6.04948 (norm. = 0.392308), norm. avg. (of 3) = 0.27586 fft 41: mflops = 2.47306 (norm. = 0.160377), norm. avg. (of 3) = 0.177994 Benchmarking for array size = 16 (power of 2): 0. Arndt DIF: elapsed time t=1.43 s, 32768 iters, t-(init.)=1.4 s t(norm)=0.667572, mflops=7.48983 (err=1.6e-16) 1. Arndt DIT: elapsed time t=1.37 s, 32768 iters, t-(init.)=1.34 s t(norm)=0.638962, mflops=7.82519 (err=1.2e-16) 2. Arndt Split-Radix: elapsed time t=1.78 s, 32768 iters, t-(init.)=1.75 s t(norm)=0.834465, mflops=5.99186 (err=1.4e-16) 3. Arndt 4-step: elapsed time t=1.74 s, 16384 iters, t-(init.)=1.73 s t(norm)=1.64986, mflops=3.03057 (err=1.3e-16) 4. Bailey: elapsed time t=1.07 s, 16384 iters, t-(init.)=1.06 s t(norm)=1.01089, mflops=4.94611 (err=1.4e-16) 5. Beauregard: elapsed time t=1.97 s, 16384 iters, t-(init.)=1.95 s t(norm)=1.85966, mflops=2.68866 (err=1.9e-16) 6. Bergland: elapsed time t=1.42 s, 32768 iters, t-(init.)=1.39 s t(norm)=0.662804, mflops=7.54371 (err=2.0e-16) 7. Brenner: elapsed time t=1.68 s, 32768 iters, t-(init.)=1.64 s t(norm)=0.782013, mflops=6.39376 (err=1.5e-16) 8. Burrus: elapsed time t=1.1 s, 16384 iters, t-(init.)=1.09 s t(norm)=1.03951, mflops=4.80998 (err=1.3e-16) 9. CWP (min N): elapsed time t=1.86 s, 65536 iters, t-(init.)=1.8 s t(norm)=0.429153, mflops=11.6508 10. CWP (best N) (N=28): elapsed time t=1.04 s, 16384 iters, t-(init.)=1.02 s t(norm)=0.972748, mflops=5.14008 11. Edelblute: elapsed time t=1.07 s, 16384 iters, t-(init.)=1.05 s t(norm)=1.00136, mflops=4.99322 (err=1.3e-16) 12. FFTPACK: elapsed time t=1.03 s, 32768 iters, t-(init.)=1 s t(norm)=0.476837, mflops=10.4858 (err=1.5e-16) 13. FFTPACK (f2c): elapsed time t=1.08 s, 32768 iters, t-(init.)=1.05 s t(norm)=0.500679, mflops=9.98644 (err=1.3e-16) FFTW_MEASURE plan: (cost = 2.258301e-05) FFTW_NOTW 16 14. FFTW: elapsed time t=1.48 s, 65536 iters, t-(init.)=1.42 s t(norm)=0.338554, mflops=14.7687 (err=1.6e-16) FFTW_ESTIMATE plan: (cost = 4.256000e+02) FFTW_NOTW 16 15. FFTW_ESTIMATE: elapsed time t=1.5 s, 65536 iters, t-(init.)=1.44 s t(norm)=0.343323, mflops=14.5636 (err=1.6e-16) 16. Frigo-old: elapsed time t=1.59 s, 65536 iters, t-(init.)=1.53 s t(norm)=0.36478, mflops=13.7069 (err=1.6e-16) 17. Green: elapsed time t=1.4 s, 32768 iters, t-(init.)=1.37 s t(norm)=0.653267, mflops=7.65384 (err=1.9e-16) 18. GSL: elapsed time t=1.41 s, 32768 iters, t-(init.)=1.38 s t(norm)=0.658035, mflops=7.59838 (err=1.5e-16) 19. GSL DIT: elapsed time t=1.91 s, 32768 iters, t-(init.)=1.88 s t(norm)=0.896454, mflops=5.57753 (err=1.6e-16) 20. GSL DIF: elapsed time t=1.2 s, 16384 iters, t-(init.)=1.18 s t(norm)=1.12534, mflops=4.44312 (err=1.9e-16) 21. Krukar: elapsed time t=1.98 s, 65536 iters, t-(init.)=1.92 s t(norm)=0.457764, mflops=10.9227 (err=1.7e-16) 22. Mayer (Buneman): elapsed time t=1.49 s, 32768 iters, t-(init.)=1.46 s t(norm)=0.696182, mflops=7.18203 (err=1.3e-16) 23. Mayer (simple): elapsed time t=1.4 s, 32768 iters, t-(init.)=1.37 s t(norm)=0.653267, mflops=7.65384 24. Mayer (lookup): elapsed time t=1.31 s, 32768 iters, t-(init.)=1.28 s t(norm)=0.610352, mflops=8.192 (err=1.4e-16) 25. Monro: elapsed time t=1.97 s, 32768 iters, t-(init.)=1.94 s t(norm)=0.925064, mflops=5.40503 (err=1.2e-08) 26. NAPACK (f2c): elapsed time t=1.83 s, 16384 iters, t-(init.)=1.81 s t(norm)=1.72615, mflops=2.89662 (err=3.5e-16) 27. Nielsen: elapsed time t=1.12 s, 16384 iters, t-(init.)=1.1 s t(norm)=1.04904, mflops=4.76625 (err=1.3e-16) 28. NR (C): elapsed time t=1.95 s, 32768 iters, t-(init.)=1.92 s t(norm)=0.915527, mflops=5.46133 (err=1.6e-16) 29. NR (F): elapsed time t=1.06 s, 16384 iters, t-(init.)=1.04 s t(norm)=0.991821, mflops=5.04123 (err=1.7e-16) 30. Ooura (C): elapsed time t=1.93 s, 65536 iters, t-(init.)=1.87 s t(norm)=0.445843, mflops=11.2147 (err=1.4e-16) 31. Ooura (F): elapsed time t=1.98 s, 65536 iters, t-(init.)=1.92 s t(norm)=0.457764, mflops=10.9227 (err=1.4e-16) 32. QFT: elapsed time t=1.81 s, 65536 iters, t-(init.)=1.75 s t(norm)=0.417233, mflops=11.9837 (err=1.3e-16) 33. Ransom: elapsed time t=1.48 s, 16384 iters, t-(init.)=1.47 s t(norm)=1.4019, mflops=3.56659 (err=6.0e-16) 34. SCIPORT: elapsed time t=1.7 s, 16384 iters, t-(init.)=1.69 s t(norm)=1.61171, mflops=3.1023 (err=5.2e-08) 35. Singleton: elapsed time t=1.76 s, 32768 iters, t-(init.)=1.73 s t(norm)=0.824928, mflops=6.06113 (err=1.7e-16) 36. Singleton (f2c): elapsed time t=1.66 s, 32768 iters, t-(init.)=1.63 s t(norm)=0.777245, mflops=6.43298 (err=1.9e-16) 37. Sorensen: elapsed time t=1.1 s, 32768 iters, t-(init.)=1.07 s t(norm)=0.510216, mflops=9.79978 (err=1.3e-16) 38. Sorensen DIT: elapsed time t=1.02 s, 16384 iters, t-(init.)=1 s t(norm)=0.953674, mflops=5.24288 (err=1.4e-16) 39. Temperton: elapsed time t=1.36 s, 32768 iters, t-(init.)=1.33 s t(norm)=0.634193, mflops=7.88403 (err=2.9e-08) 40. Temperton (f2c): elapsed time t=1.55 s, 32768 iters, t-(init.)=1.52 s t(norm)=0.724792, mflops=6.89853 (err=1.5e-16) 41. Valkenburg: elapsed time t=1.04 s, 8192 iters, t-(init.)=1.03 s t(norm)=1.96457, mflops=2.54509 (err=3.0e-16) Top mflops for N=16 = 14.7687 Normalized results and averages for N=16: fft 0: mflops = 7.48983 (norm. = 0.507143), norm. avg. (of 4) = 0.727199 fft 1: mflops = 7.82519 (norm. = 0.529851), norm. avg. (of 4) = 0.714194 fft 2: mflops = 5.99186 (norm. = 0.405714), norm. avg. (of 4) = 0.426196 fft 3: mflops = 3.03057 (norm. = 0.205202), norm. avg. (of 4) = 0.131382 fft 4: mflops = 4.94611 (norm. = 0.334906), norm. avg. (of 4) = 0.256 fft 5: mflops = 2.68866 (norm. = 0.182051), norm. avg. (of 4) = 0.152334 fft 6: mflops = 7.54371 (norm. = 0.510791), norm. avg. (of 4) = 0.396357 fft 7: mflops = 6.39376 (norm. = 0.432927), norm. avg. (of 4) = 0.324374 fft 8: mflops = 4.80998 (norm. = 0.325688), norm. avg. (of 4) = 0.430539 fft 9: mflops = 11.6508 (norm. = 0.788889), norm. avg. (of 4) = 0.50248 fft 10: mflops = 5.14008 (norm. = 0.348039), norm. avg. (of 4) = 0.205607 fft 11: mflops = 4.99322 (norm. = 0.338095), norm. avg. (of 3) = 0.339514 fft 12: mflops = 10.4858 (norm. = 0.71), norm. avg. (of 4) = 0.560398 fft 13: mflops = 9.98644 (norm. = 0.67619), norm. avg. (of 4) = 0.505219 fft 14: mflops = 14.7687 (norm. = 1), norm. avg. (of 4) = 0.948539 fft 15: mflops = 14.5636 (norm. = 0.986111), norm. avg. (of 4) = 0.950764 fft 16: mflops = 13.7069 (norm. = 0.928105), norm. avg. (of 4) = 0.977218 fft 17: mflops = 7.65384 (norm. = 0.518248), norm. avg. (of 2) = 0.552228 fft 18: mflops = 7.59838 (norm. = 0.514493), norm. avg. (of 4) = 0.370894 fft 19: mflops = 5.57753 (norm. = 0.37766), norm. avg. (of 4) = 0.241913 fft 20: mflops = 4.44312 (norm. = 0.300847), norm. avg. (of 4) = 0.19742 fft 21: mflops = 10.9227 (norm. = 0.739583), norm. avg. (of 4) = 0.692729 fft 22: mflops = 7.18203 (norm. = 0.486301), norm. avg. (of 3) = 0.477215 fft 23: mflops = 7.65384 (norm. = 0.518248), norm. avg. (of 3) = 0.48093 fft 24: mflops = 8.192 (norm. = 0.554687), norm. avg. (of 3) = 0.495622 fft 25: mflops = 5.40503 (norm. = 0.365979), norm. avg. (of 3) = 0.255703 fft 26: mflops = 2.89662 (norm. = 0.196133), norm. avg. (of 4) = 0.127031 fft 27: mflops = 4.76625 (norm. = 0.322727), norm. avg. (of 4) = 0.206708 fft 28: mflops = 5.46133 (norm. = 0.369792), norm. avg. (of 4) = 0.236407 fft 29: mflops = 5.04123 (norm. = 0.341346), norm. avg. (of 4) = 0.21859 fft 30: mflops = 11.2147 (norm. = 0.759358), norm. avg. (of 4) = 0.750812 fft 31: mflops = 10.9227 (norm. = 0.739583), norm. avg. (of 4) = 0.712322 fft 32: mflops = 11.9837 (norm. = 0.811429), norm. avg. (of 1) = 0.811429 fft 33: mflops = 3.56659 (norm. = 0.241497), norm. avg. (of 3) = 0.148883 fft 34: mflops = 3.1023 (norm. = 0.210059), norm. avg. (of 3) = 0.216478 fft 35: mflops = 6.06113 (norm. = 0.410405), norm. avg. (of 4) = 0.255385 fft 36: mflops = 6.43298 (norm. = 0.435583), norm. avg. (of 4) = 0.275818 fft 37: mflops = 9.79978 (norm. = 0.663551), norm. avg. (of 4) = 0.624583 fft 38: mflops = 5.24288 (norm. = 0.355), norm. avg. (of 4) = 0.429789 fft 39: mflops = 7.88403 (norm. = 0.533835), norm. avg. (of 4) = 0.367036 fft 40: mflops = 6.89853 (norm. = 0.467105), norm. avg. (of 4) = 0.323672 fft 41: mflops = 2.54509 (norm. = 0.17233), norm. avg. (of 4) = 0.176578 Benchmarking for array size = 32 (power of 2): 0. Arndt DIF: elapsed time t=1.65 s, 16384 iters, t-(init.)=1.63 s t(norm)=0.621796, mflops=8.04123 (err=6.0e-16) 1. Arndt DIT: elapsed time t=1.61 s, 16384 iters, t-(init.)=1.58 s t(norm)=0.602722, mflops=8.2957 (err=5.5e-16) 2. Arndt Split-Radix: elapsed time t=1.07 s, 8192 iters, t-(init.)=1.05 s t(norm)=0.801086, mflops=6.24152 (err=3.6e-16) 3. Arndt 4-step: elapsed time t=1.94 s, 8192 iters, t-(init.)=1.93 s t(norm)=1.47247, mflops=3.39565 (err=3.2e-16) 4. Bailey: elapsed time t=1.13 s, 8192 iters, t-(init.)=1.12 s t(norm)=0.854492, mflops=5.85143 (err=6.7e-16) 5. Beauregard: elapsed time t=1.23 s, 4096 iters, t-(init.)=1.22 s t(norm)=1.86157, mflops=2.6859 (err=6.6e-16) 6. Bergland: elapsed time t=1.55 s, 16384 iters, t-(init.)=1.53 s t(norm)=0.583649, mflops=8.5668 (err=6.4e-16) 7. Brenner: elapsed time t=1.95 s, 16384 iters, t-(init.)=1.93 s t(norm)=0.736237, mflops=6.7913 (err=6.0e-16) 8. Burrus: elapsed time t=1.34 s, 8192 iters, t-(init.)=1.33 s t(norm)=1.01471, mflops=4.92752 (err=3.5e-16) 9. CWP (min N) (N=33): elapsed time t=1.52 s, 16384 iters, t-(init.)=1.49 s t(norm)=0.56839, mflops=8.79678 10. CWP (best N) (N=35): elapsed time t=1.51 s, 16384 iters, t-(init.)=1.48 s t(norm)=0.564575, mflops=8.85622 11. Edelblute: elapsed time t=1.29 s, 8192 iters, t-(init.)=1.27 s t(norm)=0.968933, mflops=5.16031 (err=3.5e-16) 12. FFTPACK: elapsed time t=1.4 s, 16384 iters, t-(init.)=1.37 s t(norm)=0.522614, mflops=9.5673 (err=4.4e-16) 13. FFTPACK (f2c): elapsed time t=1.46 s, 16384 iters, t-(init.)=1.43 s t(norm)=0.545502, mflops=9.16587 (err=5.3e-16) FFTW_MEASURE plan: (cost = 5.859375e-05) FFTW_NOTW 32 14. FFTW: elapsed time t=1.9 s, 32768 iters, t-(init.)=1.84 s t(norm)=0.350952, mflops=14.247 (err=6.1e-16) FFTW_ESTIMATE plan: (cost = 3.200000e+01) FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.91 s, 32768 iters, t-(init.)=1.86 s t(norm)=0.354767, mflops=14.0938 (err=6.1e-16) 16. Frigo-old: elapsed time t=1.02 s, 16384 iters, t-(init.)=0.99 s t(norm)=0.377655, mflops=13.2396 (err=5.5e-16) 17. Green: elapsed time t=1.65 s, 16384 iters, t-(init.)=1.63 s t(norm)=0.621796, mflops=8.04123 (err=6.9e-16) 18. GSL: elapsed time t=1.86 s, 16384 iters, t-(init.)=1.83 s t(norm)=0.69809, mflops=7.1624 (err=5.0e-16) 19. GSL DIT: elapsed time t=1.02 s, 8192 iters, t-(init.)=1.01 s t(norm)=0.770569, mflops=6.48871 (err=6.1e-16) 20. GSL DIF: elapsed time t=1.26 s, 8192 iters, t-(init.)=1.25 s t(norm)=0.953674, mflops=5.24288 (err=4.3e-16) 21. Krukar: elapsed time t=1.27 s, 16384 iters, t-(init.)=1.25 s t(norm)=0.476837, mflops=10.4858 (err=8.6e-16) 22. Mayer (Buneman): elapsed time t=1.75 s, 16384 iters, t-(init.)=1.73 s t(norm)=0.659943, mflops=7.57642 (err=3.5e-16) 23. Mayer (simple): elapsed time t=1.58 s, 16384 iters, t-(init.)=1.56 s t(norm)=0.595093, mflops=8.40205 24. Mayer (lookup): elapsed time t=1.5 s, 16384 iters, t-(init.)=1.48 s t(norm)=0.564575, mflops=8.85622 (err=5.9e-16) 25. Monro: elapsed time t=1.02 s, 8192 iters, t-(init.)=1 s t(norm)=0.762939, mflops=6.5536 (err=1.2e-07) 26. NAPACK (f2c): elapsed time t=1 s, 4096 iters, t-(init.)=1 s t(norm)=1.52588, mflops=3.2768 (err=9.3e-16) 27. Nielsen: elapsed time t=1.09 s, 8192 iters, t-(init.)=1.08 s t(norm)=0.823975, mflops=6.06815 (err=3.1e-15) 28. NR (C): elapsed time t=1.01 s, 8192 iters, t-(init.)=0.99 s t(norm)=0.75531, mflops=6.6198 (err=6.1e-16) 29. NR (F): elapsed time t=1.11 s, 8192 iters, t-(init.)=1.1 s t(norm)=0.839233, mflops=5.95782 (err=7.0e-16) 30. Ooura (C): elapsed time t=1.2 s, 16384 iters, t-(init.)=1.18 s t(norm)=0.450134, mflops=11.1078 (err=4.3e-16) 31. Ooura (F): elapsed time t=1.25 s, 16384 iters, t-(init.)=1.22 s t(norm)=0.465393, mflops=10.7436 (err=4.3e-16) 32. QFT: elapsed time t=1.22 s, 16384 iters, t-(init.)=1.19 s t(norm)=0.453949, mflops=11.0145 (err=4.6e-16) 33. Ransom: elapsed time t=1.64 s, 8192 iters, t-(init.)=1.63 s t(norm)=1.24359, mflops=4.02061 (err=3.8e-15) 34. SCIPORT: elapsed time t=1.11 s, 4096 iters, t-(init.)=1.11 s t(norm)=1.69373, mflops=2.95207 (err=3.1e-07) 35. Singleton: elapsed time t=1.01 s, 8192 iters, t-(init.)=1 s t(norm)=0.762939, mflops=6.5536 (err=7.1e-16) 36. Singleton (f2c): elapsed time t=1.87 s, 16384 iters, t-(init.)=1.84 s t(norm)=0.701904, mflops=7.12348 (err=5.8e-16) 37. Sorensen: elapsed time t=1.32 s, 16384 iters, t-(init.)=1.3 s t(norm)=0.495911, mflops=10.0825 (err=3.4e-16) 38. Sorensen DIT: elapsed time t=1.23 s, 8192 iters, t-(init.)=1.22 s t(norm)=0.930786, mflops=5.3718 (err=5.1e-16) 39. Temperton: elapsed time t=1.67 s, 16384 iters, t-(init.)=1.64 s t(norm)=0.62561, mflops=7.9922 (err=1.8e-07) 40. Temperton (f2c): elapsed time t=1.03 s, 8192 iters, t-(init.)=1.02 s t(norm)=0.778198, mflops=6.4251 (err=5.1e-16) 41. Valkenburg: elapsed time t=1.28 s, 4096 iters, t-(init.)=1.28 s t(norm)=1.95313, mflops=2.56 (err=8.4e-16) Top mflops for N=32 = 14.247 Normalized results and averages for N=32: fft 0: mflops = 8.04123 (norm. = 0.564417), norm. avg. (of 5) = 0.694642 fft 1: mflops = 8.2957 (norm. = 0.582278), norm. avg. (of 5) = 0.687811 fft 2: mflops = 6.24152 (norm. = 0.438095), norm. avg. (of 5) = 0.428576 fft 3: mflops = 3.39565 (norm. = 0.238342), norm. avg. (of 5) = 0.152774 fft 4: mflops = 5.85143 (norm. = 0.410714), norm. avg. (of 5) = 0.286943 fft 5: mflops = 2.6859 (norm. = 0.188525), norm. avg. (of 5) = 0.159572 fft 6: mflops = 8.5668 (norm. = 0.601307), norm. avg. (of 5) = 0.437347 fft 7: mflops = 6.7913 (norm. = 0.476684), norm. avg. (of 5) = 0.354836 fft 8: mflops = 4.92752 (norm. = 0.345865), norm. avg. (of 5) = 0.413604 fft 9: mflops = 8.79678 (norm. = 0.61745), norm. avg. (of 5) = 0.525474 fft 10: mflops = 8.85622 (norm. = 0.621622), norm. avg. (of 5) = 0.28881 fft 11: mflops = 5.16031 (norm. = 0.362205), norm. avg. (of 4) = 0.345187 fft 12: mflops = 9.5673 (norm. = 0.671533), norm. avg. (of 5) = 0.582625 fft 13: mflops = 9.16587 (norm. = 0.643357), norm. avg. (of 5) = 0.532846 fft 14: mflops = 14.247 (norm. = 1), norm. avg. (of 5) = 0.958831 fft 15: mflops = 14.0938 (norm. = 0.989247), norm. avg. (of 5) = 0.958461 fft 16: mflops = 13.2396 (norm. = 0.929293), norm. avg. (of 5) = 0.967633 fft 17: mflops = 8.04123 (norm. = 0.564417), norm. avg. (of 3) = 0.556291 fft 18: mflops = 7.1624 (norm. = 0.502732), norm. avg. (of 5) = 0.397262 fft 19: mflops = 6.48871 (norm. = 0.455446), norm. avg. (of 5) = 0.284619 fft 20: mflops = 5.24288 (norm. = 0.368), norm. avg. (of 5) = 0.231536 fft 21: mflops = 10.4858 (norm. = 0.736), norm. avg. (of 5) = 0.701383 fft 22: mflops = 7.57642 (norm. = 0.531792), norm. avg. (of 4) = 0.490859 fft 23: mflops = 8.40205 (norm. = 0.589744), norm. avg. (of 4) = 0.508134 fft 24: mflops = 8.85622 (norm. = 0.621622), norm. avg. (of 4) = 0.527122 fft 25: mflops = 6.5536 (norm. = 0.46), norm. avg. (of 4) = 0.306777 fft 26: mflops = 3.2768 (norm. = 0.23), norm. avg. (of 5) = 0.147625 fft 27: mflops = 6.06815 (norm. = 0.425926), norm. avg. (of 5) = 0.250551 fft 28: mflops = 6.6198 (norm. = 0.464646), norm. avg. (of 5) = 0.282055 fft 29: mflops = 5.95782 (norm. = 0.418182), norm. avg. (of 5) = 0.258508 fft 30: mflops = 11.1078 (norm. = 0.779661), norm. avg. (of 5) = 0.756582 fft 31: mflops = 10.7436 (norm. = 0.754098), norm. avg. (of 5) = 0.720677 fft 32: mflops = 11.0145 (norm. = 0.773109), norm. avg. (of 2) = 0.792269 fft 33: mflops = 4.02061 (norm. = 0.282209), norm. avg. (of 4) = 0.182214 fft 34: mflops = 2.95207 (norm. = 0.207207), norm. avg. (of 4) = 0.21416 fft 35: mflops = 6.5536 (norm. = 0.46), norm. avg. (of 5) = 0.296308 fft 36: mflops = 7.12348 (norm. = 0.5), norm. avg. (of 5) = 0.320654 fft 37: mflops = 10.0825 (norm. = 0.707692), norm. avg. (of 5) = 0.641205 fft 38: mflops = 5.3718 (norm. = 0.377049), norm. avg. (of 5) = 0.419241 fft 39: mflops = 7.9922 (norm. = 0.560976), norm. avg. (of 5) = 0.405824 fft 40: mflops = 6.4251 (norm. = 0.45098), norm. avg. (of 5) = 0.349133 fft 41: mflops = 2.56 (norm. = 0.179687), norm. avg. (of 5) = 0.1772 Benchmarking for array size = 64 (power of 2): 0. Arndt DIF: elapsed time t=1.06 s, 4096 iters, t-(init.)=1.05 s t(norm)=0.667572, mflops=7.48983 (err=2.3e-16) 1. Arndt DIT: elapsed time t=1.99 s, 8192 iters, t-(init.)=1.97 s t(norm)=0.626246, mflops=7.98408 (err=3.4e-16) 2. Arndt Split-Radix: elapsed time t=1.22 s, 4096 iters, t-(init.)=1.21 s t(norm)=0.769297, mflops=6.49944 (err=4.4e-16) 3. Arndt 4-step: elapsed time t=1.92 s, 4096 iters, t-(init.)=1.9 s t(norm)=1.20799, mflops=4.13912 (err=3.6e-16) 4. Bailey: elapsed time t=1.3 s, 4096 iters, t-(init.)=1.29 s t(norm)=0.82016, mflops=6.09637 (err=4.8e-16) 5. Beauregard: elapsed time t=1.52 s, 2048 iters, t-(init.)=1.51 s t(norm)=1.92006, mflops=2.60408 (err=3.8e-16) 6. Bergland: elapsed time t=1.81 s, 8192 iters, t-(init.)=1.79 s t(norm)=0.569026, mflops=8.78695 (err=2.5e-16) 7. Brenner: elapsed time t=1.08 s, 4096 iters, t-(init.)=1.07 s t(norm)=0.680288, mflops=7.34983 (err=3.8e-16) 8. Burrus: elapsed time t=1.55 s, 4096 iters, t-(init.)=1.53 s t(norm)=0.972748, mflops=5.14008 (err=4.2e-16) 9. CWP (min N) (N=65): elapsed time t=1.78 s, 8192 iters, t-(init.)=1.75 s t(norm)=0.55631, mflops=8.98779 10. CWP (best N) (N=84): elapsed time t=1.95 s, 8192 iters, t-(init.)=1.92 s t(norm)=0.610352, mflops=8.192 11. Edelblute: elapsed time t=1.5 s, 4096 iters, t-(init.)=1.49 s t(norm)=0.947316, mflops=5.27807 (err=3.6e-16) 12. FFTPACK: elapsed time t=1.64 s, 8192 iters, t-(init.)=1.61 s t(norm)=0.511805, mflops=9.76934 (err=4.2e-16) 13. FFTPACK (f2c): elapsed time t=1.71 s, 8192 iters, t-(init.)=1.69 s t(norm)=0.537237, mflops=9.30689 (err=4.5e-16) FFTW_MEASURE plan: (cost = 1.464844e-04) FFTW_NOTW 64 14. FFTW: elapsed time t=1.19 s, 8192 iters, t-(init.)=1.16 s t(norm)=0.368754, mflops=13.5592 (err=4.2e-16) FFTW_ESTIMATE plan: (cost = 7.680000e+02) FFTW_TWIDDLE 2 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.31 s, 8192 iters, t-(init.)=1.29 s t(norm)=0.41008, mflops=12.1927 (err=2.6e-16) 16. Frigo-old: elapsed time t=1.42 s, 8192 iters, t-(init.)=1.39 s t(norm)=0.441869, mflops=11.3156 (err=4.5e-16) 17. Green: elapsed time t=1.84 s, 8192 iters, t-(init.)=1.81 s t(norm)=0.575384, mflops=8.68986 (err=3.7e-16) 18. GSL: elapsed time t=1.02 s, 4096 iters, t-(init.)=1.01 s t(norm)=0.642141, mflops=7.78646 (err=3.8e-16) 19. GSL DIT: elapsed time t=1.1 s, 4096 iters, t-(init.)=1.09 s t(norm)=0.693003, mflops=7.21497 (err=3.2e-16) 20. GSL DIF: elapsed time t=1.39 s, 4096 iters, t-(init.)=1.38 s t(norm)=0.87738, mflops=5.69878 (err=3.1e-16) 21. Krukar: elapsed time t=1.6 s, 8192 iters, t-(init.)=1.57 s t(norm)=0.49909, mflops=10.0182 (err=5.3e-16) 22. Mayer (Buneman): elapsed time t=1.07 s, 4096 iters, t-(init.)=1.06 s t(norm)=0.67393, mflops=7.41917 (err=2.0e-16) 23. Mayer (simple): elapsed time t=1.92 s, 8192 iters, t-(init.)=1.9 s t(norm)=0.603994, mflops=8.27823 24. Mayer (lookup): elapsed time t=1.8 s, 8192 iters, t-(init.)=1.77 s t(norm)=0.562668, mflops=8.88624 (err=3.4e-16) 25. Monro: elapsed time t=1.08 s, 4096 iters, t-(init.)=1.06 s t(norm)=0.67393, mflops=7.41917 (err=4.9e-08) 26. NAPACK (f2c): elapsed time t=1.07 s, 2048 iters, t-(init.)=1.07 s t(norm)=1.36058, mflops=3.67492 (err=1.0e-15) 27. Nielsen: elapsed time t=1.15 s, 4096 iters, t-(init.)=1.14 s t(norm)=0.724792, mflops=6.89853 (err=6.5e-15) 28. NR (C): elapsed time t=1.09 s, 4096 iters, t-(init.)=1.08 s t(norm)=0.686646, mflops=7.28178 (err=3.2e-16) 29. NR (F): elapsed time t=1.2 s, 4096 iters, t-(init.)=1.19 s t(norm)=0.756582, mflops=6.60867 (err=3.4e-16) 30. Ooura (C): elapsed time t=1.46 s, 8192 iters, t-(init.)=1.43 s t(norm)=0.454585, mflops=10.999 (err=2.9e-16) 31. Ooura (F): elapsed time t=1.49 s, 8192 iters, t-(init.)=1.46 s t(norm)=0.464122, mflops=10.773 (err=2.9e-16) 32. QFT: elapsed time t=1.56 s, 8192 iters, t-(init.)=1.53 s t(norm)=0.486374, mflops=10.2802 (err=5.9e-16) 33. Ransom: elapsed time t=1.44 s, 4096 iters, t-(init.)=1.42 s t(norm)=0.902812, mflops=5.53825 (err=2.5e-15) 34. SCIPORT: elapsed time t=1.34 s, 2048 iters, t-(init.)=1.33 s t(norm)=1.69118, mflops=2.95651 (err=2.0e-07) 35. Singleton: elapsed time t=1.09 s, 4096 iters, t-(init.)=1.08 s t(norm)=0.686646, mflops=7.28178 (err=3.5e-16) 36. Singleton (f2c): elapsed time t=1.03 s, 4096 iters, t-(init.)=1.02 s t(norm)=0.648499, mflops=7.71012 (err=4.6e-16) 37. Sorensen: elapsed time t=1.54 s, 8192 iters, t-(init.)=1.51 s t(norm)=0.480016, mflops=10.4163 (err=3.4e-16) 38. Sorensen DIT: elapsed time t=1.41 s, 4096 iters, t-(init.)=1.4 s t(norm)=0.890096, mflops=5.61737 (err=5.4e-16) 39. Temperton: elapsed time t=1.81 s, 8192 iters, t-(init.)=1.78 s t(norm)=0.565847, mflops=8.83631 (err=1.2e-07) 40. Temperton (f2c): elapsed time t=1.08 s, 4096 iters, t-(init.)=1.07 s t(norm)=0.680288, mflops=7.34983 (err=3.7e-16) 41. Valkenburg: elapsed time t=1.51 s, 2048 iters, t-(init.)=1.5 s t(norm)=1.90735, mflops=2.62144 (err=8.5e-16) Top mflops for N=64 = 13.5592 Normalized results and averages for N=64: fft 0: mflops = 7.48983 (norm. = 0.552381), norm. avg. (of 6) = 0.670932 fft 1: mflops = 7.98408 (norm. = 0.588832), norm. avg. (of 6) = 0.671315 fft 2: mflops = 6.49944 (norm. = 0.479339), norm. avg. (of 6) = 0.437036 fft 3: mflops = 4.13912 (norm. = 0.305263), norm. avg. (of 6) = 0.178189 fft 4: mflops = 6.09637 (norm. = 0.449612), norm. avg. (of 6) = 0.314055 fft 5: mflops = 2.60408 (norm. = 0.192053), norm. avg. (of 6) = 0.164985 fft 6: mflops = 8.78695 (norm. = 0.648045), norm. avg. (of 6) = 0.472463 fft 7: mflops = 7.34983 (norm. = 0.542056), norm. avg. (of 6) = 0.386039 fft 8: mflops = 5.14008 (norm. = 0.379085), norm. avg. (of 6) = 0.407851 fft 9: mflops = 8.98779 (norm. = 0.662857), norm. avg. (of 6) = 0.548371 fft 10: mflops = 8.192 (norm. = 0.604167), norm. avg. (of 6) = 0.341369 fft 11: mflops = 5.27807 (norm. = 0.389262), norm. avg. (of 5) = 0.354002 fft 12: mflops = 9.76934 (norm. = 0.720497), norm. avg. (of 6) = 0.605604 fft 13: mflops = 9.30689 (norm. = 0.686391), norm. avg. (of 6) = 0.558437 fft 14: mflops = 13.5592 (norm. = 1), norm. avg. (of 6) = 0.965693 fft 15: mflops = 12.1927 (norm. = 0.899225), norm. avg. (of 6) = 0.948588 fft 16: mflops = 11.3156 (norm. = 0.834532), norm. avg. (of 6) = 0.94545 fft 17: mflops = 8.68986 (norm. = 0.640884), norm. avg. (of 4) = 0.577439 fft 18: mflops = 7.78646 (norm. = 0.574257), norm. avg. (of 6) = 0.426761 fft 19: mflops = 7.21497 (norm. = 0.53211), norm. avg. (of 6) = 0.325868 fft 20: mflops = 5.69878 (norm. = 0.42029), norm. avg. (of 6) = 0.262995 fft 21: mflops = 10.0182 (norm. = 0.738854), norm. avg. (of 6) = 0.707628 fft 22: mflops = 7.41917 (norm. = 0.54717), norm. avg. (of 5) = 0.502121 fft 23: mflops = 8.27823 (norm. = 0.610526), norm. avg. (of 5) = 0.528612 fft 24: mflops = 8.88624 (norm. = 0.655367), norm. avg. (of 5) = 0.552771 fft 25: mflops = 7.41917 (norm. = 0.54717), norm. avg. (of 5) = 0.354856 fft 26: mflops = 3.67492 (norm. = 0.271028), norm. avg. (of 6) = 0.168192 fft 27: mflops = 6.89853 (norm. = 0.508772), norm. avg. (of 6) = 0.293588 fft 28: mflops = 7.28178 (norm. = 0.537037), norm. avg. (of 6) = 0.324552 fft 29: mflops = 6.60867 (norm. = 0.487395), norm. avg. (of 6) = 0.296656 fft 30: mflops = 10.999 (norm. = 0.811189), norm. avg. (of 6) = 0.765683 fft 31: mflops = 10.773 (norm. = 0.794521), norm. avg. (of 6) = 0.732984 fft 32: mflops = 10.2802 (norm. = 0.75817), norm. avg. (of 3) = 0.780903 fft 33: mflops = 5.53825 (norm. = 0.408451), norm. avg. (of 5) = 0.227461 fft 34: mflops = 2.95651 (norm. = 0.218045), norm. avg. (of 5) = 0.214937 fft 35: mflops = 7.28178 (norm. = 0.537037), norm. avg. (of 6) = 0.33643 fft 36: mflops = 7.71012 (norm. = 0.568627), norm. avg. (of 6) = 0.361983 fft 37: mflops = 10.4163 (norm. = 0.768212), norm. avg. (of 6) = 0.662373 fft 38: mflops = 5.61737 (norm. = 0.414286), norm. avg. (of 6) = 0.418415 fft 39: mflops = 8.83631 (norm. = 0.651685), norm. avg. (of 6) = 0.446801 fft 40: mflops = 7.34983 (norm. = 0.542056), norm. avg. (of 6) = 0.381287 fft 41: mflops = 2.62144 (norm. = 0.193333), norm. avg. (of 6) = 0.179889 Benchmarking for array size = 128 (power of 2): 0. Arndt DIF: elapsed time t=1.17 s, 2048 iters, t-(init.)=1.15 s t(norm)=0.6267, mflops=7.9783 (err=3.8e-16) 1. Arndt DIT: elapsed time t=1.12 s, 2048 iters, t-(init.)=1.1 s t(norm)=0.599452, mflops=8.34095 (err=5.1e-16) 2. Arndt Split-Radix: elapsed time t=1.36 s, 2048 iters, t-(init.)=1.35 s t(norm)=0.735692, mflops=6.79633 (err=6.1e-16) 3. Arndt 4-step: elapsed time t=1.15 s, 1024 iters, t-(init.)=1.14 s t(norm)=1.2425, mflops=4.02414 (err=3.3e-16) 4. Bailey: elapsed time t=1.39 s, 2048 iters, t-(init.)=1.38 s t(norm)=0.75204, mflops=6.64858 (err=6.1e-16) 5. Beauregard: elapsed time t=1.79 s, 1024 iters, t-(init.)=1.79 s t(norm)=1.95095, mflops=2.56286 (err=9.3e-16) 6. Bergland: elapsed time t=1.05 s, 2048 iters, t-(init.)=1.03 s t(norm)=0.561305, mflops=8.90781 (err=6.2e-16) 7. Brenner: elapsed time t=1.22 s, 2048 iters, t-(init.)=1.21 s t(norm)=0.659398, mflops=7.58268 (err=6.6e-16) 8. Burrus: elapsed time t=1.72 s, 2048 iters, t-(init.)=1.71 s t(norm)=0.931876, mflops=5.36552 (err=5.3e-16) 9. CWP (min N) (N=130): elapsed time t=1.96 s, 4096 iters, t-(init.)=1.93 s t(norm)=0.525883, mflops=9.50781 10. CWP (best N) (N=140): elapsed time t=1.82 s, 4096 iters, t-(init.)=1.79 s t(norm)=0.487736, mflops=10.2514 11. Edelblute: elapsed time t=1.68 s, 2048 iters, t-(init.)=1.66 s t(norm)=0.904628, mflops=5.52713 (err=6.7e-16) 12. FFTPACK: elapsed time t=1.86 s, 4096 iters, t-(init.)=1.84 s t(norm)=0.50136, mflops=9.97287 (err=5.3e-16) 13. FFTPACK (f2c): elapsed time t=1.89 s, 4096 iters, t-(init.)=1.86 s t(norm)=0.50681, mflops=9.86563 (err=6.2e-16) FFTW_MEASURE plan: (cost = 3.710938e-04) FFTW_TWIDDLE 16 FFTW_NOTW 8 14. FFTW: elapsed time t=1.52 s, 4096 iters, t-(init.)=1.49 s t(norm)=0.405993, mflops=12.3155 (err=5.8e-16) FFTW_ESTIMATE plan: (cost = 1.075200e+03) FFTW_TWIDDLE 4 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.53 s, 4096 iters, t-(init.)=1.51 s t(norm)=0.411442, mflops=12.1524 (err=3.6e-16) 16. Frigo-old: elapsed time t=1.65 s, 4096 iters, t-(init.)=1.63 s t(norm)=0.44414, mflops=11.2577 (err=4.4e-16) 17. Green: elapsed time t=1.13 s, 2048 iters, t-(init.)=1.12 s t(norm)=0.610352, mflops=8.192 (err=6.9e-16) 18. GSL: elapsed time t=1.19 s, 2048 iters, t-(init.)=1.18 s t(norm)=0.643049, mflops=7.77546 (err=8.2e-16) 19. GSL DIT: elapsed time t=1.21 s, 2048 iters, t-(init.)=1.2 s t(norm)=0.653948, mflops=7.64587 (err=7.5e-16) 20. GSL DIF: elapsed time t=1.54 s, 2048 iters, t-(init.)=1.53 s t(norm)=0.833784, mflops=5.99676 (err=7.6e-16) 21. Krukar: elapsed time t=1.95 s, 4096 iters, t-(init.)=1.93 s t(norm)=0.525883, mflops=9.50781 (err=6.5e-16) 22. Mayer (Buneman): elapsed time t=1.18 s, 2048 iters, t-(init.)=1.17 s t(norm)=0.637599, mflops=7.84191 (err=3.1e-16) 23. Mayer (simple): elapsed time t=1.07 s, 2048 iters, t-(init.)=1.06 s t(norm)=0.577654, mflops=8.6557 24. Mayer (lookup): elapsed time t=1.98 s, 4096 iters, t-(init.)=1.95 s t(norm)=0.531333, mflops=9.4103 (err=3.5e-16) 25. Monro: elapsed time t=1.24 s, 2048 iters, t-(init.)=1.23 s t(norm)=0.670297, mflops=7.45938 (err=8.3e-08) 26. NAPACK (f2c): elapsed time t=1.18 s, 1024 iters, t-(init.)=1.18 s t(norm)=1.2861, mflops=3.88773 (err=1.6e-15) 27. Nielsen: elapsed time t=1.32 s, 2048 iters, t-(init.)=1.31 s t(norm)=0.713893, mflops=7.00385 (err=1.7e-15) 28. NR (C): elapsed time t=1.18 s, 2048 iters, t-(init.)=1.17 s t(norm)=0.637599, mflops=7.84191 (err=7.5e-16) 29. NR (F): elapsed time t=1.3 s, 2048 iters, t-(init.)=1.28 s t(norm)=0.697545, mflops=7.168 (err=6.9e-16) 30. Ooura (C): elapsed time t=1.73 s, 4096 iters, t-(init.)=1.71 s t(norm)=0.465938, mflops=10.731 (err=6.7e-16) 31. Ooura (F): elapsed time t=1.75 s, 4096 iters, t-(init.)=1.72 s t(norm)=0.468663, mflops=10.6687 (err=6.7e-16) 32. QFT: elapsed time t=1.85 s, 4096 iters, t-(init.)=1.83 s t(norm)=0.498635, mflops=10.0274 (err=4.9e-16) 33. Ransom: elapsed time t=1.66 s, 2048 iters, t-(init.)=1.65 s t(norm)=0.899179, mflops=5.56063 (err=1.7e-15) 34. SCIPORT: elapsed time t=1.6 s, 1024 iters, t-(init.)=1.6 s t(norm)=1.74386, mflops=2.8672 (err=1.6e-07) 35. Singleton: elapsed time t=1.3 s, 2048 iters, t-(init.)=1.29 s t(norm)=0.702994, mflops=7.11243 (err=6.2e-16) 36. Singleton (f2c): elapsed time t=1.2 s, 2048 iters, t-(init.)=1.19 s t(norm)=0.648499, mflops=7.71012 (err=5.7e-16) 37. Sorensen: elapsed time t=1.78 s, 4096 iters, t-(init.)=1.75 s t(norm)=0.476837, mflops=10.4858 (err=4.3e-16) 38. Sorensen DIT: elapsed time t=1.56 s, 2048 iters, t-(init.)=1.54 s t(norm)=0.839233, mflops=5.95782 (err=4.0e-16) 39. Temperton: elapsed time t=1.1 s, 2048 iters, t-(init.)=1.09 s t(norm)=0.594003, mflops=8.41747 (err=9.9e-08) 40. Temperton (f2c): elapsed time t=1.28 s, 2048 iters, t-(init.)=1.26 s t(norm)=0.686646, mflops=7.28178 (err=7.7e-16) 41. Valkenburg: elapsed time t=1.76 s, 1024 iters, t-(init.)=1.76 s t(norm)=1.91825, mflops=2.60655 (err=8.6e-16) Top mflops for N=128 = 12.3155 Normalized results and averages for N=128: fft 0: mflops = 7.9783 (norm. = 0.647826), norm. avg. (of 7) = 0.667631 fft 1: mflops = 8.34095 (norm. = 0.677273), norm. avg. (of 7) = 0.672166 fft 2: mflops = 6.79633 (norm. = 0.551852), norm. avg. (of 7) = 0.453439 fft 3: mflops = 4.02414 (norm. = 0.326754), norm. avg. (of 7) = 0.199413 fft 4: mflops = 6.64858 (norm. = 0.539855), norm. avg. (of 7) = 0.346312 fft 5: mflops = 2.56286 (norm. = 0.208101), norm. avg. (of 7) = 0.171145 fft 6: mflops = 8.90781 (norm. = 0.723301), norm. avg. (of 7) = 0.508297 fft 7: mflops = 7.58268 (norm. = 0.615702), norm. avg. (of 7) = 0.418848 fft 8: mflops = 5.36552 (norm. = 0.435673), norm. avg. (of 7) = 0.411825 fft 9: mflops = 9.50781 (norm. = 0.772021), norm. avg. (of 7) = 0.580321 fft 10: mflops = 10.2514 (norm. = 0.832402), norm. avg. (of 7) = 0.411517 fft 11: mflops = 5.52713 (norm. = 0.448795), norm. avg. (of 6) = 0.3698 fft 12: mflops = 9.97287 (norm. = 0.809783), norm. avg. (of 7) = 0.634772 fft 13: mflops = 9.86563 (norm. = 0.801075), norm. avg. (of 7) = 0.5931 fft 14: mflops = 12.3155 (norm. = 1), norm. avg. (of 7) = 0.970594 fft 15: mflops = 12.1524 (norm. = 0.986755), norm. avg. (of 7) = 0.954041 fft 16: mflops = 11.2577 (norm. = 0.91411), norm. avg. (of 7) = 0.940973 fft 17: mflops = 8.192 (norm. = 0.665179), norm. avg. (of 5) = 0.594987 fft 18: mflops = 7.77546 (norm. = 0.631356), norm. avg. (of 7) = 0.455989 fft 19: mflops = 7.64587 (norm. = 0.620833), norm. avg. (of 7) = 0.368006 fft 20: mflops = 5.99676 (norm. = 0.486928), norm. avg. (of 7) = 0.294985 fft 21: mflops = 9.50781 (norm. = 0.772021), norm. avg. (of 7) = 0.716827 fft 22: mflops = 7.84191 (norm. = 0.636752), norm. avg. (of 6) = 0.52456 fft 23: mflops = 8.6557 (norm. = 0.70283), norm. avg. (of 6) = 0.557648 fft 24: mflops = 9.4103 (norm. = 0.764103), norm. avg. (of 6) = 0.587993 fft 25: mflops = 7.45938 (norm. = 0.605691), norm. avg. (of 6) = 0.396662 fft 26: mflops = 3.88773 (norm. = 0.315678), norm. avg. (of 7) = 0.189261 fft 27: mflops = 7.00385 (norm. = 0.568702), norm. avg. (of 7) = 0.33289 fft 28: mflops = 7.84191 (norm. = 0.636752), norm. avg. (of 7) = 0.369152 fft 29: mflops = 7.168 (norm. = 0.582031), norm. avg. (of 7) = 0.337424 fft 30: mflops = 10.731 (norm. = 0.871345), norm. avg. (of 7) = 0.780778 fft 31: mflops = 10.6687 (norm. = 0.866279), norm. avg. (of 7) = 0.752026 fft 32: mflops = 10.0274 (norm. = 0.814208), norm. avg. (of 4) = 0.789229 fft 33: mflops = 5.56063 (norm. = 0.451515), norm. avg. (of 6) = 0.264804 fft 34: mflops = 2.8672 (norm. = 0.232812), norm. avg. (of 6) = 0.217916 fft 35: mflops = 7.11243 (norm. = 0.577519), norm. avg. (of 7) = 0.370871 fft 36: mflops = 7.71012 (norm. = 0.62605), norm. avg. (of 7) = 0.399707 fft 37: mflops = 10.4858 (norm. = 0.851429), norm. avg. (of 7) = 0.689381 fft 38: mflops = 5.95782 (norm. = 0.483766), norm. avg. (of 7) = 0.427751 fft 39: mflops = 8.41747 (norm. = 0.683486), norm. avg. (of 7) = 0.480613 fft 40: mflops = 7.28178 (norm. = 0.59127), norm. avg. (of 7) = 0.411285 fft 41: mflops = 2.60655 (norm. = 0.211648), norm. avg. (of 7) = 0.184426 Benchmarking for array size = 256 (power of 2): 0. Arndt DIF: elapsed time t=1.34 s, 1024 iters, t-(init.)=1.33 s t(norm)=0.634193, mflops=7.88403 (err=4.8e-16) 1. Arndt DIT: elapsed time t=1.26 s, 1024 iters, t-(init.)=1.25 s t(norm)=0.596046, mflops=8.38861 (err=5.1e-16) 2. Arndt Split-Radix: elapsed time t=1.5 s, 1024 iters, t-(init.)=1.49 s t(norm)=0.710487, mflops=7.03742 (err=5.5e-16) 3. Arndt 4-step: elapsed time t=1.29 s, 512 iters, t-(init.)=1.28 s t(norm)=1.2207, mflops=4.096 (err=5.7e-16) 4. Bailey: elapsed time t=1.61 s, 1024 iters, t-(init.)=1.59 s t(norm)=0.758171, mflops=6.59482 (err=5.5e-16) 5. Beauregard: elapsed time t=1.03 s, 256 iters, t-(init.)=1.03 s t(norm)=1.96457, mflops=2.54509 (err=4.8e-16) 6. Bergland: elapsed time t=1.14 s, 1024 iters, t-(init.)=1.13 s t(norm)=0.538826, mflops=9.27943 (err=5.7e-16) 7. Brenner: elapsed time t=1.33 s, 1024 iters, t-(init.)=1.32 s t(norm)=0.629425, mflops=7.94376 (err=4.8e-16) 8. Burrus: elapsed time t=1.87 s, 1024 iters, t-(init.)=1.85 s t(norm)=0.882149, mflops=5.66798 (err=5.4e-16) 9. CWP (min N) (N=260): elapsed time t=1.04 s, 1024 iters, t-(init.)=1.02 s t(norm)=0.486374, mflops=10.2802 10. CWP (best N) (N=280): elapsed time t=1.01 s, 1024 iters, t-(init.)=1 s t(norm)=0.476837, mflops=10.4858 11. Edelblute: elapsed time t=1.82 s, 1024 iters, t-(init.)=1.81 s t(norm)=0.863075, mflops=5.79324 (err=5.9e-16) 12. FFTPACK: elapsed time t=1.07 s, 1024 iters, t-(init.)=1.06 s t(norm)=0.505447, mflops=9.89223 (err=4.4e-16) 13. FFTPACK (f2c): elapsed time t=1.09 s, 1024 iters, t-(init.)=1.08 s t(norm)=0.514984, mflops=9.70904 (err=4.5e-16) FFTW_MEASURE plan: (cost = 8.593750e-04) FFTW_TWIDDLE 16 FFTW_NOTW 16 14. FFTW: elapsed time t=1.75 s, 2048 iters, t-(init.)=1.73 s t(norm)=0.412464, mflops=12.1223 (err=4.7e-16) FFTW_ESTIMATE plan: (cost = 9.216000e+02) FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.78 s, 2048 iters, t-(init.)=1.76 s t(norm)=0.419617, mflops=11.9156 (err=4.6e-16) 16. Frigo-old: elapsed time t=1.92 s, 2048 iters, t-(init.)=1.89 s t(norm)=0.450611, mflops=11.096 (err=4.5e-16) 17. Green: elapsed time t=1.29 s, 1024 iters, t-(init.)=1.28 s t(norm)=0.610352, mflops=8.192 (err=4.8e-16) 18. GSL: elapsed time t=1.29 s, 1024 iters, t-(init.)=1.28 s t(norm)=0.610352, mflops=8.192 (err=4.7e-16) 19. GSL DIT: elapsed time t=1.34 s, 1024 iters, t-(init.)=1.33 s t(norm)=0.634193, mflops=7.88403 (err=5.0e-16) 20. GSL DIF: elapsed time t=1.69 s, 1024 iters, t-(init.)=1.68 s t(norm)=0.801086, mflops=6.24152 (err=4.9e-16) 21. Krukar: elapsed time t=1.14 s, 1024 iters, t-(init.)=1.13 s t(norm)=0.538826, mflops=9.27943 (err=5.0e-16) 22. Mayer (Buneman): elapsed time t=1.34 s, 1024 iters, t-(init.)=1.32 s t(norm)=0.629425, mflops=7.94376 (err=4.7e-16) 23. Mayer (simple): elapsed time t=1.21 s, 1024 iters, t-(init.)=1.19 s t(norm)=0.567436, mflops=8.81156 24. Mayer (lookup): elapsed time t=1.14 s, 1024 iters, t-(init.)=1.12 s t(norm)=0.534058, mflops=9.36229 (err=5.7e-16) 25. Monro: elapsed time t=1.32 s, 1024 iters, t-(init.)=1.31 s t(norm)=0.624657, mflops=8.0044 (err=8.2e-08) 26. NAPACK (f2c): elapsed time t=1.28 s, 512 iters, t-(init.)=1.28 s t(norm)=1.2207, mflops=4.096 (err=3.9e-15) 27. Nielsen: elapsed time t=1.43 s, 1024 iters, t-(init.)=1.41 s t(norm)=0.67234, mflops=7.43671 (err=3.8e-15) 28. NR (C): elapsed time t=1.29 s, 1024 iters, t-(init.)=1.28 s t(norm)=0.610352, mflops=8.192 (err=4.9e-16) 29. NR (F): elapsed time t=1.43 s, 1024 iters, t-(init.)=1.42 s t(norm)=0.677109, mflops=7.38434 (err=4.5e-16) 30. Ooura (C): elapsed time t=1 s, 1024 iters, t-(init.)=0.99 s t(norm)=0.472069, mflops=10.5917 (err=5.0e-16) 31. Ooura (F): elapsed time t=1 s, 1024 iters, t-(init.)=0.99 s t(norm)=0.472069, mflops=10.5917 (err=5.0e-16) 32. QFT: elapsed time t=1.09 s, 1024 iters, t-(init.)=1.08 s t(norm)=0.514984, mflops=9.70904 (err=7.0e-16) 33. Ransom: elapsed time t=1.65 s, 1024 iters, t-(init.)=1.64 s t(norm)=0.782013, mflops=6.39376 (err=2.0e-15) 34. SCIPORT: elapsed time t=1.86 s, 512 iters, t-(init.)=1.85 s t(norm)=1.7643, mflops=2.83399 (err=1.4e-07) 35. Singleton: elapsed time t=1.36 s, 1024 iters, t-(init.)=1.35 s t(norm)=0.64373, mflops=7.76723 (err=5.0e-16) 36. Singleton (f2c): elapsed time t=1.31 s, 1024 iters, t-(init.)=1.3 s t(norm)=0.619888, mflops=8.06597 (err=5.4e-16) 37. Sorensen: elapsed time t=1.02 s, 1024 iters, t-(init.)=1.01 s t(norm)=0.481606, mflops=10.3819 (err=6.0e-16) 38. Sorensen DIT: elapsed time t=1.74 s, 1024 iters, t-(init.)=1.73 s t(norm)=0.824928, mflops=6.06113 (err=5.7e-16) 39. Temperton: elapsed time t=1.21 s, 1024 iters, t-(init.)=1.2 s t(norm)=0.572205, mflops=8.73813 (err=9.1e-08) 40. Temperton (f2c): elapsed time t=1.45 s, 1024 iters, t-(init.)=1.44 s t(norm)=0.686646, mflops=7.28178 (err=4.5e-16) 41. Valkenburg: elapsed time t=1 s, 256 iters, t-(init.)=0.99 s t(norm)=1.88828, mflops=2.64792 (err=6.4e-16) Top mflops for N=256 = 12.1223 Normalized results and averages for N=256: fft 0: mflops = 7.88403 (norm. = 0.650376), norm. avg. (of 8) = 0.665474 fft 1: mflops = 8.38861 (norm. = 0.692), norm. avg. (of 8) = 0.674645 fft 2: mflops = 7.03742 (norm. = 0.580537), norm. avg. (of 8) = 0.469326 fft 3: mflops = 4.096 (norm. = 0.337891), norm. avg. (of 8) = 0.216722 fft 4: mflops = 6.59482 (norm. = 0.544025), norm. avg. (of 8) = 0.371026 fft 5: mflops = 2.54509 (norm. = 0.209951), norm. avg. (of 8) = 0.175995 fft 6: mflops = 9.27943 (norm. = 0.765487), norm. avg. (of 8) = 0.540446 fft 7: mflops = 7.94376 (norm. = 0.655303), norm. avg. (of 8) = 0.448405 fft 8: mflops = 5.66798 (norm. = 0.467568), norm. avg. (of 8) = 0.418793 fft 9: mflops = 10.2802 (norm. = 0.848039), norm. avg. (of 8) = 0.613786 fft 10: mflops = 10.4858 (norm. = 0.865), norm. avg. (of 8) = 0.468202 fft 11: mflops = 5.79324 (norm. = 0.477901), norm. avg. (of 7) = 0.385243 fft 12: mflops = 9.89223 (norm. = 0.816038), norm. avg. (of 8) = 0.65743 fft 13: mflops = 9.70904 (norm. = 0.800926), norm. avg. (of 8) = 0.619078 fft 14: mflops = 12.1223 (norm. = 1), norm. avg. (of 8) = 0.974269 fft 15: mflops = 11.9156 (norm. = 0.982955), norm. avg. (of 8) = 0.957655 fft 16: mflops = 11.096 (norm. = 0.915344), norm. avg. (of 8) = 0.937769 fft 17: mflops = 8.192 (norm. = 0.675781), norm. avg. (of 6) = 0.608453 fft 18: mflops = 8.192 (norm. = 0.675781), norm. avg. (of 8) = 0.483463 fft 19: mflops = 7.88403 (norm. = 0.650376), norm. avg. (of 8) = 0.403302 fft 20: mflops = 6.24152 (norm. = 0.514881), norm. avg. (of 8) = 0.322472 fft 21: mflops = 9.27943 (norm. = 0.765487), norm. avg. (of 8) = 0.72291 fft 22: mflops = 7.94376 (norm. = 0.655303), norm. avg. (of 7) = 0.543237 fft 23: mflops = 8.81156 (norm. = 0.726891), norm. avg. (of 7) = 0.581826 fft 24: mflops = 9.36229 (norm. = 0.772321), norm. avg. (of 7) = 0.614326 fft 25: mflops = 8.0044 (norm. = 0.660305), norm. avg. (of 7) = 0.434325 fft 26: mflops = 4.096 (norm. = 0.337891), norm. avg. (of 8) = 0.20784 fft 27: mflops = 7.43671 (norm. = 0.613475), norm. avg. (of 8) = 0.367963 fft 28: mflops = 8.192 (norm. = 0.675781), norm. avg. (of 8) = 0.40748 fft 29: mflops = 7.38434 (norm. = 0.609155), norm. avg. (of 8) = 0.37139 fft 30: mflops = 10.5917 (norm. = 0.873737), norm. avg. (of 8) = 0.792397 fft 31: mflops = 10.5917 (norm. = 0.873737), norm. avg. (of 8) = 0.76724 fft 32: mflops = 9.70904 (norm. = 0.800926), norm. avg. (of 5) = 0.791568 fft 33: mflops = 6.39376 (norm. = 0.527439), norm. avg. (of 7) = 0.302323 fft 34: mflops = 2.83399 (norm. = 0.233784), norm. avg. (of 7) = 0.220183 fft 35: mflops = 7.76723 (norm. = 0.640741), norm. avg. (of 8) = 0.404605 fft 36: mflops = 8.06597 (norm. = 0.665385), norm. avg. (of 8) = 0.432917 fft 37: mflops = 10.3819 (norm. = 0.856436), norm. avg. (of 8) = 0.710263 fft 38: mflops = 6.06113 (norm. = 0.5), norm. avg. (of 8) = 0.436782 fft 39: mflops = 8.73813 (norm. = 0.720833), norm. avg. (of 8) = 0.510641 fft 40: mflops = 7.28178 (norm. = 0.600694), norm. avg. (of 8) = 0.434961 fft 41: mflops = 2.64792 (norm. = 0.218434), norm. avg. (of 8) = 0.188677 Benchmarking for array size = 512 (power of 2): 0. Arndt DIF: elapsed time t=1.45 s, 512 iters, t-(init.)=1.43 s t(norm)=0.606113, mflops=8.24929 (err=5.4e-16) 1. Arndt DIT: elapsed time t=1.37 s, 512 iters, t-(init.)=1.36 s t(norm)=0.576443, mflops=8.67388 (err=5.5e-16) 2. Arndt Split-Radix: elapsed time t=1.63 s, 512 iters, t-(init.)=1.62 s t(norm)=0.686646, mflops=7.28178 (err=6.0e-16) 3. Arndt 4-step: elapsed time t=1.4 s, 256 iters, t-(init.)=1.4 s t(norm)=1.18679, mflops=4.21303 (err=5.5e-16) 4. Bailey: elapsed time t=1.83 s, 512 iters, t-(init.)=1.82 s t(norm)=0.771417, mflops=6.48158 (err=5.7e-16) 5. Beauregard: elapsed time t=1.17 s, 128 iters, t-(init.)=1.17 s t(norm)=1.98364, mflops=2.52062 (err=6.6e-16) 6. Bergland: elapsed time t=1.26 s, 512 iters, t-(init.)=1.24 s t(norm)=0.525581, mflops=9.51329 (err=5.5e-16) 7. Brenner: elapsed time t=1.49 s, 512 iters, t-(init.)=1.48 s t(norm)=0.627306, mflops=7.97059 (err=5.7e-16) 8. Burrus: elapsed time t=1.01 s, 256 iters, t-(init.)=1 s t(norm)=0.847711, mflops=5.89824 (err=5.8e-16) 9. CWP (min N) (N=520): elapsed time t=1.16 s, 512 iters, t-(init.)=1.15 s t(norm)=0.487434, mflops=10.2578 10. CWP (best N) (N=560): elapsed time t=1.14 s, 512 iters, t-(init.)=1.13 s t(norm)=0.478956, mflops=10.4394 11. Edelblute: elapsed time t=1.97 s, 512 iters, t-(init.)=1.96 s t(norm)=0.830756, mflops=6.01861 (err=6.1e-16) 12. FFTPACK: elapsed time t=1.33 s, 512 iters, t-(init.)=1.32 s t(norm)=0.559489, mflops=8.93673 (err=5.5e-16) 13. FFTPACK (f2c): elapsed time t=1.36 s, 512 iters, t-(init.)=1.35 s t(norm)=0.572205, mflops=8.73813 (err=5.4e-16) FFTW_MEASURE plan: (cost = 1.953125e-03) FFTW_TWIDDLE 32 FFTW_NOTW 16 14. FFTW: elapsed time t=1 s, 512 iters, t-(init.)=0.98 s t(norm)=0.415378, mflops=12.0372 (err=5.3e-16) FFTW_ESTIMATE plan: (cost = 1.843200e+03) FFTW_TWIDDLE 16 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.01 s, 512 iters, t-(init.)=1 s t(norm)=0.423855, mflops=11.7965 (err=5.4e-16) 16. Frigo-old: elapsed time t=1.17 s, 512 iters, t-(init.)=1.16 s t(norm)=0.491672, mflops=10.1694 (err=5.4e-16) 17. Green: elapsed time t=1.41 s, 512 iters, t-(init.)=1.4 s t(norm)=0.593397, mflops=8.42606 (err=5.9e-16) 18. GSL: elapsed time t=1.58 s, 512 iters, t-(init.)=1.57 s t(norm)=0.665453, mflops=7.51368 (err=6.1e-16) 19. GSL DIT: elapsed time t=1.47 s, 512 iters, t-(init.)=1.46 s t(norm)=0.618829, mflops=8.07978 (err=5.9e-16) 20. GSL DIF: elapsed time t=1.86 s, 512 iters, t-(init.)=1.85 s t(norm)=0.784132, mflops=6.37648 (err=5.6e-16) 21. Krukar: elapsed time t=1.29 s, 512 iters, t-(init.)=1.28 s t(norm)=0.542535, mflops=9.216 (err=5.6e-16) 22. Mayer (Buneman): elapsed time t=1.44 s, 512 iters, t-(init.)=1.43 s t(norm)=0.606113, mflops=8.24929 (err=5.7e-16) 23. Mayer (simple): elapsed time t=1.29 s, 512 iters, t-(init.)=1.27 s t(norm)=0.538296, mflops=9.28857 24. Mayer (lookup): elapsed time t=1.23 s, 512 iters, t-(init.)=1.21 s t(norm)=0.512865, mflops=9.74916 (err=5.1e-16) 25. Monro: elapsed time t=1.46 s, 512 iters, t-(init.)=1.45 s t(norm)=0.61459, mflops=8.1355 (err=7.5e-08) 26. NAPACK (f2c): elapsed time t=1.44 s, 256 iters, t-(init.)=1.43 s t(norm)=1.21223, mflops=4.12464 (err=6.0e-15) 27. Nielsen: elapsed time t=1.58 s, 512 iters, t-(init.)=1.56 s t(norm)=0.661214, mflops=7.56185 (err=3.0e-15) 28. NR (C): elapsed time t=1.41 s, 512 iters, t-(init.)=1.4 s t(norm)=0.593397, mflops=8.42606 (err=5.9e-16) 29. NR (F): elapsed time t=1.57 s, 512 iters, t-(init.)=1.56 s t(norm)=0.661214, mflops=7.56185 (err=5.8e-16) 30. Ooura (C): elapsed time t=1.12 s, 512 iters, t-(init.)=1.11 s t(norm)=0.470479, mflops=10.6275 (err=5.3e-16) 31. Ooura (F): elapsed time t=1.14 s, 512 iters, t-(init.)=1.13 s t(norm)=0.478956, mflops=10.4394 (err=5.3e-16) 32. QFT: elapsed time t=1.29 s, 512 iters, t-(init.)=1.28 s t(norm)=0.542535, mflops=9.216 (err=7.4e-16) 33. Ransom: elapsed time t=1.84 s, 512 iters, t-(init.)=1.83 s t(norm)=0.775655, mflops=6.44616 (err=1.6e-15) 34. SCIPORT: elapsed time t=1.09 s, 128 iters, t-(init.)=1.09 s t(norm)=1.84801, mflops=2.70561 (err=1.3e-07) 35. Singleton: elapsed time t=1.53 s, 512 iters, t-(init.)=1.52 s t(norm)=0.64426, mflops=7.76084 (err=7.9e-16) 36. Singleton (f2c): elapsed time t=1.46 s, 512 iters, t-(init.)=1.45 s t(norm)=0.61459, mflops=8.1355 (err=8.0e-16) 37. Sorensen: elapsed time t=1.14 s, 512 iters, t-(init.)=1.13 s t(norm)=0.478956, mflops=10.4394 (err=5.7e-16) 38. Sorensen DIT: elapsed time t=1.83 s, 512 iters, t-(init.)=1.82 s t(norm)=0.771417, mflops=6.48158 (err=5.5e-16) 39. Temperton: elapsed time t=1.41 s, 512 iters, t-(init.)=1.39 s t(norm)=0.589159, mflops=8.48668 (err=9.2e-08) 40. Temperton (f2c): elapsed time t=1.75 s, 512 iters, t-(init.)=1.74 s t(norm)=0.737508, mflops=6.77959 (err=6.1e-16) 41. Valkenburg: elapsed time t=1.12 s, 128 iters, t-(init.)=1.12 s t(norm)=1.89887, mflops=2.63314 (err=6.6e-16) Top mflops for N=512 = 12.0372 Normalized results and averages for N=512: fft 0: mflops = 8.24929 (norm. = 0.685315), norm. avg. (of 9) = 0.667679 fft 1: mflops = 8.67388 (norm. = 0.720588), norm. avg. (of 9) = 0.67975 fft 2: mflops = 7.28178 (norm. = 0.604938), norm. avg. (of 9) = 0.484394 fft 3: mflops = 4.21303 (norm. = 0.35), norm. avg. (of 9) = 0.231531 fft 4: mflops = 6.48158 (norm. = 0.538462), norm. avg. (of 9) = 0.38963 fft 5: mflops = 2.52062 (norm. = 0.209402), norm. avg. (of 9) = 0.179707 fft 6: mflops = 9.51329 (norm. = 0.790323), norm. avg. (of 9) = 0.56821 fft 7: mflops = 7.97059 (norm. = 0.662162), norm. avg. (of 9) = 0.472156 fft 8: mflops = 5.89824 (norm. = 0.49), norm. avg. (of 9) = 0.426705 fft 9: mflops = 10.2578 (norm. = 0.852174), norm. avg. (of 9) = 0.640273 fft 10: mflops = 10.4394 (norm. = 0.867257), norm. avg. (of 9) = 0.512542 fft 11: mflops = 6.01861 (norm. = 0.5), norm. avg. (of 8) = 0.399588 fft 12: mflops = 8.93673 (norm. = 0.742424), norm. avg. (of 9) = 0.666874 fft 13: mflops = 8.73813 (norm. = 0.725926), norm. avg. (of 9) = 0.63095 fft 14: mflops = 12.0372 (norm. = 1), norm. avg. (of 9) = 0.977128 fft 15: mflops = 11.7965 (norm. = 0.98), norm. avg. (of 9) = 0.960138 fft 16: mflops = 10.1694 (norm. = 0.844828), norm. avg. (of 9) = 0.927442 fft 17: mflops = 8.42606 (norm. = 0.7), norm. avg. (of 7) = 0.621531 fft 18: mflops = 7.51368 (norm. = 0.624204), norm. avg. (of 9) = 0.499101 fft 19: mflops = 8.07978 (norm. = 0.671233), norm. avg. (of 9) = 0.433072 fft 20: mflops = 6.37648 (norm. = 0.52973), norm. avg. (of 9) = 0.345501 fft 21: mflops = 9.216 (norm. = 0.765625), norm. avg. (of 9) = 0.727656 fft 22: mflops = 8.24929 (norm. = 0.685315), norm. avg. (of 8) = 0.560997 fft 23: mflops = 9.28857 (norm. = 0.771654), norm. avg. (of 8) = 0.605554 fft 24: mflops = 9.74916 (norm. = 0.809917), norm. avg. (of 8) = 0.638775 fft 25: mflops = 8.1355 (norm. = 0.675862), norm. avg. (of 8) = 0.464517 fft 26: mflops = 4.12464 (norm. = 0.342657), norm. avg. (of 9) = 0.22282 fft 27: mflops = 7.56185 (norm. = 0.628205), norm. avg. (of 9) = 0.396879 fft 28: mflops = 8.42606 (norm. = 0.7), norm. avg. (of 9) = 0.439983 fft 29: mflops = 7.56185 (norm. = 0.628205), norm. avg. (of 9) = 0.399925 fft 30: mflops = 10.6275 (norm. = 0.882883), norm. avg. (of 9) = 0.802451 fft 31: mflops = 10.4394 (norm. = 0.867257), norm. avg. (of 9) = 0.778353 fft 32: mflops = 9.216 (norm. = 0.765625), norm. avg. (of 6) = 0.787244 fft 33: mflops = 6.44616 (norm. = 0.535519), norm. avg. (of 8) = 0.331473 fft 34: mflops = 2.70561 (norm. = 0.224771), norm. avg. (of 8) = 0.220756 fft 35: mflops = 7.76084 (norm. = 0.644737), norm. avg. (of 9) = 0.431286 fft 36: mflops = 8.1355 (norm. = 0.675862), norm. avg. (of 9) = 0.459911 fft 37: mflops = 10.4394 (norm. = 0.867257), norm. avg. (of 9) = 0.727706 fft 38: mflops = 6.48158 (norm. = 0.538462), norm. avg. (of 9) = 0.44808 fft 39: mflops = 8.48668 (norm. = 0.705036), norm. avg. (of 9) = 0.53224 fft 40: mflops = 6.77959 (norm. = 0.563218), norm. avg. (of 9) = 0.449212 fft 41: mflops = 2.63314 (norm. = 0.21875), norm. avg. (of 9) = 0.192018 Benchmarking for array size = 1024 (power of 2): 0. Arndt DIF: elapsed time t=1.65 s, 256 iters, t-(init.)=1.64 s t(norm)=0.62561, mflops=7.9922 (err=5.2e-16) 1. Arndt DIT: elapsed time t=1.53 s, 256 iters, t-(init.)=1.51 s t(norm)=0.576019, mflops=8.68026 (err=4.9e-16) 2. Arndt Split-Radix: elapsed time t=1.81 s, 256 iters, t-(init.)=1.8 s t(norm)=0.686646, mflops=7.28178 (err=5.1e-16) 3. Arndt 4-step: elapsed time t=1.46 s, 128 iters, t-(init.)=1.45 s t(norm)=1.10626, mflops=4.51972 (err=4.4e-16) 4. Bailey: elapsed time t=1.21 s, 128 iters, t-(init.)=1.21 s t(norm)=0.923157, mflops=5.4162 (err=5.6e-16) 5. Beauregard: elapsed time t=1.33 s, 64 iters, t-(init.)=1.33 s t(norm)=2.02942, mflops=2.46376 (err=5.1e-16) 6. Bergland: elapsed time t=1.47 s, 256 iters, t-(init.)=1.46 s t(norm)=0.556946, mflops=8.97753 (err=5.0e-16) 7. Brenner: elapsed time t=1.68 s, 256 iters, t-(init.)=1.67 s t(norm)=0.637054, mflops=7.84862 (err=5.1e-16) 8. Burrus: elapsed time t=1.12 s, 128 iters, t-(init.)=1.12 s t(norm)=0.854492, mflops=5.85143 (err=5.2e-16) 9. CWP (min N) (N=1040): elapsed time t=1.3 s, 256 iters, t-(init.)=1.29 s t(norm)=0.492096, mflops=10.1606 10. CWP (best N) (N=1040): elapsed time t=1.27 s, 256 iters, t-(init.)=1.26 s t(norm)=0.480652, mflops=10.4025 11. Edelblute: elapsed time t=1.09 s, 128 iters, t-(init.)=1.09 s t(norm)=0.831604, mflops=6.01248 (err=5.2e-16) 12. FFTPACK: elapsed time t=1.55 s, 256 iters, t-(init.)=1.53 s t(norm)=0.583649, mflops=8.5668 (err=4.9e-16) 13. FFTPACK (f2c): elapsed time t=1.6 s, 256 iters, t-(init.)=1.59 s t(norm)=0.606537, mflops=8.24352 (err=4.7e-16) FFTW_MEASURE plan: (cost = 4.531250e-03) FFTW_TWIDDLE 32 FFTW_NOTW 32 14. FFTW: elapsed time t=1.16 s, 256 iters, t-(init.)=1.15 s t(norm)=0.43869, mflops=11.3976 (err=4.6e-16) FFTW_ESTIMATE plan: (cost = 1.126400e+04) FFTW_TWIDDLE 8 FFTW_TWIDDLE 4 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.23 s, 256 iters, t-(init.)=1.22 s t(norm)=0.465393, mflops=10.7436 (err=4.8e-16) 16. Frigo-old: elapsed time t=1.4 s, 256 iters, t-(init.)=1.39 s t(norm)=0.530243, mflops=9.42964 (err=4.7e-16) 17. Green: elapsed time t=1.68 s, 256 iters, t-(init.)=1.67 s t(norm)=0.637054, mflops=7.84862 (err=5.9e-16) 18. GSL: elapsed time t=1.9 s, 256 iters, t-(init.)=1.89 s t(norm)=0.720978, mflops=6.93503 (err=4.9e-16) 19. GSL DIT: elapsed time t=1.65 s, 256 iters, t-(init.)=1.64 s t(norm)=0.62561, mflops=7.9922 (err=5.1e-16) 20. GSL DIF: elapsed time t=1.03 s, 128 iters, t-(init.)=1.02 s t(norm)=0.778198, mflops=6.4251 (err=4.9e-16) 21. Krukar: elapsed time t=1.57 s, 256 iters, t-(init.)=1.55 s t(norm)=0.591278, mflops=8.45626 (err=5.4e-16) 22. Mayer (Buneman): elapsed time t=1.61 s, 256 iters, t-(init.)=1.59 s t(norm)=0.606537, mflops=8.24352 (err=4.6e-16) 23. Mayer (simple): elapsed time t=1.46 s, 256 iters, t-(init.)=1.45 s t(norm)=0.553131, mflops=9.03945 24. Mayer (lookup): elapsed time t=1.4 s, 256 iters, t-(init.)=1.39 s t(norm)=0.530243, mflops=9.42964 (err=4.6e-16) 25. Monro: elapsed time t=1.61 s, 256 iters, t-(init.)=1.6 s t(norm)=0.610352, mflops=8.192 (err=8.4e-08) 26. NAPACK (f2c): elapsed time t=1.66 s, 128 iters, t-(init.)=1.65 s t(norm)=1.25885, mflops=3.97188 (err=1.5e-14) 27. Nielsen: elapsed time t=1.86 s, 256 iters, t-(init.)=1.85 s t(norm)=0.705719, mflops=7.08497 (err=6.2e-15) 28. NR (C): elapsed time t=1.55 s, 256 iters, t-(init.)=1.54 s t(norm)=0.587463, mflops=8.51117 (err=5.1e-16) 29. NR (F): elapsed time t=1.73 s, 256 iters, t-(init.)=1.72 s t(norm)=0.656128, mflops=7.62047 (err=5.0e-16) 30. Ooura (C): elapsed time t=1.27 s, 256 iters, t-(init.)=1.26 s t(norm)=0.480652, mflops=10.4025 (err=4.6e-16) 31. Ooura (F): elapsed time t=1.27 s, 256 iters, t-(init.)=1.26 s t(norm)=0.480652, mflops=10.4025 (err=4.6e-16) 32. QFT: elapsed time t=1.51 s, 256 iters, t-(init.)=1.5 s t(norm)=0.572205, mflops=8.73813 (err=9.5e-16) 33. Ransom: elapsed time t=1.9 s, 256 iters, t-(init.)=1.89 s t(norm)=0.720978, mflops=6.93503 (err=1.8e-15) 34. SCIPORT: elapsed time t=1.31 s, 64 iters, t-(init.)=1.31 s t(norm)=1.9989, mflops=2.50137 (err=1.4e-07) 35. Singleton: elapsed time t=1.69 s, 256 iters, t-(init.)=1.68 s t(norm)=0.640869, mflops=7.8019 (err=6.0e-16) 36. Singleton (f2c): elapsed time t=1.62 s, 256 iters, t-(init.)=1.6 s t(norm)=0.610352, mflops=8.192 (err=6.0e-16) 37. Sorensen: elapsed time t=1.36 s, 256 iters, t-(init.)=1.35 s t(norm)=0.514984, mflops=9.70904 (err=4.9e-16) 38. Sorensen DIT: elapsed time t=1.97 s, 256 iters, t-(init.)=1.95 s t(norm)=0.743866, mflops=6.72164 (err=4.9e-16) 39. Temperton: elapsed time t=1.63 s, 256 iters, t-(init.)=1.62 s t(norm)=0.617981, mflops=8.09086 (err=9.8e-08) 40. Temperton (f2c): elapsed time t=1.92 s, 256 iters, t-(init.)=1.91 s t(norm)=0.728607, mflops=6.86241 (err=4.8e-16) 41. Valkenburg: elapsed time t=1.29 s, 64 iters, t-(init.)=1.28 s t(norm)=1.95313, mflops=2.56 (err=8.2e-16) Top mflops for N=1024 = 11.3976 Normalized results and averages for N=1024: fft 0: mflops = 7.9922 (norm. = 0.70122), norm. avg. (of 10) = 0.671033 fft 1: mflops = 8.68026 (norm. = 0.761589), norm. avg. (of 10) = 0.687934 fft 2: mflops = 7.28178 (norm. = 0.638889), norm. avg. (of 10) = 0.499843 fft 3: mflops = 4.51972 (norm. = 0.396552), norm. avg. (of 10) = 0.248033 fft 4: mflops = 5.4162 (norm. = 0.475207), norm. avg. (of 10) = 0.398188 fft 5: mflops = 2.46376 (norm. = 0.216165), norm. avg. (of 10) = 0.183353 fft 6: mflops = 8.97753 (norm. = 0.787671), norm. avg. (of 10) = 0.590156 fft 7: mflops = 7.84862 (norm. = 0.688623), norm. avg. (of 10) = 0.493803 fft 8: mflops = 5.85143 (norm. = 0.513393), norm. avg. (of 10) = 0.435374 fft 9: mflops = 10.1606 (norm. = 0.891473), norm. avg. (of 10) = 0.665393 fft 10: mflops = 10.4025 (norm. = 0.912698), norm. avg. (of 10) = 0.552557 fft 11: mflops = 6.01248 (norm. = 0.527523), norm. avg. (of 9) = 0.413803 fft 12: mflops = 8.5668 (norm. = 0.751634), norm. avg. (of 10) = 0.67535 fft 13: mflops = 8.24352 (norm. = 0.72327), norm. avg. (of 10) = 0.640182 fft 14: mflops = 11.3976 (norm. = 1), norm. avg. (of 10) = 0.979416 fft 15: mflops = 10.7436 (norm. = 0.942623), norm. avg. (of 10) = 0.958386 fft 16: mflops = 9.42964 (norm. = 0.827338), norm. avg. (of 10) = 0.917432 fft 17: mflops = 7.84862 (norm. = 0.688623), norm. avg. (of 8) = 0.629917 fft 18: mflops = 6.93503 (norm. = 0.608466), norm. avg. (of 10) = 0.510037 fft 19: mflops = 7.9922 (norm. = 0.70122), norm. avg. (of 10) = 0.459887 fft 20: mflops = 6.4251 (norm. = 0.563725), norm. avg. (of 10) = 0.367323 fft 21: mflops = 8.45626 (norm. = 0.741935), norm. avg. (of 10) = 0.729084 fft 22: mflops = 8.24352 (norm. = 0.72327), norm. avg. (of 9) = 0.579027 fft 23: mflops = 9.03945 (norm. = 0.793103), norm. avg. (of 9) = 0.626393 fft 24: mflops = 9.42964 (norm. = 0.827338), norm. avg. (of 9) = 0.659726 fft 25: mflops = 8.192 (norm. = 0.71875), norm. avg. (of 9) = 0.492765 fft 26: mflops = 3.97188 (norm. = 0.348485), norm. avg. (of 10) = 0.235386 fft 27: mflops = 7.08497 (norm. = 0.621622), norm. avg. (of 10) = 0.419353 fft 28: mflops = 8.51117 (norm. = 0.746753), norm. avg. (of 10) = 0.47066 fft 29: mflops = 7.62047 (norm. = 0.668605), norm. avg. (of 10) = 0.426793 fft 30: mflops = 10.4025 (norm. = 0.912698), norm. avg. (of 10) = 0.813476 fft 31: mflops = 10.4025 (norm. = 0.912698), norm. avg. (of 10) = 0.791788 fft 32: mflops = 8.73813 (norm. = 0.766667), norm. avg. (of 7) = 0.784305 fft 33: mflops = 6.93503 (norm. = 0.608466), norm. avg. (of 9) = 0.36225 fft 34: mflops = 2.50137 (norm. = 0.219466), norm. avg. (of 9) = 0.220613 fft 35: mflops = 7.8019 (norm. = 0.684524), norm. avg. (of 10) = 0.45661 fft 36: mflops = 8.192 (norm. = 0.71875), norm. avg. (of 10) = 0.485794 fft 37: mflops = 9.70904 (norm. = 0.851852), norm. avg. (of 10) = 0.740121 fft 38: mflops = 6.72164 (norm. = 0.589744), norm. avg. (of 10) = 0.462246 fft 39: mflops = 8.09086 (norm. = 0.709877), norm. avg. (of 10) = 0.550004 fft 40: mflops = 6.86241 (norm. = 0.602094), norm. avg. (of 10) = 0.4645 fft 41: mflops = 2.56 (norm. = 0.224609), norm. avg. (of 10) = 0.195277 Benchmarking for array size = 2048 (power of 2): 0. Arndt DIF: elapsed time t=2 s, 128 iters, t-(init.)=1.97 s t(norm)=0.683178, mflops=7.31874 (err=4.6e-16) 1. Arndt DIT: elapsed time t=1.84 s, 128 iters, t-(init.)=1.81 s t(norm)=0.627691, mflops=7.9657 (err=4.6e-16) 2. Arndt Split-Radix: elapsed time t=1.14 s, 64 iters, t-(init.)=1.13 s t(norm)=0.783747, mflops=6.37961 (err=4.8e-16) 3. Arndt 4-step: elapsed time t=1.67 s, 64 iters, t-(init.)=1.65 s t(norm)=1.14441, mflops=4.36907 (err=4.7e-16) 4. Bailey: elapsed time t=1.37 s, 64 iters, t-(init.)=1.35 s t(norm)=0.936335, mflops=5.33997 (err=5.0e-16) 5. Beauregard: elapsed time t=1.52 s, 32 iters, t-(init.)=1.52 s t(norm)=2.10849, mflops=2.37137 (err=4.7e-16) 6. Bergland: elapsed time t=1.72 s, 128 iters, t-(init.)=1.69 s t(norm)=0.586076, mflops=8.53131 (err=4.9e-16) 7. Brenner: elapsed time t=1.97 s, 128 iters, t-(init.)=1.95 s t(norm)=0.676242, mflops=7.39381 (err=5.0e-16) 8. Burrus: elapsed time t=1.33 s, 64 iters, t-(init.)=1.31 s t(norm)=0.908592, mflops=5.50302 (err=4.6e-16) 9. CWP (min N) (N=2145): elapsed time t=1.65 s, 128 iters, t-(init.)=1.62 s t(norm)=0.561801, mflops=8.89995 10. CWP (best N) (N=2184): elapsed time t=1.55 s, 128 iters, t-(init.)=1.52 s t(norm)=0.527122, mflops=9.48547 11. Edelblute: elapsed time t=1.3 s, 64 iters, t-(init.)=1.29 s t(norm)=0.89472, mflops=5.58834 (err=4.7e-16) 12. FFTPACK: elapsed time t=1.73 s, 128 iters, t-(init.)=1.7 s t(norm)=0.589544, mflops=8.48113 (err=4.5e-16) 13. FFTPACK (f2c): elapsed time t=1.76 s, 128 iters, t-(init.)=1.73 s t(norm)=0.599948, mflops=8.33406 (err=4.6e-16) FFTW_MEASURE plan: (cost = 1.000000e-02) FFTW_TWIDDLE 64 FFTW_NOTW 32 14. FFTW: elapsed time t=1.37 s, 128 iters, t-(init.)=1.35 s t(norm)=0.468167, mflops=10.6799 (err=4.5e-16) FFTW_ESTIMATE plan: (cost = 1.269760e+04) FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.41 s, 128 iters, t-(init.)=1.38 s t(norm)=0.478571, mflops=10.4478 (err=4.4e-16) 16. Frigo-old: elapsed time t=1.57 s, 128 iters, t-(init.)=1.54 s t(norm)=0.534058, mflops=9.36229 (err=4.6e-16) 17. Green: elapsed time t=1 s, 64 iters, t-(init.)=0.98 s t(norm)=0.67971, mflops=7.35608 (err=5.8e-16) 18. GSL: elapsed time t=1.03 s, 64 iters, t-(init.)=1.02 s t(norm)=0.707453, mflops=7.06761 (err=4.7e-16) 19. GSL DIT: elapsed time t=1.07 s, 64 iters, t-(init.)=1.06 s t(norm)=0.735196, mflops=6.80091 (err=4.5e-16) 20. GSL DIF: elapsed time t=1.37 s, 64 iters, t-(init.)=1.36 s t(norm)=0.943271, mflops=5.30071 (err=4.4e-16) 21. Krukar: elapsed time t=1.77 s, 128 iters, t-(init.)=1.75 s t(norm)=0.606884, mflops=8.23881 (err=5.0e-16) 22. Mayer (Buneman): elapsed time t=1.74 s, 128 iters, t-(init.)=1.71 s t(norm)=0.593012, mflops=8.43153 (err=4.5e-16) 23. Mayer (simple): elapsed time t=1.57 s, 128 iters, t-(init.)=1.54 s t(norm)=0.534058, mflops=9.36229 24. Mayer (lookup): elapsed time t=1.56 s, 128 iters, t-(init.)=1.53 s t(norm)=0.53059, mflops=9.42348 (err=4.5e-16) 25. Monro: elapsed time t=1.02 s, 64 iters, t-(init.)=1.01 s t(norm)=0.700517, mflops=7.13758 (err=1.0e-07) 26. NAPACK (f2c): elapsed time t=1.82 s, 64 iters, t-(init.)=1.81 s t(norm)=1.25538, mflops=3.98285 (err=1.5e-14) 27. Nielsen: elapsed time t=1.12 s, 64 iters, t-(init.)=1.11 s t(norm)=0.769875, mflops=6.49456 (err=1.1e-14) 28. NR (C): elapsed time t=1.94 s, 128 iters, t-(init.)=1.91 s t(norm)=0.66237, mflops=7.54865 (err=4.5e-16) 29. NR (F): elapsed time t=1.04 s, 64 iters, t-(init.)=1.02 s t(norm)=0.707453, mflops=7.06761 (err=4.5e-16) 30. Ooura (C): elapsed time t=1.48 s, 128 iters, t-(init.)=1.45 s t(norm)=0.502846, mflops=9.94339 (err=4.6e-16) 31. Ooura (F): elapsed time t=1.51 s, 128 iters, t-(init.)=1.48 s t(norm)=0.51325, mflops=9.74184 (err=4.6e-16) 32. QFT: elapsed time t=1.66 s, 128 iters, t-(init.)=1.63 s t(norm)=0.565269, mflops=8.84535 (err=1.2e-15) 33. Ransom: elapsed time t=1.13 s, 64 iters, t-(init.)=1.12 s t(norm)=0.776811, mflops=6.43657 (err=2.1e-15) 34. SCIPORT: elapsed time t=1.42 s, 32 iters, t-(init.)=1.42 s t(norm)=1.96977, mflops=2.53837 (err=1.6e-07) 35. Singleton: elapsed time t=1.1 s, 64 iters, t-(init.)=1.09 s t(norm)=0.756004, mflops=6.61372 (err=5.9e-16) 36. Singleton (f2c): elapsed time t=1.04 s, 64 iters, t-(init.)=1.03 s t(norm)=0.714389, mflops=6.99899 (err=5.9e-16) 37. Sorensen: elapsed time t=1.71 s, 128 iters, t-(init.)=1.69 s t(norm)=0.586076, mflops=8.53131 (err=4.5e-16) 38. Sorensen DIT: elapsed time t=1.16 s, 64 iters, t-(init.)=1.15 s t(norm)=0.797619, mflops=6.26866 (err=4.4e-16) 39. Temperton: elapsed time t=1.92 s, 128 iters, t-(init.)=1.89 s t(norm)=0.655434, mflops=7.62853 (err=1.0e-07) 40. Temperton (f2c): elapsed time t=1.11 s, 64 iters, t-(init.)=1.09 s t(norm)=0.756004, mflops=6.61372 (err=4.7e-16) 41. Valkenburg: elapsed time t=1.42 s, 32 iters, t-(init.)=1.41 s t(norm)=1.9559, mflops=2.55637 (err=7.4e-16) Top mflops for N=2048 = 10.6799 Normalized results and averages for N=2048: fft 0: mflops = 7.31874 (norm. = 0.685279), norm. avg. (of 11) = 0.672328 fft 1: mflops = 7.9657 (norm. = 0.745856), norm. avg. (of 11) = 0.693199 fft 2: mflops = 6.37961 (norm. = 0.597345), norm. avg. (of 11) = 0.508707 fft 3: mflops = 4.36907 (norm. = 0.409091), norm. avg. (of 11) = 0.262675 fft 4: mflops = 5.33997 (norm. = 0.5), norm. avg. (of 11) = 0.407443 fft 5: mflops = 2.37137 (norm. = 0.222039), norm. avg. (of 11) = 0.18687 fft 6: mflops = 8.53131 (norm. = 0.798817), norm. avg. (of 11) = 0.609125 fft 7: mflops = 7.39381 (norm. = 0.692308), norm. avg. (of 11) = 0.511848 fft 8: mflops = 5.50302 (norm. = 0.515267), norm. avg. (of 11) = 0.442637 fft 9: mflops = 8.89995 (norm. = 0.833333), norm. avg. (of 11) = 0.680661 fft 10: mflops = 9.48547 (norm. = 0.888158), norm. avg. (of 11) = 0.583067 fft 11: mflops = 5.58834 (norm. = 0.523256), norm. avg. (of 10) = 0.424748 fft 12: mflops = 8.48113 (norm. = 0.794118), norm. avg. (of 11) = 0.686147 fft 13: mflops = 8.33406 (norm. = 0.780347), norm. avg. (of 11) = 0.652924 fft 14: mflops = 10.6799 (norm. = 1), norm. avg. (of 11) = 0.981287 fft 15: mflops = 10.4478 (norm. = 0.978261), norm. avg. (of 11) = 0.960193 fft 16: mflops = 9.36229 (norm. = 0.876623), norm. avg. (of 11) = 0.913722 fft 17: mflops = 7.35608 (norm. = 0.688776), norm. avg. (of 9) = 0.636457 fft 18: mflops = 7.06761 (norm. = 0.661765), norm. avg. (of 11) = 0.523831 fft 19: mflops = 6.80091 (norm. = 0.636792), norm. avg. (of 11) = 0.475969 fft 20: mflops = 5.30071 (norm. = 0.496324), norm. avg. (of 11) = 0.379051 fft 21: mflops = 8.23881 (norm. = 0.771429), norm. avg. (of 11) = 0.732933 fft 22: mflops = 8.43153 (norm. = 0.789474), norm. avg. (of 10) = 0.600072 fft 23: mflops = 9.36229 (norm. = 0.876623), norm. avg. (of 10) = 0.651416 fft 24: mflops = 9.42348 (norm. = 0.882353), norm. avg. (of 10) = 0.681989 fft 25: mflops = 7.13758 (norm. = 0.668317), norm. avg. (of 10) = 0.51032 fft 26: mflops = 3.98285 (norm. = 0.372928), norm. avg. (of 11) = 0.24789 fft 27: mflops = 6.49456 (norm. = 0.608108), norm. avg. (of 11) = 0.436513 fft 28: mflops = 7.54865 (norm. = 0.706806), norm. avg. (of 11) = 0.492128 fft 29: mflops = 7.06761 (norm. = 0.661765), norm. avg. (of 11) = 0.448154 fft 30: mflops = 9.94339 (norm. = 0.931034), norm. avg. (of 11) = 0.824163 fft 31: mflops = 9.74184 (norm. = 0.912162), norm. avg. (of 11) = 0.802731 fft 32: mflops = 8.84535 (norm. = 0.828221), norm. avg. (of 8) = 0.789794 fft 33: mflops = 6.43657 (norm. = 0.602679), norm. avg. (of 10) = 0.386292 fft 34: mflops = 2.53837 (norm. = 0.237676), norm. avg. (of 10) = 0.222319 fft 35: mflops = 6.61372 (norm. = 0.619266), norm. avg. (of 11) = 0.471397 fft 36: mflops = 6.99899 (norm. = 0.65534), norm. avg. (of 11) = 0.501208 fft 37: mflops = 8.53131 (norm. = 0.798817), norm. avg. (of 11) = 0.745457 fft 38: mflops = 6.26866 (norm. = 0.586957), norm. avg. (of 11) = 0.473583 fft 39: mflops = 7.62853 (norm. = 0.714286), norm. avg. (of 11) = 0.564938 fft 40: mflops = 6.61372 (norm. = 0.619266), norm. avg. (of 11) = 0.47857 fft 41: mflops = 2.55637 (norm. = 0.239362), norm. avg. (of 11) = 0.199285 Benchmarking for array size = 4096 (power of 2): 0. Arndt DIF: elapsed time t=1.23 s, 32 iters, t-(init.)=1.21 s t(norm)=0.769297, mflops=6.49944 (err=1.0e-15) 1. Arndt DIT: elapsed time t=1.08 s, 32 iters, t-(init.)=1.06 s t(norm)=0.67393, mflops=7.41917 (err=1.1e-15) 2. Arndt Split-Radix: elapsed time t=1.36 s, 32 iters, t-(init.)=1.34 s t(norm)=0.851949, mflops=5.8689 (err=1.0e-15) 3. Arndt 4-step: elapsed time t=1.81 s, 32 iters, t-(init.)=1.79 s t(norm)=1.13805, mflops=4.39347 (err=1.0e-15) 4. Bailey: elapsed time t=1.57 s, 32 iters, t-(init.)=1.56 s t(norm)=0.991821, mflops=5.04123 (err=1.0e-15) 5. Beauregard: elapsed time t=1.7 s, 16 iters, t-(init.)=1.69 s t(norm)=2.14895, mflops=2.32672 (err=1.0e-15) 6. Bergland: elapsed time t=1 s, 32 iters, t-(init.)=0.98 s t(norm)=0.623067, mflops=8.02482 (err=1.1e-15) 7. Brenner: elapsed time t=1.11 s, 32 iters, t-(init.)=1.09 s t(norm)=0.693003, mflops=7.21497 (err=1.1e-15) 8. Burrus: elapsed time t=1.56 s, 32 iters, t-(init.)=1.54 s t(norm)=0.979106, mflops=5.1067 (err=1.0e-15) 9. CWP (min N) (N=4290): elapsed time t=1.81 s, 64 iters, t-(init.)=1.77 s t(norm)=0.562668, mflops=8.88624 10. CWP (best N) (N=4368): elapsed time t=1.69 s, 64 iters, t-(init.)=1.65 s t(norm)=0.524521, mflops=9.53251 11. Edelblute: elapsed time t=1.53 s, 32 iters, t-(init.)=1.51 s t(norm)=0.960032, mflops=5.20816 (err=1.1e-15) 12. FFTPACK: elapsed time t=1.94 s, 64 iters, t-(init.)=1.9 s t(norm)=0.603994, mflops=8.27823 (err=1.0e-15) 13. FFTPACK (f2c): elapsed time t=1.98 s, 64 iters, t-(init.)=1.94 s t(norm)=0.616709, mflops=8.10755 (err=1.0e-15) FFTW_MEASURE plan: (cost = 2.250000e-02) FFTW_TWIDDLE 64 FFTW_NOTW 64 14. FFTW: elapsed time t=1.48 s, 64 iters, t-(init.)=1.44 s t(norm)=0.457764, mflops=10.9227 (err=1.0e-15) FFTW_ESTIMATE plan: (cost = 2.539520e+04) FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.51 s, 64 iters, t-(init.)=1.47 s t(norm)=0.4673, mflops=10.6998 (err=1.1e-15) 16. Frigo-old: elapsed time t=1.84 s, 64 iters, t-(init.)=1.8 s t(norm)=0.572205, mflops=8.73813 (err=1.1e-15) 17. Green: elapsed time t=1.13 s, 32 iters, t-(init.)=1.11 s t(norm)=0.705719, mflops=7.08497 (err=1.1e-15) 18. GSL: elapsed time t=1.17 s, 32 iters, t-(init.)=1.15 s t(norm)=0.73115, mflops=6.83854 (err=1.0e-15) 19. GSL DIT: elapsed time t=1.29 s, 32 iters, t-(init.)=1.27 s t(norm)=0.807444, mflops=6.19238 (err=1.0e-15) 20. GSL DIF: elapsed time t=1.62 s, 32 iters, t-(init.)=1.6 s t(norm)=1.01725, mflops=4.9152 (err=1.0e-15) 21. Krukar: elapsed time t=1.95 s, 64 iters, t-(init.)=1.91 s t(norm)=0.607173, mflops=8.23489 (err=1.1e-15) 22. Mayer (Buneman): elapsed time t=1.03 s, 32 iters, t-(init.)=1.01 s t(norm)=0.642141, mflops=7.78646 (err=1.1e-15) 23. Mayer (simple): elapsed time t=1.91 s, 64 iters, t-(init.)=1.87 s t(norm)=0.594457, mflops=8.41104 24. Mayer (lookup): elapsed time t=1.88 s, 64 iters, t-(init.)=1.84 s t(norm)=0.58492, mflops=8.54817 (err=1.1e-15) 25. Monro: elapsed time t=1.2 s, 32 iters, t-(init.)=1.18 s t(norm)=0.750224, mflops=6.66468 (err=1.1e-07) 26. NAPACK (f2c): elapsed time t=1.94 s, 32 iters, t-(init.)=1.92 s t(norm)=1.2207, mflops=4.096 (err=4.5e-14) 27. Nielsen: elapsed time t=1.27 s, 32 iters, t-(init.)=1.25 s t(norm)=0.794729, mflops=6.29146 (err=2.2e-14) 28. NR (C): elapsed time t=1.16 s, 32 iters, t-(init.)=1.14 s t(norm)=0.724792, mflops=6.89853 (err=1.0e-15) 29. NR (F): elapsed time t=1.25 s, 32 iters, t-(init.)=1.23 s t(norm)=0.782013, mflops=6.39376 (err=1.0e-15) 30. Ooura (C): elapsed time t=1.69 s, 64 iters, t-(init.)=1.65 s t(norm)=0.524521, mflops=9.53251 (err=1.1e-15) 31. Ooura (F): elapsed time t=1.7 s, 64 iters, t-(init.)=1.66 s t(norm)=0.5277, mflops=9.47508 (err=1.1e-15) 32. QFT: elapsed time t=1.99 s, 64 iters, t-(init.)=1.96 s t(norm)=0.623067, mflops=8.02482 (err=1.9e-15) 33. Ransom: elapsed time t=1.15 s, 32 iters, t-(init.)=1.13 s t(norm)=0.718435, mflops=6.95958 (err=2.6e-15) 34. SCIPORT: elapsed time t=1.63 s, 16 iters, t-(init.)=1.62 s t(norm)=2.05994, mflops=2.42726 (err=1.7e-07) 35. Singleton: elapsed time t=1.18 s, 32 iters, t-(init.)=1.16 s t(norm)=0.737508, mflops=6.77959 (err=1.6e-15) 36. Singleton (f2c): elapsed time t=1.15 s, 32 iters, t-(init.)=1.13 s t(norm)=0.718435, mflops=6.95958 (err=1.6e-15) 37. Sorensen: elapsed time t=1.04 s, 32 iters, t-(init.)=1.02 s t(norm)=0.648499, mflops=7.71012 (err=1.1e-15) 38. Sorensen DIT: elapsed time t=1.36 s, 32 iters, t-(init.)=1.34 s t(norm)=0.851949, mflops=5.8689 (err=1.0e-15) 39. Temperton: elapsed time t=1.1 s, 32 iters, t-(init.)=1.08 s t(norm)=0.686646, mflops=7.28178 (err=1.2e-07) 40. Temperton (f2c): elapsed time t=1.25 s, 32 iters, t-(init.)=1.23 s t(norm)=0.782013, mflops=6.39376 (err=1.0e-15) 41. Valkenburg: elapsed time t=1.57 s, 16 iters, t-(init.)=1.56 s t(norm)=1.98364, mflops=2.52062 (err=1.1e-15) Top mflops for N=4096 = 10.9227 Normalized results and averages for N=4096: fft 0: mflops = 6.49944 (norm. = 0.595041), norm. avg. (of 12) = 0.665887 fft 1: mflops = 7.41917 (norm. = 0.679245), norm. avg. (of 12) = 0.692037 fft 2: mflops = 5.8689 (norm. = 0.537313), norm. avg. (of 12) = 0.511091 fft 3: mflops = 4.39347 (norm. = 0.402235), norm. avg. (of 12) = 0.274305 fft 4: mflops = 5.04123 (norm. = 0.461538), norm. avg. (of 12) = 0.411951 fft 5: mflops = 2.32672 (norm. = 0.213018), norm. avg. (of 12) = 0.189049 fft 6: mflops = 8.02482 (norm. = 0.734694), norm. avg. (of 12) = 0.619589 fft 7: mflops = 7.21497 (norm. = 0.66055), norm. avg. (of 12) = 0.52424 fft 8: mflops = 5.1067 (norm. = 0.467532), norm. avg. (of 12) = 0.444711 fft 9: mflops = 8.88624 (norm. = 0.813559), norm. avg. (of 12) = 0.691736 fft 10: mflops = 9.53251 (norm. = 0.872727), norm. avg. (of 12) = 0.607205 fft 11: mflops = 5.20816 (norm. = 0.476821), norm. avg. (of 11) = 0.429482 fft 12: mflops = 8.27823 (norm. = 0.757895), norm. avg. (of 12) = 0.692126 fft 13: mflops = 8.10755 (norm. = 0.742268), norm. avg. (of 12) = 0.66037 fft 14: mflops = 10.9227 (norm. = 1), norm. avg. (of 12) = 0.982846 fft 15: mflops = 10.6998 (norm. = 0.979592), norm. avg. (of 12) = 0.96181 fft 16: mflops = 8.73813 (norm. = 0.8), norm. avg. (of 12) = 0.904245 fft 17: mflops = 7.08497 (norm. = 0.648649), norm. avg. (of 10) = 0.637676 fft 18: mflops = 6.83854 (norm. = 0.626087), norm. avg. (of 12) = 0.532352 fft 19: mflops = 6.19238 (norm. = 0.566929), norm. avg. (of 12) = 0.483549 fft 20: mflops = 4.9152 (norm. = 0.45), norm. avg. (of 12) = 0.384963 fft 21: mflops = 8.23489 (norm. = 0.753927), norm. avg. (of 12) = 0.734683 fft 22: mflops = 7.78646 (norm. = 0.712871), norm. avg. (of 11) = 0.610326 fft 23: mflops = 8.41104 (norm. = 0.770053), norm. avg. (of 11) = 0.662201 fft 24: mflops = 8.54817 (norm. = 0.782609), norm. avg. (of 11) = 0.691136 fft 25: mflops = 6.66468 (norm. = 0.610169), norm. avg. (of 11) = 0.519398 fft 26: mflops = 4.096 (norm. = 0.375), norm. avg. (of 12) = 0.258483 fft 27: mflops = 6.29146 (norm. = 0.576), norm. avg. (of 12) = 0.448137 fft 28: mflops = 6.89853 (norm. = 0.631579), norm. avg. (of 12) = 0.503748 fft 29: mflops = 6.39376 (norm. = 0.585366), norm. avg. (of 12) = 0.459588 fft 30: mflops = 9.53251 (norm. = 0.872727), norm. avg. (of 12) = 0.82821 fft 31: mflops = 9.47508 (norm. = 0.86747), norm. avg. (of 12) = 0.808126 fft 32: mflops = 8.02482 (norm. = 0.734694), norm. avg. (of 9) = 0.783672 fft 33: mflops = 6.95958 (norm. = 0.637168), norm. avg. (of 11) = 0.409099 fft 34: mflops = 2.42726 (norm. = 0.222222), norm. avg. (of 11) = 0.222311 fft 35: mflops = 6.77959 (norm. = 0.62069), norm. avg. (of 12) = 0.483838 fft 36: mflops = 6.95958 (norm. = 0.637168), norm. avg. (of 12) = 0.512538 fft 37: mflops = 7.71012 (norm. = 0.705882), norm. avg. (of 12) = 0.742159 fft 38: mflops = 5.8689 (norm. = 0.537313), norm. avg. (of 12) = 0.478894 fft 39: mflops = 7.28178 (norm. = 0.666667), norm. avg. (of 12) = 0.573416 fft 40: mflops = 6.39376 (norm. = 0.585366), norm. avg. (of 12) = 0.487469 fft 41: mflops = 2.52062 (norm. = 0.230769), norm. avg. (of 12) = 0.201909 Benchmarking for array size = 8192 (power of 2): 0. Arndt DIF: elapsed time t=1.23 s, 16 iters, t-(init.)=1.21 s t(norm)=0.710121, mflops=7.04106 (err=1.3e-15) 1. Arndt DIT: elapsed time t=1.14 s, 16 iters, t-(init.)=1.12 s t(norm)=0.657302, mflops=7.60686 (err=1.3e-15) 2. Arndt Split-Radix: elapsed time t=1.41 s, 16 iters, t-(init.)=1.39 s t(norm)=0.815758, mflops=6.12927 (err=1.3e-15) 3. Arndt 4-step: elapsed time t=2.02 s, 16 iters, t-(init.)=2.01 s t(norm)=1.17962, mflops=4.23865 (err=1.4e-15) 4. Bailey: elapsed time t=1.7 s, 16 iters, t-(init.)=1.68 s t(norm)=0.985953, mflops=5.07124 (err=1.3e-15) 5. Beauregard: elapsed time t=1.84 s, 8 iters, t-(init.)=1.83 s t(norm)=2.14797, mflops=2.32778 (err=1.3e-15) 6. Bergland: elapsed time t=1.06 s, 16 iters, t-(init.)=1.04 s t(norm)=0.610352, mflops=8.192 (err=1.4e-15) 7. Brenner: elapsed time t=1.19 s, 16 iters, t-(init.)=1.17 s t(norm)=0.686646, mflops=7.28178 (err=1.4e-15) 8. Burrus: elapsed time t=1.61 s, 16 iters, t-(init.)=1.6 s t(norm)=0.939002, mflops=5.3248 (err=1.3e-15) 9. CWP (min N) (N=8580): elapsed time t=1.9 s, 32 iters, t-(init.)=1.86 s t(norm)=0.545795, mflops=9.16095 10. CWP (best N) (N=9240): elapsed time t=1.96 s, 32 iters, t-(init.)=1.92 s t(norm)=0.563401, mflops=8.87467 11. Edelblute: elapsed time t=1.57 s, 16 iters, t-(init.)=1.56 s t(norm)=0.915527, mflops=5.46133 (err=1.3e-15) 12. FFTPACK: elapsed time t=1.1 s, 16 iters, t-(init.)=1.08 s t(norm)=0.633827, mflops=7.88859 (err=1.3e-15) 13. FFTPACK (f2c): elapsed time t=1.13 s, 16 iters, t-(init.)=1.11 s t(norm)=0.651433, mflops=7.67539 (err=1.3e-15) FFTW_MEASURE plan: (cost = 5.250000e-02) FFTW_TWIDDLE 32 FFTW_TWIDDLE 32 FFTW_NOTW 8 14. FFTW: elapsed time t=1.64 s, 32 iters, t-(init.)=1.61 s t(norm)=0.472436, mflops=10.5835 (err=1.3e-15) FFTW_ESTIMATE plan: (cost = 5.079040e+04) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.63 s, 32 iters, t-(init.)=1.6 s t(norm)=0.469501, mflops=10.6496 (err=1.4e-15) 16. Frigo-old: elapsed time t=1.02 s, 16 iters, t-(init.)=1 s t(norm)=0.586877, mflops=8.51968 (err=1.4e-15) 17. Green: elapsed time t=1.22 s, 16 iters, t-(init.)=1.2 s t(norm)=0.704252, mflops=7.09973 (err=1.4e-15) 18. GSL: elapsed time t=1.32 s, 16 iters, t-(init.)=1.31 s t(norm)=0.768808, mflops=6.50357 (err=1.3e-15) 19. GSL DIT: elapsed time t=1.35 s, 16 iters, t-(init.)=1.33 s t(norm)=0.780546, mflops=6.40577 (err=1.3e-15) 20. GSL DIF: elapsed time t=1.71 s, 16 iters, t-(init.)=1.7 s t(norm)=0.99769, mflops=5.01158 (err=1.3e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1.09 s, 16 iters, t-(init.)=1.07 s t(norm)=0.627958, mflops=7.96232 (err=1.3e-15) 23. Mayer (simple): elapsed time t=1.02 s, 16 iters, t-(init.)=1 s t(norm)=0.586877, mflops=8.51968 24. Mayer (lookup): elapsed time t=1.03 s, 16 iters, t-(init.)=1.02 s t(norm)=0.598614, mflops=8.35263 (err=1.4e-15) 25. Monro: elapsed time t=1.27 s, 16 iters, t-(init.)=1.26 s t(norm)=0.739464, mflops=6.76165 (err=1.2e-07) 26. NAPACK (f2c): elapsed time t=1.06 s, 8 iters, t-(init.)=1.05 s t(norm)=1.23244, mflops=4.05699 (err=4.1e-14) 27. Nielsen: elapsed time t=1.38 s, 16 iters, t-(init.)=1.37 s t(norm)=0.804021, mflops=6.21874 (err=1.1e-14) 28. NR (C): elapsed time t=1.21 s, 16 iters, t-(init.)=1.19 s t(norm)=0.698383, mflops=7.15939 (err=1.3e-15) 29. NR (F): elapsed time t=1.3 s, 16 iters, t-(init.)=1.28 s t(norm)=0.751202, mflops=6.656 (err=1.3e-15) 30. Ooura (C): elapsed time t=1.83 s, 32 iters, t-(init.)=1.79 s t(norm)=0.525254, mflops=9.5192 (err=1.4e-15) 31. Ooura (F): elapsed time t=1.82 s, 32 iters, t-(init.)=1.79 s t(norm)=0.525254, mflops=9.5192 (err=1.4e-15) 32. QFT: elapsed time t=1.1 s, 16 iters, t-(init.)=1.08 s t(norm)=0.633827, mflops=7.88859 (err=2.8e-15) 33. Ransom: elapsed time t=1.26 s, 16 iters, t-(init.)=1.24 s t(norm)=0.727727, mflops=6.87071 (err=3.2e-15) 34. SCIPORT: elapsed time t=1.79 s, 8 iters, t-(init.)=1.78 s t(norm)=2.08928, mflops=2.39317 (err=1.9e-07) 35. Singleton: elapsed time t=1.28 s, 16 iters, t-(init.)=1.27 s t(norm)=0.745333, mflops=6.70841 (err=2.0e-15) 36. Singleton (f2c): elapsed time t=1.23 s, 16 iters, t-(init.)=1.21 s t(norm)=0.710121, mflops=7.04106 (err=2.0e-15) 37. Sorensen: elapsed time t=1.11 s, 16 iters, t-(init.)=1.09 s t(norm)=0.639695, mflops=7.81622 (err=1.4e-15) 38. Sorensen DIT: elapsed time t=1.41 s, 16 iters, t-(init.)=1.4 s t(norm)=0.821627, mflops=6.08549 (err=1.3e-15) 39. Temperton: elapsed time t=1.21 s, 16 iters, t-(init.)=1.19 s t(norm)=0.698383, mflops=7.15939 (err=1.4e-07) 40. Temperton (f2c): elapsed time t=1.46 s, 16 iters, t-(init.)=1.44 s t(norm)=0.845102, mflops=5.91644 (err=1.3e-15) 41. Valkenburg: elapsed time t=1.79 s, 8 iters, t-(init.)=1.78 s t(norm)=2.08928, mflops=2.39317 (err=1.4e-15) Top mflops for N=8192 = 10.6496 Normalized results and averages for N=8192: fft 0: mflops = 7.04106 (norm. = 0.661157), norm. avg. (of 13) = 0.665524 fft 1: mflops = 7.60686 (norm. = 0.714286), norm. avg. (of 13) = 0.693748 fft 2: mflops = 6.12927 (norm. = 0.57554), norm. avg. (of 13) = 0.516049 fft 3: mflops = 4.23865 (norm. = 0.39801), norm. avg. (of 13) = 0.283821 fft 4: mflops = 5.07124 (norm. = 0.47619), norm. avg. (of 13) = 0.416893 fft 5: mflops = 2.32778 (norm. = 0.218579), norm. avg. (of 13) = 0.191321 fft 6: mflops = 8.192 (norm. = 0.769231), norm. avg. (of 13) = 0.6311 fft 7: mflops = 7.28178 (norm. = 0.683761), norm. avg. (of 13) = 0.536511 fft 8: mflops = 5.3248 (norm. = 0.5), norm. avg. (of 13) = 0.448964 fft 9: mflops = 9.16095 (norm. = 0.860215), norm. avg. (of 13) = 0.704696 fft 10: mflops = 8.87467 (norm. = 0.833333), norm. avg. (of 13) = 0.624599 fft 11: mflops = 5.46133 (norm. = 0.512821), norm. avg. (of 12) = 0.436427 fft 12: mflops = 7.88859 (norm. = 0.740741), norm. avg. (of 13) = 0.695866 fft 13: mflops = 7.67539 (norm. = 0.720721), norm. avg. (of 13) = 0.665012 fft 14: mflops = 10.5835 (norm. = 0.993789), norm. avg. (of 13) = 0.983688 fft 15: mflops = 10.6496 (norm. = 1), norm. avg. (of 13) = 0.964747 fft 16: mflops = 8.51968 (norm. = 0.8), norm. avg. (of 13) = 0.896226 fft 17: mflops = 7.09973 (norm. = 0.666667), norm. avg. (of 11) = 0.640312 fft 18: mflops = 6.50357 (norm. = 0.610687), norm. avg. (of 13) = 0.538378 fft 19: mflops = 6.40577 (norm. = 0.601504), norm. avg. (of 13) = 0.492623 fft 20: mflops = 5.01158 (norm. = 0.470588), norm. avg. (of 13) = 0.39155 fft 21: mflops = -1 (norm. = -0.0939002), norm. avg. (of 12) = 0.734683 fft 22: mflops = 7.96232 (norm. = 0.747664), norm. avg. (of 12) = 0.621771 fft 23: mflops = 8.51968 (norm. = 0.8), norm. avg. (of 12) = 0.673685 fft 24: mflops = 8.35263 (norm. = 0.784314), norm. avg. (of 12) = 0.698901 fft 25: mflops = 6.76165 (norm. = 0.634921), norm. avg. (of 12) = 0.529024 fft 26: mflops = 4.05699 (norm. = 0.380952), norm. avg. (of 13) = 0.267903 fft 27: mflops = 6.21874 (norm. = 0.583942), norm. avg. (of 13) = 0.458583 fft 28: mflops = 7.15939 (norm. = 0.672269), norm. avg. (of 13) = 0.516712 fft 29: mflops = 6.656 (norm. = 0.625), norm. avg. (of 13) = 0.472312 fft 30: mflops = 9.5192 (norm. = 0.893855), norm. avg. (of 13) = 0.83326 fft 31: mflops = 9.5192 (norm. = 0.893855), norm. avg. (of 13) = 0.81472 fft 32: mflops = 7.88859 (norm. = 0.740741), norm. avg. (of 10) = 0.779379 fft 33: mflops = 6.87071 (norm. = 0.645161), norm. avg. (of 12) = 0.428771 fft 34: mflops = 2.39317 (norm. = 0.224719), norm. avg. (of 12) = 0.222511 fft 35: mflops = 6.70841 (norm. = 0.629921), norm. avg. (of 13) = 0.495075 fft 36: mflops = 7.04106 (norm. = 0.661157), norm. avg. (of 13) = 0.52397 fft 37: mflops = 7.81622 (norm. = 0.733945), norm. avg. (of 13) = 0.741527 fft 38: mflops = 6.08549 (norm. = 0.571429), norm. avg. (of 13) = 0.486012 fft 39: mflops = 7.15939 (norm. = 0.672269), norm. avg. (of 13) = 0.58102 fft 40: mflops = 5.91644 (norm. = 0.555556), norm. avg. (of 13) = 0.492707 fft 41: mflops = 2.39317 (norm. = 0.224719), norm. avg. (of 13) = 0.203663 Benchmarking for array size = 16384 (power of 2): 0. Arndt DIF: elapsed time t=1.39 s, 8 iters, t-(init.)=1.37 s t(norm)=0.746591, mflops=6.69711 (err=1.7e-15) 1. Arndt DIT: elapsed time t=1.23 s, 8 iters, t-(init.)=1.21 s t(norm)=0.659398, mflops=7.58268 (err=1.8e-15) 2. Arndt Split-Radix: elapsed time t=1.55 s, 8 iters, t-(init.)=1.53 s t(norm)=0.833784, mflops=5.99676 (err=1.8e-15) 3. Arndt 4-step: elapsed time t=1.98 s, 8 iters, t-(init.)=1.96 s t(norm)=1.06812, mflops=4.68114 (err=1.8e-15) 4. Bailey: elapsed time t=1.98 s, 8 iters, t-(init.)=1.96 s t(norm)=1.06812, mflops=4.68114 (err=1.7e-15) 5. Beauregard: elapsed time t=1 s, 2 iters, t-(init.)=1 s t(norm)=2.17983, mflops=2.29376 (err=1.8e-15) 6. Bergland: elapsed time t=1.13 s, 8 iters, t-(init.)=1.11 s t(norm)=0.604902, mflops=8.2658 (err=1.8e-15) 7. Brenner: elapsed time t=1.3 s, 8 iters, t-(init.)=1.28 s t(norm)=0.697545, mflops=7.168 (err=1.8e-15) 8. Burrus: elapsed time t=1.75 s, 8 iters, t-(init.)=1.73 s t(norm)=0.942775, mflops=5.30349 (err=1.8e-15) 9. CWP (min N) (N=17160): elapsed time t=1.02 s, 8 iters, t-(init.)=1 s t(norm)=0.544957, mflops=9.17504 10. CWP (best N) (N=17160): elapsed time t=1.01 s, 8 iters, t-(init.)=0.99 s t(norm)=0.539507, mflops=9.26772 11. Edelblute: elapsed time t=1.72 s, 8 iters, t-(init.)=1.7 s t(norm)=0.926426, mflops=5.39708 (err=1.8e-15) 12. FFTPACK: elapsed time t=1.24 s, 8 iters, t-(init.)=1.22 s t(norm)=0.664847, mflops=7.52052 (err=1.8e-15) 13. FFTPACK (f2c): elapsed time t=1.26 s, 8 iters, t-(init.)=1.24 s t(norm)=0.675746, mflops=7.39923 (err=1.8e-15) FFTW_MEASURE plan: (cost = 1.150000e-01) FFTW_TWIDDLE 64 FFTW_TWIDDLE 16 FFTW_NOTW 16 14. FFTW: elapsed time t=1.78 s, 16 iters, t-(init.)=1.74 s t(norm)=0.474112, mflops=10.546 (err=1.9e-15) FFTW_ESTIMATE plan: (cost = 1.441792e+05) FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.86 s, 16 iters, t-(init.)=1.82 s t(norm)=0.495911, mflops=10.0825 (err=1.8e-15) 16. Frigo-old: elapsed time t=1.14 s, 8 iters, t-(init.)=1.12 s t(norm)=0.610352, mflops=8.192 (err=1.9e-15) 17. Green: elapsed time t=1.37 s, 8 iters, t-(init.)=1.35 s t(norm)=0.735692, mflops=6.79633 (err=1.8e-15) 18. GSL: elapsed time t=1.41 s, 8 iters, t-(init.)=1.39 s t(norm)=0.75749, mflops=6.60075 (err=1.8e-15) 19. GSL DIT: elapsed time t=1.49 s, 8 iters, t-(init.)=1.47 s t(norm)=0.801086, mflops=6.24152 (err=1.8e-15) 20. GSL DIF: elapsed time t=1.88 s, 8 iters, t-(init.)=1.87 s t(norm)=1.01907, mflops=4.90644 (err=1.8e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1.21 s, 8 iters, t-(init.)=1.19 s t(norm)=0.648499, mflops=7.71012 (err=1.8e-15) 23. Mayer (simple): elapsed time t=1.15 s, 8 iters, t-(init.)=1.13 s t(norm)=0.615801, mflops=8.1195 24. Mayer (lookup): elapsed time t=1.14 s, 8 iters, t-(init.)=1.12 s t(norm)=0.610352, mflops=8.192 (err=1.9e-15) 25. Monro: elapsed time t=1.37 s, 8 iters, t-(init.)=1.35 s t(norm)=0.735692, mflops=6.79633 (err=1.4e-07) 26. NAPACK (f2c): elapsed time t=1.13 s, 4 iters, t-(init.)=1.12 s t(norm)=1.2207, mflops=4.096 (err=2.3e-13) 27. Nielsen: elapsed time t=1.51 s, 8 iters, t-(init.)=1.49 s t(norm)=0.811986, mflops=6.15774 (err=1.3e-13) 28. NR (C): elapsed time t=1.35 s, 8 iters, t-(init.)=1.33 s t(norm)=0.724792, mflops=6.89853 (err=1.8e-15) 29. NR (F): elapsed time t=1.45 s, 8 iters, t-(init.)=1.43 s t(norm)=0.779288, mflops=6.41611 (err=1.8e-15) 30. Ooura (C): elapsed time t=1.97 s, 16 iters, t-(init.)=1.93 s t(norm)=0.525883, mflops=9.50781 (err=1.9e-15) 31. Ooura (F): elapsed time t=1.97 s, 16 iters, t-(init.)=1.93 s t(norm)=0.525883, mflops=9.50781 (err=1.9e-15) 32. QFT: elapsed time t=1.25 s, 8 iters, t-(init.)=1.23 s t(norm)=0.670297, mflops=7.45938 (err=3.8e-15) 33. Ransom: elapsed time t=1.29 s, 8 iters, t-(init.)=1.27 s t(norm)=0.692095, mflops=7.22444 (err=4.0e-15) 34. SCIPORT: elapsed time t=1 s, 2 iters, t-(init.)=1 s t(norm)=2.17983, mflops=2.29376 (err=2.1e-07) 35. Singleton: elapsed time t=1.37 s, 8 iters, t-(init.)=1.35 s t(norm)=0.735692, mflops=6.79633 (err=2.5e-15) 36. Singleton (f2c): elapsed time t=1.34 s, 8 iters, t-(init.)=1.32 s t(norm)=0.719343, mflops=6.95079 (err=2.5e-15) 37. Sorensen: elapsed time t=1.25 s, 8 iters, t-(init.)=1.23 s t(norm)=0.670297, mflops=7.45938 (err=1.9e-15) 38. Sorensen DIT: elapsed time t=1.53 s, 8 iters, t-(init.)=1.51 s t(norm)=0.822885, mflops=6.07619 (err=1.8e-15) 39. Temperton: elapsed time t=1.29 s, 8 iters, t-(init.)=1.27 s t(norm)=0.692095, mflops=7.22444 (err=1.5e-07) 40. Temperton (f2c): elapsed time t=1.51 s, 8 iters, t-(init.)=1.49 s t(norm)=0.811986, mflops=6.15774 (err=1.8e-15) 41. Valkenburg: elapsed time t=2 s, 4 iters, t-(init.)=1.99 s t(norm)=2.16893, mflops=2.30529 (err=1.7e-15) Top mflops for N=16384 = 10.546 Normalized results and averages for N=16384: fft 0: mflops = 6.69711 (norm. = 0.635036), norm. avg. (of 14) = 0.663346 fft 1: mflops = 7.58268 (norm. = 0.719008), norm. avg. (of 14) = 0.695552 fft 2: mflops = 5.99676 (norm. = 0.568627), norm. avg. (of 14) = 0.519804 fft 3: mflops = 4.68114 (norm. = 0.443878), norm. avg. (of 14) = 0.295253 fft 4: mflops = 4.68114 (norm. = 0.443878), norm. avg. (of 14) = 0.41882 fft 5: mflops = 2.29376 (norm. = 0.2175), norm. avg. (of 14) = 0.193191 fft 6: mflops = 8.2658 (norm. = 0.783784), norm. avg. (of 14) = 0.642006 fft 7: mflops = 7.168 (norm. = 0.679688), norm. avg. (of 14) = 0.546738 fft 8: mflops = 5.30349 (norm. = 0.50289), norm. avg. (of 14) = 0.452816 fft 9: mflops = 9.17504 (norm. = 0.87), norm. avg. (of 14) = 0.716503 fft 10: mflops = 9.26772 (norm. = 0.878788), norm. avg. (of 14) = 0.642756 fft 11: mflops = 5.39708 (norm. = 0.511765), norm. avg. (of 13) = 0.442222 fft 12: mflops = 7.52052 (norm. = 0.713115), norm. avg. (of 14) = 0.697098 fft 13: mflops = 7.39923 (norm. = 0.701613), norm. avg. (of 14) = 0.667626 fft 14: mflops = 10.546 (norm. = 1), norm. avg. (of 14) = 0.984853 fft 15: mflops = 10.0825 (norm. = 0.956044), norm. avg. (of 14) = 0.964126 fft 16: mflops = 8.192 (norm. = 0.776786), norm. avg. (of 14) = 0.887695 fft 17: mflops = 6.79633 (norm. = 0.644444), norm. avg. (of 12) = 0.640656 fft 18: mflops = 6.60075 (norm. = 0.625899), norm. avg. (of 14) = 0.544629 fft 19: mflops = 6.24152 (norm. = 0.591837), norm. avg. (of 14) = 0.499709 fft 20: mflops = 4.90644 (norm. = 0.465241), norm. avg. (of 14) = 0.396813 fft 21: mflops = -1 (norm. = -0.0948225), norm. avg. (of 12) = 0.734683 fft 22: mflops = 7.71012 (norm. = 0.731092), norm. avg. (of 13) = 0.630181 fft 23: mflops = 8.1195 (norm. = 0.769912), norm. avg. (of 13) = 0.681087 fft 24: mflops = 8.192 (norm. = 0.776786), norm. avg. (of 13) = 0.704892 fft 25: mflops = 6.79633 (norm. = 0.644444), norm. avg. (of 13) = 0.537903 fft 26: mflops = 4.096 (norm. = 0.388393), norm. avg. (of 14) = 0.27651 fft 27: mflops = 6.15774 (norm. = 0.583893), norm. avg. (of 14) = 0.467534 fft 28: mflops = 6.89853 (norm. = 0.654135), norm. avg. (of 14) = 0.526528 fft 29: mflops = 6.41611 (norm. = 0.608392), norm. avg. (of 14) = 0.482032 fft 30: mflops = 9.50781 (norm. = 0.901554), norm. avg. (of 14) = 0.838138 fft 31: mflops = 9.50781 (norm. = 0.901554), norm. avg. (of 14) = 0.820923 fft 32: mflops = 7.45938 (norm. = 0.707317), norm. avg. (of 11) = 0.772828 fft 33: mflops = 7.22444 (norm. = 0.685039), norm. avg. (of 13) = 0.448484 fft 34: mflops = 2.29376 (norm. = 0.2175), norm. avg. (of 13) = 0.222126 fft 35: mflops = 6.79633 (norm. = 0.644444), norm. avg. (of 14) = 0.505744 fft 36: mflops = 6.95079 (norm. = 0.659091), norm. avg. (of 14) = 0.533621 fft 37: mflops = 7.45938 (norm. = 0.707317), norm. avg. (of 14) = 0.739084 fft 38: mflops = 6.07619 (norm. = 0.576159), norm. avg. (of 14) = 0.492451 fft 39: mflops = 7.22444 (norm. = 0.685039), norm. avg. (of 14) = 0.58845 fft 40: mflops = 6.15774 (norm. = 0.583893), norm. avg. (of 14) = 0.49922 fft 41: mflops = 2.30529 (norm. = 0.218593), norm. avg. (of 14) = 0.20473 Benchmarking for array size = 32768 (power of 2): 0. Arndt DIF: elapsed time t=1.55 s, 4 iters, t-(init.)=1.53 s t(norm)=0.778198, mflops=6.4251 (err=2.1e-15) 1. Arndt DIT: elapsed time t=1.39 s, 4 iters, t-(init.)=1.36 s t(norm)=0.691732, mflops=7.22824 (err=2.1e-15) 2. Arndt Split-Radix: elapsed time t=1.77 s, 4 iters, t-(init.)=1.74 s t(norm)=0.88501, mflops=5.64966 (err=2.1e-15) 3. Arndt 4-step: elapsed time t=1.13 s, 2 iters, t-(init.)=1.12 s t(norm)=1.13932, mflops=4.38857 (err=2.1e-15) 4. Bailey: elapsed time t=1.07 s, 2 iters, t-(init.)=1.06 s t(norm)=1.07829, mflops=4.63698 (err=2.1e-15) 5. Beauregard: elapsed time t=1.09 s, 1 iters, t-(init.)=1.09 s t(norm)=2.21761, mflops=2.25468 (err=2.2e-15) 6. Bergland: elapsed time t=1.25 s, 4 iters, t-(init.)=1.23 s t(norm)=0.62561, mflops=7.9922 (err=2.2e-15) 7. Brenner: elapsed time t=1.44 s, 4 iters, t-(init.)=1.41 s t(norm)=0.717163, mflops=6.97191 (err=2.2e-15) 8. Burrus: elapsed time t=1.97 s, 4 iters, t-(init.)=1.94 s t(norm)=0.986735, mflops=5.06722 (err=2.1e-15) 9. CWP (min N) (N=34320): elapsed time t=1.07 s, 4 iters, t-(init.)=1.04 s t(norm)=0.528971, mflops=9.45231 10. CWP (best N) (N=34320): elapsed time t=1.07 s, 4 iters, t-(init.)=1.05 s t(norm)=0.534058, mflops=9.36229 11. Edelblute: elapsed time t=1.93 s, 4 iters, t-(init.)=1.91 s t(norm)=0.971476, mflops=5.14681 (err=2.1e-15) 12. FFTPACK: elapsed time t=1.32 s, 4 iters, t-(init.)=1.29 s t(norm)=0.656128, mflops=7.62047 (err=2.1e-15) 13. FFTPACK (f2c): elapsed time t=1.34 s, 4 iters, t-(init.)=1.31 s t(norm)=0.6663, mflops=7.50412 (err=2.1e-15) FFTW_MEASURE plan: (cost = 2.500000e-01) FFTW_TWIDDLE 64 FFTW_TWIDDLE 32 FFTW_NOTW 16 14. FFTW: elapsed time t=1.98 s, 8 iters, t-(init.)=1.93 s t(norm)=0.490824, mflops=10.1869 (err=2.1e-15) FFTW_ESTIMATE plan: (cost = 2.883584e+05) FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1 s, 4 iters, t-(init.)=0.98 s t(norm)=0.498454, mflops=10.031 (err=2.1e-15) 16. Frigo-old: elapsed time t=1.29 s, 4 iters, t-(init.)=1.27 s t(norm)=0.645955, mflops=7.74047 (err=2.2e-15) 17. Green: elapsed time t=1.5 s, 4 iters, t-(init.)=1.47 s t(norm)=0.747681, mflops=6.68735 (err=2.2e-15) 18. GSL: elapsed time t=1.52 s, 4 iters, t-(init.)=1.49 s t(norm)=0.757853, mflops=6.59758 (err=2.2e-15) 19. GSL DIT: elapsed time t=1.7 s, 4 iters, t-(init.)=1.67 s t(norm)=0.849406, mflops=5.88647 (err=2.2e-15) 20. GSL DIF: elapsed time t=1.1 s, 2 iters, t-(init.)=1.08 s t(norm)=1.09863, mflops=4.55111 (err=2.2e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1.34 s, 4 iters, t-(init.)=1.32 s t(norm)=0.671387, mflops=7.44727 (err=2.1e-15) 23. Mayer (simple): elapsed time t=1.22 s, 4 iters, t-(init.)=1.2 s t(norm)=0.610352, mflops=8.192 24. Mayer (lookup): elapsed time t=1.23 s, 4 iters, t-(init.)=1.2 s t(norm)=0.610352, mflops=8.192 (err=2.1e-15) 25. Monro: elapsed time t=1.57 s, 4 iters, t-(init.)=1.55 s t(norm)=0.788371, mflops=6.34219 (err=1.5e-07) 26. NAPACK (f2c): elapsed time t=1.23 s, 2 iters, t-(init.)=1.22 s t(norm)=1.24105, mflops=4.02885 (err=5.7e-13) 27. Nielsen: elapsed time t=1.69 s, 4 iters, t-(init.)=1.67 s t(norm)=0.849406, mflops=5.88647 (err=2.3e-13) 28. NR (C): elapsed time t=1.53 s, 4 iters, t-(init.)=1.5 s t(norm)=0.762939, mflops=6.5536 (err=2.2e-15) 29. NR (F): elapsed time t=1.63 s, 4 iters, t-(init.)=1.6 s t(norm)=0.813802, mflops=6.144 (err=2.2e-15) 30. Ooura (C): elapsed time t=1.09 s, 4 iters, t-(init.)=1.07 s t(norm)=0.54423, mflops=9.18729 (err=2.2e-15) 31. Ooura (F): elapsed time t=1.1 s, 4 iters, t-(init.)=1.08 s t(norm)=0.549316, mflops=9.10222 (err=2.2e-15) 32. QFT: elapsed time t=1.41 s, 4 iters, t-(init.)=1.39 s t(norm)=0.706991, mflops=7.07223 (err=4.9e-15) 33. Ransom: elapsed time t=1.44 s, 4 iters, t-(init.)=1.42 s t(norm)=0.722249, mflops=6.92282 (err=3.6e-15) 34. SCIPORT: elapsed time t=1.09 s, 1 iters, t-(init.)=1.09 s t(norm)=2.21761, mflops=2.25468 (err=2.3e-07) 35. Singleton: elapsed time t=1.66 s, 4 iters, t-(init.)=1.64 s t(norm)=0.834147, mflops=5.99415 (err=3.2e-15) 36. Singleton (f2c): elapsed time t=1.6 s, 4 iters, t-(init.)=1.58 s t(norm)=0.80363, mflops=6.22177 (err=3.2e-15) 37. Sorensen: elapsed time t=1.45 s, 4 iters, t-(init.)=1.43 s t(norm)=0.727336, mflops=6.87441 (err=2.1e-15) 38. Sorensen DIT: elapsed time t=1.72 s, 4 iters, t-(init.)=1.7 s t(norm)=0.864665, mflops=5.78259 (err=2.1e-15) 39. Temperton: elapsed time t=1.48 s, 4 iters, t-(init.)=1.45 s t(norm)=0.737508, mflops=6.77959 (err=1.5e-07) 40. Temperton (f2c): elapsed time t=1.68 s, 4 iters, t-(init.)=1.66 s t(norm)=0.84432, mflops=5.92193 (err=2.2e-15) 41. Valkenburg: elapsed time t=1.09 s, 1 iters, t-(init.)=1.09 s t(norm)=2.21761, mflops=2.25468 (err=2.3e-15) Top mflops for N=32768 = 10.1869 Normalized results and averages for N=32768: fft 0: mflops = 6.4251 (norm. = 0.630719), norm. avg. (of 15) = 0.661171 fft 1: mflops = 7.22824 (norm. = 0.709559), norm. avg. (of 15) = 0.696486 fft 2: mflops = 5.64966 (norm. = 0.554598), norm. avg. (of 15) = 0.522124 fft 3: mflops = 4.38857 (norm. = 0.430804), norm. avg. (of 15) = 0.30429 fft 4: mflops = 4.63698 (norm. = 0.455189), norm. avg. (of 15) = 0.421245 fft 5: mflops = 2.25468 (norm. = 0.22133), norm. avg. (of 15) = 0.195067 fft 6: mflops = 7.9922 (norm. = 0.784553), norm. avg. (of 15) = 0.651509 fft 7: mflops = 6.97191 (norm. = 0.684397), norm. avg. (of 15) = 0.555915 fft 8: mflops = 5.06722 (norm. = 0.497423), norm. avg. (of 15) = 0.45579 fft 9: mflops = 9.45231 (norm. = 0.927885), norm. avg. (of 15) = 0.730595 fft 10: mflops = 9.36229 (norm. = 0.919048), norm. avg. (of 15) = 0.661175 fft 11: mflops = 5.14681 (norm. = 0.505236), norm. avg. (of 14) = 0.446723 fft 12: mflops = 7.62047 (norm. = 0.748062), norm. avg. (of 15) = 0.700495 fft 13: mflops = 7.50412 (norm. = 0.736641), norm. avg. (of 15) = 0.672227 fft 14: mflops = 10.1869 (norm. = 1), norm. avg. (of 15) = 0.985863 fft 15: mflops = 10.031 (norm. = 0.984694), norm. avg. (of 15) = 0.965497 fft 16: mflops = 7.74047 (norm. = 0.759843), norm. avg. (of 15) = 0.879171 fft 17: mflops = 6.68735 (norm. = 0.656463), norm. avg. (of 13) = 0.641872 fft 18: mflops = 6.59758 (norm. = 0.647651), norm. avg. (of 15) = 0.551497 fft 19: mflops = 5.88647 (norm. = 0.577844), norm. avg. (of 15) = 0.504918 fft 20: mflops = 4.55111 (norm. = 0.446759), norm. avg. (of 15) = 0.400143 fft 21: mflops = -1 (norm. = -0.0981649), norm. avg. (of 12) = 0.734683 fft 22: mflops = 7.44727 (norm. = 0.731061), norm. avg. (of 14) = 0.637386 fft 23: mflops = 8.192 (norm. = 0.804167), norm. avg. (of 14) = 0.689878 fft 24: mflops = 8.192 (norm. = 0.804167), norm. avg. (of 14) = 0.711983 fft 25: mflops = 6.34219 (norm. = 0.622581), norm. avg. (of 14) = 0.543951 fft 26: mflops = 4.02885 (norm. = 0.395492), norm. avg. (of 15) = 0.284442 fft 27: mflops = 5.88647 (norm. = 0.577844), norm. avg. (of 15) = 0.474888 fft 28: mflops = 6.5536 (norm. = 0.643333), norm. avg. (of 15) = 0.534315 fft 29: mflops = 6.144 (norm. = 0.603125), norm. avg. (of 15) = 0.490105 fft 30: mflops = 9.18729 (norm. = 0.901869), norm. avg. (of 15) = 0.842387 fft 31: mflops = 9.10222 (norm. = 0.893519), norm. avg. (of 15) = 0.825762 fft 32: mflops = 7.07223 (norm. = 0.694245), norm. avg. (of 12) = 0.766279 fft 33: mflops = 6.92282 (norm. = 0.679577), norm. avg. (of 14) = 0.464991 fft 34: mflops = 2.25468 (norm. = 0.22133), norm. avg. (of 14) = 0.222069 fft 35: mflops = 5.99415 (norm. = 0.588415), norm. avg. (of 15) = 0.511256 fft 36: mflops = 6.22177 (norm. = 0.610759), norm. avg. (of 15) = 0.538764 fft 37: mflops = 6.87441 (norm. = 0.674825), norm. avg. (of 15) = 0.7348 fft 38: mflops = 5.78259 (norm. = 0.567647), norm. avg. (of 15) = 0.497464 fft 39: mflops = 6.77959 (norm. = 0.665517), norm. avg. (of 15) = 0.593588 fft 40: mflops = 5.92193 (norm. = 0.581325), norm. avg. (of 15) = 0.504694 fft 41: mflops = 2.25468 (norm. = 0.22133), norm. avg. (of 15) = 0.205837 Benchmarking for array size = 65536 (power of 2): 0. Arndt DIF: elapsed time t=1.75 s, 2 iters, t-(init.)=1.73 s t(norm)=0.824928, mflops=6.06113 (err=4.0e-15) 1. Arndt DIT: elapsed time t=1.57 s, 2 iters, t-(init.)=1.54 s t(norm)=0.734329, mflops=6.80894 (err=4.1e-15) 2. Arndt Split-Radix: elapsed time t=1.96 s, 2 iters, t-(init.)=1.94 s t(norm)=0.925064, mflops=5.40503 (err=4.1e-15) 3. Arndt 4-step: elapsed time t=1.13 s, 1 iters, t-(init.)=1.11 s t(norm)=1.05858, mflops=4.72332 (err=4.2e-15) 4. Bailey: elapsed time t=1.2 s, 1 iters, t-(init.)=1.18 s t(norm)=1.12534, mflops=4.44312 (err=4.0e-15) 5. Beauregard: elapsed time t=2.33 s, 1 iters, t-(init.)=2.32 s t(norm)=2.21252, mflops=2.25986 (err=4.2e-15) 6. Bergland: elapsed time t=1.38 s, 2 iters, t-(init.)=1.35 s t(norm)=0.64373, mflops=7.76723 (err=4.3e-15) 7. Brenner: elapsed time t=1.58 s, 2 iters, t-(init.)=1.56 s t(norm)=0.743866, mflops=6.72164 (err=4.3e-15) 8. Burrus: elapsed time t=1.08 s, 1 iters, t-(init.)=1.07 s t(norm)=1.02043, mflops=4.89989 (err=4.1e-15) 9. CWP (min N) (N=72072): elapsed time t=1.24 s, 2 iters, t-(init.)=1.21 s t(norm)=0.576973, mflops=8.66592 10. CWP (best N) (N=72072): elapsed time t=1.25 s, 2 iters, t-(init.)=1.23 s t(norm)=0.58651, mflops=8.52501 11. Edelblute: elapsed time t=1.06 s, 1 iters, t-(init.)=1.04 s t(norm)=0.991821, mflops=5.04123 (err=4.1e-15) 12. FFTPACK: elapsed time t=1.41 s, 2 iters, t-(init.)=1.38 s t(norm)=0.658035, mflops=7.59838 (err=4.2e-15) 13. FFTPACK (f2c): elapsed time t=1.45 s, 2 iters, t-(init.)=1.42 s t(norm)=0.677109, mflops=7.38434 (err=4.2e-15) FFTW_MEASURE plan: (cost = 5.500000e-01) FFTW_TWIDDLE 32 FFTW_TWIDDLE 32 FFTW_NOTW 64 14. FFTW: elapsed time t=1.07 s, 2 iters, t-(init.)=1.04 s t(norm)=0.495911, mflops=10.0825 (err=4.4e-15) FFTW_ESTIMATE plan: (cost = 5.767168e+05) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.06 s, 2 iters, t-(init.)=1.03 s t(norm)=0.491142, mflops=10.1803 (err=4.3e-15) 16. Frigo-old: elapsed time t=1.41 s, 2 iters, t-(init.)=1.38 s t(norm)=0.658035, mflops=7.59838 (err=4.4e-15) 17. Green: elapsed time t=1.62 s, 2 iters, t-(init.)=1.59 s t(norm)=0.758171, mflops=6.59482 (err=4.3e-15) 18. GSL: elapsed time t=1.62 s, 2 iters, t-(init.)=1.59 s t(norm)=0.758171, mflops=6.59482 (err=4.2e-15) 19. GSL DIT: elapsed time t=1.9 s, 2 iters, t-(init.)=1.87 s t(norm)=0.891685, mflops=5.60736 (err=4.2e-15) 20. GSL DIF: elapsed time t=1.19 s, 1 iters, t-(init.)=1.18 s t(norm)=1.12534, mflops=4.44312 (err=4.2e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1.42 s, 2 iters, t-(init.)=1.39 s t(norm)=0.662804, mflops=7.54371 (err=4.2e-15) 23. Mayer (simple): elapsed time t=1.35 s, 2 iters, t-(init.)=1.33 s t(norm)=0.634193, mflops=7.88403 24. Mayer (lookup): elapsed time t=1.37 s, 2 iters, t-(init.)=1.34 s t(norm)=0.638962, mflops=7.82519 (err=4.2e-15) 25. Monro: elapsed time t=1.75 s, 2 iters, t-(init.)=1.72 s t(norm)=0.82016, mflops=6.09637 (err=1.6e-07) 26. NAPACK (f2c): elapsed time t=1.37 s, 1 iters, t-(init.)=1.36 s t(norm)=1.297, mflops=3.85506 (err=8.9e-13) 27. Nielsen: elapsed time t=1.01 s, 1 iters, t-(init.)=1 s t(norm)=0.953674, mflops=5.24288 (err=2.7e-13) 28. NR (C): elapsed time t=1.81 s, 2 iters, t-(init.)=1.79 s t(norm)=0.853539, mflops=5.85797 (err=4.2e-15) 29. NR (F): elapsed time t=1.9 s, 2 iters, t-(init.)=1.87 s t(norm)=0.891685, mflops=5.60736 (err=4.2e-15) 30. Ooura (C): elapsed time t=1.21 s, 2 iters, t-(init.)=1.18 s t(norm)=0.562668, mflops=8.88624 (err=4.4e-15) 31. Ooura (F): elapsed time t=1.18 s, 2 iters, t-(init.)=1.15 s t(norm)=0.548363, mflops=9.11805 (err=4.4e-15) 32. QFT: elapsed time t=1.59 s, 2 iters, t-(init.)=1.57 s t(norm)=0.748634, mflops=6.67883 (err=7.9e-15) 33. Ransom: elapsed time t=1.57 s, 2 iters, t-(init.)=1.54 s t(norm)=0.734329, mflops=6.80894 (err=6.9e-15) 34. SCIPORT: elapsed time t=2.44 s, 1 iters, t-(init.)=2.43 s t(norm)=2.31743, mflops=2.15756 (err=2.5e-07) 35. Singleton: elapsed time t=1.67 s, 2 iters, t-(init.)=1.64 s t(norm)=0.782013, mflops=6.39376 (err=5.6e-15) 36. Singleton (f2c): elapsed time t=1.64 s, 2 iters, t-(init.)=1.62 s t(norm)=0.772476, mflops=6.47269 (err=5.6e-15) 37. Sorensen: elapsed time t=1.58 s, 2 iters, t-(init.)=1.56 s t(norm)=0.743866, mflops=6.72164 (err=4.2e-15) 38. Sorensen DIT: elapsed time t=1.91 s, 2 iters, t-(init.)=1.88 s t(norm)=0.896454, mflops=5.57753 (err=4.1e-15) 39. Temperton: elapsed time t=1.58 s, 2 iters, t-(init.)=1.55 s t(norm)=0.739098, mflops=6.76501 (err=1.8e-07) 40. Temperton (f2c): elapsed time t=1.81 s, 2 iters, t-(init.)=1.79 s t(norm)=0.853539, mflops=5.85797 (err=4.2e-15) 41. Valkenburg: elapsed time t=2.38 s, 1 iters, t-(init.)=2.37 s t(norm)=2.26021, mflops=2.21219 (err=4.0e-15) Top mflops for N=65536 = 10.1803 Normalized results and averages for N=65536: fft 0: mflops = 6.06113 (norm. = 0.595376), norm. avg. (of 16) = 0.657059 fft 1: mflops = 6.80894 (norm. = 0.668831), norm. avg. (of 16) = 0.694758 fft 2: mflops = 5.40503 (norm. = 0.530928), norm. avg. (of 16) = 0.522674 fft 3: mflops = 4.72332 (norm. = 0.463964), norm. avg. (of 16) = 0.314269 fft 4: mflops = 4.44312 (norm. = 0.436441), norm. avg. (of 16) = 0.422195 fft 5: mflops = 2.25986 (norm. = 0.221983), norm. avg. (of 16) = 0.196749 fft 6: mflops = 7.76723 (norm. = 0.762963), norm. avg. (of 16) = 0.658475 fft 7: mflops = 6.72164 (norm. = 0.660256), norm. avg. (of 16) = 0.562437 fft 8: mflops = 4.89989 (norm. = 0.481308), norm. avg. (of 16) = 0.457385 fft 9: mflops = 8.66592 (norm. = 0.85124), norm. avg. (of 16) = 0.738135 fft 10: mflops = 8.52501 (norm. = 0.837398), norm. avg. (of 16) = 0.672189 fft 11: mflops = 5.04123 (norm. = 0.495192), norm. avg. (of 15) = 0.449954 fft 12: mflops = 7.59838 (norm. = 0.746377), norm. avg. (of 16) = 0.703363 fft 13: mflops = 7.38434 (norm. = 0.725352), norm. avg. (of 16) = 0.675548 fft 14: mflops = 10.0825 (norm. = 0.990385), norm. avg. (of 16) = 0.986146 fft 15: mflops = 10.1803 (norm. = 1), norm. avg. (of 16) = 0.967653 fft 16: mflops = 7.59838 (norm. = 0.746377), norm. avg. (of 16) = 0.870872 fft 17: mflops = 6.59482 (norm. = 0.647799), norm. avg. (of 14) = 0.642295 fft 18: mflops = 6.59482 (norm. = 0.647799), norm. avg. (of 16) = 0.557516 fft 19: mflops = 5.60736 (norm. = 0.550802), norm. avg. (of 16) = 0.507786 fft 20: mflops = 4.44312 (norm. = 0.436441), norm. avg. (of 16) = 0.402412 fft 21: mflops = -1 (norm. = -0.0982285), norm. avg. (of 12) = 0.734683 fft 22: mflops = 7.54371 (norm. = 0.741007), norm. avg. (of 15) = 0.644294 fft 23: mflops = 7.88403 (norm. = 0.774436), norm. avg. (of 15) = 0.695515 fft 24: mflops = 7.82519 (norm. = 0.768657), norm. avg. (of 15) = 0.715761 fft 25: mflops = 6.09637 (norm. = 0.598837), norm. avg. (of 15) = 0.54761 fft 26: mflops = 3.85506 (norm. = 0.378676), norm. avg. (of 16) = 0.290332 fft 27: mflops = 5.24288 (norm. = 0.515), norm. avg. (of 16) = 0.477395 fft 28: mflops = 5.85797 (norm. = 0.575419), norm. avg. (of 16) = 0.536884 fft 29: mflops = 5.60736 (norm. = 0.550802), norm. avg. (of 16) = 0.493899 fft 30: mflops = 8.88624 (norm. = 0.872881), norm. avg. (of 16) = 0.844293 fft 31: mflops = 9.11805 (norm. = 0.895652), norm. avg. (of 16) = 0.830131 fft 32: mflops = 6.67883 (norm. = 0.656051), norm. avg. (of 13) = 0.7578 fft 33: mflops = 6.80894 (norm. = 0.668831), norm. avg. (of 15) = 0.47858 fft 34: mflops = 2.15756 (norm. = 0.211934), norm. avg. (of 15) = 0.221393 fft 35: mflops = 6.39376 (norm. = 0.628049), norm. avg. (of 16) = 0.518555 fft 36: mflops = 6.47269 (norm. = 0.635802), norm. avg. (of 16) = 0.544829 fft 37: mflops = 6.72164 (norm. = 0.660256), norm. avg. (of 16) = 0.730141 fft 38: mflops = 5.57753 (norm. = 0.547872), norm. avg. (of 16) = 0.500615 fft 39: mflops = 6.76501 (norm. = 0.664516), norm. avg. (of 16) = 0.598021 fft 40: mflops = 5.85797 (norm. = 0.575419), norm. avg. (of 16) = 0.509114 fft 41: mflops = 2.21219 (norm. = 0.2173), norm. avg. (of 16) = 0.206553 Benchmarking for array size = 131072 (power of 2): 0. Arndt DIF: elapsed time t=1.85 s, 1 iters, t-(init.)=1.82 s t(norm)=0.816794, mflops=6.12149 (err=2.8e-15) 1. Arndt DIT: elapsed time t=1.7 s, 1 iters, t-(init.)=1.67 s t(norm)=0.749476, mflops=6.67133 (err=2.8e-15) 2. Arndt Split-Radix: elapsed time t=2.16 s, 1 iters, t-(init.)=2.13 s t(norm)=0.955918, mflops=5.23057 (err=2.8e-15) 3. Arndt 4-step: elapsed time t=2.52 s, 1 iters, t-(init.)=2.49 s t(norm)=1.11748, mflops=4.47435 (err=2.8e-15) 4. Bailey: elapsed time t=2.57 s, 1 iters, t-(init.)=2.54 s t(norm)=1.13992, mflops=4.38627 (err=2.8e-15) 5. Beauregard: elapsed time t=5.01 s, 1 iters, t-(init.)=4.98 s t(norm)=2.23496, mflops=2.23717 (err=2.9e-15) 6. Bergland: elapsed time t=1.49 s, 1 iters, t-(init.)=1.46 s t(norm)=0.65523, mflops=7.6309 (err=2.9e-15) 7. Brenner: elapsed time t=1.71 s, 1 iters, t-(init.)=1.68 s t(norm)=0.753964, mflops=6.63162 (err=2.9e-15) 8. Burrus: elapsed time t=2.35 s, 1 iters, t-(init.)=2.32 s t(norm)=1.04119, mflops=4.80221 (err=2.8e-15) 9. CWP (min N) (N=144144): elapsed time t=1.32 s, 1 iters, t-(init.)=1.29 s t(norm)=0.578936, mflops=8.63653 10. CWP (best N) (N=144144): elapsed time t=1.32 s, 1 iters, t-(init.)=1.29 s t(norm)=0.578936, mflops=8.63653 11. Edelblute: elapsed time t=2.32 s, 1 iters, t-(init.)=2.29 s t(norm)=1.02772, mflops=4.86512 (err=2.8e-15) 12. FFTPACK: elapsed time t=1.6 s, 1 iters, t-(init.)=1.57 s t(norm)=0.704597, mflops=7.09625 (err=2.9e-15) 13. FFTPACK (f2c): elapsed time t=1.64 s, 1 iters, t-(init.)=1.61 s t(norm)=0.722549, mflops=6.91995 (err=2.9e-15) FFTW_MEASURE plan: (cost = 1.200000e+00) FFTW_TWIDDLE 8 FFTW_TWIDDLE 32 FFTW_TWIDDLE 32 FFTW_NOTW 16 14. FFTW: elapsed time t=1.15 s, 1 iters, t-(init.)=1.12 s t(norm)=0.502642, mflops=9.94743 (err=2.8e-15) FFTW_ESTIMATE plan: (cost = 1.153434e+06) FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_TWIDDLE 16 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=1.15 s, 1 iters, t-(init.)=1.12 s t(norm)=0.502642, mflops=9.94743 (err=2.9e-15) 16. Frigo-old: elapsed time t=1.55 s, 1 iters, t-(init.)=1.53 s t(norm)=0.686646, mflops=7.28178 (err=2.8e-15) 17. Green: elapsed time t=1.81 s, 1 iters, t-(init.)=1.78 s t(norm)=0.798842, mflops=6.25906 (err=2.9e-15) 18. GSL: elapsed time t=1.79 s, 1 iters, t-(init.)=1.76 s t(norm)=0.789867, mflops=6.33018 (err=2.9e-15) 19. GSL DIT: elapsed time t=2.11 s, 1 iters, t-(init.)=2.08 s t(norm)=0.933479, mflops=5.35631 (err=2.9e-15) 20. GSL DIF: elapsed time t=2.57 s, 1 iters, t-(init.)=2.54 s t(norm)=1.13992, mflops=4.38627 (err=2.9e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=1.59 s, 1 iters, t-(init.)=1.56 s t(norm)=0.700109, mflops=7.14174 (err=2.8e-15) 23. Mayer (simple): elapsed time t=1.51 s, 1 iters, t-(init.)=1.48 s t(norm)=0.664206, mflops=7.52778 24. Mayer (lookup): elapsed time t=1.53 s, 1 iters, t-(init.)=1.5 s t(norm)=0.673182, mflops=7.42741 (err=2.8e-15) 25. Monro: elapsed time t=1.87 s, 1 iters, t-(init.)=1.84 s t(norm)=0.82577, mflops=6.05496 (err=1.7e-07) 26. NAPACK (f2c): elapsed time t=2.81 s, 1 iters, t-(init.)=2.78 s t(norm)=1.24763, mflops=4.0076 (err=2.1e-12) 27. Nielsen: elapsed time t=2.04 s, 1 iters, t-(init.)=2.02 s t(norm)=0.906552, mflops=5.51541 (err=9.6e-13) 28. NR (C): elapsed time t=1.94 s, 1 iters, t-(init.)=1.92 s t(norm)=0.861673, mflops=5.80267 (err=2.9e-15) 29. NR (F): elapsed time t=2.02 s, 1 iters, t-(init.)=1.99 s t(norm)=0.893088, mflops=5.59855 (err=2.9e-15) 30. Ooura (C): elapsed time t=1.28 s, 1 iters, t-(init.)=1.25 s t(norm)=0.560985, mflops=8.9129 (err=2.8e-15) 31. Ooura (F): elapsed time t=1.28 s, 1 iters, t-(init.)=1.26 s t(norm)=0.565473, mflops=8.84216 (err=2.8e-15) 32. QFT: elapsed time t=1.75 s, 1 iters, t-(init.)=1.72 s t(norm)=0.771915, mflops=6.4774 (err=8.6e-15) 33. Ransom: elapsed time t=1.71 s, 1 iters, t-(init.)=1.68 s t(norm)=0.753964, mflops=6.63162 (err=4.0e-15) 34. SCIPORT: elapsed time t=5.07 s, 1 iters, t-(init.)=5.04 s t(norm)=2.26189, mflops=2.21054 (err=2.7e-07) 35. Singleton: elapsed time t=1.85 s, 1 iters, t-(init.)=1.82 s t(norm)=0.816794, mflops=6.12149 (err=4.3e-15) 36. Singleton (f2c): elapsed time t=1.82 s, 1 iters, t-(init.)=1.79 s t(norm)=0.80333, mflops=6.22409 (err=4.2e-15) 37. Sorensen: elapsed time t=1.74 s, 1 iters, t-(init.)=1.71 s t(norm)=0.767427, mflops=6.51527 (err=2.8e-15) 38. Sorensen DIT: elapsed time t=2.07 s, 1 iters, t-(init.)=2.05 s t(norm)=0.920015, mflops=5.43469 (err=2.8e-15) 39. Temperton: elapsed time t=1.78 s, 1 iters, t-(init.)=1.76 s t(norm)=0.789867, mflops=6.33018 (err=2.0e-07) 40. Temperton (f2c): elapsed time t=2.02 s, 1 iters, t-(init.)=1.99 s t(norm)=0.893088, mflops=5.59855 (err=2.9e-15) 41. Valkenburg: elapsed time t=5.18 s, 1 iters, t-(init.)=5.15 s t(norm)=2.31126, mflops=2.16332 (err=3.1e-15) Top mflops for N=131072 = 9.94743 Normalized results and averages for N=131072: fft 0: mflops = 6.12149 (norm. = 0.615385), norm. avg. (of 17) = 0.654607 fft 1: mflops = 6.67133 (norm. = 0.670659), norm. avg. (of 17) = 0.69334 fft 2: mflops = 5.23057 (norm. = 0.525822), norm. avg. (of 17) = 0.522859 fft 3: mflops = 4.47435 (norm. = 0.449799), norm. avg. (of 17) = 0.322242 fft 4: mflops = 4.38627 (norm. = 0.440945), norm. avg. (of 17) = 0.423297 fft 5: mflops = 2.23717 (norm. = 0.2249), norm. avg. (of 17) = 0.198405 fft 6: mflops = 7.6309 (norm. = 0.767123), norm. avg. (of 17) = 0.664866 fft 7: mflops = 6.63162 (norm. = 0.666667), norm. avg. (of 17) = 0.568568 fft 8: mflops = 4.80221 (norm. = 0.482759), norm. avg. (of 17) = 0.458878 fft 9: mflops = 8.63653 (norm. = 0.868217), norm. avg. (of 17) = 0.745787 fft 10: mflops = 8.63653 (norm. = 0.868217), norm. avg. (of 17) = 0.68372 fft 11: mflops = 4.86512 (norm. = 0.489083), norm. avg. (of 16) = 0.4524 fft 12: mflops = 7.09625 (norm. = 0.713376), norm. avg. (of 17) = 0.703952 fft 13: mflops = 6.91995 (norm. = 0.695652), norm. avg. (of 17) = 0.67673 fft 14: mflops = 9.94743 (norm. = 1), norm. avg. (of 17) = 0.986961 fft 15: mflops = 9.94743 (norm. = 1), norm. avg. (of 17) = 0.969556 fft 16: mflops = 7.28178 (norm. = 0.732026), norm. avg. (of 17) = 0.862704 fft 17: mflops = 6.25906 (norm. = 0.629213), norm. avg. (of 15) = 0.641423 fft 18: mflops = 6.33018 (norm. = 0.636364), norm. avg. (of 17) = 0.562154 fft 19: mflops = 5.35631 (norm. = 0.538462), norm. avg. (of 17) = 0.50959 fft 20: mflops = 4.38627 (norm. = 0.440945), norm. avg. (of 17) = 0.404678 fft 21: mflops = -1 (norm. = -0.100528), norm. avg. (of 12) = 0.734683 fft 22: mflops = 7.14174 (norm. = 0.717949), norm. avg. (of 16) = 0.648898 fft 23: mflops = 7.52778 (norm. = 0.756757), norm. avg. (of 16) = 0.699343 fft 24: mflops = 7.42741 (norm. = 0.746667), norm. avg. (of 16) = 0.717693 fft 25: mflops = 6.05496 (norm. = 0.608696), norm. avg. (of 16) = 0.551428 fft 26: mflops = 4.0076 (norm. = 0.402878), norm. avg. (of 17) = 0.296952 fft 27: mflops = 5.51541 (norm. = 0.554455), norm. avg. (of 17) = 0.481928 fft 28: mflops = 5.80267 (norm. = 0.583333), norm. avg. (of 17) = 0.539616 fft 29: mflops = 5.59855 (norm. = 0.562814), norm. avg. (of 17) = 0.497953 fft 30: mflops = 8.9129 (norm. = 0.896), norm. avg. (of 17) = 0.847334 fft 31: mflops = 8.84216 (norm. = 0.888889), norm. avg. (of 17) = 0.833587 fft 32: mflops = 6.4774 (norm. = 0.651163), norm. avg. (of 14) = 0.750183 fft 33: mflops = 6.63162 (norm. = 0.666667), norm. avg. (of 16) = 0.490336 fft 34: mflops = 2.21054 (norm. = 0.222222), norm. avg. (of 16) = 0.221445 fft 35: mflops = 6.12149 (norm. = 0.615385), norm. avg. (of 17) = 0.524251 fft 36: mflops = 6.22409 (norm. = 0.625698), norm. avg. (of 17) = 0.549586 fft 37: mflops = 6.51527 (norm. = 0.654971), norm. avg. (of 17) = 0.725719 fft 38: mflops = 5.43469 (norm. = 0.546341), norm. avg. (of 17) = 0.503305 fft 39: mflops = 6.33018 (norm. = 0.636364), norm. avg. (of 17) = 0.600276 fft 40: mflops = 5.59855 (norm. = 0.562814), norm. avg. (of 17) = 0.512273 fft 41: mflops = 2.16332 (norm. = 0.217476), norm. avg. (of 17) = 0.207195 Benchmarking for array size = 262144 (power of 2): 0. Arndt DIF: elapsed time t=4.09 s, 1 iters, t-(init.)=4.03 s t(norm)=0.854068, mflops=5.85433 (err=6.7e-15) 1. Arndt DIT: elapsed time t=3.69 s, 1 iters, t-(init.)=3.64 s t(norm)=0.771417, mflops=6.48158 (err=6.7e-15) 2. Arndt Split-Radix: elapsed time t=4.65 s, 1 iters, t-(init.)=4.6 s t(norm)=0.974867, mflops=5.1289 (err=6.7e-15) 3. Arndt 4-step: elapsed time t=4.82 s, 1 iters, t-(init.)=4.76 s t(norm)=1.00878, mflops=4.9565 (err=6.8e-15) 4. Bailey: elapsed time t=5.61 s, 1 iters, t-(init.)=5.55 s t(norm)=1.1762, mflops=4.25098 (err=6.7e-15) 5. Beauregard: elapsed time t=10.69 s, 1 iters, t-(init.)=10.64 s t(norm)=2.25491, mflops=2.21738 (err=6.8e-15) 6. Bergland: elapsed time t=3.13 s, 1 iters, t-(init.)=3.07 s t(norm)=0.650618, mflops=7.685 (err=6.8e-15) 7. Brenner: elapsed time t=3.61 s, 1 iters, t-(init.)=3.55 s t(norm)=0.752343, mflops=6.6459 (err=6.9e-15) 8. Burrus: elapsed time t=5.02 s, 1 iters, t-(init.)=4.97 s t(norm)=1.05328, mflops=4.74707 (err=6.7e-15) 9. CWP (min N) (N=360360): elapsed time t=3.6 s, 1 iters, t-(init.)=3.53 s t(norm)=0.748105, mflops=6.68356 10. CWP (best N) (N=360360): elapsed time t=3.61 s, 1 iters, t-(init.)=3.53 s t(norm)=0.748105, mflops=6.68356 11. Edelblute: elapsed time t=4.97 s, 1 iters, t-(init.)=4.91 s t(norm)=1.04056, mflops=4.80508 (err=6.7e-15) 12. FFTPACK: elapsed time t=3.45 s, 1 iters, t-(init.)=3.39 s t(norm)=0.718435, mflops=6.95958 (err=6.8e-15) 13. FFTPACK (f2c): elapsed time t=3.51 s, 1 iters, t-(init.)=3.45 s t(norm)=0.73115, mflops=6.83854 (err=6.8e-15) FFTW_MEASURE plan: (cost = 2.580000e+00) FFTW_TWIDDLE 8 FFTW_TWIDDLE 64 FFTW_TWIDDLE 32 FFTW_NOTW 16 14. FFTW: elapsed time t=2.51 s, 1 iters, t-(init.)=2.46 s t(norm)=0.521342, mflops=9.59063 (err=6.9e-15) FFTW_ESTIMATE plan: (cost = 2.988442e+06) FFTW_TWIDDLE 16 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_TWIDDLE 8 FFTW_NOTW 32 15. FFTW_ESTIMATE: elapsed time t=2.59 s, 1 iters, t-(init.)=2.54 s t(norm)=0.538296, mflops=9.28857 (err=6.8e-15) 16. Frigo-old: elapsed time t=3.37 s, 1 iters, t-(init.)=3.32 s t(norm)=0.7036, mflops=7.10631 (err=6.9e-15) 17. Green: elapsed time t=3.8 s, 1 iters, t-(init.)=3.76 s t(norm)=0.796848, mflops=6.27472 (err=6.9e-15) 18. GSL: elapsed time t=3.78 s, 1 iters, t-(init.)=3.72 s t(norm)=0.788371, mflops=6.34219 (err=6.8e-15) 19. GSL DIT: elapsed time t=4.51 s, 1 iters, t-(init.)=4.45 s t(norm)=0.943078, mflops=5.30179 (err=6.8e-15) 20. GSL DIF: elapsed time t=5.52 s, 1 iters, t-(init.)=5.47 s t(norm)=1.15924, mflops=4.31316 (err=6.8e-15) 21. Skipping fft (Krukar can't handle N > 4096). 22. Mayer (Buneman): elapsed time t=3.46 s, 1 iters, t-(init.)=3.41 s t(norm)=0.722673, mflops=6.91876 (err=6.8e-15) 23. Mayer (simple): elapsed time t=3.3 s, 1 iters, t-(init.)=3.24 s t(norm)=0.686646, mflops=7.28178 24. Mayer (lookup): elapsed time t=3.4 s, 1 iters, t-(init.)=3.34 s t(norm)=0.707838, mflops=7.06376 (err=6.8e-15) 25. Monro: elapsed time t=4.01 s, 1 iters, t-(init.)=3.96 s t(norm)=0.839233, mflops=5.95782 (err=1.8e-07) 26. NAPACK (f2c): elapsed time t=5.8 s, 1 iters, t-(init.)=5.75 s t(norm)=1.21858, mflops=4.10312 (err=3.7e-12) 27. Nielsen: elapsed time t=4.28 s, 1 iters, t-(init.)=4.22 s t(norm)=0.894335, mflops=5.59075 (err=2.2e-12) 28. NR (C): elapsed time t=4.09 s, 1 iters, t-(init.)=4.04 s t(norm)=0.856188, mflops=5.83984 (err=6.8e-15) 29. NR (F): elapsed time t=4.32 s, 1 iters, t-(init.)=4.26 s t(norm)=0.902812, mflops=5.53825 (err=6.8e-15) 30. Ooura (C): elapsed time t=2.71 s, 1 iters, t-(init.)=2.66 s t(norm)=0.563727, mflops=8.86953 (err=6.9e-15) 31. Ooura (F): elapsed time t=2.71 s, 1 iters, t-(init.)=2.65 s t(norm)=0.561608, mflops=8.903 (err=6.9e-15) 32. QFT: elapsed time t=4.02 s, 1 iters, t-(init.)=3.96 s t(norm)=0.839233, mflops=5.95782 (err=1.4e-14) 33. Ransom: elapsed time t=3.4 s, 1 iters, t-(init.)=3.34 s t(norm)=0.707838, mflops=7.06376 (err=8.2e-15) 34. SCIPORT: elapsed time t=10.94 s, 1 iters, t-(init.)=10.88 s t(norm)=2.30577, mflops=2.16847 (err=2.8e-07) 35. Singleton: elapsed time t=3.86 s, 1 iters, t-(init.)=3.8 s t(norm)=0.805325, mflops=6.20867 (err=1.0e-14) 36. Singleton (f2c): elapsed time t=3.81 s, 1 iters, t-(init.)=3.75 s t(norm)=0.794729, mflops=6.29146 (err=1.0e-14) 37. Sorensen: elapsed time t=3.82 s, 1 iters, t-(init.)=3.77 s t(norm)=0.798967, mflops=6.25808 (err=6.8e-15) 38. Sorensen DIT: elapsed time t=4.41 s, 1 iters, t-(init.)=4.36 s t(norm)=0.924004, mflops=5.41123 (err=6.7e-15) 39. Temperton: elapsed time t=3.68 s, 1 iters, t-(init.)=3.62 s t(norm)=0.767178, mflops=6.51739 (err=2.0e-07) 40. Temperton (f2c): elapsed time t=4.18 s, 1 iters, t-(init.)=4.13 s t(norm)=0.875261, mflops=5.71258 (err=6.8e-15) 41. Valkenburg: elapsed time t=11.3 s, 1 iters, t-(init.)=11.25 s t(norm)=2.38419, mflops=2.09715 (err=6.8e-15) Top mflops for N=262144 = 9.59063 Normalized results and averages for N=262144: fft 0: mflops = 5.85433 (norm. = 0.610422), norm. avg. (of 18) = 0.652152 fft 1: mflops = 6.48158 (norm. = 0.675824), norm. avg. (of 18) = 0.692367 fft 2: mflops = 5.1289 (norm. = 0.534783), norm. avg. (of 18) = 0.523522 fft 3: mflops = 4.9565 (norm. = 0.516807), norm. avg. (of 18) = 0.333051 fft 4: mflops = 4.25098 (norm. = 0.443243), norm. avg. (of 18) = 0.424406 fft 5: mflops = 2.21738 (norm. = 0.231203), norm. avg. (of 18) = 0.200227 fft 6: mflops = 7.685 (norm. = 0.801303), norm. avg. (of 18) = 0.672446 fft 7: mflops = 6.6459 (norm. = 0.692958), norm. avg. (of 18) = 0.575478 fft 8: mflops = 4.74707 (norm. = 0.49497), norm. avg. (of 18) = 0.460883 fft 9: mflops = 6.68356 (norm. = 0.696884), norm. avg. (of 18) = 0.74307 fft 10: mflops = 6.68356 (norm. = 0.696884), norm. avg. (of 18) = 0.684451 fft 11: mflops = 4.80508 (norm. = 0.501018), norm. avg. (of 17) = 0.45526 fft 12: mflops = 6.95958 (norm. = 0.725664), norm. avg. (of 18) = 0.705158 fft 13: mflops = 6.83854 (norm. = 0.713043), norm. avg. (of 18) = 0.678748 fft 14: mflops = 9.59063 (norm. = 1), norm. avg. (of 18) = 0.987685 fft 15: mflops = 9.28857 (norm. = 0.968504), norm. avg. (of 18) = 0.969498 fft 16: mflops = 7.10631 (norm. = 0.740964), norm. avg. (of 18) = 0.855941 fft 17: mflops = 6.27472 (norm. = 0.654255), norm. avg. (of 16) = 0.642225 fft 18: mflops = 6.34219 (norm. = 0.66129), norm. avg. (of 18) = 0.567662 fft 19: mflops = 5.30179 (norm. = 0.552809), norm. avg. (of 18) = 0.511991 fft 20: mflops = 4.31316 (norm. = 0.449726), norm. avg. (of 18) = 0.407181 fft 21: mflops = -1 (norm. = -0.104268), norm. avg. (of 12) = 0.734683 fft 22: mflops = 6.91876 (norm. = 0.721408), norm. avg. (of 17) = 0.653163 fft 23: mflops = 7.28178 (norm. = 0.759259), norm. avg. (of 17) = 0.702867 fft 24: mflops = 7.06376 (norm. = 0.736527), norm. avg. (of 17) = 0.718801 fft 25: mflops = 5.95782 (norm. = 0.621212), norm. avg. (of 17) = 0.555533 fft 26: mflops = 4.10312 (norm. = 0.427826), norm. avg. (of 18) = 0.304223 fft 27: mflops = 5.59075 (norm. = 0.582938), norm. avg. (of 18) = 0.48754 fft 28: mflops = 5.83984 (norm. = 0.608911), norm. avg. (of 18) = 0.543466 fft 29: mflops = 5.53825 (norm. = 0.577465), norm. avg. (of 18) = 0.50237 fft 30: mflops = 8.86953 (norm. = 0.924812), norm. avg. (of 18) = 0.851639 fft 31: mflops = 8.903 (norm. = 0.928302), norm. avg. (of 18) = 0.838849 fft 32: mflops = 5.95782 (norm. = 0.621212), norm. avg. (of 15) = 0.741585 fft 33: mflops = 7.06376 (norm. = 0.736527), norm. avg. (of 17) = 0.504817 fft 34: mflops = 2.16847 (norm. = 0.226103), norm. avg. (of 17) = 0.221719 fft 35: mflops = 6.20867 (norm. = 0.647368), norm. avg. (of 18) = 0.531091 fft 36: mflops = 6.29146 (norm. = 0.656), norm. avg. (of 18) = 0.555498 fft 37: mflops = 6.25808 (norm. = 0.65252), norm. avg. (of 18) = 0.721652 fft 38: mflops = 5.41123 (norm. = 0.56422), norm. avg. (of 18) = 0.506689 fft 39: mflops = 6.51739 (norm. = 0.679558), norm. avg. (of 18) = 0.604681 fft 40: mflops = 5.71258 (norm. = 0.595642), norm. avg. (of 18) = 0.516904 fft 41: mflops = 2.09715 (norm. = 0.218667), norm. avg. (of 18) = 0.207833 ------------------------------------------------------ @@@@ bench.1d.np2.log Benchmarking for sizes: 6 (0.000686646 MB) 9 (0.000915527 MB) 12 (0.00114441 MB) 15 (0.00137329 MB) 18 (0.00180054 MB) 24 (0.0022583 MB) 36 (0.0032959 MB) 80 (0.00738525 MB) 108 (0.00994873 MB) 210 (0.0192261 MB) 504 (0.0461426 MB) 1000 (0.0916748 MB) 1960 (0.179749 MB) 4725 (0.437393 MB) 10368 (0.960205 MB) 27000 (2.48291 MB) 75600 (6.98975 MB) 165375 (15.3664 MB) 362880 (38.6829 MB) Maximum array size = 720720 Benchmarking FFTs: 0. Brenner 1. CWP (min N) 2. CWP (best N) 3. FFTPACK 4. FFTPACK (f2c) 5. FFTW 6. FFTW_ESTIMATE 7. Frigo-old 8. GSL 9. Nielsen 10. Singleton 11. Singleton (f2c) 12. Temperton 13. Temperton (f2c) 14. Valkenburg Computing normalized averages (15 transforms). Benchmarking for array size = 6: 0. Skipping fft (Brenner has a bug for N=3*2^m). 1. CWP (min N): elapsed time t=1.53 s, 131072 iters, t-(init.)=1.47 s t(norm)=0.723106, mflops=6.91462 2. CWP (best N) (N=15): elapsed time t=1.08 s, 32768 iters, t-(init.)=1.05 s t(norm)=2.06602, mflops=2.42012 3. FFTPACK: elapsed time t=1.43 s, 131072 iters, t-(init.)=1.37 s t(norm)=0.673915, mflops=7.41933 (err=1.2e-16) 4. FFTPACK (f2c): elapsed time t=1.52 s, 131072 iters, t-(init.)=1.46 s t(norm)=0.718187, mflops=6.96198 (err=1.5e-16) FFTW_MEASURE plan: (cost = 6.103516e-06) FFTW_NOTW 6 5. FFTW: elapsed time t=1.6 s, 262144 iters, t-(init.)=1.48 s t(norm)=0.364012, mflops=13.7358 (err=8.7e-17) FFTW_ESTIMATE plan: (cost = 4.116000e+02) FFTW_NOTW 6 6. FFTW_ESTIMATE: elapsed time t=1.62 s, 262144 iters, t-(init.)=1.51 s t(norm)=0.371391, mflops=13.4629 (err=8.7e-17) 7. Frigo-old: elapsed time t=1.22 s, 65536 iters, t-(init.)=1.19 s t(norm)=1.17074, mflops=4.27079 (err=2.6e-16) 8. GSL: elapsed time t=1.85 s, 131072 iters, t-(init.)=1.79 s t(norm)=0.880517, mflops=5.67848 (err=8.2e-17) 9. Nielsen: elapsed time t=1.26 s, 32768 iters, t-(init.)=1.25 s t(norm)=2.45954, mflops=2.0329 (err=5.7e-16) 10. Singleton: elapsed time t=1.61 s, 65536 iters, t-(init.)=1.58 s t(norm)=1.55443, mflops=3.21661 (err=1.2e-16) 11. Singleton (f2c): elapsed time t=1.44 s, 65536 iters, t-(init.)=1.41 s t(norm)=1.38718, mflops=3.60443 (err=1.2e-16) 12. Temperton: elapsed time t=1.03 s, 65536 iters, t-(init.)=1 s t(norm)=0.983818, mflops=5.08224 (err=3.9e-09) 13. Temperton (f2c): elapsed time t=1.2 s, 65536 iters, t-(init.)=1.17 s t(norm)=1.15107, mflops=4.3438 (err=2.1e-16) 14. Valkenburg: elapsed time t=1.76 s, 65536 iters, t-(init.)=1.73 s t(norm)=1.702, mflops=2.93771 (err=2.4e-16) Top mflops for N=6 = 13.7358 Normalized results and averages for N=6: fft 0: mflops = -1 (norm. = -0.0728025), norm. avg. (of 0) = -1 fft 1: mflops = 6.91462 (norm. = 0.503401), norm. avg. (of 1) = 0.503401 fft 2: mflops = 2.42012 (norm. = 0.17619), norm. avg. (of 1) = 0.17619 fft 3: mflops = 7.41933 (norm. = 0.540146), norm. avg. (of 1) = 0.540146 fft 4: mflops = 6.96198 (norm. = 0.506849), norm. avg. (of 1) = 0.506849 fft 5: mflops = 13.7358 (norm. = 1), norm. avg. (of 1) = 1 fft 6: mflops = 13.4629 (norm. = 0.980132), norm. avg. (of 1) = 0.980132 fft 7: mflops = 4.27079 (norm. = 0.310924), norm. avg. (of 1) = 0.310924 fft 8: mflops = 5.67848 (norm. = 0.413408), norm. avg. (of 1) = 0.413408 fft 9: mflops = 2.0329 (norm. = 0.148), norm. avg. (of 1) = 0.148 fft 10: mflops = 3.21661 (norm. = 0.234177), norm. avg. (of 1) = 0.234177 fft 11: mflops = 3.60443 (norm. = 0.262411), norm. avg. (of 1) = 0.262411 fft 12: mflops = 5.08224 (norm. = 0.37), norm. avg. (of 1) = 0.37 fft 13: mflops = 4.3438 (norm. = 0.316239), norm. avg. (of 1) = 0.316239 fft 14: mflops = 2.93771 (norm. = 0.213873), norm. avg. (of 1) = 0.213873 Benchmarking for array size = 9: 0. Brenner: elapsed time t=1.94 s, 32768 iters, t-(init.)=1.92 s t(norm)=2.05381, mflops=2.4345 (err=4.8e-16) 1. CWP (min N): elapsed time t=1.23 s, 65536 iters, t-(init.)=1.19 s t(norm)=0.636466, mflops=7.85587 2. CWP (best N) (N=15): elapsed time t=1.07 s, 32768 iters, t-(init.)=1.04 s t(norm)=1.11248, mflops=4.49447 3. FFTPACK: elapsed time t=1.3 s, 65536 iters, t-(init.)=1.26 s t(norm)=0.673906, mflops=7.41944 (err=1.1e-16) 4. FFTPACK (f2c): elapsed time t=1.33 s, 65536 iters, t-(init.)=1.29 s t(norm)=0.689951, mflops=7.24689 (err=2.5e-16) FFTW_MEASURE plan: (cost = 1.464844e-05) FFTW_NOTW 9 5. FFTW: elapsed time t=1.76 s, 131072 iters, t-(init.)=1.68 s t(norm)=0.44927, mflops=11.1292 (err=2.1e-16) FFTW_ESTIMATE plan: (cost = 4.851000e+02) FFTW_NOTW 9 6. FFTW_ESTIMATE: elapsed time t=1.76 s, 131072 iters, t-(init.)=1.68 s t(norm)=0.44927, mflops=11.1292 (err=2.1e-16) 7. Frigo-old: elapsed time t=1.47 s, 32768 iters, t-(init.)=1.45 s t(norm)=1.55105, mflops=3.22362 (err=3.1e-16) 8. GSL: elapsed time t=1.92 s, 65536 iters, t-(init.)=1.88 s t(norm)=1.00551, mflops=4.9726 (err=1.7e-16) 9. Nielsen: elapsed time t=1.48 s, 32768 iters, t-(init.)=1.46 s t(norm)=1.56175, mflops=3.20154 (err=9.6e-16) 10. Singleton: elapsed time t=1.04 s, 32768 iters, t-(init.)=1.02 s t(norm)=1.09109, mflops=4.58259 (err=1.5e-16) 11. Singleton (f2c): elapsed time t=1.87 s, 65536 iters, t-(init.)=1.83 s t(norm)=0.978768, mflops=5.10846 (err=1.5e-16) 12. Temperton: elapsed time t=1.69 s, 65536 iters, t-(init.)=1.65 s t(norm)=0.882496, mflops=5.66575 (err=1.9e-08) 13. Temperton (f2c): elapsed time t=1.93 s, 65536 iters, t-(init.)=1.9 s t(norm)=1.01621, mflops=4.92026 (err=1.3e-16) 14. Valkenburg: elapsed time t=1.74 s, 32768 iters, t-(init.)=1.72 s t(norm)=1.83987, mflops=2.71758 (err=4.0e-16) Top mflops for N=9 = 11.1292 Normalized results and averages for N=9: fft 0: mflops = 2.4345 (norm. = 0.21875), norm. avg. (of 1) = 0.21875 fft 1: mflops = 7.85587 (norm. = 0.705882), norm. avg. (of 2) = 0.604642 fft 2: mflops = 4.49447 (norm. = 0.403846), norm. avg. (of 2) = 0.290018 fft 3: mflops = 7.41944 (norm. = 0.666667), norm. avg. (of 2) = 0.603406 fft 4: mflops = 7.24689 (norm. = 0.651163), norm. avg. (of 2) = 0.579006 fft 5: mflops = 11.1292 (norm. = 1), norm. avg. (of 2) = 1 fft 6: mflops = 11.1292 (norm. = 1), norm. avg. (of 2) = 0.990066 fft 7: mflops = 3.22362 (norm. = 0.289655), norm. avg. (of 2) = 0.30029 fft 8: mflops = 4.9726 (norm. = 0.446809), norm. avg. (of 2) = 0.430108 fft 9: mflops = 3.20154 (norm. = 0.287671), norm. avg. (of 2) = 0.217836 fft 10: mflops = 4.58259 (norm. = 0.411765), norm. avg. (of 2) = 0.322971 fft 11: mflops = 5.10846 (norm. = 0.459016), norm. avg. (of 2) = 0.360714 fft 12: mflops = 5.66575 (norm. = 0.509091), norm. avg. (of 2) = 0.439545 fft 13: mflops = 4.92026 (norm. = 0.442105), norm. avg. (of 2) = 0.379172 fft 14: mflops = 2.71758 (norm. = 0.244186), norm. avg. (of 2) = 0.229029 Benchmarking for array size = 12: 0. Skipping fft (Brenner has a bug for N=3*2^m). 1. CWP (min N): elapsed time t=1.44 s, 65536 iters, t-(init.)=1.39 s t(norm)=0.493025, mflops=10.1415 2. CWP (best N) (N=15): elapsed time t=1.08 s, 32768 iters, t-(init.)=1.06 s t(norm)=0.751952, mflops=6.64936 3. FFTPACK: elapsed time t=1.6 s, 65536 iters, t-(init.)=1.55 s t(norm)=0.549776, mflops=9.09461 (err=2.0e-16) 4. FFTPACK (f2c): elapsed time t=1.68 s, 65536 iters, t-(init.)=1.64 s t(norm)=0.581699, mflops=8.59552 (err=2.6e-16) FFTW_MEASURE plan: (cost = 1.525879e-05) FFTW_NOTW 12 5. FFTW: elapsed time t=1.94 s, 131072 iters, t-(init.)=1.84 s t(norm)=0.326319, mflops=15.3224 (err=1.4e-16) FFTW_ESTIMATE plan: (cost = 4.920000e+02) FFTW_NOTW 12 6. FFTW_ESTIMATE: elapsed time t=1.95 s, 131072 iters, t-(init.)=1.86 s t(norm)=0.329866, mflops=15.1577 (err=1.4e-16) 7. Frigo-old: elapsed time t=1.37 s, 32768 iters, t-(init.)=1.35 s t(norm)=0.957675, mflops=5.22098 (err=2.9e-16) 8. GSL: elapsed time t=1.11 s, 32768 iters, t-(init.)=1.08 s t(norm)=0.76614, mflops=6.52623 (err=3.2e-16) 9. Nielsen: elapsed time t=1.79 s, 32768 iters, t-(init.)=1.77 s t(norm)=1.25562, mflops=3.9821 (err=6.0e-16) 10. Singleton: elapsed time t=1.53 s, 32768 iters, t-(init.)=1.5 s t(norm)=1.06408, mflops=4.69888 (err=2.5e-16) 11. Singleton (f2c): elapsed time t=1.37 s, 32768 iters, t-(init.)=1.35 s t(norm)=0.957675, mflops=5.22098 (err=2.5e-16) 12. Temperton: elapsed time t=1.82 s, 65536 iters, t-(init.)=1.77 s t(norm)=0.627809, mflops=7.96421 (err=8.2e-09) 13. Temperton (f2c): elapsed time t=1.06 s, 32768 iters, t-(init.)=1.04 s t(norm)=0.737764, mflops=6.77723 (err=1.7e-16) 14. Valkenburg: elapsed time t=1.35 s, 16384 iters, t-(init.)=1.34 s t(norm)=1.90116, mflops=2.62997 (err=4.2e-16) Top mflops for N=12 = 15.3224 Normalized results and averages for N=12: fft 0: mflops = -1 (norm. = -0.0652638), norm. avg. (of 1) = 0.21875 fft 1: mflops = 10.1415 (norm. = 0.661871), norm. avg. (of 3) = 0.623718 fft 2: mflops = 6.64936 (norm. = 0.433962), norm. avg. (of 3) = 0.338 fft 3: mflops = 9.09461 (norm. = 0.593548), norm. avg. (of 3) = 0.60012 fft 4: mflops = 8.59552 (norm. = 0.560976), norm. avg. (of 3) = 0.572996 fft 5: mflops = 15.3224 (norm. = 1), norm. avg. (of 3) = 1 fft 6: mflops = 15.1577 (norm. = 0.989247), norm. avg. (of 3) = 0.989793 fft 7: mflops = 5.22098 (norm. = 0.340741), norm. avg. (of 3) = 0.313773 fft 8: mflops = 6.52623 (norm. = 0.425926), norm. avg. (of 3) = 0.428714 fft 9: mflops = 3.9821 (norm. = 0.259887), norm. avg. (of 3) = 0.231853 fft 10: mflops = 4.69888 (norm. = 0.306667), norm. avg. (of 3) = 0.317536 fft 11: mflops = 5.22098 (norm. = 0.340741), norm. avg. (of 3) = 0.354056 fft 12: mflops = 7.96421 (norm. = 0.519774), norm. avg. (of 3) = 0.466288 fft 13: mflops = 6.77723 (norm. = 0.442308), norm. avg. (of 3) = 0.400217 fft 14: mflops = 2.62997 (norm. = 0.171642), norm. avg. (of 3) = 0.2099 Benchmarking for array size = 15: 0. Brenner: elapsed time t=1.64 s, 16384 iters, t-(init.)=1.63 s t(norm)=1.69764, mflops=2.94527 (err=4.1e-16) 1. CWP (min N): elapsed time t=1.11 s, 32768 iters, t-(init.)=1.09 s t(norm)=0.567615, mflops=8.80878 2. CWP (best N): elapsed time t=1.07 s, 32768 iters, t-(init.)=1.04 s t(norm)=0.541578, mflops=9.23228 3. FFTPACK: elapsed time t=1.18 s, 32768 iters, t-(init.)=1.15 s t(norm)=0.59886, mflops=8.3492 (err=1.4e-16) 4. FFTPACK (f2c): elapsed time t=1.31 s, 32768 iters, t-(init.)=1.29 s t(norm)=0.671765, mflops=7.44308 (err=3.0e-16) FFTW_MEASURE plan: (cost = 2.441406e-05) FFTW_NOTW 15 5. FFTW: elapsed time t=1.6 s, 65536 iters, t-(init.)=1.55 s t(norm)=0.40358, mflops=12.3891 (err=1.9e-16) FFTW_ESTIMATE plan: (cost = 4.485000e+02) FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.6 s, 65536 iters, t-(init.)=1.54 s t(norm)=0.400976, mflops=12.4696 (err=1.9e-16) 7. Frigo-old: elapsed time t=1.49 s, 16384 iters, t-(init.)=1.47 s t(norm)=1.531, mflops=3.26584 (err=2.6e-16) 8. GSL: elapsed time t=1.83 s, 32768 iters, t-(init.)=1.8 s t(norm)=0.937346, mflops=5.33421 (err=1.4e-16) 9. Nielsen: elapsed time t=1.11 s, 16384 iters, t-(init.)=1.1 s t(norm)=1.14565, mflops=4.36435 (err=4.3e-15) 10. Singleton: elapsed time t=1.99 s, 32768 iters, t-(init.)=1.96 s t(norm)=1.02067, mflops=4.89876 (err=2.2e-16) 11. Singleton (f2c): elapsed time t=1.85 s, 32768 iters, t-(init.)=1.82 s t(norm)=0.947761, mflops=5.27559 (err=2.2e-16) 12. Temperton: elapsed time t=1.31 s, 32768 iters, t-(init.)=1.28 s t(norm)=0.666557, mflops=7.50123 (err=1.0e-08) 13. Temperton (f2c): elapsed time t=1.47 s, 32768 iters, t-(init.)=1.44 s t(norm)=0.749877, mflops=6.66776 (err=1.8e-16) 14. Valkenburg: elapsed time t=1.92 s, 16384 iters, t-(init.)=1.91 s t(norm)=1.98926, mflops=2.5135 (err=2.2e-16) Top mflops for N=15 = 12.4696 Normalized results and averages for N=15: fft 0: mflops = 2.94527 (norm. = 0.236196), norm. avg. (of 2) = 0.227473 fft 1: mflops = 8.80878 (norm. = 0.706422), norm. avg. (of 4) = 0.644394 fft 2: mflops = 9.23228 (norm. = 0.740385), norm. avg. (of 4) = 0.438596 fft 3: mflops = 8.3492 (norm. = 0.669565), norm. avg. (of 4) = 0.617482 fft 4: mflops = 7.44308 (norm. = 0.596899), norm. avg. (of 4) = 0.578972 fft 5: mflops = 12.3891 (norm. = 0.993548), norm. avg. (of 4) = 0.998387 fft 6: mflops = 12.4696 (norm. = 1), norm. avg. (of 4) = 0.992345 fft 7: mflops = 3.26584 (norm. = 0.261905), norm. avg. (of 4) = 0.300806 fft 8: mflops = 5.33421 (norm. = 0.427778), norm. avg. (of 4) = 0.42848 fft 9: mflops = 4.36435 (norm. = 0.35), norm. avg. (of 4) = 0.26139 fft 10: mflops = 4.89876 (norm. = 0.392857), norm. avg. (of 4) = 0.336366 fft 11: mflops = 5.27559 (norm. = 0.423077), norm. avg. (of 4) = 0.371311 fft 12: mflops = 7.50123 (norm. = 0.601562), norm. avg. (of 4) = 0.500107 fft 13: mflops = 6.66776 (norm. = 0.534722), norm. avg. (of 4) = 0.433844 fft 14: mflops = 2.5135 (norm. = 0.201571), norm. avg. (of 4) = 0.207818 Benchmarking for array size = 18: 0. Brenner: elapsed time t=1.1 s, 8192 iters, t-(init.)=1.09 s t(norm)=1.7727, mflops=2.82055 (err=4.1e-16) 1. CWP (min N): elapsed time t=1.37 s, 32768 iters, t-(init.)=1.34 s t(norm)=0.544821, mflops=9.17732 2. CWP (best N) (N=28): elapsed time t=1.03 s, 16384 iters, t-(init.)=1.01 s t(norm)=0.821298, mflops=6.08793 3. FFTPACK: elapsed time t=1.7 s, 32768 iters, t-(init.)=1.66 s t(norm)=0.674928, mflops=7.4082 (err=2.5e-16) 4. FFTPACK (f2c): elapsed time t=1.73 s, 32768 iters, t-(init.)=1.7 s t(norm)=0.691191, mflops=7.23389 (err=2.8e-16) FFTW_MEASURE plan: (cost = 3.784180e-05) FFTW_TWIDDLE 3 FFTW_NOTW 6 5. FFTW: elapsed time t=1.24 s, 32768 iters, t-(init.)=1.21 s t(norm)=0.491966, mflops=10.1633 (err=2.0e-16) FFTW_ESTIMATE plan: (cost = 1.168200e+03) FFTW_TWIDDLE 2 FFTW_NOTW 9 6. FFTW_ESTIMATE: elapsed time t=1.29 s, 32768 iters, t-(init.)=1.26 s t(norm)=0.512295, mflops=9.76001 (err=2.2e-16) 7. Frigo-old: elapsed time t=1.69 s, 16384 iters, t-(init.)=1.68 s t(norm)=1.36612, mflops=3.66 (err=3.5e-16) 8. GSL: elapsed time t=1.89 s, 32768 iters, t-(init.)=1.86 s t(norm)=0.756245, mflops=6.61162 (err=2.1e-16) 9. Nielsen: elapsed time t=1.69 s, 16384 iters, t-(init.)=1.67 s t(norm)=1.35799, mflops=3.68192 (err=8.7e-16) 10. Singleton: elapsed time t=1.2 s, 16384 iters, t-(init.)=1.19 s t(norm)=0.967668, mflops=5.16706 (err=2.1e-16) 11. Singleton (f2c): elapsed time t=1.07 s, 16384 iters, t-(init.)=1.05 s t(norm)=0.853824, mflops=5.856 (err=2.1e-16) 12. Temperton: elapsed time t=1.85 s, 32768 iters, t-(init.)=1.82 s t(norm)=0.739981, mflops=6.75693 (err=4.5e-08) 13. Temperton (f2c): elapsed time t=1.1 s, 16384 iters, t-(init.)=1.08 s t(norm)=0.878219, mflops=5.69334 (err=2.6e-16) 14. Valkenburg: elapsed time t=1.13 s, 8192 iters, t-(init.)=1.12 s t(norm)=1.82149, mflops=2.745 (err=4.2e-16) Top mflops for N=18 = 10.1633 Normalized results and averages for N=18: fft 0: mflops = 2.82055 (norm. = 0.277523), norm. avg. (of 3) = 0.244156 fft 1: mflops = 9.17732 (norm. = 0.902985), norm. avg. (of 5) = 0.696112 fft 2: mflops = 6.08793 (norm. = 0.59901), norm. avg. (of 5) = 0.470679 fft 3: mflops = 7.4082 (norm. = 0.728916), norm. avg. (of 5) = 0.639768 fft 4: mflops = 7.23389 (norm. = 0.711765), norm. avg. (of 5) = 0.60553 fft 5: mflops = 10.1633 (norm. = 1), norm. avg. (of 5) = 0.99871 fft 6: mflops = 9.76001 (norm. = 0.960317), norm. avg. (of 5) = 0.985939 fft 7: mflops = 3.66 (norm. = 0.360119), norm. avg. (of 5) = 0.312669 fft 8: mflops = 6.61162 (norm. = 0.650538), norm. avg. (of 5) = 0.472892 fft 9: mflops = 3.68192 (norm. = 0.362275), norm. avg. (of 5) = 0.281567 fft 10: mflops = 5.16706 (norm. = 0.508403), norm. avg. (of 5) = 0.370774 fft 11: mflops = 5.856 (norm. = 0.57619), norm. avg. (of 5) = 0.412287 fft 12: mflops = 6.75693 (norm. = 0.664835), norm. avg. (of 5) = 0.533053 fft 13: mflops = 5.69334 (norm. = 0.560185), norm. avg. (of 5) = 0.459112 fft 14: mflops = 2.745 (norm. = 0.270089), norm. avg. (of 5) = 0.220272 Benchmarking for array size = 24: 0. Skipping fft (Brenner has a bug for N=3*2^m). 1. CWP (min N): elapsed time t=1.61 s, 32768 iters, t-(init.)=1.57 s t(norm)=0.435414, mflops=11.4833 2. CWP (best N) (N=28): elapsed time t=1.03 s, 16384 iters, t-(init.)=1.01 s t(norm)=0.560215, mflops=8.92515 3. FFTPACK: elapsed time t=1.07 s, 16384 iters, t-(init.)=1.05 s t(norm)=0.582401, mflops=8.58515 (err=1.7e-16) 4. FFTPACK (f2c): elapsed time t=1.11 s, 16384 iters, t-(init.)=1.09 s t(norm)=0.604588, mflops=8.27009 (err=2.5e-16) FFTW_MEASURE plan: (cost = 4.638672e-05) FFTW_TWIDDLE 2 FFTW_NOTW 12 5. FFTW: elapsed time t=1.48 s, 32768 iters, t-(init.)=1.44 s t(norm)=0.399361, mflops=12.52 (err=2.1e-16) FFTW_ESTIMATE plan: (cost = 1.248000e+03) FFTW_TWIDDLE 2 FFTW_NOTW 12 6. FFTW_ESTIMATE: elapsed time t=1.5 s, 32768 iters, t-(init.)=1.46 s t(norm)=0.404908, mflops=12.3485 (err=2.1e-16) 7. Frigo-old: elapsed time t=1.51 s, 16384 iters, t-(init.)=1.49 s t(norm)=0.826455, mflops=6.04993 (err=3.8e-16) 8. GSL: elapsed time t=1.21 s, 16384 iters, t-(init.)=1.19 s t(norm)=0.660055, mflops=7.57513 (err=2.1e-16) 9. Nielsen: elapsed time t=1.66 s, 16384 iters, t-(init.)=1.64 s t(norm)=0.909655, mflops=5.49659 (err=1.7e-15) 10. Singleton: elapsed time t=1.76 s, 16384 iters, t-(init.)=1.74 s t(norm)=0.965122, mflops=5.18069 (err=2.1e-16) 11. Singleton (f2c): elapsed time t=1.56 s, 16384 iters, t-(init.)=1.54 s t(norm)=0.854189, mflops=5.85351 (err=2.1e-16) 12. Temperton: elapsed time t=1.07 s, 16384 iters, t-(init.)=1.04 s t(norm)=0.576855, mflops=8.6677 (err=8.0e-09) 13. Temperton (f2c): elapsed time t=1.21 s, 16384 iters, t-(init.)=1.19 s t(norm)=0.660055, mflops=7.57513 (err=2.4e-16) 14. Valkenburg: elapsed time t=1.7 s, 8192 iters, t-(init.)=1.69 s t(norm)=1.87478, mflops=2.66698 (err=6.1e-16) Top mflops for N=24 = 12.52 Normalized results and averages for N=24: fft 0: mflops = -1 (norm. = -0.0798722), norm. avg. (of 3) = 0.244156 fft 1: mflops = 11.4833 (norm. = 0.917197), norm. avg. (of 6) = 0.73296 fft 2: mflops = 8.92515 (norm. = 0.712871), norm. avg. (of 6) = 0.511044 fft 3: mflops = 8.58515 (norm. = 0.685714), norm. avg. (of 6) = 0.647426 fft 4: mflops = 8.27009 (norm. = 0.66055), norm. avg. (of 6) = 0.6147 fft 5: mflops = 12.52 (norm. = 1), norm. avg. (of 6) = 0.998925 fft 6: mflops = 12.3485 (norm. = 0.986301), norm. avg. (of 6) = 0.986 fft 7: mflops = 6.04993 (norm. = 0.483221), norm. avg. (of 6) = 0.341094 fft 8: mflops = 7.57513 (norm. = 0.605042), norm. avg. (of 6) = 0.494917 fft 9: mflops = 5.49659 (norm. = 0.439024), norm. avg. (of 6) = 0.30781 fft 10: mflops = 5.18069 (norm. = 0.413793), norm. avg. (of 6) = 0.377944 fft 11: mflops = 5.85351 (norm. = 0.467532), norm. avg. (of 6) = 0.421495 fft 12: mflops = 8.6677 (norm. = 0.692308), norm. avg. (of 6) = 0.559595 fft 13: mflops = 7.57513 (norm. = 0.605042), norm. avg. (of 6) = 0.483434 fft 14: mflops = 2.66698 (norm. = 0.213018), norm. avg. (of 6) = 0.219063 Benchmarking for array size = 36: 0. Brenner: elapsed time t=1.15 s, 4096 iters, t-(init.)=1.14 s t(norm)=1.4954, mflops=3.34358 (err=1.4e-15) 1. CWP (min N): elapsed time t=1.44 s, 16384 iters, t-(init.)=1.41 s t(norm)=0.462394, mflops=10.8133 2. CWP (best N): elapsed time t=1.44 s, 16384 iters, t-(init.)=1.41 s t(norm)=0.462394, mflops=10.8133 3. FFTPACK: elapsed time t=1.9 s, 16384 iters, t-(init.)=1.87 s t(norm)=0.613246, mflops=8.15333 (err=4.3e-16) 4. FFTPACK (f2c): elapsed time t=1.94 s, 16384 iters, t-(init.)=1.91 s t(norm)=0.626364, mflops=7.98258 (err=1.1e-15) FFTW_MEASURE plan: (cost = 8.300781e-05) FFTW_TWIDDLE 3 FFTW_NOTW 12 5. FFTW: elapsed time t=1.31 s, 16384 iters, t-(init.)=1.28 s t(norm)=0.419762, mflops=11.9115 (err=6.2e-16) FFTW_ESTIMATE plan: (cost = 1.803600e+03) FFTW_TWIDDLE 3 FFTW_NOTW 12 6. FFTW_ESTIMATE: elapsed time t=1.32 s, 16384 iters, t-(init.)=1.29 s t(norm)=0.423042, mflops=11.8192 (err=6.2e-16) 7. Frigo-old: elapsed time t=1.78 s, 8192 iters, t-(init.)=1.77 s t(norm)=1.1609, mflops=4.30699 (err=6.6e-16) 8. GSL: elapsed time t=1.97 s, 16384 iters, t-(init.)=1.94 s t(norm)=0.636202, mflops=7.85914 (err=4.2e-16) 9. Nielsen: elapsed time t=1.47 s, 8192 iters, t-(init.)=1.45 s t(norm)=0.951024, mflops=5.25749 (err=1.6e-15) 10. Singleton: elapsed time t=1.26 s, 8192 iters, t-(init.)=1.24 s t(norm)=0.813289, mflops=6.14787 (err=4.2e-16) 11. Singleton (f2c): elapsed time t=1.16 s, 8192 iters, t-(init.)=1.14 s t(norm)=0.747701, mflops=6.68716 (err=4.1e-16) 12. Temperton: elapsed time t=1.83 s, 16384 iters, t-(init.)=1.8 s t(norm)=0.590291, mflops=8.47041 (err=6.4e-08) 13. Temperton (f2c): elapsed time t=1.08 s, 8192 iters, t-(init.)=1.07 s t(norm)=0.70179, mflops=7.12464 (err=3.5e-16) 14. Valkenburg: elapsed time t=1.4 s, 4096 iters, t-(init.)=1.4 s t(norm)=1.83646, mflops=2.72263 (err=8.4e-16) Top mflops for N=36 = 11.9115 Normalized results and averages for N=36: fft 0: mflops = 3.34358 (norm. = 0.280702), norm. avg. (of 4) = 0.253293 fft 1: mflops = 10.8133 (norm. = 0.907801), norm. avg. (of 7) = 0.757937 fft 2: mflops = 10.8133 (norm. = 0.907801), norm. avg. (of 7) = 0.567724 fft 3: mflops = 8.15333 (norm. = 0.684492), norm. avg. (of 7) = 0.652721 fft 4: mflops = 7.98258 (norm. = 0.670157), norm. avg. (of 7) = 0.622623 fft 5: mflops = 11.9115 (norm. = 1), norm. avg. (of 7) = 0.999078 fft 6: mflops = 11.8192 (norm. = 0.992248), norm. avg. (of 7) = 0.986892 fft 7: mflops = 4.30699 (norm. = 0.361582), norm. avg. (of 7) = 0.344021 fft 8: mflops = 7.85914 (norm. = 0.659794), norm. avg. (of 7) = 0.518471 fft 9: mflops = 5.25749 (norm. = 0.441379), norm. avg. (of 7) = 0.326891 fft 10: mflops = 6.14787 (norm. = 0.516129), norm. avg. (of 7) = 0.397684 fft 11: mflops = 6.68716 (norm. = 0.561404), norm. avg. (of 7) = 0.441482 fft 12: mflops = 8.47041 (norm. = 0.711111), norm. avg. (of 7) = 0.58124 fft 13: mflops = 7.12464 (norm. = 0.598131), norm. avg. (of 7) = 0.499819 fft 14: mflops = 2.72263 (norm. = 0.228571), norm. avg. (of 7) = 0.220421 Benchmarking for array size = 80: 0. Brenner: elapsed time t=1.12 s, 2048 iters, t-(init.)=1.11 s t(norm)=1.07165, mflops=4.6657 (err=5.1e-16) 1. CWP (min N): elapsed time t=1.73 s, 8192 iters, t-(init.)=1.7 s t(norm)=0.410317, mflops=12.1857 2. CWP (best N) (N=84): elapsed time t=1.95 s, 8192 iters, t-(init.)=1.91 s t(norm)=0.461003, mflops=10.8459 3. FFTPACK: elapsed time t=1.15 s, 4096 iters, t-(init.)=1.13 s t(norm)=0.54548, mflops=9.16624 (err=4.3e-16) 4. FFTPACK (f2c): elapsed time t=1.25 s, 4096 iters, t-(init.)=1.24 s t(norm)=0.59858, mflops=8.3531 (err=4.7e-16) FFTW_MEASURE plan: (cost = 2.148438e-04) FFTW_TWIDDLE 16 FFTW_NOTW 5 5. FFTW: elapsed time t=2 s, 8192 iters, t-(init.)=1.97 s t(norm)=0.475485, mflops=10.5156 (err=2.7e-16) FFTW_ESTIMATE plan: (cost = 2.600000e+03) FFTW_TWIDDLE 5 FFTW_NOTW 16 6. FFTW_ESTIMATE: elapsed time t=1.9 s, 8192 iters, t-(init.)=1.87 s t(norm)=0.451349, mflops=11.0779 (err=4.6e-16) 7. Frigo-old: elapsed time t=1.74 s, 4096 iters, t-(init.)=1.72 s t(norm)=0.830288, mflops=6.022 (err=3.3e-16) 8. GSL: elapsed time t=1.54 s, 4096 iters, t-(init.)=1.52 s t(norm)=0.733743, mflops=6.81437 (err=4.1e-16) 9. Nielsen: elapsed time t=1.53 s, 4096 iters, t-(init.)=1.52 s t(norm)=0.733743, mflops=6.81437 (err=8.1e-15) 10. Singleton: elapsed time t=1.42 s, 4096 iters, t-(init.)=1.4 s t(norm)=0.675816, mflops=7.39846 (err=4.3e-16) 11. Singleton (f2c): elapsed time t=1.37 s, 4096 iters, t-(init.)=1.35 s t(norm)=0.65168, mflops=7.67248 (err=3.5e-16) 12. Temperton: elapsed time t=1.11 s, 4096 iters, t-(init.)=1.1 s t(norm)=0.530998, mflops=9.41622 (err=1.7e-07) 13. Temperton (f2c): elapsed time t=1.29 s, 4096 iters, t-(init.)=1.28 s t(norm)=0.617889, mflops=8.09207 (err=4.0e-16) 14. Valkenburg: elapsed time t=1 s, 1024 iters, t-(init.)=1 s t(norm)=1.9309, mflops=2.58946 (err=5.4e-16) Top mflops for N=80 = 12.1857 Normalized results and averages for N=80: fft 0: mflops = 4.6657 (norm. = 0.382883), norm. avg. (of 5) = 0.279211 fft 1: mflops = 12.1857 (norm. = 1), norm. avg. (of 8) = 0.788195 fft 2: mflops = 10.8459 (norm. = 0.890052), norm. avg. (of 8) = 0.608015 fft 3: mflops = 9.16624 (norm. = 0.752212), norm. avg. (of 8) = 0.665158 fft 4: mflops = 8.3531 (norm. = 0.685484), norm. avg. (of 8) = 0.63048 fft 5: mflops = 10.5156 (norm. = 0.862944), norm. avg. (of 8) = 0.982062 fft 6: mflops = 11.0779 (norm. = 0.909091), norm. avg. (of 8) = 0.977167 fft 7: mflops = 6.022 (norm. = 0.494186), norm. avg. (of 8) = 0.362792 fft 8: mflops = 6.81437 (norm. = 0.559211), norm. avg. (of 8) = 0.523563 fft 9: mflops = 6.81437 (norm. = 0.559211), norm. avg. (of 8) = 0.355931 fft 10: mflops = 7.39846 (norm. = 0.607143), norm. avg. (of 8) = 0.423867 fft 11: mflops = 7.67248 (norm. = 0.62963), norm. avg. (of 8) = 0.465 fft 12: mflops = 9.41622 (norm. = 0.772727), norm. avg. (of 8) = 0.605176 fft 13: mflops = 8.09207 (norm. = 0.664063), norm. avg. (of 8) = 0.520349 fft 14: mflops = 2.58946 (norm. = 0.2125), norm. avg. (of 8) = 0.219431 Benchmarking for array size = 108: 0. Brenner: elapsed time t=1.06 s, 1024 iters, t-(init.)=1.05 s t(norm)=1.40555, mflops=3.55732 (err=8.7e-16) 1. CWP (min N) (N=110): elapsed time t=1.55 s, 4096 iters, t-(init.)=1.53 s t(norm)=0.512023, mflops=9.76518 2. CWP (best N) (N=112): elapsed time t=1.31 s, 4096 iters, t-(init.)=1.28 s t(norm)=0.428359, mflops=11.6724 3. FFTPACK: elapsed time t=1.83 s, 4096 iters, t-(init.)=1.81 s t(norm)=0.605727, mflops=8.25455 (err=3.4e-16) 4. FFTPACK (f2c): elapsed time t=1.81 s, 4096 iters, t-(init.)=1.79 s t(norm)=0.599034, mflops=8.34678 (err=7.1e-16) FFTW_MEASURE plan: (cost = 3.613281e-04) FFTW_TWIDDLE 9 FFTW_NOTW 12 5. FFTW: elapsed time t=1.36 s, 4096 iters, t-(init.)=1.34 s t(norm)=0.448439, mflops=11.1498 (err=3.0e-16) FFTW_ESTIMATE plan: (cost = 4.633200e+03) FFTW_TWIDDLE 9 FFTW_NOTW 12 6. FFTW_ESTIMATE: elapsed time t=1.36 s, 4096 iters, t-(init.)=1.34 s t(norm)=0.448439, mflops=11.1498 (err=3.0e-16) 7. Frigo-old: elapsed time t=1.9 s, 2048 iters, t-(init.)=1.89 s t(norm)=1.265, mflops=3.95257 (err=5.6e-16) 8. GSL: elapsed time t=1 s, 2048 iters, t-(init.)=0.99 s t(norm)=0.662618, mflops=7.54582 (err=3.2e-16) 9. Nielsen: elapsed time t=1.27 s, 2048 iters, t-(init.)=1.26 s t(norm)=0.843332, mflops=5.92886 (err=1.2e-15) 10. Singleton: elapsed time t=1.21 s, 2048 iters, t-(init.)=1.2 s t(norm)=0.803174, mflops=6.2253 (err=3.3e-16) 11. Singleton (f2c): elapsed time t=1.08 s, 2048 iters, t-(init.)=1.07 s t(norm)=0.716163, mflops=6.98165 (err=3.3e-16) 12. Temperton: elapsed time t=1.83 s, 4096 iters, t-(init.)=1.81 s t(norm)=0.605727, mflops=8.25455 (err=1.0e-07) 13. Temperton (f2c): elapsed time t=1.08 s, 2048 iters, t-(init.)=1.07 s t(norm)=0.716163, mflops=6.98165 (err=3.1e-16) 14. Valkenburg: elapsed time t=1.35 s, 1024 iters, t-(init.)=1.34 s t(norm)=1.79375, mflops=2.78745 (err=6.6e-16) Top mflops for N=108 = 11.6724 Normalized results and averages for N=108: fft 0: mflops = 3.55732 (norm. = 0.304762), norm. avg. (of 6) = 0.283469 fft 1: mflops = 9.76518 (norm. = 0.836601), norm. avg. (of 9) = 0.793573 fft 2: mflops = 11.6724 (norm. = 1), norm. avg. (of 9) = 0.651569 fft 3: mflops = 8.25455 (norm. = 0.707182), norm. avg. (of 9) = 0.669827 fft 4: mflops = 8.34678 (norm. = 0.715084), norm. avg. (of 9) = 0.639881 fft 5: mflops = 11.1498 (norm. = 0.955224), norm. avg. (of 9) = 0.97908 fft 6: mflops = 11.1498 (norm. = 0.955224), norm. avg. (of 9) = 0.974729 fft 7: mflops = 3.95257 (norm. = 0.338624), norm. avg. (of 9) = 0.360106 fft 8: mflops = 7.54582 (norm. = 0.646465), norm. avg. (of 9) = 0.537219 fft 9: mflops = 5.92886 (norm. = 0.507937), norm. avg. (of 9) = 0.37282 fft 10: mflops = 6.2253 (norm. = 0.533333), norm. avg. (of 9) = 0.43603 fft 11: mflops = 6.98165 (norm. = 0.598131), norm. avg. (of 9) = 0.479792 fft 12: mflops = 8.25455 (norm. = 0.707182), norm. avg. (of 9) = 0.61651 fft 13: mflops = 6.98165 (norm. = 0.598131), norm. avg. (of 9) = 0.528992 fft 14: mflops = 2.78745 (norm. = 0.238806), norm. avg. (of 9) = 0.221584 Benchmarking for array size = 210: 0. Brenner: elapsed time t=1.09 s, 512 iters, t-(init.)=1.09 s t(norm)=1.31415, mflops=3.80475 (err=6.2e-16) 1. CWP (min N): elapsed time t=1.56 s, 2048 iters, t-(init.)=1.54 s t(norm)=0.464171, mflops=10.7719 2. CWP (best N): elapsed time t=1.55 s, 2048 iters, t-(init.)=1.53 s t(norm)=0.461157, mflops=10.8423 3. FFTPACK: elapsed time t=1.1 s, 1024 iters, t-(init.)=1.09 s t(norm)=0.657073, mflops=7.6095 (err=3.7e-16) 4. FFTPACK (f2c): elapsed time t=1.2 s, 1024 iters, t-(init.)=1.19 s t(norm)=0.717355, mflops=6.97005 (err=4.2e-16) FFTW_MEASURE plan: (cost = 7.812500e-04) FFTW_TWIDDLE 10 FFTW_TWIDDLE 3 FFTW_NOTW 7 5. FFTW: elapsed time t=1.61 s, 2048 iters, t-(init.)=1.59 s t(norm)=0.479242, mflops=10.4332 (err=2.8e-16) FFTW_ESTIMATE plan: (cost = 9.324000e+03) FFTW_TWIDDLE 7 FFTW_TWIDDLE 2 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.73 s, 2048 iters, t-(init.)=1.71 s t(norm)=0.515411, mflops=9.701 (err=3.5e-16) 7. Frigo-old: elapsed time t=1.25 s, 512 iters, t-(init.)=1.24 s t(norm)=1.49499, mflops=3.3445 (err=4.6e-16) 8. GSL: elapsed time t=1.31 s, 1024 iters, t-(init.)=1.3 s t(norm)=0.783665, mflops=6.38027 (err=3.6e-16) 9. Nielsen: elapsed time t=1.31 s, 1024 iters, t-(init.)=1.3 s t(norm)=0.783665, mflops=6.38027 (err=8.6e-15) 10. Singleton: elapsed time t=1.37 s, 1024 iters, t-(init.)=1.36 s t(norm)=0.819835, mflops=6.09879 (err=3.4e-16) 11. Singleton (f2c): elapsed time t=1.23 s, 1024 iters, t-(init.)=1.22 s t(norm)=0.73544, mflops=6.79865 (err=3.4e-16) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=1.77 s, 512 iters, t-(init.)=1.77 s t(norm)=2.13398, mflops=2.34304 (err=6.1e-16) Top mflops for N=210 = 10.8423 Normalized results and averages for N=210: fft 0: mflops = 3.80475 (norm. = 0.350917), norm. avg. (of 7) = 0.293105 fft 1: mflops = 10.7719 (norm. = 0.993506), norm. avg. (of 10) = 0.813567 fft 2: mflops = 10.8423 (norm. = 1), norm. avg. (of 10) = 0.686412 fft 3: mflops = 7.6095 (norm. = 0.701835), norm. avg. (of 10) = 0.673028 fft 4: mflops = 6.97005 (norm. = 0.642857), norm. avg. (of 10) = 0.640178 fft 5: mflops = 10.4332 (norm. = 0.962264), norm. avg. (of 10) = 0.977398 fft 6: mflops = 9.701 (norm. = 0.894737), norm. avg. (of 10) = 0.96673 fft 7: mflops = 3.3445 (norm. = 0.308468), norm. avg. (of 10) = 0.354943 fft 8: mflops = 6.38027 (norm. = 0.588462), norm. avg. (of 10) = 0.542343 fft 9: mflops = 6.38027 (norm. = 0.588462), norm. avg. (of 10) = 0.394385 fft 10: mflops = 6.09879 (norm. = 0.5625), norm. avg. (of 10) = 0.448677 fft 11: mflops = 6.79865 (norm. = 0.627049), norm. avg. (of 10) = 0.494518 fft 12: mflops = -1 (norm. = -0.0922314), norm. avg. (of 9) = 0.61651 fft 13: mflops = -1 (norm. = -0.0922314), norm. avg. (of 9) = 0.528992 fft 14: mflops = 2.34304 (norm. = 0.216102), norm. avg. (of 10) = 0.221036 Benchmarking for array size = 504: 0. Brenner: elapsed time t=1.4 s, 256 iters, t-(init.)=1.39 s t(norm)=1.20005, mflops=4.16649 (err=6.6e-16) 1. CWP (min N): elapsed time t=1.04 s, 512 iters, t-(init.)=1.03 s t(norm)=0.444623, mflops=11.2455 2. CWP (best N): elapsed time t=1.04 s, 512 iters, t-(init.)=1.03 s t(norm)=0.444623, mflops=11.2455 3. FFTPACK: elapsed time t=1.57 s, 512 iters, t-(init.)=1.56 s t(norm)=0.67341, mflops=7.4249 (err=4.6e-16) 4. FFTPACK (f2c): elapsed time t=1.64 s, 512 iters, t-(init.)=1.63 s t(norm)=0.703627, mflops=7.10604 (err=6.0e-16) FFTW_MEASURE plan: (cost = 2.187500e-03) FFTW_TWIDDLE 9 FFTW_TWIDDLE 8 FFTW_NOTW 7 5. FFTW: elapsed time t=1.12 s, 512 iters, t-(init.)=1.11 s t(norm)=0.479157, mflops=10.435 (err=4.5e-16) FFTW_ESTIMATE plan: (cost = 2.147040e+04) FFTW_TWIDDLE 6 FFTW_TWIDDLE 6 FFTW_NOTW 14 6. FFTW_ESTIMATE: elapsed time t=1.14 s, 512 iters, t-(init.)=1.13 s t(norm)=0.48779, mflops=10.2503 (err=4.5e-16) 7. Frigo-old: elapsed time t=1.47 s, 256 iters, t-(init.)=1.47 s t(norm)=1.26912, mflops=3.93974 (err=6.0e-16) 8. GSL: elapsed time t=1.63 s, 512 iters, t-(init.)=1.62 s t(norm)=0.69931, mflops=7.1499 (err=5.7e-16) 9. Nielsen: elapsed time t=1.81 s, 512 iters, t-(init.)=1.8 s t(norm)=0.777011, mflops=6.43491 (err=5.4e-15) 10. Singleton: elapsed time t=1.84 s, 512 iters, t-(init.)=1.83 s t(norm)=0.789961, mflops=6.32942 (err=6.6e-16) 11. Singleton (f2c): elapsed time t=1.62 s, 512 iters, t-(init.)=1.6 s t(norm)=0.690677, mflops=7.23928 (err=6.6e-16) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=1.17 s, 128 iters, t-(init.)=1.17 s t(norm)=2.02023, mflops=2.47497 (err=7.6e-16) Top mflops for N=504 = 11.2455 Normalized results and averages for N=504: fft 0: mflops = 4.16649 (norm. = 0.370504), norm. avg. (of 8) = 0.30278 fft 1: mflops = 11.2455 (norm. = 1), norm. avg. (of 11) = 0.830515 fft 2: mflops = 11.2455 (norm. = 1), norm. avg. (of 11) = 0.71492 fft 3: mflops = 7.4249 (norm. = 0.660256), norm. avg. (of 11) = 0.671867 fft 4: mflops = 7.10604 (norm. = 0.631902), norm. avg. (of 11) = 0.639426 fft 5: mflops = 10.435 (norm. = 0.927928), norm. avg. (of 11) = 0.972901 fft 6: mflops = 10.2503 (norm. = 0.911504), norm. avg. (of 11) = 0.961709 fft 7: mflops = 3.93974 (norm. = 0.35034), norm. avg. (of 11) = 0.354524 fft 8: mflops = 7.1499 (norm. = 0.635802), norm. avg. (of 11) = 0.550839 fft 9: mflops = 6.43491 (norm. = 0.572222), norm. avg. (of 11) = 0.410552 fft 10: mflops = 6.32942 (norm. = 0.562842), norm. avg. (of 11) = 0.459055 fft 11: mflops = 7.23928 (norm. = 0.64375), norm. avg. (of 11) = 0.508085 fft 12: mflops = -1 (norm. = -0.0889246), norm. avg. (of 9) = 0.61651 fft 13: mflops = -1 (norm. = -0.0889246), norm. avg. (of 9) = 0.528992 fft 14: mflops = 2.47497 (norm. = 0.220085), norm. avg. (of 11) = 0.220949 Benchmarking for array size = 1000: 0. Brenner: elapsed time t=1.62 s, 128 iters, t-(init.)=1.61 s t(norm)=1.26213, mflops=3.96155 (err=8.0e-16) 1. CWP (min N) (N=1001): elapsed time t=1.43 s, 256 iters, t-(init.)=1.42 s t(norm)=0.556592, mflops=8.98324 2. CWP (best N) (N=1008): elapsed time t=1.17 s, 256 iters, t-(init.)=1.16 s t(norm)=0.454681, mflops=10.9967 3. FFTPACK: elapsed time t=1.68 s, 256 iters, t-(init.)=1.66 s t(norm)=0.650664, mflops=7.68446 (err=6.1e-16) 4. FFTPACK (f2c): elapsed time t=1.78 s, 256 iters, t-(init.)=1.77 s t(norm)=0.69378, mflops=7.20689 (err=7.8e-16) FFTW_MEASURE plan: (cost = 4.843750e-03) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_NOTW 10 5. FFTW: elapsed time t=1.31 s, 256 iters, t-(init.)=1.3 s t(norm)=0.509556, mflops=9.81246 (err=6.3e-16) FFTW_ESTIMATE plan: (cost = 5.220000e+04) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_NOTW 10 6. FFTW_ESTIMATE: elapsed time t=1.31 s, 256 iters, t-(init.)=1.3 s t(norm)=0.509556, mflops=9.81246 (err=6.3e-16) 7. Frigo-old: elapsed time t=1.62 s, 128 iters, t-(init.)=1.61 s t(norm)=1.26213, mflops=3.96155 (err=6.3e-16) 8. GSL: elapsed time t=1.12 s, 128 iters, t-(init.)=1.11 s t(norm)=0.870165, mflops=5.74604 (err=6.3e-16) 9. Nielsen: elapsed time t=1.73 s, 256 iters, t-(init.)=1.72 s t(norm)=0.674182, mflops=7.4164 (err=1.3e-14) 10. Singleton: elapsed time t=1.65 s, 256 iters, t-(init.)=1.63 s t(norm)=0.638905, mflops=7.82589 (err=9.0e-16) 11. Singleton (f2c): elapsed time t=1.56 s, 256 iters, t-(init.)=1.55 s t(norm)=0.607548, mflops=8.22981 (err=9.1e-16) 12. Temperton: elapsed time t=1.64 s, 256 iters, t-(init.)=1.63 s t(norm)=0.638905, mflops=7.82589 (err=1.1e-07) 13. Temperton (f2c): elapsed time t=1.87 s, 256 iters, t-(init.)=1.85 s t(norm)=0.725137, mflops=6.89525 (err=6.4e-16) 14. Valkenburg: elapsed time t=1.31 s, 64 iters, t-(init.)=1.31 s t(norm)=2.0539, mflops=2.43439 (err=7.2e-16) Top mflops for N=1000 = 10.9967 Normalized results and averages for N=1000: fft 0: mflops = 3.96155 (norm. = 0.360248), norm. avg. (of 9) = 0.309165 fft 1: mflops = 8.98324 (norm. = 0.816901), norm. avg. (of 12) = 0.829381 fft 2: mflops = 10.9967 (norm. = 1), norm. avg. (of 12) = 0.738677 fft 3: mflops = 7.68446 (norm. = 0.698795), norm. avg. (of 12) = 0.674111 fft 4: mflops = 7.20689 (norm. = 0.655367), norm. avg. (of 12) = 0.640754 fft 5: mflops = 9.81246 (norm. = 0.892308), norm. avg. (of 12) = 0.966185 fft 6: mflops = 9.81246 (norm. = 0.892308), norm. avg. (of 12) = 0.955926 fft 7: mflops = 3.96155 (norm. = 0.360248), norm. avg. (of 12) = 0.355001 fft 8: mflops = 5.74604 (norm. = 0.522523), norm. avg. (of 12) = 0.54848 fft 9: mflops = 7.4164 (norm. = 0.674419), norm. avg. (of 12) = 0.432541 fft 10: mflops = 7.82589 (norm. = 0.711656), norm. avg. (of 12) = 0.480105 fft 11: mflops = 8.22981 (norm. = 0.748387), norm. avg. (of 12) = 0.52811 fft 12: mflops = 7.82589 (norm. = 0.711656), norm. avg. (of 10) = 0.626025 fft 13: mflops = 6.89525 (norm. = 0.627027), norm. avg. (of 10) = 0.538795 fft 14: mflops = 2.43439 (norm. = 0.221374), norm. avg. (of 12) = 0.220985 Benchmarking for array size = 1960: 0. Brenner: elapsed time t=1.81 s, 64 iters, t-(init.)=1.79 s t(norm)=1.30477, mflops=3.8321 (err=7.3e-16) 1. CWP (min N) (N=1980): elapsed time t=1.42 s, 128 iters, t-(init.)=1.38 s t(norm)=0.502955, mflops=9.94125 2. CWP (best N) (N=1980): elapsed time t=1.42 s, 128 iters, t-(init.)=1.38 s t(norm)=0.502955, mflops=9.94125 3. FFTPACK: elapsed time t=1.08 s, 64 iters, t-(init.)=1.06 s t(norm)=0.772656, mflops=6.47119 (err=5.6e-16) 4. FFTPACK (f2c): elapsed time t=1.17 s, 64 iters, t-(init.)=1.15 s t(norm)=0.838258, mflops=5.96475 (err=6.3e-16) FFTW_MEASURE plan: (cost = 1.062500e-02) FFTW_TWIDDLE 5 FFTW_TWIDDLE 8 FFTW_TWIDDLE 7 FFTW_NOTW 7 5. FFTW: elapsed time t=1.46 s, 128 iters, t-(init.)=1.42 s t(norm)=0.517534, mflops=9.66121 (err=5.6e-16) FFTW_ESTIMATE plan: (cost = 9.662800e+04) FFTW_TWIDDLE 10 FFTW_TWIDDLE 7 FFTW_TWIDDLE 2 FFTW_NOTW 14 6. FFTW_ESTIMATE: elapsed time t=1.52 s, 128 iters, t-(init.)=1.48 s t(norm)=0.539401, mflops=9.26954 (err=5.6e-16) 7. Frigo-old: elapsed time t=1.94 s, 64 iters, t-(init.)=1.92 s t(norm)=1.39953, mflops=3.57264 (err=6.9e-16) 8. GSL: elapsed time t=1.14 s, 64 iters, t-(init.)=1.12 s t(norm)=0.816391, mflops=6.12452 (err=7.0e-16) 9. Nielsen: elapsed time t=1.1 s, 64 iters, t-(init.)=1.07 s t(norm)=0.779945, mflops=6.41071 (err=1.5e-14) 10. Singleton: elapsed time t=1.02 s, 64 iters, t-(init.)=1 s t(norm)=0.72892, mflops=6.85946 (err=7.7e-16) 11. Singleton (f2c): elapsed time t=1.87 s, 128 iters, t-(init.)=1.83 s t(norm)=0.666962, mflops=7.49668 (err=7.7e-16) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=1.56 s, 32 iters, t-(init.)=1.55 s t(norm)=2.25965, mflops=2.21273 (err=6.1e-16) Top mflops for N=1960 = 9.94125 Normalized results and averages for N=1960: fft 0: mflops = 3.8321 (norm. = 0.385475), norm. avg. (of 10) = 0.316796 fft 1: mflops = 9.94125 (norm. = 1), norm. avg. (of 13) = 0.842505 fft 2: mflops = 9.94125 (norm. = 1), norm. avg. (of 13) = 0.758778 fft 3: mflops = 6.47119 (norm. = 0.650943), norm. avg. (of 13) = 0.672329 fft 4: mflops = 5.96475 (norm. = 0.6), norm. avg. (of 13) = 0.637619 fft 5: mflops = 9.66121 (norm. = 0.971831), norm. avg. (of 13) = 0.966619 fft 6: mflops = 9.26954 (norm. = 0.932432), norm. avg. (of 13) = 0.954119 fft 7: mflops = 3.57264 (norm. = 0.359375), norm. avg. (of 13) = 0.355338 fft 8: mflops = 6.12452 (norm. = 0.616071), norm. avg. (of 13) = 0.553679 fft 9: mflops = 6.41071 (norm. = 0.64486), norm. avg. (of 13) = 0.448873 fft 10: mflops = 6.85946 (norm. = 0.69), norm. avg. (of 13) = 0.496251 fft 11: mflops = 7.49668 (norm. = 0.754098), norm. avg. (of 13) = 0.545494 fft 12: mflops = -1 (norm. = -0.100591), norm. avg. (of 10) = 0.626025 fft 13: mflops = -1 (norm. = -0.100591), norm. avg. (of 10) = 0.538795 fft 14: mflops = 2.21273 (norm. = 0.222581), norm. avg. (of 13) = 0.221108 Benchmarking for array size = 4725: 0. Brenner: elapsed time t=1.3 s, 16 iters, t-(init.)=1.29 s t(norm)=1.39795, mflops=3.57667 (err=1.4e-15) 1. CWP (min N) (N=5005): elapsed time t=1.1 s, 32 iters, t-(init.)=1.08 s t(norm)=0.585188, mflops=8.54427 2. CWP (best N) (N=5040): elapsed time t=1.86 s, 64 iters, t-(init.)=1.81 s t(norm)=0.490366, mflops=10.1965 3. FFTPACK: elapsed time t=1.41 s, 32 iters, t-(init.)=1.39 s t(norm)=0.753158, mflops=6.63871 (err=1.2e-15) 4. FFTPACK (f2c): elapsed time t=1.46 s, 32 iters, t-(init.)=1.44 s t(norm)=0.78025, mflops=6.4082 (err=1.3e-15) FFTW_MEASURE plan: (cost = 3.125000e-02) FFTW_TWIDDLE 5 FFTW_TWIDDLE 9 FFTW_TWIDDLE 3 FFTW_TWIDDLE 5 FFTW_NOTW 7 5. FFTW: elapsed time t=1.04 s, 32 iters, t-(init.)=1.02 s t(norm)=0.552677, mflops=9.04687 (err=1.2e-15) FFTW_ESTIMATE plan: (cost = 1.946700e+05) FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 5 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.07 s, 32 iters, t-(init.)=1.05 s t(norm)=0.568932, mflops=8.78839 (err=1.2e-15) 7. Frigo-old: elapsed time t=1.63 s, 16 iters, t-(init.)=1.62 s t(norm)=1.75556, mflops=2.84809 (err=1.3e-15) 8. GSL: elapsed time t=1.51 s, 32 iters, t-(init.)=1.49 s t(norm)=0.807342, mflops=6.19316 (err=1.3e-15) 9. Nielsen: elapsed time t=1.55 s, 32 iters, t-(init.)=1.53 s t(norm)=0.829016, mflops=6.03125 (err=4.3e-14) 10. Singleton: elapsed time t=1.51 s, 32 iters, t-(init.)=1.49 s t(norm)=0.807342, mflops=6.19316 (err=1.8e-15) 11. Singleton (f2c): elapsed time t=1.41 s, 32 iters, t-(init.)=1.39 s t(norm)=0.753158, mflops=6.63871 (err=1.8e-15) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=1 s, 8 iters, t-(init.)=1 s t(norm)=2.16736, mflops=2.30695 (err=1.3e-15) Top mflops for N=4725 = 10.1965 Normalized results and averages for N=4725: fft 0: mflops = 3.57667 (norm. = 0.350775), norm. avg. (of 11) = 0.319885 fft 1: mflops = 8.54427 (norm. = 0.837963), norm. avg. (of 14) = 0.842181 fft 2: mflops = 10.1965 (norm. = 1), norm. avg. (of 14) = 0.776008 fft 3: mflops = 6.63871 (norm. = 0.651079), norm. avg. (of 14) = 0.670811 fft 4: mflops = 6.4082 (norm. = 0.628472), norm. avg. (of 14) = 0.636966 fft 5: mflops = 9.04687 (norm. = 0.887255), norm. avg. (of 14) = 0.96095 fft 6: mflops = 8.78839 (norm. = 0.861905), norm. avg. (of 14) = 0.947532 fft 7: mflops = 2.84809 (norm. = 0.279321), norm. avg. (of 14) = 0.349908 fft 8: mflops = 6.19316 (norm. = 0.607383), norm. avg. (of 14) = 0.557515 fft 9: mflops = 6.03125 (norm. = 0.591503), norm. avg. (of 14) = 0.459061 fft 10: mflops = 6.19316 (norm. = 0.607383), norm. avg. (of 14) = 0.504189 fft 11: mflops = 6.63871 (norm. = 0.651079), norm. avg. (of 14) = 0.553035 fft 12: mflops = -1 (norm. = -0.0980731), norm. avg. (of 10) = 0.626025 fft 13: mflops = -1 (norm. = -0.0980731), norm. avg. (of 10) = 0.538795 fft 14: mflops = 2.30695 (norm. = 0.22625), norm. avg. (of 14) = 0.221475 Benchmarking for array size = 10368: 0. Brenner: elapsed time t=1.38 s, 8 iters, t-(init.)=1.36 s t(norm)=1.22914, mflops=4.06787 (err=1.1e-15) 1. CWP (min N) (N=10920): elapsed time t=1.19 s, 16 iters, t-(init.)=1.16 s t(norm)=0.524194, mflops=9.53845 2. CWP (best N) (N=11088): elapsed time t=1.18 s, 16 iters, t-(init.)=1.16 s t(norm)=0.524194, mflops=9.53845 3. FFTPACK: elapsed time t=1.46 s, 16 iters, t-(init.)=1.44 s t(norm)=0.650724, mflops=7.68375 (err=9.9e-16) 4. FFTPACK (f2c): elapsed time t=1.47 s, 16 iters, t-(init.)=1.45 s t(norm)=0.655243, mflops=7.63076 (err=1.1e-15) FFTW_MEASURE plan: (cost = 7.250000e-02) FFTW_TWIDDLE 32 FFTW_TWIDDLE 9 FFTW_TWIDDLE 3 FFTW_NOTW 12 5. FFTW: elapsed time t=1.12 s, 16 iters, t-(init.)=1.1 s t(norm)=0.497081, mflops=10.0587 (err=9.1e-16) FFTW_ESTIMATE plan: (cost = 1.254528e+05) FFTW_TWIDDLE 9 FFTW_TWIDDLE 6 FFTW_TWIDDLE 6 FFTW_NOTW 32 6. FFTW_ESTIMATE: elapsed time t=1.2 s, 16 iters, t-(init.)=1.18 s t(norm)=0.533232, mflops=9.37678 (err=9.4e-16) 7. Frigo-old: elapsed time t=1.23 s, 8 iters, t-(init.)=1.22 s t(norm)=1.10262, mflops=4.53467 (err=1.0e-15) 8. GSL: elapsed time t=1.43 s, 16 iters, t-(init.)=1.41 s t(norm)=0.637167, mflops=7.84724 (err=9.4e-16) 9. Nielsen: elapsed time t=1.9 s, 16 iters, t-(init.)=1.88 s t(norm)=0.849556, mflops=5.88543 (err=1.1e-14) 10. Singleton: elapsed time t=1.84 s, 16 iters, t-(init.)=1.82 s t(norm)=0.822442, mflops=6.07945 (err=1.3e-15) 11. Singleton (f2c): elapsed time t=1.7 s, 16 iters, t-(init.)=1.68 s t(norm)=0.759178, mflops=6.58607 (err=1.3e-15) 12. Temperton: elapsed time t=1.59 s, 16 iters, t-(init.)=1.57 s t(norm)=0.70947, mflops=7.04752 (err=2.2e-07) 13. Temperton (f2c): elapsed time t=1.83 s, 16 iters, t-(init.)=1.81 s t(norm)=0.817923, mflops=6.11304 (err=9.7e-16) 14. Valkenburg: elapsed time t=1.11 s, 4 iters, t-(init.)=1.11 s t(norm)=2.0064, mflops=2.49203 (err=1.3e-15) Top mflops for N=10368 = 10.0587 Normalized results and averages for N=10368: fft 0: mflops = 4.06787 (norm. = 0.404412), norm. avg. (of 12) = 0.326929 fft 1: mflops = 9.53845 (norm. = 0.948276), norm. avg. (of 15) = 0.849254 fft 2: mflops = 9.53845 (norm. = 0.948276), norm. avg. (of 15) = 0.787493 fft 3: mflops = 7.68375 (norm. = 0.763889), norm. avg. (of 15) = 0.677016 fft 4: mflops = 7.63076 (norm. = 0.758621), norm. avg. (of 15) = 0.645076 fft 5: mflops = 10.0587 (norm. = 1), norm. avg. (of 15) = 0.963553 fft 6: mflops = 9.37678 (norm. = 0.932203), norm. avg. (of 15) = 0.94651 fft 7: mflops = 4.53467 (norm. = 0.45082), norm. avg. (of 15) = 0.356635 fft 8: mflops = 7.84724 (norm. = 0.780142), norm. avg. (of 15) = 0.572357 fft 9: mflops = 5.88543 (norm. = 0.585106), norm. avg. (of 15) = 0.467464 fft 10: mflops = 6.07945 (norm. = 0.604396), norm. avg. (of 15) = 0.51087 fft 11: mflops = 6.58607 (norm. = 0.654762), norm. avg. (of 15) = 0.559817 fft 12: mflops = 7.04752 (norm. = 0.700637), norm. avg. (of 11) = 0.632808 fft 13: mflops = 6.11304 (norm. = 0.607735), norm. avg. (of 11) = 0.545063 fft 14: mflops = 2.49203 (norm. = 0.247748), norm. avg. (of 15) = 0.223226 Benchmarking for array size = 27000: 0. Brenner: elapsed time t=1.16 s, 2 iters, t-(init.)=1.15 s t(norm)=1.44669, mflops=3.45616 (err=3.6e-15) 1. CWP (min N) (N=27720): elapsed time t=1.67 s, 8 iters, t-(init.)=1.64 s t(norm)=0.515778, mflops=9.6941 2. CWP (best N) (N=27720): elapsed time t=1.67 s, 8 iters, t-(init.)=1.63 s t(norm)=0.512633, mflops=9.75357 3. FFTPACK: elapsed time t=1.12 s, 4 iters, t-(init.)=1.1 s t(norm)=0.691897, mflops=7.22651 (err=3.4e-15) 4. FFTPACK (f2c): elapsed time t=1.16 s, 4 iters, t-(init.)=1.15 s t(norm)=0.723347, mflops=6.91232 (err=3.5e-15) FFTW_MEASURE plan: (cost = 2.200000e-01) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_TWIDDLE 6 FFTW_TWIDDLE 9 FFTW_NOTW 5 5. FFTW: elapsed time t=1.93 s, 8 iters, t-(init.)=1.89 s t(norm)=0.594402, mflops=8.41181 (err=3.5e-15) FFTW_ESTIMATE plan: (cost = 1.231200e+06) FFTW_TWIDDLE 10 FFTW_TWIDDLE 10 FFTW_TWIDDLE 9 FFTW_TWIDDLE 2 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.82 s, 8 iters, t-(init.)=1.79 s t(norm)=0.562952, mflops=8.88175 (err=3.5e-15) 7. Frigo-old: elapsed time t=1.21 s, 2 iters, t-(init.)=1.21 s t(norm)=1.52217, mflops=3.28478 (err=3.6e-15) 8. GSL: elapsed time t=1.24 s, 4 iters, t-(init.)=1.22 s t(norm)=0.767376, mflops=6.51571 (err=3.4e-15) 9. Nielsen: elapsed time t=1.27 s, 4 iters, t-(init.)=1.25 s t(norm)=0.786246, mflops=6.35933 (err=2.0e-13) 10. Singleton: elapsed time t=1.35 s, 4 iters, t-(init.)=1.33 s t(norm)=0.836566, mflops=5.97681 (err=5.0e-15) 11. Singleton (f2c): elapsed time t=1.26 s, 4 iters, t-(init.)=1.24 s t(norm)=0.779956, mflops=6.41062 (err=5.0e-15) 12. Temperton: elapsed time t=1.2 s, 4 iters, t-(init.)=1.18 s t(norm)=0.742217, mflops=6.73658 (err=1.4e-07) 13. Temperton (f2c): elapsed time t=1.38 s, 4 iters, t-(init.)=1.36 s t(norm)=0.855436, mflops=5.84497 (err=3.6e-15) 14. Valkenburg: elapsed time t=1.74 s, 2 iters, t-(init.)=1.73 s t(norm)=2.17633, mflops=2.29745 (err=3.4e-15) Top mflops for N=27000 = 9.75357 Normalized results and averages for N=27000: fft 0: mflops = 3.45616 (norm. = 0.354348), norm. avg. (of 13) = 0.329038 fft 1: mflops = 9.6941 (norm. = 0.993902), norm. avg. (of 16) = 0.858294 fft 2: mflops = 9.75357 (norm. = 1), norm. avg. (of 16) = 0.800775 fft 3: mflops = 7.22651 (norm. = 0.740909), norm. avg. (of 16) = 0.681009 fft 4: mflops = 6.91232 (norm. = 0.708696), norm. avg. (of 16) = 0.649053 fft 5: mflops = 8.41181 (norm. = 0.862434), norm. avg. (of 16) = 0.957233 fft 6: mflops = 8.88175 (norm. = 0.910615), norm. avg. (of 16) = 0.944267 fft 7: mflops = 3.28478 (norm. = 0.336777), norm. avg. (of 16) = 0.355394 fft 8: mflops = 6.51571 (norm. = 0.668033), norm. avg. (of 16) = 0.578336 fft 9: mflops = 6.35933 (norm. = 0.652), norm. avg. (of 16) = 0.478997 fft 10: mflops = 5.97681 (norm. = 0.612782), norm. avg. (of 16) = 0.517239 fft 11: mflops = 6.41062 (norm. = 0.657258), norm. avg. (of 16) = 0.565907 fft 12: mflops = 6.73658 (norm. = 0.690678), norm. avg. (of 12) = 0.63763 fft 13: mflops = 5.84497 (norm. = 0.599265), norm. avg. (of 12) = 0.549579 fft 14: mflops = 2.29745 (norm. = 0.235549), norm. avg. (of 16) = 0.223997 Benchmarking for array size = 75600: 0. Brenner: elapsed time t=1.84 s, 1 iters, t-(init.)=1.83 s t(norm)=1.49366, mflops=3.34749 (err=4.7e-15) 1. CWP (min N) (N=80080): elapsed time t=1.39 s, 2 iters, t-(init.)=1.35 s t(norm)=0.550939, mflops=9.07542 2. CWP (best N) (N=80080): elapsed time t=1.37 s, 2 iters, t-(init.)=1.33 s t(norm)=0.542777, mflops=9.21189 3. FFTPACK: elapsed time t=1.95 s, 2 iters, t-(init.)=1.92 s t(norm)=0.783558, mflops=6.38115 (err=4.7e-15) 4. FFTPACK (f2c): elapsed time t=1 s, 1 iters, t-(init.)=0.99 s t(norm)=0.808044, mflops=6.18778 (err=4.7e-15) FFTW_MEASURE plan: (cost = 6.800000e-01) FFTW_TWIDDLE 10 FFTW_TWIDDLE 6 FFTW_TWIDDLE 10 FFTW_TWIDDLE 3 FFTW_TWIDDLE 6 FFTW_NOTW 7 5. FFTW: elapsed time t=1.29 s, 2 iters, t-(init.)=1.25 s t(norm)=0.510129, mflops=9.80145 (err=4.6e-15) FFTW_ESTIMATE plan: (cost = 2.971080e+06) FFTW_TWIDDLE 10 FFTW_TWIDDLE 9 FFTW_TWIDDLE 8 FFTW_TWIDDLE 7 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.42 s, 2 iters, t-(init.)=1.38 s t(norm)=0.563182, mflops=8.87812 (err=4.7e-15) 7. Frigo-old: elapsed time t=1.91 s, 1 iters, t-(init.)=1.89 s t(norm)=1.54263, mflops=3.24122 (err=4.7e-15) 8. GSL: elapsed time t=1.93 s, 2 iters, t-(init.)=1.9 s t(norm)=0.775396, mflops=6.44832 (err=4.7e-15) 9. Nielsen: elapsed time t=1.07 s, 1 iters, t-(init.)=1.05 s t(norm)=0.857016, mflops=5.8342 (err=4.8e-13) 10. Singleton: elapsed time t=1.09 s, 1 iters, t-(init.)=1.07 s t(norm)=0.87334, mflops=5.72515 (err=6.1e-15) 11. Singleton (f2c): elapsed time t=1.04 s, 1 iters, t-(init.)=1.03 s t(norm)=0.840692, mflops=5.94748 (err=6.1e-15) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=2.89 s, 1 iters, t-(init.)=2.87 s t(norm)=2.34251, mflops=2.13446 (err=4.5e-15) Top mflops for N=75600 = 9.80145 Normalized results and averages for N=75600: fft 0: mflops = 3.34749 (norm. = 0.34153), norm. avg. (of 14) = 0.32993 fft 1: mflops = 9.07542 (norm. = 0.925926), norm. avg. (of 17) = 0.862273 fft 2: mflops = 9.21189 (norm. = 0.93985), norm. avg. (of 17) = 0.808956 fft 3: mflops = 6.38115 (norm. = 0.651042), norm. avg. (of 17) = 0.679247 fft 4: mflops = 6.18778 (norm. = 0.631313), norm. avg. (of 17) = 0.648009 fft 5: mflops = 9.80145 (norm. = 1), norm. avg. (of 17) = 0.959749 fft 6: mflops = 8.87812 (norm. = 0.905797), norm. avg. (of 17) = 0.942004 fft 7: mflops = 3.24122 (norm. = 0.330688), norm. avg. (of 17) = 0.353941 fft 8: mflops = 6.44832 (norm. = 0.657895), norm. avg. (of 17) = 0.583016 fft 9: mflops = 5.8342 (norm. = 0.595238), norm. avg. (of 17) = 0.485835 fft 10: mflops = 5.72515 (norm. = 0.584112), norm. avg. (of 17) = 0.521173 fft 11: mflops = 5.94748 (norm. = 0.606796), norm. avg. (of 17) = 0.568312 fft 12: mflops = -1 (norm. = -0.102026), norm. avg. (of 12) = 0.63763 fft 13: mflops = -1 (norm. = -0.102026), norm. avg. (of 12) = 0.549579 fft 14: mflops = 2.13446 (norm. = 0.21777), norm. avg. (of 17) = 0.22363 Benchmarking for array size = 165375: 0. Brenner: elapsed time t=4.67 s, 1 iters, t-(init.)=4.63 s t(norm)=1.61502, mflops=3.09594 (err=1.2e-14) 1. CWP (min N) (N=180180): elapsed time t=1.73 s, 1 iters, t-(init.)=1.69 s t(norm)=0.589499, mflops=8.48177 2. CWP (best N) (N=180180): elapsed time t=1.73 s, 1 iters, t-(init.)=1.7 s t(norm)=0.592988, mflops=8.43188 3. FFTPACK: elapsed time t=2.76 s, 1 iters, t-(init.)=2.72 s t(norm)=0.94878, mflops=5.26992 (err=1.2e-14) 4. FFTPACK (f2c): elapsed time t=2.83 s, 1 iters, t-(init.)=2.8 s t(norm)=0.976686, mflops=5.11935 (err=1.2e-14) FFTW_MEASURE plan: (cost = 1.770000e+00) FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 3 FFTW_TWIDDLE 5 FFTW_TWIDDLE 5 FFTW_TWIDDLE 5 FFTW_NOTW 7 5. FFTW: elapsed time t=1.69 s, 1 iters, t-(init.)=1.66 s t(norm)=0.579035, mflops=8.63506 (err=1.2e-14) FFTW_ESTIMATE plan: (cost = 8.367975e+06) FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 7 FFTW_TWIDDLE 5 FFTW_TWIDDLE 5 FFTW_NOTW 15 6. FFTW_ESTIMATE: elapsed time t=1.81 s, 1 iters, t-(init.)=1.78 s t(norm)=0.620893, mflops=8.05292 (err=1.2e-14) 7. Frigo-old: elapsed time t=5.69 s, 1 iters, t-(init.)=5.66 s t(norm)=1.9743, mflops=2.53254 (err=1.2e-14) 8. GSL: elapsed time t=2.42 s, 1 iters, t-(init.)=2.38 s t(norm)=0.830183, mflops=6.02277 (err=1.2e-14) 9. Nielsen: elapsed time t=2.73 s, 1 iters, t-(init.)=2.7 s t(norm)=0.941804, mflops=5.30896 (err=1.7e-12) 10. Singleton: elapsed time t=2.61 s, 1 iters, t-(init.)=2.57 s t(norm)=0.896458, mflops=5.57751 (err=1.8e-14) 11. Singleton (f2c): elapsed time t=2.41 s, 1 iters, t-(init.)=2.37 s t(norm)=0.826695, mflops=6.04818 (err=1.8e-14) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=7.31 s, 1 iters, t-(init.)=7.27 s t(norm)=2.53589, mflops=1.97169 (err=1.2e-14) Top mflops for N=165375 = 8.63506 Normalized results and averages for N=165375: fft 0: mflops = 3.09594 (norm. = 0.358531), norm. avg. (of 15) = 0.331837 fft 1: mflops = 8.48177 (norm. = 0.982249), norm. avg. (of 18) = 0.868938 fft 2: mflops = 8.43188 (norm. = 0.976471), norm. avg. (of 18) = 0.818262 fft 3: mflops = 5.26992 (norm. = 0.610294), norm. avg. (of 18) = 0.675416 fft 4: mflops = 5.11935 (norm. = 0.592857), norm. avg. (of 18) = 0.644945 fft 5: mflops = 8.63506 (norm. = 1), norm. avg. (of 18) = 0.961985 fft 6: mflops = 8.05292 (norm. = 0.932584), norm. avg. (of 18) = 0.94148 fft 7: mflops = 2.53254 (norm. = 0.293286), norm. avg. (of 18) = 0.350571 fft 8: mflops = 6.02277 (norm. = 0.697479), norm. avg. (of 18) = 0.589375 fft 9: mflops = 5.30896 (norm. = 0.614815), norm. avg. (of 18) = 0.493001 fft 10: mflops = 5.57751 (norm. = 0.645914), norm. avg. (of 18) = 0.528103 fft 11: mflops = 6.04818 (norm. = 0.700422), norm. avg. (of 18) = 0.575652 fft 12: mflops = -1 (norm. = -0.115807), norm. avg. (of 12) = 0.63763 fft 13: mflops = -1 (norm. = -0.115807), norm. avg. (of 12) = 0.549579 fft 14: mflops = 1.97169 (norm. = 0.228336), norm. avg. (of 18) = 0.223892 Benchmarking for array size = 362880: 0. Brenner: elapsed time t=10.01 s, 1 iters, t-(init.)=9.93 s t(norm)=1.48163, mflops=3.37466 (err=7.7e-15) 1. CWP (min N) (N=720720): elapsed time t=7.5 s, 1 iters, t-(init.)=7.34 s t(norm)=1.09518, mflops=4.56545 2. CWP (best N) (N=720720): elapsed time t=7.5 s, 1 iters, t-(init.)=7.35 s t(norm)=1.09667, mflops=4.55924 3. FFTPACK: elapsed time t=5.4 s, 1 iters, t-(init.)=5.32 s t(norm)=0.793784, mflops=6.29895 (err=7.5e-15) 4. FFTPACK (f2c): elapsed time t=5.41 s, 1 iters, t-(init.)=5.33 s t(norm)=0.795276, mflops=6.28713 (err=7.5e-15) FFTW_MEASURE plan: (cost = 3.640000e+00) FFTW_TWIDDLE 32 FFTW_TWIDDLE 10 FFTW_TWIDDLE 6 FFTW_TWIDDLE 3 FFTW_TWIDDLE 9 FFTW_NOTW 7 5. FFTW: elapsed time t=3.42 s, 1 iters, t-(init.)=3.34 s t(norm)=0.498353, mflops=10.0331 (err=7.6e-15) FFTW_ESTIMATE plan: (cost = 7.511616e+06) FFTW_TWIDDLE 10 FFTW_TWIDDLE 9 FFTW_TWIDDLE 9 FFTW_TWIDDLE 7 FFTW_TWIDDLE 2 FFTW_NOTW 32 6. FFTW_ESTIMATE: elapsed time t=4.14 s, 1 iters, t-(init.)=4.06 s t(norm)=0.605782, mflops=8.25379 (err=7.6e-15) 7. Frigo-old: elapsed time t=9.93 s, 1 iters, t-(init.)=9.85 s t(norm)=1.46969, mflops=3.40207 (err=7.6e-15) 8. GSL: elapsed time t=4.9 s, 1 iters, t-(init.)=4.83 s t(norm)=0.720672, mflops=6.93797 (err=7.5e-15) 9. Nielsen: elapsed time t=6.03 s, 1 iters, t-(init.)=5.95 s t(norm)=0.887784, mflops=5.632 (err=3.5e-12) 10. Singleton: elapsed time t=6.67 s, 1 iters, t-(init.)=6.6 s t(norm)=0.984769, mflops=5.07733 (err=1.1e-14) 11. Singleton (f2c): elapsed time t=6.3 s, 1 iters, t-(init.)=6.22 s t(norm)=0.92807, mflops=5.38752 (err=1.1e-14) 12. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 13. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 14. Valkenburg: elapsed time t=16.35 s, 1 iters, t-(init.)=16.28 s t(norm)=2.4291, mflops=2.05838 (err=7.9e-15) Top mflops for N=362880 = 10.0331 Normalized results and averages for N=362880: fft 0: mflops = 3.37466 (norm. = 0.336354), norm. avg. (of 16) = 0.332119 fft 1: mflops = 4.56545 (norm. = 0.455041), norm. avg. (of 19) = 0.847154 fft 2: mflops = 4.55924 (norm. = 0.454422), norm. avg. (of 19) = 0.799112 fft 3: mflops = 6.29895 (norm. = 0.62782), norm. avg. (of 19) = 0.672911 fft 4: mflops = 6.28713 (norm. = 0.626642), norm. avg. (of 19) = 0.643982 fft 5: mflops = 10.0331 (norm. = 1), norm. avg. (of 19) = 0.963986 fft 6: mflops = 8.25379 (norm. = 0.82266), norm. avg. (of 19) = 0.935227 fft 7: mflops = 3.40207 (norm. = 0.339086), norm. avg. (of 19) = 0.349967 fft 8: mflops = 6.93797 (norm. = 0.691511), norm. avg. (of 19) = 0.594751 fft 9: mflops = 5.632 (norm. = 0.561345), norm. avg. (of 19) = 0.496598 fft 10: mflops = 5.07733 (norm. = 0.506061), norm. avg. (of 19) = 0.526943 fft 11: mflops = 5.38752 (norm. = 0.536977), norm. avg. (of 19) = 0.573616 fft 12: mflops = -1 (norm. = -0.0996706), norm. avg. (of 12) = 0.63763 fft 13: mflops = -1 (norm. = -0.0996706), norm. avg. (of 12) = 0.549579 fft 14: mflops = 2.05838 (norm. = 0.20516), norm. avg. (of 19) = 0.222906 ------------------------------------------------------ @@@@ bench.3d.p2.log Benchmarking for sizes: 4x4x4 (0.00128174 MB) 8x8x8 (0.00830078 MB) 16x16x16 (0.0633545 MB) 32x32x32 (0.501587 MB) 64x64x64 (4.00305 MB) 256x64x32 (8.01184 MB) 16x1024x64 (16.047 MB) 128x128x128 (32.006 MB) Maximum array size N = 2097152 Benchmarking FFTs: 0. FFTW 1. HARM 2. HARM (f2c) 3. NR (C) 4. NR (F) 5. PDA 6. PDA (f2c) 7. Singleton 8. Singleton (f2c) 9. Temperton 10. Temperton (f2c) Computing normalized averages (11 transforms). Benchmarking for array size = 4x4x4 (power of 2): 0. FFTW: elapsed time t=1.05 s, 8192 iters, t-(init.)=1.02 s t(norm)=0.324249, mflops=15.4202 (err=3.0e-16) 1. Skipping fft (all dimensions must be > 4 for HARM). 2. Skipping fft (all dimensions must be > 4 for HARM). 3. NR (C): elapsed time t=1.03 s, 4096 iters, t-(init.)=1.02 s t(norm)=0.648499, mflops=7.71012 (err=3.0e-16) 4. NR (F): elapsed time t=1.15 s, 4096 iters, t-(init.)=1.14 s t(norm)=0.724792, mflops=6.89853 (err=3.0e-16) 5. PDA: elapsed time t=1.32 s, 4096 iters, t-(init.)=1.3 s t(norm)=0.826518, mflops=6.04948 (err=2.9e-16) 6. PDA (f2c): elapsed time t=1.55 s, 4096 iters, t-(init.)=1.54 s t(norm)=0.979106, mflops=5.1067 (err=3.3e-16) 7. Singleton: elapsed time t=1.87 s, 8192 iters, t-(init.)=1.84 s t(norm)=0.58492, mflops=8.54817 (err=3.0e-16) 8. Singleton (f2c): elapsed time t=1.75 s, 8192 iters, t-(init.)=1.72 s t(norm)=0.546773, mflops=9.14456 (err=2.2e-16) 9. Temperton: elapsed time t=1.34 s, 8192 iters, t-(init.)=1.32 s t(norm)=0.419617, mflops=11.9156 (err=4.1e-16) 10. Temperton (f2c): elapsed time t=1.73 s, 8192 iters, t-(init.)=1.71 s t(norm)=0.543594, mflops=9.19804 (err=3.0e-16) Top mflops for N=64 = 15.4202 Normalized results and averages for N=64: fft 0: mflops = 15.4202 (norm. = 1), norm. avg. (of 1) = 1 fft 1: mflops = -1 (norm. = -0.0648499), norm. avg. (of 0) = -1 fft 2: mflops = -1 (norm. = -0.0648499), norm. avg. (of 0) = -1 fft 3: mflops = 7.71012 (norm. = 0.5), norm. avg. (of 1) = 0.5 fft 4: mflops = 6.89853 (norm. = 0.447368), norm. avg. (of 1) = 0.447368 fft 5: mflops = 6.04948 (norm. = 0.392308), norm. avg. (of 1) = 0.392308 fft 6: mflops = 5.1067 (norm. = 0.331169), norm. avg. (of 1) = 0.331169 fft 7: mflops = 8.54817 (norm. = 0.554348), norm. avg. (of 1) = 0.554348 fft 8: mflops = 9.14456 (norm. = 0.593023), norm. avg. (of 1) = 0.593023 fft 9: mflops = 11.9156 (norm. = 0.772727), norm. avg. (of 1) = 0.772727 fft 10: mflops = 9.19804 (norm. = 0.596491), norm. avg. (of 1) = 0.596491 Benchmarking for array size = 8x8x8 (power of 2): 0. FFTW: elapsed time t=1.57 s, 1024 iters, t-(init.)=1.54 s t(norm)=0.326369, mflops=15.3201 (err=2.8e-16) 1. HARM: elapsed time t=1.07 s, 512 iters, t-(init.)=1.05 s t(norm)=0.445048, mflops=11.2347 (err=3.3e-16) 2. HARM (f2c): elapsed time t=1.24 s, 512 iters, t-(init.)=1.23 s t(norm)=0.521342, mflops=9.59063 (err=3.2e-16) 3. NR (C): elapsed time t=1.32 s, 512 iters, t-(init.)=1.31 s t(norm)=0.55525, mflops=9.00495 (err=3.1e-16) 4. NR (F): elapsed time t=1.44 s, 512 iters, t-(init.)=1.43 s t(norm)=0.606113, mflops=8.24929 (err=3.1e-16) 5. PDA: elapsed time t=1.52 s, 512 iters, t-(init.)=1.51 s t(norm)=0.640021, mflops=7.81224 (err=2.5e-16) 6. PDA (f2c): elapsed time t=1.67 s, 512 iters, t-(init.)=1.66 s t(norm)=0.7036, mflops=7.10631 (err=2.6e-16) 7. Singleton: elapsed time t=1.45 s, 512 iters, t-(init.)=1.44 s t(norm)=0.610352, mflops=8.192 (err=3.4e-16) 8. Singleton (f2c): elapsed time t=1.27 s, 512 iters, t-(init.)=1.26 s t(norm)=0.534058, mflops=9.36229 (err=3.4e-16) 9. Temperton: elapsed time t=1.01 s, 512 iters, t-(init.)=1 s t(norm)=0.423855, mflops=11.7965 (err=1.1e-08) 10. Temperton (f2c): elapsed time t=1.3 s, 512 iters, t-(init.)=1.29 s t(norm)=0.546773, mflops=9.14456 (err=3.0e-16) Top mflops for N=512 = 15.3201 Normalized results and averages for N=512: fft 0: mflops = 15.3201 (norm. = 1), norm. avg. (of 2) = 1 fft 1: mflops = 11.2347 (norm. = 0.733333), norm. avg. (of 1) = 0.733333 fft 2: mflops = 9.59063 (norm. = 0.626016), norm. avg. (of 1) = 0.626016 fft 3: mflops = 9.00495 (norm. = 0.587786), norm. avg. (of 2) = 0.543893 fft 4: mflops = 8.24929 (norm. = 0.538462), norm. avg. (of 2) = 0.492915 fft 5: mflops = 7.81224 (norm. = 0.509934), norm. avg. (of 2) = 0.451121 fft 6: mflops = 7.10631 (norm. = 0.463855), norm. avg. (of 2) = 0.397512 fft 7: mflops = 8.192 (norm. = 0.534722), norm. avg. (of 2) = 0.544535 fft 8: mflops = 9.36229 (norm. = 0.611111), norm. avg. (of 2) = 0.602067 fft 9: mflops = 11.7965 (norm. = 0.77), norm. avg. (of 2) = 0.771364 fft 10: mflops = 9.14456 (norm. = 0.596899), norm. avg. (of 2) = 0.596695 Benchmarking for array size = 16x16x16 (power of 2): 0. FFTW: elapsed time t=1.18 s, 64 iters, t-(init.)=1.15 s t(norm)=0.365575, mflops=13.6771 (err=3.1e-16) 1. HARM: elapsed time t=1.5 s, 64 iters, t-(init.)=1.47 s t(norm)=0.4673, mflops=10.6998 (err=3.2e-16) 2. HARM (f2c): elapsed time t=1.76 s, 64 iters, t-(init.)=1.74 s t(norm)=0.553131, mflops=9.03945 (err=3.1e-16) 3. NR (C): elapsed time t=1.06 s, 32 iters, t-(init.)=1.05 s t(norm)=0.667572, mflops=7.48983 (err=3.5e-16) 4. NR (F): elapsed time t=1.11 s, 32 iters, t-(init.)=1.1 s t(norm)=0.699361, mflops=7.14938 (err=3.5e-16) 5. PDA: elapsed time t=1.94 s, 64 iters, t-(init.)=1.91 s t(norm)=0.607173, mflops=8.23489 (err=2.9e-16) 6. PDA (f2c): elapsed time t=1.02 s, 32 iters, t-(init.)=1.01 s t(norm)=0.642141, mflops=7.78646 (err=2.9e-16) 7. Singleton: elapsed time t=1.09 s, 32 iters, t-(init.)=1.08 s t(norm)=0.686646, mflops=7.28178 (err=3.5e-16) 8. Singleton (f2c): elapsed time t=1.03 s, 32 iters, t-(init.)=1.02 s t(norm)=0.648499, mflops=7.71012 (err=3.3e-16) 9. Temperton: elapsed time t=1.5 s, 64 iters, t-(init.)=1.47 s t(norm)=0.4673, mflops=10.6998 (err=6.0e-08) 10. Temperton (f2c): elapsed time t=1.82 s, 64 iters, t-(init.)=1.79 s t(norm)=0.569026, mflops=8.78695 (err=3.1e-16) Top mflops for N=4096 = 13.6771 Normalized results and averages for N=4096: fft 0: mflops = 13.6771 (norm. = 1), norm. avg. (of 3) = 1 fft 1: mflops = 10.6998 (norm. = 0.782313), norm. avg. (of 2) = 0.757823 fft 2: mflops = 9.03945 (norm. = 0.66092), norm. avg. (of 2) = 0.643468 fft 3: mflops = 7.48983 (norm. = 0.547619), norm. avg. (of 3) = 0.545135 fft 4: mflops = 7.14938 (norm. = 0.522727), norm. avg. (of 3) = 0.502852 fft 5: mflops = 8.23489 (norm. = 0.602094), norm. avg. (of 3) = 0.501445 fft 6: mflops = 7.78646 (norm. = 0.569307), norm. avg. (of 3) = 0.454777 fft 7: mflops = 7.28178 (norm. = 0.532407), norm. avg. (of 3) = 0.540492 fft 8: mflops = 7.71012 (norm. = 0.563725), norm. avg. (of 3) = 0.589287 fft 9: mflops = 10.6998 (norm. = 0.782313), norm. avg. (of 3) = 0.775013 fft 10: mflops = 8.78695 (norm. = 0.642458), norm. avg. (of 3) = 0.61195 Benchmarking for array size = 32x32x32 (power of 2): 0. FFTW: elapsed time t=1.59 s, 8 iters, t-(init.)=1.54 s t(norm)=0.391642, mflops=12.7668 (err=4.1e-16) 1. HARM: elapsed time t=1.08 s, 4 iters, t-(init.)=1.06 s t(norm)=0.539144, mflops=9.27396 (err=4.6e-16) 2. HARM (f2c): elapsed time t=1.27 s, 4 iters, t-(init.)=1.24 s t(norm)=0.630697, mflops=7.92774 (err=4.3e-16) 3. NR (C): elapsed time t=1.57 s, 4 iters, t-(init.)=1.55 s t(norm)=0.788371, mflops=6.34219 (err=4.2e-16) 4. NR (F): elapsed time t=1.61 s, 4 iters, t-(init.)=1.58 s t(norm)=0.80363, mflops=6.22177 (err=4.2e-16) 5. PDA: elapsed time t=1.32 s, 4 iters, t-(init.)=1.29 s t(norm)=0.656128, mflops=7.62047 (err=3.5e-16) 6. PDA (f2c): elapsed time t=1.38 s, 4 iters, t-(init.)=1.35 s t(norm)=0.686646, mflops=7.28178 (err=3.6e-16) 7. Singleton: elapsed time t=1.63 s, 4 iters, t-(init.)=1.61 s t(norm)=0.818888, mflops=6.10584 (err=4.4e-16) 8. Singleton (f2c): elapsed time t=1.55 s, 4 iters, t-(init.)=1.53 s t(norm)=0.778198, mflops=6.4251 (err=4.2e-16) 9. Temperton: elapsed time t=1.1 s, 4 iters, t-(init.)=1.08 s t(norm)=0.549316, mflops=9.10222 (err=9.6e-08) 10. Temperton (f2c): elapsed time t=1.34 s, 4 iters, t-(init.)=1.32 s t(norm)=0.671387, mflops=7.44727 (err=4.0e-16) Top mflops for N=32768 = 12.7668 Normalized results and averages for N=32768: fft 0: mflops = 12.7668 (norm. = 1), norm. avg. (of 4) = 1 fft 1: mflops = 9.27396 (norm. = 0.726415), norm. avg. (of 3) = 0.747354 fft 2: mflops = 7.92774 (norm. = 0.620968), norm. avg. (of 3) = 0.635968 fft 3: mflops = 6.34219 (norm. = 0.496774), norm. avg. (of 4) = 0.533045 fft 4: mflops = 6.22177 (norm. = 0.487342), norm. avg. (of 4) = 0.498975 fft 5: mflops = 7.62047 (norm. = 0.596899), norm. avg. (of 4) = 0.525309 fft 6: mflops = 7.28178 (norm. = 0.57037), norm. avg. (of 4) = 0.483675 fft 7: mflops = 6.10584 (norm. = 0.478261), norm. avg. (of 4) = 0.524935 fft 8: mflops = 6.4251 (norm. = 0.503268), norm. avg. (of 4) = 0.567782 fft 9: mflops = 9.10222 (norm. = 0.712963), norm. avg. (of 4) = 0.759501 fft 10: mflops = 7.44727 (norm. = 0.583333), norm. avg. (of 4) = 0.604795 Benchmarking for array size = 64x64x64 (power of 2): 0. FFTW: elapsed time t=2.03 s, 1 iters, t-(init.)=1.97 s t(norm)=0.417497, mflops=11.9761 (err=4.3e-16) 1. HARM: elapsed time t=2.54 s, 1 iters, t-(init.)=2.48 s t(norm)=0.525581, mflops=9.51329 (err=4.9e-16) 2. HARM (f2c): elapsed time t=3.09 s, 1 iters, t-(init.)=3.04 s t(norm)=0.64426, mflops=7.76084 (err=4.3e-16) 3. NR (C): elapsed time t=4.26 s, 1 iters, t-(init.)=4.2 s t(norm)=0.890096, mflops=5.61737 (err=4.9e-16) 4. NR (F): elapsed time t=4.37 s, 1 iters, t-(init.)=4.31 s t(norm)=0.913408, mflops=5.474 (err=4.9e-16) 5. PDA: elapsed time t=3.18 s, 1 iters, t-(init.)=3.12 s t(norm)=0.661214, mflops=7.56185 (err=4.4e-16) 6. PDA (f2c): elapsed time t=3.27 s, 1 iters, t-(init.)=3.21 s t(norm)=0.680288, mflops=7.34983 (err=4.4e-16) 7. Singleton: elapsed time t=3.88 s, 1 iters, t-(init.)=3.82 s t(norm)=0.809564, mflops=6.17617 (err=5.0e-16) 8. Singleton (f2c): elapsed time t=3.77 s, 1 iters, t-(init.)=3.71 s t(norm)=0.786251, mflops=6.35929 (err=4.8e-16) 9. Temperton: elapsed time t=2.64 s, 1 iters, t-(init.)=2.59 s t(norm)=0.548893, mflops=9.10925 (err=1.4e-07) 10. Temperton (f2c): elapsed time t=3.2 s, 1 iters, t-(init.)=3.14 s t(norm)=0.665453, mflops=7.51368 (err=4.4e-16) Top mflops for N=262144 = 11.9761 Normalized results and averages for N=262144: fft 0: mflops = 11.9761 (norm. = 1), norm. avg. (of 5) = 1 fft 1: mflops = 9.51329 (norm. = 0.794355), norm. avg. (of 4) = 0.759104 fft 2: mflops = 7.76084 (norm. = 0.648026), norm. avg. (of 4) = 0.638982 fft 3: mflops = 5.61737 (norm. = 0.469048), norm. avg. (of 5) = 0.520245 fft 4: mflops = 5.474 (norm. = 0.457077), norm. avg. (of 5) = 0.490595 fft 5: mflops = 7.56185 (norm. = 0.63141), norm. avg. (of 5) = 0.546529 fft 6: mflops = 7.34983 (norm. = 0.613707), norm. avg. (of 5) = 0.509682 fft 7: mflops = 6.17617 (norm. = 0.515707), norm. avg. (of 5) = 0.523089 fft 8: mflops = 6.35929 (norm. = 0.530997), norm. avg. (of 5) = 0.560425 fft 9: mflops = 9.10925 (norm. = 0.760618), norm. avg. (of 5) = 0.759724 fft 10: mflops = 7.51368 (norm. = 0.627389), norm. avg. (of 5) = 0.609314 Benchmarking for array size = 256x64x32 (power of 2): 0. FFTW: elapsed time t=4.81 s, 1 iters, t-(init.)=4.69 s t(norm)=0.470814, mflops=10.6199 (err=5.8e-16) 1. HARM: elapsed time t=5.51 s, 1 iters, t-(init.)=5.4 s t(norm)=0.542089, mflops=9.22359 (err=6.1e-16) 2. HARM (f2c): elapsed time t=6.63 s, 1 iters, t-(init.)=6.51 s t(norm)=0.653518, mflops=7.6509 (err=5.9e-16) 3. NR (C): elapsed time t=9.05 s, 1 iters, t-(init.)=8.94 s t(norm)=0.897458, mflops=5.57129 (err=5.7e-16) 4. NR (F): elapsed time t=9.28 s, 1 iters, t-(init.)=9.17 s t(norm)=0.920547, mflops=5.43156 (err=5.7e-16) 5. PDA: elapsed time t=6.85 s, 1 iters, t-(init.)=6.74 s t(norm)=0.676607, mflops=7.38982 (err=4.9e-16) 6. PDA (f2c): elapsed time t=7 s, 1 iters, t-(init.)=6.89 s t(norm)=0.691665, mflops=7.22893 (err=5.1e-16) 7. Singleton: elapsed time t=8.53 s, 1 iters, t-(init.)=8.42 s t(norm)=0.845257, mflops=5.91536 (err=7.2e-16) 8. Singleton (f2c): elapsed time t=8.31 s, 1 iters, t-(init.)=8.2 s t(norm)=0.823172, mflops=6.07407 (err=7.1e-16) 9. Temperton: elapsed time t=5.75 s, 1 iters, t-(init.)=5.64 s t(norm)=0.566181, mflops=8.83109 (err=1.5e-07) 10. Temperton (f2c): elapsed time t=6.93 s, 1 iters, t-(init.)=6.82 s t(norm)=0.684638, mflops=7.30313 (err=5.5e-16) Top mflops for N=524288 = 10.6199 Normalized results and averages for N=524288: fft 0: mflops = 10.6199 (norm. = 1), norm. avg. (of 6) = 1 fft 1: mflops = 9.22359 (norm. = 0.868519), norm. avg. (of 5) = 0.780987 fft 2: mflops = 7.6509 (norm. = 0.72043), norm. avg. (of 5) = 0.655272 fft 3: mflops = 5.57129 (norm. = 0.524609), norm. avg. (of 6) = 0.520973 fft 4: mflops = 5.43156 (norm. = 0.51145), norm. avg. (of 6) = 0.494071 fft 5: mflops = 7.38982 (norm. = 0.695846), norm. avg. (of 6) = 0.571415 fft 6: mflops = 7.22893 (norm. = 0.680697), norm. avg. (of 6) = 0.538184 fft 7: mflops = 5.91536 (norm. = 0.557007), norm. avg. (of 6) = 0.528742 fft 8: mflops = 6.07407 (norm. = 0.571951), norm. avg. (of 6) = 0.562346 fft 9: mflops = 8.83109 (norm. = 0.83156), norm. avg. (of 6) = 0.771697 fft 10: mflops = 7.30313 (norm. = 0.687683), norm. avg. (of 6) = 0.622376 Benchmarking for array size = 16x1024x64 (power of 2): 0. FFTW: elapsed time t=10.29 s, 1 iters, t-(init.)=10.07 s t(norm)=0.480175, mflops=10.4129 (err=5.7e-16) 1. HARM: elapsed time t=11.56 s, 1 iters, t-(init.)=11.33 s t(norm)=0.540257, mflops=9.25486 (err=5.8e-16) 2. HARM (f2c): elapsed time t=13.99 s, 1 iters, t-(init.)=13.77 s t(norm)=0.656605, mflops=7.61493 (err=5.5e-16) 3. NR (C): elapsed time t=19.16 s, 1 iters, t-(init.)=18.93 s t(norm)=0.902653, mflops=5.53923 (err=7.3e-16) 4. NR (F): elapsed time t=19.66 s, 1 iters, t-(init.)=19.44 s t(norm)=0.926971, mflops=5.39391 (err=7.3e-16) 5. PDA: elapsed time t=14.86 s, 1 iters, t-(init.)=14.63 s t(norm)=0.697613, mflops=7.1673 (err=6.0e-16) 6. PDA (f2c): elapsed time t=15.23 s, 1 iters, t-(init.)=15.01 s t(norm)=0.715733, mflops=6.98585 (err=6.0e-16) 7. Singleton: elapsed time t=17.65 s, 1 iters, t-(init.)=17.43 s t(norm)=0.831127, mflops=6.01593 (err=7.1e-16) 8. Singleton (f2c): elapsed time t=17.14 s, 1 iters, t-(init.)=16.92 s t(norm)=0.806808, mflops=6.19726 (err=6.9e-16) 9. Skipping fft (Temperton can't handle dimensions > 256). 10. Skipping fft (Temperton can't handle dimensions > 256). Top mflops for N=1048576 = 10.4129 Normalized results and averages for N=1048576: fft 0: mflops = 10.4129 (norm. = 1), norm. avg. (of 7) = 1 fft 1: mflops = 9.25486 (norm. = 0.888791), norm. avg. (of 6) = 0.798954 fft 2: mflops = 7.61493 (norm. = 0.7313), norm. avg. (of 6) = 0.667943 fft 3: mflops = 5.53923 (norm. = 0.53196), norm. avg. (of 7) = 0.522542 fft 4: mflops = 5.39391 (norm. = 0.518004), norm. avg. (of 7) = 0.49749 fft 5: mflops = 7.1673 (norm. = 0.688312), norm. avg. (of 7) = 0.588115 fft 6: mflops = 6.98585 (norm. = 0.670886), norm. avg. (of 7) = 0.557142 fft 7: mflops = 6.01593 (norm. = 0.57774), norm. avg. (of 7) = 0.535742 fft 8: mflops = 6.19726 (norm. = 0.595154), norm. avg. (of 7) = 0.567033 fft 9: mflops = -1 (norm. = -0.096035), norm. avg. (of 6) = 0.771697 fft 10: mflops = -1 (norm. = -0.096035), norm. avg. (of 6) = 0.622376 Benchmarking for array size = 128x128x128 (power of 2): 0. FFTW: elapsed time t=23.8 s, 1 iters, t-(init.)=23.35 s t(norm)=0.530198, mflops=9.43045 (err=6.6e-16) 1. HARM: elapsed time t=25.39 s, 1 iters, t-(init.)=24.94 s t(norm)=0.566301, mflops=8.82923 (err=6.3e-16) 2. HARM (f2c): elapsed time t=30.01 s, 1 iters, t-(init.)=29.57 s t(norm)=0.671432, mflops=7.44677 (err=6.1e-16) 3. NR (C): elapsed time t=43.56 s, 1 iters, t-(init.)=43.12 s t(norm)=0.979106, mflops=5.1067 (err=7.5e-16) 4. NR (F): elapsed time t=44.59 s, 1 iters, t-(init.)=44.14 s t(norm)=1.00227, mflops=4.98869 (err=7.5e-16) 5. PDA: elapsed time t=30.73 s, 1 iters, t-(init.)=30.29 s t(norm)=0.687781, mflops=7.26976 (err=6.7e-16) 6. PDA (f2c): elapsed time t=30.85 s, 1 iters, t-(init.)=30.41 s t(norm)=0.690506, mflops=7.24107 (err=6.7e-16) 7. Singleton: elapsed time t=46.13 s, 1 iters, t-(init.)=45.68 s t(norm)=1.03723, mflops=4.82051 (err=8.0e-16) 8. Singleton (f2c): elapsed time t=44.21 s, 1 iters, t-(init.)=43.77 s t(norm)=0.993865, mflops=5.03086 (err=7.9e-16) 9. Temperton: elapsed time t=28.81 s, 1 iters, t-(init.)=28.37 s t(norm)=0.644184, mflops=7.76175 (err=1.5e-07) 10. Temperton (f2c): elapsed time t=34.29 s, 1 iters, t-(init.)=33.84 s t(norm)=0.768389, mflops=6.50712 (err=7.1e-16) Top mflops for N=2097152 = 9.43045 Normalized results and averages for N=2097152: fft 0: mflops = 9.43045 (norm. = 1), norm. avg. (of 8) = 1 fft 1: mflops = 8.82923 (norm. = 0.936247), norm. avg. (of 7) = 0.818568 fft 2: mflops = 7.44677 (norm. = 0.789652), norm. avg. (of 7) = 0.68533 fft 3: mflops = 5.1067 (norm. = 0.541512), norm. avg. (of 8) = 0.524913 fft 4: mflops = 4.98869 (norm. = 0.528999), norm. avg. (of 8) = 0.501429 fft 5: mflops = 7.26976 (norm. = 0.770881), norm. avg. (of 8) = 0.610961 fft 6: mflops = 7.24107 (norm. = 0.76784), norm. avg. (of 8) = 0.583479 fft 7: mflops = 4.82051 (norm. = 0.511165), norm. avg. (of 8) = 0.53267 fft 8: mflops = 5.03086 (norm. = 0.53347), norm. avg. (of 8) = 0.562838 fft 9: mflops = 7.76175 (norm. = 0.823053), norm. avg. (of 7) = 0.779033 fft 10: mflops = 6.50712 (norm. = 0.690012), norm. avg. (of 7) = 0.632038 ------------------------------------------------------ @@@@ bench.3d.np2.log Benchmarking for sizes: 5x5x5 (0.0022583 MB) 6x6x6 (0.00369263 MB) 7x7x7 (0.00567627 MB) 9x9x9 (0.0116577 MB) 10x10x10 (0.0158386 MB) 11x11x11 (0.0209351 MB) 12x12x12 (0.0270386 MB) 13x13x13 (0.0342407 MB) 14x14x14 (0.0426331 MB) 15x15x15 (0.0523071 MB) 24x25x28 (0.257751 MB) 48x48x48 (1.68982 MB) 49x49x49 (1.79755 MB) 60x60x60 (3.29877 MB) 72x60x56 (3.69482 MB) 75x75x75 (6.44086 MB) 80x80x80 (7.81628 MB) 84x84x84 (9.04791 MB) 96x96x96 (13.5045 MB) 105x105x105 (17.6689 MB) 112x112x112 (21.4427 MB) 120x120x120 (26.3728 MB) 144x144x144 (45.5692 MB) Maximum array size N = 2985984 Benchmarking FFTs: 0. FFTW 1. PDA 2. PDA (f2c) 3. Singleton 4. Singleton (f2c) 5. Temperton 6. Temperton (f2c) Computing normalized averages (7 transforms). Benchmarking for array size = 5x5x5: 0. FFTW: elapsed time t=1.68 s, 4096 iters, t-(init.)=1.65 s t(norm)=0.462641, mflops=10.8075 (err=2.9e-16) 1. PDA: elapsed time t=1.4 s, 2048 iters, t-(init.)=1.38 s t(norm)=0.773872, mflops=6.46102 (err=2.5e-16) 2. PDA (f2c): elapsed time t=1.7 s, 2048 iters, t-(init.)=1.69 s t(norm)=0.947713, mflops=5.27586 (err=2.4e-16) 3. Singleton: elapsed time t=1.5 s, 4096 iters, t-(init.)=1.47 s t(norm)=0.412171, mflops=12.1309 (err=3.2e-16) 4. Singleton (f2c): elapsed time t=1.52 s, 4096 iters, t-(init.)=1.5 s t(norm)=0.420583, mflops=11.8883 (err=2.6e-16) 5. Temperton: elapsed time t=1.94 s, 4096 iters, t-(init.)=1.92 s t(norm)=0.538346, mflops=9.28771 (err=3.8e-08) 6. Temperton (f2c): elapsed time t=1.14 s, 2048 iters, t-(init.)=1.13 s t(norm)=0.633678, mflops=7.89045 (err=1.7e-16) Top mflops for N=125 = 12.1309 Normalized results and averages for N=125: fft 0: mflops = 10.8075 (norm. = 0.890909), norm. avg. (of 1) = 0.890909 fft 1: mflops = 6.46102 (norm. = 0.532609), norm. avg. (of 1) = 0.532609 fft 2: mflops = 5.27586 (norm. = 0.434911), norm. avg. (of 1) = 0.434911 fft 3: mflops = 12.1309 (norm. = 1), norm. avg. (of 1) = 1 fft 4: mflops = 11.8883 (norm. = 0.98), norm. avg. (of 1) = 0.98 fft 5: mflops = 9.28771 (norm. = 0.765625), norm. avg. (of 1) = 0.765625 fft 6: mflops = 7.89045 (norm. = 0.650442), norm. avg. (of 1) = 0.650442 Benchmarking for array size = 6x6x6: 0. FFTW: elapsed time t=1.29 s, 2048 iters, t-(init.)=1.27 s t(norm)=0.370207, mflops=13.506 (err=2.3e-16) 1. PDA: elapsed time t=1.46 s, 1024 iters, t-(init.)=1.45 s t(norm)=0.845354, mflops=5.91468 (err=3.7e-16) 2. PDA (f2c): elapsed time t=1.54 s, 1024 iters, t-(init.)=1.53 s t(norm)=0.891995, mflops=5.60542 (err=3.8e-16) 3. Singleton: elapsed time t=1.12 s, 1024 iters, t-(init.)=1.11 s t(norm)=0.647133, mflops=7.72638 (err=3.0e-16) 4. Singleton (f2c): elapsed time t=1.94 s, 2048 iters, t-(init.)=1.92 s t(norm)=0.559683, mflops=8.93363 (err=3.0e-16) 5. Temperton: elapsed time t=1.78 s, 2048 iters, t-(init.)=1.76 s t(norm)=0.513043, mflops=9.74578 (err=1.7e-08) 6. Temperton (f2c): elapsed time t=1.07 s, 1024 iters, t-(init.)=1.06 s t(norm)=0.617983, mflops=8.09084 (err=2.9e-16) Top mflops for N=216 = 13.506 Normalized results and averages for N=216: fft 0: mflops = 13.506 (norm. = 1), norm. avg. (of 2) = 0.945455 fft 1: mflops = 5.91468 (norm. = 0.437931), norm. avg. (of 2) = 0.48527 fft 2: mflops = 5.60542 (norm. = 0.415033), norm. avg. (of 2) = 0.424972 fft 3: mflops = 7.72638 (norm. = 0.572072), norm. avg. (of 2) = 0.786036 fft 4: mflops = 8.93363 (norm. = 0.661458), norm. avg. (of 2) = 0.820729 fft 5: mflops = 9.74578 (norm. = 0.721591), norm. avg. (of 2) = 0.743608 fft 6: mflops = 8.09084 (norm. = 0.599057), norm. avg. (of 2) = 0.62475 Benchmarking for array size = 7x7x7: 0. FFTW: elapsed time t=1.35 s, 1024 iters, t-(init.)=1.34 s t(norm)=0.452994, mflops=11.0377 (err=2.1e-16) 1. PDA: elapsed time t=1.92 s, 512 iters, t-(init.)=1.91 s t(norm)=1.29137, mflops=3.87186 (err=3.7e-16) 2. PDA (f2c): elapsed time t=1.08 s, 256 iters, t-(init.)=1.08 s t(norm)=1.4604, mflops=3.42373 (err=3.4e-16) 3. Singleton: elapsed time t=1.57 s, 1024 iters, t-(init.)=1.56 s t(norm)=0.527366, mflops=9.48109 (err=4.2e-16) 4. Singleton (f2c): elapsed time t=1.4 s, 1024 iters, t-(init.)=1.38 s t(norm)=0.466516, mflops=10.7177 (err=4.2e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=343 = 11.0377 Normalized results and averages for N=343: fft 0: mflops = 11.0377 (norm. = 1), norm. avg. (of 3) = 0.963636 fft 1: mflops = 3.87186 (norm. = 0.350785), norm. avg. (of 3) = 0.440442 fft 2: mflops = 3.42373 (norm. = 0.310185), norm. avg. (of 3) = 0.38671 fft 3: mflops = 9.48109 (norm. = 0.858974), norm. avg. (of 3) = 0.810349 fft 4: mflops = 10.7177 (norm. = 0.971014), norm. avg. (of 3) = 0.870824 fft 5: mflops = -1 (norm. = -0.0905987), norm. avg. (of 2) = 0.743608 fft 6: mflops = -1 (norm. = -0.0905987), norm. avg. (of 2) = 0.62475 Benchmarking for array size = 9x9x9: 0. FFTW: elapsed time t=1.66 s, 512 iters, t-(init.)=1.65 s t(norm)=0.464854, mflops=10.7561 (err=4.1e-16) 1. PDA: elapsed time t=1.25 s, 256 iters, t-(init.)=1.25 s t(norm)=0.704324, mflops=7.09901 (err=3.2e-16) 2. PDA (f2c): elapsed time t=1.34 s, 256 iters, t-(init.)=1.33 s t(norm)=0.7494, mflops=6.672 (err=3.6e-16) 3. Singleton: elapsed time t=1.1 s, 256 iters, t-(init.)=1.09 s t(norm)=0.61417, mflops=8.14107 (err=3.6e-16) 4. Singleton (f2c): elapsed time t=1.95 s, 512 iters, t-(init.)=1.94 s t(norm)=0.546555, mflops=9.14821 (err=3.6e-16) 5. Temperton: elapsed time t=1.06 s, 256 iters, t-(init.)=1.05 s t(norm)=0.591632, mflops=8.4512 (err=5.3e-08) 6. Temperton (f2c): elapsed time t=1.25 s, 256 iters, t-(init.)=1.24 s t(norm)=0.698689, mflops=7.15626 (err=3.5e-16) Top mflops for N=729 = 10.7561 Normalized results and averages for N=729: fft 0: mflops = 10.7561 (norm. = 1), norm. avg. (of 4) = 0.972727 fft 1: mflops = 7.09901 (norm. = 0.66), norm. avg. (of 4) = 0.495331 fft 2: mflops = 6.672 (norm. = 0.620301), norm. avg. (of 4) = 0.445107 fft 3: mflops = 8.14107 (norm. = 0.756881), norm. avg. (of 4) = 0.796982 fft 4: mflops = 9.14821 (norm. = 0.850515), norm. avg. (of 4) = 0.865747 fft 5: mflops = 8.4512 (norm. = 0.785714), norm. avg. (of 3) = 0.757643 fft 6: mflops = 7.15626 (norm. = 0.665323), norm. avg. (of 3) = 0.638274 Benchmarking for array size = 10x10x10: 0. FFTW: elapsed time t=1.05 s, 256 iters, t-(init.)=1.04 s t(norm)=0.407645, mflops=12.2656 (err=2.6e-16) 1. PDA: elapsed time t=1.75 s, 256 iters, t-(init.)=1.74 s t(norm)=0.682021, mflops=7.33115 (err=2.8e-16) 2. PDA (f2c): elapsed time t=1 s, 128 iters, t-(init.)=0.99 s t(norm)=0.776093, mflops=6.44253 (err=3.1e-16) 3. Singleton: elapsed time t=1.6 s, 256 iters, t-(init.)=1.59 s t(norm)=0.623226, mflops=8.02277 (err=3.8e-16) 4. Singleton (f2c): elapsed time t=1.48 s, 256 iters, t-(init.)=1.47 s t(norm)=0.57619, mflops=8.67769 (err=3.7e-16) 5. Temperton: elapsed time t=1.3 s, 256 iters, t-(init.)=1.29 s t(norm)=0.505636, mflops=9.88853 (err=2.5e-08) 6. Temperton (f2c): elapsed time t=1.59 s, 256 iters, t-(init.)=1.58 s t(norm)=0.619307, mflops=8.07355 (err=2.8e-16) Top mflops for N=1000 = 12.2656 Normalized results and averages for N=1000: fft 0: mflops = 12.2656 (norm. = 1), norm. avg. (of 5) = 0.978182 fft 1: mflops = 7.33115 (norm. = 0.597701), norm. avg. (of 5) = 0.515805 fft 2: mflops = 6.44253 (norm. = 0.525253), norm. avg. (of 5) = 0.461136 fft 3: mflops = 8.02277 (norm. = 0.654088), norm. avg. (of 5) = 0.768403 fft 4: mflops = 8.67769 (norm. = 0.707483), norm. avg. (of 5) = 0.834094 fft 5: mflops = 9.88853 (norm. = 0.806202), norm. avg. (of 4) = 0.769783 fft 6: mflops = 8.07355 (norm. = 0.658228), norm. avg. (of 4) = 0.643262 Benchmarking for array size = 11x11x11: 0. FFTW: elapsed time t=1.92 s, 256 iters, t-(init.)=1.89 s t(norm)=0.534463, mflops=9.35518 (err=2.4e-16) 1. PDA: elapsed time t=1.18 s, 64 iters, t-(init.)=1.17 s t(norm)=1.32343, mflops=3.77805 (err=4.2e-16) 2. PDA (f2c): elapsed time t=1.28 s, 64 iters, t-(init.)=1.27 s t(norm)=1.43655, mflops=3.48057 (err=4.3e-16) 3. Singleton: elapsed time t=1.06 s, 128 iters, t-(init.)=1.04 s t(norm)=0.588192, mflops=8.50062 (err=3.8e-16) 4. Singleton (f2c): elapsed time t=1.95 s, 256 iters, t-(init.)=1.93 s t(norm)=0.545775, mflops=9.16129 (err=3.8e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=1331 = 9.35518 Normalized results and averages for N=1331: fft 0: mflops = 9.35518 (norm. = 1), norm. avg. (of 6) = 0.981818 fft 1: mflops = 3.77805 (norm. = 0.403846), norm. avg. (of 6) = 0.497145 fft 2: mflops = 3.48057 (norm. = 0.372047), norm. avg. (of 6) = 0.446288 fft 3: mflops = 8.50062 (norm. = 0.908654), norm. avg. (of 6) = 0.791778 fft 4: mflops = 9.16129 (norm. = 0.979275), norm. avg. (of 6) = 0.858291 fft 5: mflops = -1 (norm. = -0.106893), norm. avg. (of 4) = 0.769783 fft 6: mflops = -1 (norm. = -0.106893), norm. avg. (of 4) = 0.643262 Benchmarking for array size = 12x12x12: 0. FFTW: elapsed time t=1.73 s, 256 iters, t-(init.)=1.69 s t(norm)=0.35522, mflops=14.0758 (err=3.3e-16) 1. PDA: elapsed time t=1.59 s, 128 iters, t-(init.)=1.57 s t(norm)=0.659994, mflops=7.57582 (err=3.1e-16) 2. PDA (f2c): elapsed time t=1.68 s, 128 iters, t-(init.)=1.66 s t(norm)=0.697828, mflops=7.16509 (err=3.5e-16) 3. Singleton: elapsed time t=1.61 s, 128 iters, t-(init.)=1.59 s t(norm)=0.668402, mflops=7.48053 (err=3.7e-16) 4. Singleton (f2c): elapsed time t=1.45 s, 128 iters, t-(init.)=1.43 s t(norm)=0.601141, mflops=8.31751 (err=3.7e-16) 5. Temperton: elapsed time t=1.11 s, 128 iters, t-(init.)=1.09 s t(norm)=0.458212, mflops=10.912 (err=1.6e-08) 6. Temperton (f2c): elapsed time t=1.31 s, 128 iters, t-(init.)=1.29 s t(norm)=0.542288, mflops=9.22019 (err=3.9e-16) Top mflops for N=1728 = 14.0758 Normalized results and averages for N=1728: fft 0: mflops = 14.0758 (norm. = 1), norm. avg. (of 7) = 0.984416 fft 1: mflops = 7.57582 (norm. = 0.538217), norm. avg. (of 7) = 0.503013 fft 2: mflops = 7.16509 (norm. = 0.509036), norm. avg. (of 7) = 0.455252 fft 3: mflops = 7.48053 (norm. = 0.531447), norm. avg. (of 7) = 0.754588 fft 4: mflops = 8.31751 (norm. = 0.590909), norm. avg. (of 7) = 0.820094 fft 5: mflops = 10.912 (norm. = 0.775229), norm. avg. (of 5) = 0.770872 fft 6: mflops = 9.22019 (norm. = 0.655039), norm. avg. (of 5) = 0.645618 Benchmarking for array size = 13x13x13: 0. FFTW: elapsed time t=1.81 s, 128 iters, t-(init.)=1.78 s t(norm)=0.570171, mflops=8.76929 (err=2.3e-16) 1. PDA: elapsed time t=1.13 s, 32 iters, t-(init.)=1.12 s t(norm)=1.43504, mflops=3.48423 (err=8.7e-16) 2. PDA (f2c): elapsed time t=1.17 s, 32 iters, t-(init.)=1.16 s t(norm)=1.48629, mflops=3.36408 (err=8.4e-16) 3. Singleton: elapsed time t=1 s, 64 iters, t-(init.)=0.99 s t(norm)=0.634236, mflops=7.88351 (err=5.3e-16) 4. Singleton (f2c): elapsed time t=1.82 s, 128 iters, t-(init.)=1.79 s t(norm)=0.573375, mflops=8.7203 (err=5.3e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=2197 = 8.76929 Normalized results and averages for N=2197: fft 0: mflops = 8.76929 (norm. = 1), norm. avg. (of 8) = 0.986364 fft 1: mflops = 3.48423 (norm. = 0.397321), norm. avg. (of 8) = 0.489801 fft 2: mflops = 3.36408 (norm. = 0.383621), norm. avg. (of 8) = 0.446298 fft 3: mflops = 7.88351 (norm. = 0.89899), norm. avg. (of 8) = 0.772638 fft 4: mflops = 8.7203 (norm. = 0.994413), norm. avg. (of 8) = 0.841884 fft 5: mflops = -1 (norm. = -0.114034), norm. avg. (of 5) = 0.770872 fft 6: mflops = -1 (norm. = -0.114034), norm. avg. (of 5) = 0.645618 Benchmarking for array size = 14x14x14: 0. FFTW: elapsed time t=1.79 s, 128 iters, t-(init.)=1.75 s t(norm)=0.436214, mflops=11.4623 (err=2.6e-16) 1. PDA: elapsed time t=1.89 s, 64 iters, t-(init.)=1.87 s t(norm)=0.932251, mflops=5.36336 (err=3.7e-16) 2. PDA (f2c): elapsed time t=1 s, 32 iters, t-(init.)=0.99 s t(norm)=0.987089, mflops=5.0654 (err=3.6e-16) 3. Singleton: elapsed time t=1.42 s, 64 iters, t-(init.)=1.4 s t(norm)=0.697942, mflops=7.16392 (err=4.1e-16) 4. Singleton (f2c): elapsed time t=1.24 s, 64 iters, t-(init.)=1.22 s t(norm)=0.608207, mflops=8.22089 (err=4.1e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=2744 = 11.4623 Normalized results and averages for N=2744: fft 0: mflops = 11.4623 (norm. = 1), norm. avg. (of 9) = 0.987879 fft 1: mflops = 5.36336 (norm. = 0.467914), norm. avg. (of 9) = 0.487369 fft 2: mflops = 5.0654 (norm. = 0.441919), norm. avg. (of 9) = 0.445812 fft 3: mflops = 7.16392 (norm. = 0.625), norm. avg. (of 9) = 0.756234 fft 4: mflops = 8.22089 (norm. = 0.717213), norm. avg. (of 9) = 0.828031 fft 5: mflops = -1 (norm. = -0.0872428), norm. avg. (of 5) = 0.770872 fft 6: mflops = -1 (norm. = -0.0872428), norm. avg. (of 5) = 0.645618 Benchmarking for array size = 15x15x15: 0. FFTW: elapsed time t=1.1 s, 64 iters, t-(init.)=1.07 s t(norm)=0.422647, mflops=11.8302 (err=3.7e-16) 1. PDA: elapsed time t=1.56 s, 64 iters, t-(init.)=1.53 s t(norm)=0.604345, mflops=8.27342 (err=3.5e-16) 2. PDA (f2c): elapsed time t=1.79 s, 64 iters, t-(init.)=1.76 s t(norm)=0.695195, mflops=7.19223 (err=3.4e-16) 3. Singleton: elapsed time t=1.77 s, 64 iters, t-(init.)=1.75 s t(norm)=0.691245, mflops=7.23333 (err=4.2e-16) 4. Singleton (f2c): elapsed time t=1.64 s, 64 iters, t-(init.)=1.62 s t(norm)=0.639895, mflops=7.81378 (err=4.1e-16) 5. Temperton: elapsed time t=1.37 s, 64 iters, t-(init.)=1.35 s t(norm)=0.533246, mflops=9.37654 (err=1.8e-08) 6. Temperton (f2c): elapsed time t=1.6 s, 64 iters, t-(init.)=1.58 s t(norm)=0.624095, mflops=8.0116 (err=3.8e-16) Top mflops for N=3375 = 11.8302 Normalized results and averages for N=3375: fft 0: mflops = 11.8302 (norm. = 1), norm. avg. (of 10) = 0.989091 fft 1: mflops = 8.27342 (norm. = 0.699346), norm. avg. (of 10) = 0.508567 fft 2: mflops = 7.19223 (norm. = 0.607955), norm. avg. (of 10) = 0.462026 fft 3: mflops = 7.23333 (norm. = 0.611429), norm. avg. (of 10) = 0.741753 fft 4: mflops = 7.81378 (norm. = 0.660494), norm. avg. (of 10) = 0.811278 fft 5: mflops = 9.37654 (norm. = 0.792593), norm. avg. (of 6) = 0.774492 fft 6: mflops = 8.0116 (norm. = 0.677215), norm. avg. (of 6) = 0.650884 Benchmarking for array size = 24x25x28: 0. FFTW: elapsed time t=1.06 s, 8 iters, t-(init.)=1.04 s t(norm)=0.551297, mflops=9.06953 (err=3.3e-16) 1. PDA: elapsed time t=1.34 s, 8 iters, t-(init.)=1.32 s t(norm)=0.699723, mflops=7.14569 (err=3.8e-16) 2. PDA (f2c): elapsed time t=1.45 s, 8 iters, t-(init.)=1.43 s t(norm)=0.758033, mflops=6.59602 (err=3.8e-16) 3. Singleton: elapsed time t=1.4 s, 8 iters, t-(init.)=1.38 s t(norm)=0.731528, mflops=6.83501 (err=4.5e-16) 4. Singleton (f2c): elapsed time t=1.28 s, 8 iters, t-(init.)=1.26 s t(norm)=0.667917, mflops=7.48596 (err=4.4e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=16800 = 9.06953 Normalized results and averages for N=16800: fft 0: mflops = 9.06953 (norm. = 1), norm. avg. (of 11) = 0.990083 fft 1: mflops = 7.14569 (norm. = 0.787879), norm. avg. (of 11) = 0.533959 fft 2: mflops = 6.59602 (norm. = 0.727273), norm. avg. (of 11) = 0.486139 fft 3: mflops = 6.83501 (norm. = 0.753623), norm. avg. (of 11) = 0.742832 fft 4: mflops = 7.48596 (norm. = 0.825397), norm. avg. (of 11) = 0.812561 fft 5: mflops = -1 (norm. = -0.110259), norm. avg. (of 6) = 0.774492 fft 6: mflops = -1 (norm. = -0.110259), norm. avg. (of 6) = 0.650884 Benchmarking for array size = 48x48x48: 0. FFTW: elapsed time t=1.86 s, 2 iters, t-(init.)=1.82 s t(norm)=0.491107, mflops=10.1811 (err=4.0e-16) 1. PDA: elapsed time t=1.29 s, 1 iters, t-(init.)=1.27 s t(norm)=0.685391, mflops=7.2951 (err=4.0e-16) 2. PDA (f2c): elapsed time t=1.31 s, 1 iters, t-(init.)=1.28 s t(norm)=0.690788, mflops=7.23811 (err=4.2e-16) 3. Singleton: elapsed time t=1.61 s, 1 iters, t-(init.)=1.59 s t(norm)=0.858088, mflops=5.82691 (err=3.7e-16) 4. Singleton (f2c): elapsed time t=1.55 s, 1 iters, t-(init.)=1.53 s t(norm)=0.825707, mflops=6.05541 (err=3.6e-16) 5. Temperton: elapsed time t=1.03 s, 1 iters, t-(init.)=1 s t(norm)=0.539678, mflops=9.26478 (err=1.1e-07) 6. Temperton (f2c): elapsed time t=1.2 s, 1 iters, t-(init.)=1.18 s t(norm)=0.63682, mflops=7.85151 (err=4.8e-16) Top mflops for N=110592 = 10.1811 Normalized results and averages for N=110592: fft 0: mflops = 10.1811 (norm. = 1), norm. avg. (of 12) = 0.990909 fft 1: mflops = 7.2951 (norm. = 0.716535), norm. avg. (of 12) = 0.549174 fft 2: mflops = 7.23811 (norm. = 0.710938), norm. avg. (of 12) = 0.504873 fft 3: mflops = 5.82691 (norm. = 0.572327), norm. avg. (of 12) = 0.728624 fft 4: mflops = 6.05541 (norm. = 0.594771), norm. avg. (of 12) = 0.794412 fft 5: mflops = 9.26478 (norm. = 0.91), norm. avg. (of 7) = 0.793851 fft 6: mflops = 7.85151 (norm. = 0.771186), norm. avg. (of 7) = 0.66807 Benchmarking for array size = 49x49x49: 0. FFTW: elapsed time t=1.15 s, 1 iters, t-(init.)=1.13 s t(norm)=0.570219, mflops=8.76856 (err=4.3e-16) 1. PDA: elapsed time t=1.62 s, 1 iters, t-(init.)=1.59 s t(norm)=0.802343, mflops=6.23175 (err=5.7e-16) 2. PDA (f2c): elapsed time t=1.84 s, 1 iters, t-(init.)=1.82 s t(norm)=0.918406, mflops=5.44422 (err=5.7e-16) 3. Singleton: elapsed time t=1.54 s, 1 iters, t-(init.)=1.52 s t(norm)=0.76702, mflops=6.51873 (err=6.8e-16) 4. Singleton (f2c): elapsed time t=1.41 s, 1 iters, t-(init.)=1.39 s t(norm)=0.70142, mflops=7.1284 (err=6.8e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=117649 = 8.76856 Normalized results and averages for N=117649: fft 0: mflops = 8.76856 (norm. = 1), norm. avg. (of 13) = 0.991608 fft 1: mflops = 6.23175 (norm. = 0.710692), norm. avg. (of 13) = 0.561598 fft 2: mflops = 5.44422 (norm. = 0.620879), norm. avg. (of 13) = 0.513796 fft 3: mflops = 6.51873 (norm. = 0.743421), norm. avg. (of 13) = 0.729762 fft 4: mflops = 7.1284 (norm. = 0.81295), norm. avg. (of 13) = 0.795838 fft 5: mflops = -1 (norm. = -0.114044), norm. avg. (of 7) = 0.793851 fft 6: mflops = -1 (norm. = -0.114044), norm. avg. (of 7) = 0.66807 Benchmarking for array size = 60x60x60: 0. FFTW: elapsed time t=2.05 s, 1 iters, t-(init.)=2 s t(norm)=0.522512, mflops=9.56916 (err=4.6e-16) 1. PDA: elapsed time t=2.65 s, 1 iters, t-(init.)=2.61 s t(norm)=0.681878, mflops=7.33269 (err=4.4e-16) 2. PDA (f2c): elapsed time t=2.85 s, 1 iters, t-(init.)=2.81 s t(norm)=0.734129, mflops=6.81079 (err=4.4e-16) 3. Singleton: elapsed time t=3.58 s, 1 iters, t-(init.)=3.53 s t(norm)=0.922233, mflops=5.42162 (err=5.4e-16) 4. Singleton (f2c): elapsed time t=3.4 s, 1 iters, t-(init.)=3.36 s t(norm)=0.87782, mflops=5.69593 (err=5.4e-16) 5. Temperton: elapsed time t=2.09 s, 1 iters, t-(init.)=2.05 s t(norm)=0.535575, mflops=9.33577 (err=1.9e-08) 6. Temperton (f2c): elapsed time t=2.43 s, 1 iters, t-(init.)=2.39 s t(norm)=0.624402, mflops=8.00767 (err=5.4e-16) Top mflops for N=216000 = 9.56916 Normalized results and averages for N=216000: fft 0: mflops = 9.56916 (norm. = 1), norm. avg. (of 14) = 0.992208 fft 1: mflops = 7.33269 (norm. = 0.766284), norm. avg. (of 14) = 0.576219 fft 2: mflops = 6.81079 (norm. = 0.711744), norm. avg. (of 14) = 0.527935 fft 3: mflops = 5.42162 (norm. = 0.566572), norm. avg. (of 14) = 0.718106 fft 4: mflops = 5.69593 (norm. = 0.595238), norm. avg. (of 14) = 0.781509 fft 5: mflops = 9.33577 (norm. = 0.97561), norm. avg. (of 8) = 0.81657 fft 6: mflops = 8.00767 (norm. = 0.83682), norm. avg. (of 8) = 0.689164 Benchmarking for array size = 72x60x56: 0. FFTW: elapsed time t=2.28 s, 1 iters, t-(init.)=2.23 s t(norm)=0.515424, mflops=9.70076 (err=3.8e-16) 1. PDA: elapsed time t=3.13 s, 1 iters, t-(init.)=3.08 s t(norm)=0.711885, mflops=7.0236 (err=4.7e-16) 2. PDA (f2c): elapsed time t=3.31 s, 1 iters, t-(init.)=3.25 s t(norm)=0.751178, mflops=6.65621 (err=4.7e-16) 3. Singleton: elapsed time t=4.32 s, 1 iters, t-(init.)=4.27 s t(norm)=0.986932, mflops=5.0662 (err=5.2e-16) 4. Singleton (f2c): elapsed time t=4.05 s, 1 iters, t-(init.)=4 s t(norm)=0.924527, mflops=5.40817 (err=5.2e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=241920 = 9.70076 Normalized results and averages for N=241920: fft 0: mflops = 9.70076 (norm. = 1), norm. avg. (of 15) = 0.992727 fft 1: mflops = 7.0236 (norm. = 0.724026), norm. avg. (of 15) = 0.586072 fft 2: mflops = 6.65621 (norm. = 0.686154), norm. avg. (of 15) = 0.538483 fft 3: mflops = 5.0662 (norm. = 0.522248), norm. avg. (of 15) = 0.705048 fft 4: mflops = 5.40817 (norm. = 0.5575), norm. avg. (of 15) = 0.766575 fft 5: mflops = -1 (norm. = -0.103085), norm. avg. (of 8) = 0.81657 fft 6: mflops = -1 (norm. = -0.103085), norm. avg. (of 8) = 0.689164 Benchmarking for array size = 75x75x75: 0. FFTW: elapsed time t=4.44 s, 1 iters, t-(init.)=4.35 s t(norm)=0.551796, mflops=9.06132 (err=5.2e-16) 1. PDA: elapsed time t=5.04 s, 1 iters, t-(init.)=4.95 s t(norm)=0.627906, mflops=7.96298 (err=5.8e-16) 2. PDA (f2c): elapsed time t=5.64 s, 1 iters, t-(init.)=5.55 s t(norm)=0.704016, mflops=7.10212 (err=5.8e-16) 3. Singleton: elapsed time t=6.29 s, 1 iters, t-(init.)=6.2 s t(norm)=0.786468, mflops=6.35754 (err=6.6e-16) 4. Singleton (f2c): elapsed time t=6.06 s, 1 iters, t-(init.)=5.97 s t(norm)=0.757292, mflops=6.60247 (err=6.6e-16) 5. Temperton: elapsed time t=4.85 s, 1 iters, t-(init.)=4.76 s t(norm)=0.603804, mflops=8.28083 (err=1.4e-07) 6. Temperton (f2c): elapsed time t=5.67 s, 1 iters, t-(init.)=5.58 s t(norm)=0.707821, mflops=7.06393 (err=7.0e-16) Top mflops for N=421875 = 9.06132 Normalized results and averages for N=421875: fft 0: mflops = 9.06132 (norm. = 1), norm. avg. (of 16) = 0.993182 fft 1: mflops = 7.96298 (norm. = 0.878788), norm. avg. (of 16) = 0.604367 fft 2: mflops = 7.10212 (norm. = 0.783784), norm. avg. (of 16) = 0.553814 fft 3: mflops = 6.35754 (norm. = 0.701613), norm. avg. (of 16) = 0.704834 fft 4: mflops = 6.60247 (norm. = 0.728643), norm. avg. (of 16) = 0.764205 fft 5: mflops = 8.28083 (norm. = 0.913866), norm. avg. (of 9) = 0.827381 fft 6: mflops = 7.06393 (norm. = 0.77957), norm. avg. (of 9) = 0.699209 Benchmarking for array size = 80x80x80: 0. FFTW: elapsed time t=5.56 s, 1 iters, t-(init.)=5.45 s t(norm)=0.561249, mflops=8.9087 (err=3.8e-16) 1. PDA: elapsed time t=6.72 s, 1 iters, t-(init.)=6.62 s t(norm)=0.681738, mflops=7.3342 (err=3.8e-16) 2. PDA (f2c): elapsed time t=7.17 s, 1 iters, t-(init.)=7.06 s t(norm)=0.727049, mflops=6.87711 (err=3.7e-16) 3. Singleton: elapsed time t=8.23 s, 1 iters, t-(init.)=8.12 s t(norm)=0.83621, mflops=5.97936 (err=4.8e-16) 4. Singleton (f2c): elapsed time t=7.98 s, 1 iters, t-(init.)=7.87 s t(norm)=0.810464, mflops=6.1693 (err=4.7e-16) 5. Temperton: elapsed time t=5.42 s, 1 iters, t-(init.)=5.31 s t(norm)=0.546832, mflops=9.14358 (err=1.7e-07) 6. Temperton (f2c): elapsed time t=6.34 s, 1 iters, t-(init.)=6.23 s t(norm)=0.641575, mflops=7.79332 (err=5.2e-16) Top mflops for N=512000 = 9.14358 Normalized results and averages for N=512000: fft 0: mflops = 8.9087 (norm. = 0.974312), norm. avg. (of 17) = 0.992072 fft 1: mflops = 7.3342 (norm. = 0.802115), norm. avg. (of 17) = 0.615999 fft 2: mflops = 6.87711 (norm. = 0.752125), norm. avg. (of 17) = 0.56548 fft 3: mflops = 5.97936 (norm. = 0.653941), norm. avg. (of 17) = 0.70184 fft 4: mflops = 6.1693 (norm. = 0.674714), norm. avg. (of 17) = 0.75894 fft 5: mflops = 9.14358 (norm. = 1), norm. avg. (of 10) = 0.844643 fft 6: mflops = 7.79332 (norm. = 0.852327), norm. avg. (of 10) = 0.714521 Benchmarking for array size = 84x84x84: 0. FFTW: elapsed time t=6.18 s, 1 iters, t-(init.)=6.05 s t(norm)=0.532277, mflops=9.3936 (err=4.8e-16) 1. PDA: elapsed time t=8.92 s, 1 iters, t-(init.)=8.79 s t(norm)=0.773342, mflops=6.46545 (err=4.7e-16) 2. PDA (f2c): elapsed time t=9.62 s, 1 iters, t-(init.)=9.49 s t(norm)=0.834928, mflops=5.98854 (err=5.0e-16) 3. Singleton: elapsed time t=11.47 s, 1 iters, t-(init.)=11.34 s t(norm)=0.99769, mflops=5.01158 (err=5.4e-16) 4. Singleton (f2c): elapsed time t=10.63 s, 1 iters, t-(init.)=10.5 s t(norm)=0.923787, mflops=5.4125 (err=5.4e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=592704 = 9.3936 Normalized results and averages for N=592704: fft 0: mflops = 9.3936 (norm. = 1), norm. avg. (of 18) = 0.992512 fft 1: mflops = 6.46545 (norm. = 0.688282), norm. avg. (of 18) = 0.620015 fft 2: mflops = 5.98854 (norm. = 0.637513), norm. avg. (of 18) = 0.569482 fft 3: mflops = 5.01158 (norm. = 0.53351), norm. avg. (of 18) = 0.692488 fft 4: mflops = 5.4125 (norm. = 0.57619), norm. avg. (of 18) = 0.748788 fft 5: mflops = -1 (norm. = -0.106455), norm. avg. (of 10) = 0.844643 fft 6: mflops = -1 (norm. = -0.106455), norm. avg. (of 10) = 0.714521 Benchmarking for array size = 96x96x96: 0. FFTW: elapsed time t=8.94 s, 1 iters, t-(init.)=8.75 s t(norm)=0.500633, mflops=9.98735 (err=4.8e-16) 1. PDA: elapsed time t=12.4 s, 1 iters, t-(init.)=12.21 s t(norm)=0.698598, mflops=7.15719 (err=4.9e-16) 2. PDA (f2c): elapsed time t=12.51 s, 1 iters, t-(init.)=12.32 s t(norm)=0.704892, mflops=7.09329 (err=4.8e-16) 3. Singleton: elapsed time t=17.31 s, 1 iters, t-(init.)=17.12 s t(norm)=0.979525, mflops=5.10452 (err=5.5e-16) 4. Singleton (f2c): elapsed time t=16.64 s, 1 iters, t-(init.)=16.45 s t(norm)=0.941191, mflops=5.31242 (err=5.5e-16) 5. Temperton: elapsed time t=10.58 s, 1 iters, t-(init.)=10.39 s t(norm)=0.594466, mflops=8.4109 (err=1.6e-07) 6. Temperton (f2c): elapsed time t=12.34 s, 1 iters, t-(init.)=12.15 s t(norm)=0.695165, mflops=7.19254 (err=5.2e-16) Top mflops for N=884736 = 9.98735 Normalized results and averages for N=884736: fft 0: mflops = 9.98735 (norm. = 1), norm. avg. (of 19) = 0.992906 fft 1: mflops = 7.15719 (norm. = 0.716626), norm. avg. (of 19) = 0.6251 fft 2: mflops = 7.09329 (norm. = 0.710227), norm. avg. (of 19) = 0.576889 fft 3: mflops = 5.10452 (norm. = 0.511098), norm. avg. (of 19) = 0.682941 fft 4: mflops = 5.31242 (norm. = 0.531915), norm. avg. (of 19) = 0.737373 fft 5: mflops = 8.4109 (norm. = 0.842156), norm. avg. (of 11) = 0.844417 fft 6: mflops = 7.19254 (norm. = 0.720165), norm. avg. (of 11) = 0.715034 Benchmarking for array size = 105x105x105: 0. FFTW: elapsed time t=13.92 s, 1 iters, t-(init.)=13.68 s t(norm)=0.586678, mflops=8.52256 (err=4.8e-16) 1. PDA: elapsed time t=16.67 s, 1 iters, t-(init.)=16.42 s t(norm)=0.704185, mflops=7.10041 (err=5.5e-16) 2. PDA (f2c): elapsed time t=19.4 s, 1 iters, t-(init.)=19.16 s t(norm)=0.821692, mflops=6.085 (err=5.5e-16) 3. Singleton: elapsed time t=21.15 s, 1 iters, t-(init.)=20.91 s t(norm)=0.896742, mflops=5.57574 (err=6.5e-16) 4. Singleton (f2c): elapsed time t=19.75 s, 1 iters, t-(init.)=19.5 s t(norm)=0.836273, mflops=5.97891 (err=6.5e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=1157625 = 8.52256 Normalized results and averages for N=1157625: fft 0: mflops = 8.52256 (norm. = 1), norm. avg. (of 20) = 0.993261 fft 1: mflops = 7.10041 (norm. = 0.83313), norm. avg. (of 20) = 0.635501 fft 2: mflops = 6.085 (norm. = 0.713987), norm. avg. (of 20) = 0.583744 fft 3: mflops = 5.57574 (norm. = 0.654232), norm. avg. (of 20) = 0.681506 fft 4: mflops = 5.97891 (norm. = 0.701538), norm. avg. (of 20) = 0.735582 fft 5: mflops = -1 (norm. = -0.117336), norm. avg. (of 11) = 0.844417 fft 6: mflops = -1 (norm. = -0.117336), norm. avg. (of 11) = 0.715034 Benchmarking for array size = 112x112x112: 0. FFTW: elapsed time t=15.02 s, 1 iters, t-(init.)=14.72 s t(norm)=0.513043, mflops=9.74576 (err=5.3e-16) 1. PDA: elapsed time t=22.08 s, 1 iters, t-(init.)=21.78 s t(norm)=0.759109, mflops=6.58667 (err=5.6e-16) 2. PDA (f2c): elapsed time t=23.85 s, 1 iters, t-(init.)=23.55 s t(norm)=0.8208, mflops=6.09162 (err=5.7e-16) 3. Singleton: elapsed time t=25.48 s, 1 iters, t-(init.)=25.18 s t(norm)=0.877611, mflops=5.69729 (err=6.3e-16) 4. Singleton (f2c): elapsed time t=24.07 s, 1 iters, t-(init.)=23.77 s t(norm)=0.828467, mflops=6.03524 (err=6.3e-16) 5. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). 6. Skipping fft (Temperton only handles N = 2^m 3^n 5^q). Top mflops for N=1404928 = 9.74576 Normalized results and averages for N=1404928: fft 0: mflops = 9.74576 (norm. = 1), norm. avg. (of 21) = 0.993582 fft 1: mflops = 6.58667 (norm. = 0.675849), norm. avg. (of 21) = 0.637423 fft 2: mflops = 6.09162 (norm. = 0.625053), norm. avg. (of 21) = 0.585711 fft 3: mflops = 5.69729 (norm. = 0.584591), norm. avg. (of 21) = 0.676891 fft 4: mflops = 6.03524 (norm. = 0.619268), norm. avg. (of 21) = 0.730043 fft 5: mflops = -1 (norm. = -0.102609), norm. avg. (of 11) = 0.844417 fft 6: mflops = -1 (norm. = -0.102609), norm. avg. (of 11) = 0.715034 Benchmarking for array size = 120x120x120: 0. FFTW: elapsed time t=18.91 s, 1 iters, t-(init.)=18.54 s t(norm)=0.5178, mflops=9.65624 (err=4.7e-16) 1. PDA: elapsed time t=25.37 s, 1 iters, t-(init.)=25 s t(norm)=0.69822, mflops=7.16106 (err=4.8e-16) 2. PDA (f2c): elapsed time t=26.46 s, 1 iters, t-(init.)=26.09 s t(norm)=0.728663, mflops=6.86189 (err=4.9e-16) 3. Singleton: elapsed time t=38.5 s, 1 iters, t-(init.)=38.13 s t(norm)=1.06493, mflops=4.69516 (err=5.6e-16) 4. Singleton (f2c): elapsed time t=36.64 s, 1 iters, t-(init.)=36.26 s t(norm)=1.0127, mflops=4.9373 (err=5.6e-16) 5. Temperton: elapsed time t=21.04 s, 1 iters, t-(init.)=20.67 s t(norm)=0.577289, mflops=8.66118 (err=2.0e-08) 6. Temperton (f2c): elapsed time t=24.27 s, 1 iters, t-(init.)=23.9 s t(norm)=0.667499, mflops=7.49065 (err=5.7e-16) Top mflops for N=1728000 = 9.65624 Normalized results and averages for N=1728000: fft 0: mflops = 9.65624 (norm. = 1), norm. avg. (of 22) = 0.993874 fft 1: mflops = 7.16106 (norm. = 0.7416), norm. avg. (of 22) = 0.642158 fft 2: mflops = 6.86189 (norm. = 0.710617), norm. avg. (of 22) = 0.591389 fft 3: mflops = 4.69516 (norm. = 0.486231), norm. avg. (of 22) = 0.668225 fft 4: mflops = 4.9373 (norm. = 0.511307), norm. avg. (of 22) = 0.7201 fft 5: mflops = 8.66118 (norm. = 0.896952), norm. avg. (of 12) = 0.848795 fft 6: mflops = 7.49065 (norm. = 0.775732), norm. avg. (of 12) = 0.720092 Benchmarking for array size = 144x144x144: 0. FFTW: elapsed time t=36.11 s, 1 iters, t-(init.)=35.47 s t(norm)=0.552253, mflops=9.05383 (err=5.8e-16) 1. PDA: elapsed time t=46.48 s, 1 iters, t-(init.)=45.84 s t(norm)=0.713709, mflops=7.00565 (err=5.7e-16) 2. PDA (f2c): elapsed time t=46.64 s, 1 iters, t-(init.)=45.99 s t(norm)=0.716045, mflops=6.98281 (err=5.8e-16) 3. Singleton: elapsed time t=63.18 s, 1 iters, t-(init.)=62.54 s t(norm)=0.973721, mflops=5.13494 (err=6.0e-16) 4. Singleton (f2c): elapsed time t=60.66 s, 1 iters, t-(init.)=60.02 s t(norm)=0.934486, mflops=5.35054 (err=5.8e-16) 5. Temperton: elapsed time t=39.22 s, 1 iters, t-(init.)=38.58 s t(norm)=0.600674, mflops=8.32398 (err=1.8e-07) 6. Temperton (f2c): elapsed time t=46.28 s, 1 iters, t-(init.)=45.64 s t(norm)=0.710595, mflops=7.03635 (err=6.6e-16) Top mflops for N=2985984 = 9.05383 Normalized results and averages for N=2985984: fft 0: mflops = 9.05383 (norm. = 1), norm. avg. (of 23) = 0.99414 fft 1: mflops = 7.00565 (norm. = 0.773778), norm. avg. (of 23) = 0.647881 fft 2: mflops = 6.98281 (norm. = 0.771255), norm. avg. (of 23) = 0.599209 fft 3: mflops = 5.13494 (norm. = 0.567157), norm. avg. (of 23) = 0.66383 fft 4: mflops = 5.35054 (norm. = 0.59097), norm. avg. (of 23) = 0.714486 fft 5: mflops = 8.32398 (norm. = 0.919388), norm. avg. (of 13) = 0.854225 fft 6: mflops = 7.03635 (norm. = 0.777169), norm. avg. (of 13) = 0.724483 ------------------------------------------------------ @@@@ bench.1d.p2.dat N, Arndt DIF, Arndt DIT, Arndt Split-Radix, Arndt 4-step, Bailey, Beauregard, Bergland, Brenner, Burrus, CWP (min N), CWP (best N), Edelblute, FFTPACK, FFTPACK (f2c), FFTW, FFTW_ESTIMATE, Frigo-old, Green, GSL, GSL DIT, GSL DIF, Krukar, Mayer (Buneman), Mayer (simple), Mayer (lookup), Monro, NAPACK (f2c), Nielsen, NR (C), NR (F), Ooura (C), Ooura (F), QFT, Ransom, SCIPORT, Singleton, Singleton (f2c), Sorensen, Sorensen DIT, Temperton, Temperton (f2c), Valkenburg 2, 8.88624, 8.45626, 6.35501, 0.612486, 2.22156, 1.51528, 2.54509, 2.47306, 8.0044, 2.29951, 1.68041, , 4.16102, 3.36082, 9.53251, 9.70904, 10.9227, , 2.57004, 1.57918, 1.33747, 6.20459, , , , , 0.824352, 0.65536, 1.54202, 1.44035, 8.66592, 8.0044, , , , 1.43248, 1.55115, 7.82519, 7.3327, 1.9563, 1.69125, 2.44994 4, 12.3362, 11.6508, 5.51882, 1.91346, 3.15836, 2.01649, 6.47269, 4.80998, 5.40503, 6.31672, 1.22497, 5.60736, 8.59489, 7.43671, 14.9797, 15.0874, 16.257, , 5.24288, 2.75941, 2.31986, 12.1927, 6.59482, 6.39376, 6.47269, 2.20289, 1.54202, 2.42726, 2.70252, 2.49661, 10.9227, 10.2802, , 1.37971, 3.74491, 3.85506, 4.06425, 8.192, 5.69878, 5.19097, 4.55903, 2.42726 8, 12.7875, 12.8923, 5.82542, 2.25986, 4.4939, 2.53688, 6.83854, 5.27807, 5.1067, 9.59063, 3.78092, 5.17389, 9.59063, 8.93673, 15.4202, 15.4202, 15.1237, 9.03945, 6.34219, 4.25098, 3.44926, 10.999, 8.32203, 8.192, 8.23489, 4.096, 2.18453, 4.54585, 4.13912, 3.81763, 12.0066, 11.4808, , 1.85479, 3.22308, 3.74491, 4.25098, 9.47508, 5.27807, 6.72164, 6.04948, 2.47306 16, 7.48983, 7.82519, 5.99186, 3.03057, 4.94611, 2.68866, 7.54371, 6.39376, 4.80998, 11.6508, 5.14008, 4.99322, 10.4858, 9.98644, 14.7687, 14.5636, 13.7069, 7.65384, 7.59838, 5.57753, 4.44312, 10.9227, 7.18203, 7.65384, 8.192, 5.40503, 2.89662, 4.76625, 5.46133, 5.04123, 11.2147, 10.9227, 11.9837, 3.56659, 3.1023, 6.06113, 6.43298, 9.79978, 5.24288, 7.88403, 6.89853, 2.54509 32, 8.04123, 8.2957, 6.24152, 3.39565, 5.85143, 2.6859, 8.5668, 6.7913, 4.92752, 8.79678, 8.85622, 5.16031, 9.5673, 9.16587, 14.247, 14.0938, 13.2396, 8.04123, 7.1624, 6.48871, 5.24288, 10.4858, 7.57642, 8.40205, 8.85622, 6.5536, 3.2768, 6.06815, 6.6198, 5.95782, 11.1078, 10.7436, 11.0145, 4.02061, 2.95207, 6.5536, 7.12348, 10.0825, 5.3718, 7.9922, 6.4251, 2.56 64, 7.48983, 7.98408, 6.49944, 4.13912, 6.09637, 2.60408, 8.78695, 7.34983, 5.14008, 8.98779, 8.192, 5.27807, 9.76934, 9.30689, 13.5592, 12.1927, 11.3156, 8.68986, 7.78646, 7.21497, 5.69878, 10.0182, 7.41917, 8.27823, 8.88624, 7.41917, 3.67492, 6.89853, 7.28178, 6.60867, 10.999, 10.773, 10.2802, 5.53825, 2.95651, 7.28178, 7.71012, 10.4163, 5.61737, 8.83631, 7.34983, 2.62144 128, 7.9783, 8.34095, 6.79633, 4.02414, 6.64858, 2.56286, 8.90781, 7.58268, 5.36552, 9.50781, 10.2514, 5.52713, 9.97287, 9.86563, 12.3155, 12.1524, 11.2577, 8.192, 7.77546, 7.64587, 5.99676, 9.50781, 7.84191, 8.6557, 9.4103, 7.45938, 3.88773, 7.00385, 7.84191, 7.168, 10.731, 10.6687, 10.0274, 5.56063, 2.8672, 7.11243, 7.71012, 10.4858, 5.95782, 8.41747, 7.28178, 2.60655 256, 7.88403, 8.38861, 7.03742, 4.096, 6.59482, 2.54509, 9.27943, 7.94376, 5.66798, 10.2802, 10.4858, 5.79324, 9.89223, 9.70904, 12.1223, 11.9156, 11.096, 8.192, 8.192, 7.88403, 6.24152, 9.27943, 7.94376, 8.81156, 9.36229, 8.0044, 4.096, 7.43671, 8.192, 7.38434, 10.5917, 10.5917, 9.70904, 6.39376, 2.83399, 7.76723, 8.06597, 10.3819, 6.06113, 8.73813, 7.28178, 2.64792 512, 8.24929, 8.67388, 7.28178, 4.21303, 6.48158, 2.52062, 9.51329, 7.97059, 5.89824, 10.2578, 10.4394, 6.01861, 8.93673, 8.73813, 12.0372, 11.7965, 10.1694, 8.42606, 7.51368, 8.07978, 6.37648, 9.216, 8.24929, 9.28857, 9.74916, 8.1355, 4.12464, 7.56185, 8.42606, 7.56185, 10.6275, 10.4394, 9.216, 6.44616, 2.70561, 7.76084, 8.1355, 10.4394, 6.48158, 8.48668, 6.77959, 2.63314 1024, 7.9922, 8.68026, 7.28178, 4.51972, 5.4162, 2.46376, 8.97753, 7.84862, 5.85143, 10.1606, 10.4025, 6.01248, 8.5668, 8.24352, 11.3976, 10.7436, 9.42964, 7.84862, 6.93503, 7.9922, 6.4251, 8.45626, 8.24352, 9.03945, 9.42964, 8.192, 3.97188, 7.08497, 8.51117, 7.62047, 10.4025, 10.4025, 8.73813, 6.93503, 2.50137, 7.8019, 8.192, 9.70904, 6.72164, 8.09086, 6.86241, 2.56 2048, 7.31874, 7.9657, 6.37961, 4.36907, 5.33997, 2.37137, 8.53131, 7.39381, 5.50302, 8.89995, 9.48547, 5.58834, 8.48113, 8.33406, 10.6799, 10.4478, 9.36229, 7.35608, 7.06761, 6.80091, 5.30071, 8.23881, 8.43153, 9.36229, 9.42348, 7.13758, 3.98285, 6.49456, 7.54865, 7.06761, 9.94339, 9.74184, 8.84535, 6.43657, 2.53837, 6.61372, 6.99899, 8.53131, 6.26866, 7.62853, 6.61372, 2.55637 4096, 6.49944, 7.41917, 5.8689, 4.39347, 5.04123, 2.32672, 8.02482, 7.21497, 5.1067, 8.88624, 9.53251, 5.20816, 8.27823, 8.10755, 10.9227, 10.6998, 8.73813, 7.08497, 6.83854, 6.19238, 4.9152, 8.23489, 7.78646, 8.41104, 8.54817, 6.66468, 4.096, 6.29146, 6.89853, 6.39376, 9.53251, 9.47508, 8.02482, 6.95958, 2.42726, 6.77959, 6.95958, 7.71012, 5.8689, 7.28178, 6.39376, 2.52062 8192, 7.04106, 7.60686, 6.12927, 4.23865, 5.07124, 2.32778, 8.192, 7.28178, 5.3248, 9.16095, 8.87467, 5.46133, 7.88859, 7.67539, 10.5835, 10.6496, 8.51968, 7.09973, 6.50357, 6.40577, 5.01158, , 7.96232, 8.51968, 8.35263, 6.76165, 4.05699, 6.21874, 7.15939, 6.656, 9.5192, 9.5192, 7.88859, 6.87071, 2.39317, 6.70841, 7.04106, 7.81622, 6.08549, 7.15939, 5.91644, 2.39317 16384, 6.69711, 7.58268, 5.99676, 4.68114, 4.68114, 2.29376, 8.2658, 7.168, 5.30349, 9.17504, 9.26772, 5.39708, 7.52052, 7.39923, 10.546, 10.0825, 8.192, 6.79633, 6.60075, 6.24152, 4.90644, , 7.71012, 8.1195, 8.192, 6.79633, 4.096, 6.15774, 6.89853, 6.41611, 9.50781, 9.50781, 7.45938, 7.22444, 2.29376, 6.79633, 6.95079, 7.45938, 6.07619, 7.22444, 6.15774, 2.30529 32768, 6.4251, 7.22824, 5.64966, 4.38857, 4.63698, 2.25468, 7.9922, 6.97191, 5.06722, 9.45231, 9.36229, 5.14681, 7.62047, 7.50412, 10.1869, 10.031, 7.74047, 6.68735, 6.59758, 5.88647, 4.55111, , 7.44727, 8.192, 8.192, 6.34219, 4.02885, 5.88647, 6.5536, 6.144, 9.18729, 9.10222, 7.07223, 6.92282, 2.25468, 5.99415, 6.22177, 6.87441, 5.78259, 6.77959, 5.92193, 2.25468 65536, 6.06113, 6.80894, 5.40503, 4.72332, 4.44312, 2.25986, 7.76723, 6.72164, 4.89989, 8.66592, 8.52501, 5.04123, 7.59838, 7.38434, 10.0825, 10.1803, 7.59838, 6.59482, 6.59482, 5.60736, 4.44312, , 7.54371, 7.88403, 7.82519, 6.09637, 3.85506, 5.24288, 5.85797, 5.60736, 8.88624, 9.11805, 6.67883, 6.80894, 2.15756, 6.39376, 6.47269, 6.72164, 5.57753, 6.76501, 5.85797, 2.21219 131072, 6.12149, 6.67133, 5.23057, 4.47435, 4.38627, 2.23717, 7.6309, 6.63162, 4.80221, 8.63653, 8.63653, 4.86512, 7.09625, 6.91995, 9.94743, 9.94743, 7.28178, 6.25906, 6.33018, 5.35631, 4.38627, , 7.14174, 7.52778, 7.42741, 6.05496, 4.0076, 5.51541, 5.80267, 5.59855, 8.9129, 8.84216, 6.4774, 6.63162, 2.21054, 6.12149, 6.22409, 6.51527, 5.43469, 6.33018, 5.59855, 2.16332 262144, 5.85433, 6.48158, 5.1289, 4.9565, 4.25098, 2.21738, 7.685, 6.6459, 4.74707, 6.68356, 6.68356, 4.80508, 6.95958, 6.83854, 9.59063, 9.28857, 7.10631, 6.27472, 6.34219, 5.30179, 4.31316, , 6.91876, 7.28178, 7.06376, 5.95782, 4.10312, 5.59075, 5.83984, 5.53825, 8.86953, 8.903, 5.95782, 7.06376, 2.16847, 6.20867, 6.29146, 6.25808, 5.41123, 6.51739, 5.71258, 2.09715 Norm. Avg., 0.652152, 0.692367, 0.523522, 0.333051, 0.424406, 0.200227, 0.672446, 0.575478, 0.460883, 0.74307, 0.684451, 0.45526, 0.705158, 0.678748, 0.987685, 0.969498, 0.855941, 0.642225, 0.567662, 0.511991, 0.407181, 0.734683, 0.653163, 0.702867, 0.718801, 0.555533, 0.304223, 0.48754, 0.543466, 0.50237, 0.851639, 0.838849, 0.741585, 0.504817, 0.221719, 0.531091, 0.555498, 0.721652, 0.506689, 0.604681, 0.516904, 0.207833 ------------------------------------------------------ @@@@ bench.1d.np2.dat N, Brenner, CWP (min N), CWP (best N), FFTPACK, FFTPACK (f2c), FFTW, FFTW_ESTIMATE, Frigo-old, GSL, Nielsen, Singleton, Singleton (f2c), Temperton, Temperton (f2c), Valkenburg 6, , 6.91462, 2.42012, 7.41933, 6.96198, 13.7358, 13.4629, 4.27079, 5.67848, 2.0329, 3.21661, 3.60443, 5.08224, 4.3438, 2.93771 9, 2.4345, 7.85587, 4.49447, 7.41944, 7.24689, 11.1292, 11.1292, 3.22362, 4.9726, 3.20154, 4.58259, 5.10846, 5.66575, 4.92026, 2.71758 12, , 10.1415, 6.64936, 9.09461, 8.59552, 15.3224, 15.1577, 5.22098, 6.52623, 3.9821, 4.69888, 5.22098, 7.96421, 6.77723, 2.62997 15, 2.94527, 8.80878, 9.23228, 8.3492, 7.44308, 12.3891, 12.4696, 3.26584, 5.33421, 4.36435, 4.89876, 5.27559, 7.50123, 6.66776, 2.5135 18, 2.82055, 9.17732, 6.08793, 7.4082, 7.23389, 10.1633, 9.76001, 3.66, 6.61162, 3.68192, 5.16706, 5.856, 6.75693, 5.69334, 2.745 24, , 11.4833, 8.92515, 8.58515, 8.27009, 12.52, 12.3485, 6.04993, 7.57513, 5.49659, 5.18069, 5.85351, 8.6677, 7.57513, 2.66698 36, 3.34358, 10.8133, 10.8133, 8.15333, 7.98258, 11.9115, 11.8192, 4.30699, 7.85914, 5.25749, 6.14787, 6.68716, 8.47041, 7.12464, 2.72263 80, 4.6657, 12.1857, 10.8459, 9.16624, 8.3531, 10.5156, 11.0779, 6.022, 6.81437, 6.81437, 7.39846, 7.67248, 9.41622, 8.09207, 2.58946 108, 3.55732, 9.76518, 11.6724, 8.25455, 8.34678, 11.1498, 11.1498, 3.95257, 7.54582, 5.92886, 6.2253, 6.98165, 8.25455, 6.98165, 2.78745 210, 3.80475, 10.7719, 10.8423, 7.6095, 6.97005, 10.4332, 9.701, 3.3445, 6.38027, 6.38027, 6.09879, 6.79865, , , 2.34304 504, 4.16649, 11.2455, 11.2455, 7.4249, 7.10604, 10.435, 10.2503, 3.93974, 7.1499, 6.43491, 6.32942, 7.23928, , , 2.47497 1000, 3.96155, 8.98324, 10.9967, 7.68446, 7.20689, 9.81246, 9.81246, 3.96155, 5.74604, 7.4164, 7.82589, 8.22981, 7.82589, 6.89525, 2.43439 1960, 3.8321, 9.94125, 9.94125, 6.47119, 5.96475, 9.66121, 9.26954, 3.57264, 6.12452, 6.41071, 6.85946, 7.49668, , , 2.21273 4725, 3.57667, 8.54427, 10.1965, 6.63871, 6.4082, 9.04687, 8.78839, 2.84809, 6.19316, 6.03125, 6.19316, 6.63871, , , 2.30695 10368, 4.06787, 9.53845, 9.53845, 7.68375, 7.63076, 10.0587, 9.37678, 4.53467, 7.84724, 5.88543, 6.07945, 6.58607, 7.04752, 6.11304, 2.49203 27000, 3.45616, 9.6941, 9.75357, 7.22651, 6.91232, 8.41181, 8.88175, 3.28478, 6.51571, 6.35933, 5.97681, 6.41062, 6.73658, 5.84497, 2.29745 75600, 3.34749, 9.07542, 9.21189, 6.38115, 6.18778, 9.80145, 8.87812, 3.24122, 6.44832, 5.8342, 5.72515, 5.94748, , , 2.13446 165375, 3.09594, 8.48177, 8.43188, 5.26992, 5.11935, 8.63506, 8.05292, 2.53254, 6.02277, 5.30896, 5.57751, 6.04818, , , 1.97169 362880, 3.37466, 4.56545, 4.55924, 6.29895, 6.28713, 10.0331, 8.25379, 3.40207, 6.93797, 5.632, 5.07733, 5.38752, , , 2.05838 Norm. Avg., 0.332119, 0.847154, 0.799112, 0.672911, 0.643982, 0.963986, 0.935227, 0.349967, 0.594751, 0.496598, 0.526943, 0.573616, 0.63763, 0.549579, 0.222906 ------------------------------------------------------ @@@@ bench.3d.p2.dat Array Dimensions, FFTW, HARM, HARM (f2c), NR (C), NR (F), PDA, PDA (f2c), Singleton, Singleton (f2c), Temperton, Temperton (f2c) 4x4x4, 15.4202, , , 7.71012, 6.89853, 6.04948, 5.1067, 8.54817, 9.14456, 11.9156, 9.19804 8x8x8, 15.3201, 11.2347, 9.59063, 9.00495, 8.24929, 7.81224, 7.10631, 8.192, 9.36229, 11.7965, 9.14456 16x16x16, 13.6771, 10.6998, 9.03945, 7.48983, 7.14938, 8.23489, 7.78646, 7.28178, 7.71012, 10.6998, 8.78695 32x32x32, 12.7668, 9.27396, 7.92774, 6.34219, 6.22177, 7.62047, 7.28178, 6.10584, 6.4251, 9.10222, 7.44727 64x64x64, 11.9761, 9.51329, 7.76084, 5.61737, 5.474, 7.56185, 7.34983, 6.17617, 6.35929, 9.10925, 7.51368 256x64x32, 10.6199, 9.22359, 7.6509, 5.57129, 5.43156, 7.38982, 7.22893, 5.91536, 6.07407, 8.83109, 7.30313 16x1024x64, 10.4129, 9.25486, 7.61493, 5.53923, 5.39391, 7.1673, 6.98585, 6.01593, 6.19726, , 128x128x128, 9.43045, 8.82923, 7.44677, 5.1067, 4.98869, 7.26976, 7.24107, 4.82051, 5.03086, 7.76175, 6.50712 Norm. Avg., 1, 0.818568, 0.68533, 0.524913, 0.501429, 0.610961, 0.583479, 0.53267, 0.562838, 0.779033, 0.632038 ------------------------------------------------------ @@@@ bench.3d.np2.dat Array Dimensions, FFTW, PDA, PDA (f2c), Singleton, Singleton (f2c), Temperton, Temperton (f2c) 5x5x5, 10.8075, 6.46102, 5.27586, 12.1309, 11.8883, 9.28771, 7.89045 6x6x6, 13.506, 5.91468, 5.60542, 7.72638, 8.93363, 9.74578, 8.09084 7x7x7, 11.0377, 3.87186, 3.42373, 9.48109, 10.7177, , 9x9x9, 10.7561, 7.09901, 6.672, 8.14107, 9.14821, 8.4512, 7.15626 10x10x10, 12.2656, 7.33115, 6.44253, 8.02277, 8.67769, 9.88853, 8.07355 11x11x11, 9.35518, 3.77805, 3.48057, 8.50062, 9.16129, , 12x12x12, 14.0758, 7.57582, 7.16509, 7.48053, 8.31751, 10.912, 9.22019 13x13x13, 8.76929, 3.48423, 3.36408, 7.88351, 8.7203, , 14x14x14, 11.4623, 5.36336, 5.0654, 7.16392, 8.22089, , 15x15x15, 11.8302, 8.27342, 7.19223, 7.23333, 7.81378, 9.37654, 8.0116 24x25x28, 9.06953, 7.14569, 6.59602, 6.83501, 7.48596, , 48x48x48, 10.1811, 7.2951, 7.23811, 5.82691, 6.05541, 9.26478, 7.85151 49x49x49, 8.76856, 6.23175, 5.44422, 6.51873, 7.1284, , 60x60x60, 9.56916, 7.33269, 6.81079, 5.42162, 5.69593, 9.33577, 8.00767 72x60x56, 9.70076, 7.0236, 6.65621, 5.0662, 5.40817, , 75x75x75, 9.06132, 7.96298, 7.10212, 6.35754, 6.60247, 8.28083, 7.06393 80x80x80, 8.9087, 7.3342, 6.87711, 5.97936, 6.1693, 9.14358, 7.79332 84x84x84, 9.3936, 6.46545, 5.98854, 5.01158, 5.4125, , 96x96x96, 9.98735, 7.15719, 7.09329, 5.10452, 5.31242, 8.4109, 7.19254 105x105x105, 8.52256, 7.10041, 6.085, 5.57574, 5.97891, , 112x112x112, 9.74576, 6.58667, 6.09162, 5.69729, 6.03524, , 120x120x120, 9.65624, 7.16106, 6.86189, 4.69516, 4.9373, 8.66118, 7.49065 144x144x144, 9.05383, 7.00565, 6.98281, 5.13494, 5.35054, 8.32398, 7.03635 Norm. Avg., 0.99414, 0.647881, 0.599209, 0.66383, 0.714486, 0.854225, 0.724483 @@@@ end