Doesn't fftw 3.3.3 use neon if available? Might be worth trying instead of sticking with 3.3.2.
Even 3.3.2 uses NEON (look for "neon" in the wisdom posted above). The current version is 3.3.4 to which we might upgrade with the next app release I guess.
Doesn't fftw 3.3.3 use neon if available? Might be worth trying instead of sticking with 3.3.2.
Even 3.3.2 uses NEON (look for "neon" in the wisdom posted above). The current version is 3.3.4 to which we might upgrade with the next app release I guess.
Apparently NEON support got added with 3.3.1 beta1. However 3.3.3 introduced 128 bit NEON instructions which are said to provide a speed up even on 64 bit. FFTW release notes page.
Doesn't fftw 3.3.3 use neon if available? Might be worth trying instead of sticking with 3.3.2.
Even 3.3.2 uses NEON (look for "neon" in the wisdom posted above). The current version is 3.3.4 to which we might upgrade with the next app release I guess.
Apparently NEON support got added with 3.3.1 beta1. However 3.3.3 introduced 128 bit NEON instructions which are said to provide a speed up even on 64 bit. FFTW release notes page.
The release version would be better.
I remember I tested 3.3.3 for Raspi and Android some time ago and found no significant improvement back then, but I should try 3.3.4 now. It will take some time before I have meaningful results because to make a apples-to-apples comparison, one needs to generate new wisdom first, which can take some time.
For users who are running the non-beta version of the BRP app on the RPI3, here's some wisdom file to try : store this into a file /etc/fftw/wisdomf (create directory as needed) and restart BOINC.
Unfortunately this will not work for the BETA version of the app, which comes with pre-canned wisdom that cannot be replaced. I'll change that in the next version of the app.
Seems to help a bit and brings per task (CPU) runtime down below 9.5h for the first 4 tasks I've tried it with.
Hi, can you please share the parameters that you passed to "fftwf-wisdom" when you generate the wisdom file?
I compiled fftw3.3.4 and want to try it, but I don't know the parameter for generating wisdom file.
I looked into source code and figured out "-x rif4194304".
But the generated file contains fewer lines so I think there must be more combinations.
Hi, can you please share the parameters that you passed to "fftwf-wisdom" when you generate the wisdom file?
I compiled fftw3.3.4 and want to try it, but I don't know the parameter for generating wisdom file.
I looked into source code and figured out "-x rif4194304".
But the generated file contains fewer lines so I think there must be more combinations.
I tried "-x" ("exhaustive") first but after more than 10 days (sic!) I stopped that and set a timeout of 10hrs instead. The FFT performed is a 3 * 2^22 real-to-complex in-place transform (where the factor 3 comes from the command line options and the 2^22 is dictated by the length of the data (number of samples) ), so
./fftwf-wisdom -v -t 10 -n -o wisdomf rif12582912
"-n" ignores any existing system wisdom and starts wisdom generation from scratch.
I ran fftw-wisdom in parallel to E@h tasks to simulate similar loads on the processor, cache and memory bus.
Thank you for supporting the Raspberry Pi and continuing to develop the app.
You're welcome, and thank you for contributing to E@H.
I love the idea behind the Raspberry Pi, so in my spare time I try to make them productive on E@H. I have a total of 11 Raspis at home now, most of them doing E@H plus some other more or less useful (but fun) stuff, ranging from astronomy to cat surveillance:
RPi 1 A : 2 x
RPi 1 B : 3 x
RPi 1 A+: 1 x
RPi 1 B+: 1 x
RPi 2 B : 2 x
RPi 3 B : 2 x
I'd like to have a few PiZeros as well but they are currently sold out.
Just under 6 days. This was mostly a fun exercise, but at least has some usefulness to test the code path in the app that is for non-Intel-SSE2 hosts, otherwise unused on E@H.
EDIT: In this configuration the Raspi 3 should draw less than 5W, so we are talking about ca 0.4 kWh per task, which should be in the same ballpark as moderately modern desktop systems.
EDIT^2: running 4 GW tasks in parallel, it will take ca 7 days to finish, still well within the 14 days deadline. It requires active cooling to keep it from throttling down tho, so let's say 6 W in total, or less than 0.3 kWh per task, or ca 0.1 EUR per task in (German) electricity cost per task.
Quote:
Good to keep an eye on felines. Never know when they will decide you are no longer useful. ;-)
LOL! I also decided to add motion detection to the setup so you can catch the few moments while I'm away when the cats actually do something other than sleeping ;-)
RE: Doesn't fftw 3.3.3 use
)
Even 3.3.2 uses NEON (look for "neon" in the wisdom posted above). The current version is 3.3.4 to which we might upgrade with the next app release I guess.
RE: RE: Doesn't fftw
)
Apparently NEON support got added with 3.3.1 beta1. However 3.3.3 introduced 128 bit NEON instructions which are said to provide a speed up even on 64 bit. FFTW release notes page.
The release version would be better.
BOINC blog
RE: RE: RE: Doesn't
)
I remember I tested 3.3.3 for Raspi and Android some time ago and found no significant improvement back then, but I should try 3.3.4 now. It will take some time before I have meaningful results because to make a apples-to-apples comparison, one needs to generate new wisdom first, which can take some time.
Thank you for supporting the
)
Thank you for supporting the Raspberry Pi and continuing to develop the app.
RE: For users who are
)
Hi, can you please share the parameters that you passed to "fftwf-wisdom" when you generate the wisdom file?
I compiled fftw3.3.4 and want to try it, but I don't know the parameter for generating wisdom file.
I looked into source code and figured out "-x rif4194304".
But the generated file contains fewer lines so I think there must be more combinations.
RE: Hi, can you please
)
I tried "-x" ("exhaustive") first but after more than 10 days (sic!) I stopped that and set a timeout of 10hrs instead. The FFT performed is a 3 * 2^22 real-to-complex in-place transform (where the factor 3 comes from the command line options and the 2^22 is dictated by the length of the data (number of samples) ), so
./fftwf-wisdom -v -t 10 -n -o wisdomf rif12582912
"-n" ignores any existing system wisdom and starts wisdom generation from scratch.
I ran fftw-wisdom in parallel to E@h tasks to simulate similar loads on the processor, cache and memory bus.
RE: Thank you for
)
You're welcome, and thank you for contributing to E@H.
I love the idea behind the Raspberry Pi, so in my spare time I try to make them productive on E@H. I have a total of 11 Raspis at home now, most of them doing E@H plus some other more or less useful (but fun) stuff, ranging from astronomy to cat surveillance:
RPi 1 A : 2 x
RPi 1 B : 3 x
RPi 1 A+: 1 x
RPi 1 B+: 1 x
RPi 2 B : 2 x
RPi 3 B : 2 x
I'd like to have a few PiZeros as well but they are currently sold out.
The Pis are fun to play with.
)
The Pis are fun to play with. I only have three now but plan on ordering my first Pi 3 next week. A few more will follow later.
I love this project and the Raspberry Pi lets me continue to contribute during the summer when I have to shut down the GPUs due to heat.
Good to keep an eye on felines. Never know when they will decide you are no longer useful. ;-)
Cheers!
I finally managed to get some
)
I finally managed to get some GW tasks (here running 2 in parallel) to finish successfully on the Pi3:
https://einsteinathome.org/task/550861762
Just under 6 days. This was mostly a fun exercise, but at least has some usefulness to test the code path in the app that is for non-Intel-SSE2 hosts, otherwise unused on E@H.
EDIT: In this configuration the Raspi 3 should draw less than 5W, so we are talking about ca 0.4 kWh per task, which should be in the same ballpark as moderately modern desktop systems.
EDIT^2: running 4 GW tasks in parallel, it will take ca 7 days to finish, still well within the 14 days deadline. It requires active cooling to keep it from throttling down tho, so let's say 6 W in total, or less than 0.3 kWh per task, or ca 0.1 EUR per task in (German) electricity cost per task.
LOL! I also decided to add motion detection to the setup so you can catch the few moments while I'm away when the cats actually do something other than sleeping ;-)
Cheers
HB
RE: EDIT: some pictures of
)
HB,
I'll see your bet and raise you 3 more tubes/fans. :>)