pb with -ffast-math
Alain Baeckeroot
alain.baeckeroot at laposte.net
Fri Apr 17 03:58:41 CDT 2009
Le 17/04/2009 à 10:28, Jaroslav Hajek a écrit :
>
> Hmm, this looks really too complicated to vectorize. I guess a C++
> computation "kernel" taking care just of this operation is the way to
> go. Since there are no func calls, it should be fairly straightforward
> to write, using essentially the same syntax.
yes, Octave is nice :-)
> Are all the intermediate T1...T20 results really needed? They sound
> like temporaries to me.
>
Part of them are results. So far we kept everything, to check the
results were identical to the one given by the previous tool.
I'll try to eliminate the unneeded ones.
I have tested 3.1.55 compiled with "-O3 -march=native -funroll-loops"
it seems 10-20% faster.
we are also considering to add one dimension to all our variables,
corresponding to differents independant input params, in order
to vectorize wrt this dimension, but i'm not sure this is useful
for our end-users !
Alain
More information about the Help-octave
mailing list