[Changeset] Re: Faster Array transpose

John W. Eaton jwe at bevo.che.wisc.edu
Tue May 6 12:49:58 CDT 2008


On  5-May-2008, David Bateman wrote:

| I got previous without the patch
| 
| N =  128, time = 0.0016666 sec
| N =  129, time = 0.0003333 sec
| N = 1024, time = 0.0439971 sec
| N = 1025, time = 0.0336645 sec
| N = 2048, time = 0.276649 sec
| N = 2049, time = 0.172656 sec
| N = 4096, time = 1.19492 sec
| N = 4097, time = 0.700954 sec
| 
| and with it
| 
| N =  128, time = 0 sec
| N =  129, time = 0 sec
| N = 1024, time = 0.0319979 sec
| N = 1025, time = 0.0243318 sec
| N = 2048, time = 0.170655 sec
| N = 2049, time = 0.117326 sec
| N = 4096, time = 0.745285 sec
| N = 4097, time = 0.558964 sec
| 
| and similar improvements for the hermitian operator.

That's a lot of work for a fairly modest improvement...

Is there a ChangeLog entry for this?  With that, I'd apply the change
(but skipping the diffs for the float matrix files I don't have yet).

Also, how about some tests for this since the algorithm is now more
complex?

Thanks,

jwe


More information about the Octave-maintainers mailing list