11-14-07 - 1

I put up the assembly for ConvertColorsGamma that does 4float -> dword color conversion and gamma correction. This is like super non-optimal assembly, and furthermore you can do this a lot faster with video cards these days (in fact they pretty much do the whole thing for you). I'm not very good at writing assembly anymore, I'm not even sure what the major issues are (other than memory accesses).

I've looked a little at the ASM output form the VC intrinsics and it looks pretty good; the compiler will do reordering and all that good stuff for you, so dumb assembly like this should pretty much be written with the intrinsics and let the compiler do the little stuff.

No comments:

old rants