I'm suspecting that sse is being used for the arrays because they happen to be appropriately sized. If I remember correctly, gcc4 was to introduce some autovectorization. Perhaps that's whats going on.
Brian
SSE is being used because I told it to (-mfpmath=sse)
That's not the issue, I don't want to insult your intelligence, I hope you won't get offended if I ask you to re-read the original post.