Re: What CPU extensions can we assume are available by arch?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sun, Apr 26, 2020 at 5:25 PM Dave Love <loveshack@xxxxxxxxxxxxxxxxx> wrote:
Kevin Kofler <kevin.kofler@xxxxxxxxx> writes:

> Has anyone (upstream or elsewhere) ever looked into doing an SSE2 version of
> the vector code? It should be faster than scalar (especially considering
> that the "scalar" floating-point code (under the default -mfpmath=sse)
> actually loads everything into SSE2 registers as well, but does not actually
> make use of the vectorization) and it would match the baseline of many
> distributions and upstreams out there.

What's preventing vectorization with sse2 (or other architecture' base
SIMD) anyhow, if anything?  Use something
like

gcc -Ofast -fopt-info-vec-missed

I can't comment on the exact command line used, but I did experiment with a recent pull request adding SSE 4.1 

Full details here:

https://github.com/drowe67/LPCNet/pull/25

Thanks,
Richard
_______________________________________________
devel mailing list -- devel@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe send an email to devel-leave@xxxxxxxxxxxxxxxxxxxxxxx
Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: https://lists.fedoraproject.org/archives/list/devel@xxxxxxxxxxxxxxxxxxxxxxx

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [Fedora Announce]     [Fedora Users]     [Fedora Kernel]     [Fedora Testing]     [Fedora Formulas]     [Fedora PHP Devel]     [Kernel Development]     [Fedora Legacy]     [Fedora Maintainers]     [Fedora Desktop]     [PAM]     [Red Hat Development]     [Gimp]     [Yosemite News]

  Powered by Linux