]> git.lizzy.rs Git - rust.git/commit
Define AVX broadcast intrinsics
authorRuud van Asseldonk <ruuda@google.com>
Tue, 8 Mar 2016 20:41:18 +0000 (21:41 +0100)
committerRuud van Asseldonk <ruuda@google.com>
Wed, 9 Mar 2016 00:18:46 +0000 (01:18 +0100)
commit37efeae8866c3c4d9827d0ca271b8e27f731c3e1
tree09f10eaabf92db271df0b618cad0af4b26051541
parent8f0479b2a5fc16814b68efe9e87dddf231b8941d
Define AVX broadcast intrinsics

This defines `_mm256_broadcast_ps` and `_mm256_broadcast_pd`. The `_ss`
and `_sd` variants are not supported by LLVM. In Clang these intrinsics
are implemented as inline functions in C++.

Intel reference: https://software.intel.com/en-us/node/514144.

Note: the argument type should really be "0hPc" (a pointer to a vector
of half the width), but internally the LLVM intrinsic takes a pointer to
a signed integer, and for any other type LLVM will complain. This means
that a transmute is required to call these intrinsics.

The AVX2 broadcast intrinsics `_mm256_broadcastss_ps` and
`_mm256_broadcastsd_pd` are not available as LLVM intrinsics. In Clang
they are implemented using the shufflevector builtin.
src/etc/platform-intrinsics/x86/avx.json