C-9
COMPILER INTRINSICS AND FUNCTIONAL EQUIVALENTS
PUNPCKLWD__m64 _m_punpcklwd (__m64 m1, __m64 m2)
__m64 _mm_unpacklo_pi16(__m64 m1, __m64 m2)
Interleave the two 16-bit values from the
low half of m1 with the two values from the
low half of m2 and take the least significant
element from m1.
PUNPCKLDQ__m64 _m_punpckldq (__m64 m1, __m64 m2)
__m64 _mm_unpacklo_pi32(__m64 m1, __m64 m2)
Interleave the 32-bit value from the low half
of m1 with the 32-bit value from the low half
of m2 and take the least significant element
from m1.
PXOR
__m64 _m_pxor(__m64 m1, __m64 m2)
__m64 _mm_xor_si64(__m64 m1, __m64 m2)
Perform a bitwise XOR of the 64-bit value
in m1 with the 64-bit value in m2.
RCPPS
__m128 _mm_rcp_ps(__m128 a)
Computes the approximations of the
reciprocals of the four SP FP values of a.
RCPSS
__m128 _mm_rcp_ss(__m128 a)
Computes the approximation of the
reciprocal of the lower SP FP value of a;
the upper three SP FP values are passed
through.
RSQRTPS__m128 _mm_rsqrt_ps(__m128 a)
Computes the approximations of the
reciprocals of the square roots of the four
SP FP values of a.
RSQRTSS__m128 _mm_rsqrt_ss(__m128 a)
Computes the approximation of the
reciprocal of the square root of the lower
SP FP value of a; the upper three SP FP
values are passed through.
SFENCEvoid_mm_sfence(void)
Guarantees that every preceding store is
globally visible before any subsequent
store.
SHUFPS__m128 _mm_shuffle_ps(__m128 a, __m128 b,
unsigned int imm8)
Selects four specific SP FP values from a
and b, based on the mask i. The mask must
be an immediate.
SQRTPS__m128 _mm_sqrt_ps(__m128 a)
Computes the square roots of the four SP
FP values of a.
SQRTSS__m128 _mm_sqrt_ss(__m128 a)
Computes the square root of the lower SP
FP value of a; the upper three SP FP
values are passed through.
STMXCSR_mm_getcsr(void)
Returns the contents of the control register.
SUBPS
__m128 _mm_sub_ps(__m128 a, __m128 b)
Subtracts the four SP FP values of a and b.
SUBSS
__m128 _mm_sub_ss(__m128 a, __m128 b)
Subtracts the lower SP FP values of a and
b. The upper three SP FP values are
passed through from a.
Table C-1. Simple Intrinsics
Mnemonic
Intrinsic
Description