libsimdpp
0.9.3
|
Functions | |
template<unsigned id> | |
basic_int8x16 | simdpp::insert (basic_int8x16 a, uint8_t x) |
Inserts an element into int8x16 vector at the position identified by id. More... | |
template<unsigned id> | |
basic_int16x8 | simdpp::insert (basic_int16x8 a, uint16_t x) |
Inserts an element into int16x8 vector at the position identified by id. More... | |
template<unsigned id> | |
basic_int32x4 | simdpp::insert (basic_int32x4 a, uint32_t x) |
Inserts an element into int32x4 vector at the position identified by id. More... | |
template<unsigned id> | |
basic_int64x2 | simdpp::insert (basic_int64x2 a, uint64_t x) |
Inserts an element into int64x2 vector at the position identified by id. More... | |
template<unsigned id> | |
float32x4 | simdpp::insert (float32x4 a, float x) |
Inserts an element into float32x4 vector at the position identified by id. More... | |
template<unsigned id> | |
float64x2 | simdpp::insert (float64x2 a, double x) |
Inserts an element into float64x2 vector at the position identified by id. More... | |
template<unsigned id> | |
uint8_t | simdpp::extract (basic_int8x16 a) |
Extracts the id-th element from int8x16 vector. More... | |
template<unsigned id> | |
int8_t | simdpp::extract (int8x16 a) |
Extracts the id-th element from int8x16 vector. More... | |
int256 | simdpp::combine (int128 a, int128 b) |
Combines two 128-bit vectors into a 256-bit vector. More... | |
float32x8 | simdpp::combine (float32x4 a, float32x4 b) |
Combines two 128-bit vectors into a 256-bit vector. More... | |
float64x4 | simdpp::combine (float64x2 a, float64x2 b) |
Combines two 128-bit vectors into a 256-bit vector. More... | |
Detailed Description
Function Documentation
|
inline |
Combines two 128-bit vectors into a 256-bit vector.
- In AVX2 this intrinsic results in at least 1 instructions.
- In SSE2-AVX, NEON and ALTIVEC this intrinsic results in at least 0 instructions.
|
inline |
Combines two 128-bit vectors into a 256-bit vector.
- In AVX2 this intrinsic results in at least 1 instructions.
- In SSE2-AVX, NEON and ALTIVEC this intrinsic results in at least 0 instructions.
|
inline |
Combines two 128-bit vectors into a 256-bit vector.
- In AVX2 this intrinsic results in at least 1 instructions.
- In SSE2-AVX, NEON and ALTIVEC this intrinsic results in at least 0 instructions.
uint8_t simdpp::extract | ( | basic_int8x16 | a | ) |
Extracts the id-th element from int8x16 vector.
This function may have very high latency.
- In SSE2-SSSE3 this intrinsic results in at least 1-2 instructions.
- In SSE4.1-AVX this intrinsic results in at least 1 instructions.
- In ALTIVEC this intrinsic results in at least 2 instructions.
int8_t simdpp::extract | ( | int8x16 | a | ) |
Extracts the id-th element from int8x16 vector.
This function may have very high latency.
- In SSE2-SSSE3 this intrinsic results in at least 1-2 instructions.
- In SSE4.1-AVX this intrinsic results in at least 1 instructions.
- In ALTIVEC this intrinsic results in at least 2 instructions.
basic_int8x16 simdpp::insert | ( | basic_int8x16 | a, |
uint8_t | x | ||
) |
Inserts an element into int8x16 vector at the position identified by id.
This function may have very high latency.
- In SSE2-SSSE3 this intrinsic results in at least 4-5 instructions.
- In ALTIVEC this intrinsic results in at least 3 instructions.
basic_int16x8 simdpp::insert | ( | basic_int16x8 | a, |
uint16_t | x | ||
) |
Inserts an element into int16x8 vector at the position identified by id.
This function may have very high latency.
- In ALTIVEC this intrinsic results in at least 3 instructions.
basic_int32x4 simdpp::insert | ( | basic_int32x4 | a, |
uint32_t | x | ||
) |
Inserts an element into int32x4 vector at the position identified by id.
This function may have very high latency.
- In SSE2-SSSE3 this intrinsic results in at least 4 instructions.
- In ALTIVEC this intrinsic results in at least 3 instructions.
basic_int64x2 simdpp::insert | ( | basic_int64x2 | a, |
uint64_t | x | ||
) |
Inserts an element into int64x2 vector at the position identified by id.
This function may have very high latency.
- In SSE2, SSE3 and SSSE3 this intrinsic results in at least 2 instructions.
- In SSE4_1 this intrinsic results in at least 1 instructions.
- In SSE2_32bit, SSE3_32bit and SSSE3_32bit this intrinsic results in at least 4 instructions.
- In SSE4_1_32bit this intrinsic results in at least 2 instructions.
- In ALTIVEC this intrinsic results in at least 3 instructions.
float32x4 simdpp::insert | ( | float32x4 | a, |
float | x | ||
) |
Inserts an element into float32x4 vector at the position identified by id.
This function may have very high latency.
- In SSE2-SSSE3 this intrinsic results in at least 4 instructions.
- In ALTIVEC this intrinsic results in at least 3 instructions.
float64x2 simdpp::insert | ( | float64x2 | a, |
double | x | ||
) |
Inserts an element into float64x2 vector at the position identified by id.
This function potentially
This function may have very high latency.
- In SSE2-SSSE3 this intrinsic results in at least 2 instructions.
- In ALTIVEC this intrinsic results in at least 3 instructions.
Generated on Thu Oct 31 2013 04:08:50 for libsimdpp by 1.8.3.1