1.7. Single Precision Intrinsics
This section describes single precision intrinsic functions that are only supported in device code. To use these functions you do not need to include any additional header files in your program.
Functions
- __device__  float __cosf ( float x )
- Calculate the fast approximate cosine of the input argument.
- __device__  float __exp10f ( float x )
- Calculate the fast approximate base 10 exponential of the input argument.
- __device__  float __expf ( float x )
- Calculate the fast approximate base exponential of the input argument.
- __device__  float __fadd_rd ( float x, float y )
- Add two floating-point values in round-down mode.
- __device__  float __fadd_rn ( float x, float y )
- Add two floating-point values in round-to-nearest-even mode.
- __device__  float __fadd_ru ( float x, float y )
- Add two floating-point values in round-up mode.
- __device__  float __fadd_rz ( float x, float y )
- Add two floating-point values in round-towards-zero mode.
- __device__  float __fdiv_rd ( float x, float y )
- Divide two floating-point values in round-down mode.
- __device__  float __fdiv_rn ( float x, float y )
- Divide two floating-point values in round-to-nearest-even mode.
- __device__  float __fdiv_ru ( float x, float y )
- Divide two floating-point values in round-up mode.
- __device__  float __fdiv_rz ( float x, float y )
- Divide two floating-point values in round-towards-zero mode.
- __device__  float __fdividef ( float x, float y )
- Calculate the fast approximate division of the input arguments.
- __device__  float __fmaf_ieee_rd ( float x, float y, float z )
- Compute fused multiply-add operation in round-down mode, ignore -ftz=true compiler flag.
- __device__  float __fmaf_ieee_rn ( float x, float y, float z )
- Compute fused multiply-add operation in round-to-nearest-even mode, ignore -ftz=true compiler flag.
- __device__  float __fmaf_ieee_ru ( float x, float y, float z )
- Compute fused multiply-add operation in round-up mode, ignore -ftz=true compiler flag.
- __device__  float __fmaf_ieee_rz ( float x, float y, float z )
- Compute fused multiply-add operation in round-towards-zero mode, ignore -ftz=true compiler flag.
- __device__  float __fmaf_rd ( float x, float y, float z )
- Compute as a single operation, in round-down mode.
- __device__  float __fmaf_rn ( float x, float y, float z )
- Compute as a single operation, in round-to-nearest-even mode.
- __device__  float __fmaf_ru ( float x, float y, float z )
- Compute as a single operation, in round-up mode.
- __device__  float __fmaf_rz ( float x, float y, float z )
- Compute as a single operation, in round-towards-zero mode.
- __device__  float __fmul_rd ( float x, float y )
- Multiply two floating-point values in round-down mode.
- __device__  float __fmul_rn ( float x, float y )
- Multiply two floating-point values in round-to-nearest-even mode.
- __device__  float __fmul_ru ( float x, float y )
- Multiply two floating-point values in round-up mode.
- __device__  float __fmul_rz ( float x, float y )
- Multiply two floating-point values in round-towards-zero mode.
- __device__  float __frcp_rd ( float x )
- Compute in round-down mode.
- __device__  float __frcp_rn ( float x )
- Compute in round-to-nearest-even mode.
- __device__  float __frcp_ru ( float x )
- Compute in round-up mode.
- __device__  float __frcp_rz ( float x )
- Compute in round-towards-zero mode.
- __device__  float __frsqrt_rn ( float x )
- Compute in round-to-nearest-even mode.
- __device__  float __fsqrt_rd ( float x )
- Compute in round-down mode.
- __device__  float __fsqrt_rn ( float x )
- Compute in round-to-nearest-even mode.
- __device__  float __fsqrt_ru ( float x )
- Compute in round-up mode.
- __device__  float __fsqrt_rz ( float x )
- Compute in round-towards-zero mode.
- __device__  float __fsub_rd ( float x, float y )
- Subtract two floating-point values in round-down mode.
- __device__  float __fsub_rn ( float x, float y )
- Subtract two floating-point values in round-to-nearest-even mode.
- __device__  float __fsub_ru ( float x, float y )
- Subtract two floating-point values in round-up mode.
- __device__  float __fsub_rz ( float x, float y )
- Subtract two floating-point values in round-towards-zero mode.
- __device__  float __log10f ( float x )
- Calculate the fast approximate base 10 logarithm of the input argument.
- __device__  float __log2f ( float x )
- Calculate the fast approximate base 2 logarithm of the input argument.
- __device__  float __logf ( float x )
- Calculate the fast approximate base logarithm of the input argument.
- __device__  float __powf ( float x, float y )
- Calculate the fast approximate of .
- __device__  float __saturatef ( float x )
- Clamp the input argument to [+0.0, 1.0].
- __device__  void __sincosf ( float x, float* sptr, float* cptr )
- Calculate the fast approximate of sine and cosine of the first input argument.
- __device__  float __sinf ( float x )
- Calculate the fast approximate sine of the input argument.
- __device__  float __tanf ( float x )
- Calculate the fast approximate tangent of the input argument.
Functions
- __device__  float __cosf ( float x )
- 
                           Calculate the fast approximate cosine of the input argument.ReturnsReturns the approximate cosine of x. DescriptionCalculate the fast approximate cosine of the input argument x, measured in radians. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- 
                                          Input and output in the denormal range is flushed to sign preserving 0.0. 
 
- 
                                          
- __device__  float __exp10f ( float x )
- 
                           Calculate the fast approximate base 10 exponential of the input argument.ReturnsReturns an approximation to . DescriptionCalculate the fast approximate base 10 exponential of the input argument x, . Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- 
                                          Most input and output values around denormal range are flushed to sign preserving 0.0. 
 
- 
                                          
- __device__  float __expf ( float x )
- 
                           Calculate the fast approximate base exponential of the input argument.ReturnsReturns an approximation to . DescriptionCalculate the fast approximate base exponential of the input argument x, . Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- 
                                          Most input and output values around denormal range are flushed to sign preserving 0.0. 
 
- 
                                          
- __device__  float __fadd_rd ( float x, float y )
- 
                           Add two floating-point values in round-down mode.ReturnsReturns x + y. DescriptionCompute the sum of x and y in round-down (to negative infinity) mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fadd_rn ( float x, float y )
- 
                           Add two floating-point values in round-to-nearest-even mode.ReturnsReturns x + y. DescriptionCompute the sum of x and y in round-to-nearest-even rounding mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fadd_ru ( float x, float y )
- 
                           Add two floating-point values in round-up mode.ReturnsReturns x + y. DescriptionCompute the sum of x and y in round-up (to positive infinity) mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fadd_rz ( float x, float y )
- 
                           Add two floating-point values in round-towards-zero mode.ReturnsReturns x + y. DescriptionCompute the sum of x and y in round-towards-zero mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fdiv_rd ( float x, float y )
- 
                           Divide two floating-point values in round-down mode.ReturnsReturns x / y. DescriptionDivide two floating-point values x by y in round-down (to negative infinity) mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fdiv_rn ( float x, float y )
- 
                           Divide two floating-point values in round-to-nearest-even mode.ReturnsReturns x / y. DescriptionDivide two floating-point values x by y in round-to-nearest-even mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fdiv_ru ( float x, float y )
- 
                           Divide two floating-point values in round-up mode.ReturnsReturns x / y. DescriptionDivide two floating-point values x by y in round-up (to positive infinity) mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fdiv_rz ( float x, float y )
- 
                           Divide two floating-point values in round-towards-zero mode.ReturnsReturns x / y. DescriptionDivide two floating-point values x by y in round-towards-zero mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fdividef ( float x, float y )
- 
                           Calculate the fast approximate division of the input arguments.ReturnsReturns x / y. - __fdividef( , y) returns NaN for .
- __fdividef(x, y) returns 0 for and finite .
 DescriptionCalculate the fast approximate division of x by y. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- __device__  float __fmaf_ieee_rd ( float x, float y, float z )
- 
                           Compute fused multiply-add operation in round-down mode, ignore -ftz=true compiler flag.DescriptionBehavior is the same as __fmaf_rd(x, y, z), the difference is in handling denormalized inputs and outputs: -ftz compiler flag has no effect. 
- __device__  float __fmaf_ieee_rn ( float x, float y, float z )
- 
                           Compute fused multiply-add operation in round-to-nearest-even mode, ignore -ftz=true compiler flag.DescriptionBehavior is the same as __fmaf_rn(x, y, z), the difference is in handling denormalized inputs and outputs: -ftz compiler flag has no effect. 
- __device__  float __fmaf_ieee_ru ( float x, float y, float z )
- 
                           Compute fused multiply-add operation in round-up mode, ignore -ftz=true compiler flag.DescriptionBehavior is the same as __fmaf_ru(x, y, z), the difference is in handling denormalized inputs and outputs: -ftz compiler flag has no effect. 
- __device__  float __fmaf_ieee_rz ( float x, float y, float z )
- 
                           Compute fused multiply-add operation in round-towards-zero mode, ignore -ftz=true compiler flag.DescriptionBehavior is the same as __fmaf_rz(x, y, z), the difference is in handling denormalized inputs and outputs: -ftz compiler flag has no effect. 
- __device__  float __fmaf_rd ( float x, float y, float z )
- 
                           Compute as a single operation, in round-down mode.ReturnsReturns the rounded value of as a single operation. - fmaf( , , z) returns NaN.
- fmaf( , , z) returns NaN.
- fmaf(x, y, ) returns NaN if is an exact .
- fmaf(x, y, ) returns NaN if is an exact .
 DescriptionComputes the value of as a single ternary operation, rounding the result once in round-down (to negative infinity) mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fmaf_rn ( float x, float y, float z )
- 
                           Compute as a single operation, in round-to-nearest-even mode.ReturnsReturns the rounded value of as a single operation. - fmaf( , , z) returns NaN.
- fmaf( , , z) returns NaN.
- fmaf(x, y, ) returns NaN if is an exact .
- fmaf(x, y, ) returns NaN if is an exact .
 DescriptionComputes the value of as a single ternary operation, rounding the result once in round-to-nearest-even mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fmaf_ru ( float x, float y, float z )
- 
                           Compute as a single operation, in round-up mode.ReturnsReturns the rounded value of as a single operation. - fmaf( , , z) returns NaN.
- fmaf( , , z) returns NaN.
- fmaf(x, y, ) returns NaN if is an exact .
- fmaf(x, y, ) returns NaN if is an exact .
 DescriptionComputes the value of as a single ternary operation, rounding the result once in round-up (to positive infinity) mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fmaf_rz ( float x, float y, float z )
- 
                           Compute as a single operation, in round-towards-zero mode.ReturnsReturns the rounded value of as a single operation. - fmaf( , , z) returns NaN.
- fmaf( , , z) returns NaN.
- fmaf(x, y, ) returns NaN if is an exact .
- fmaf(x, y, ) returns NaN if is an exact .
 DescriptionComputes the value of as a single ternary operation, rounding the result once in round-towards-zero mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fmul_rd ( float x, float y )
- 
                           Multiply two floating-point values in round-down mode.ReturnsReturns x * y. DescriptionCompute the product of x and y in round-down (to negative infinity) mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fmul_rn ( float x, float y )
- 
                           Multiply two floating-point values in round-to-nearest-even mode.ReturnsReturns x * y. DescriptionCompute the product of x and y in round-to-nearest-even mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fmul_ru ( float x, float y )
- 
                           Multiply two floating-point values in round-up mode.ReturnsReturns x * y. DescriptionCompute the product of x and y in round-up (to positive infinity) mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fmul_rz ( float x, float y )
- 
                           Multiply two floating-point values in round-towards-zero mode.ReturnsReturns x * y. DescriptionCompute the product of x and y in round-towards-zero mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __frcp_rd ( float x )
- 
                           Compute in round-down mode.ReturnsReturns . DescriptionCompute the reciprocal of x in round-down (to negative infinity) mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __frcp_rn ( float x )
- 
                           Compute in round-to-nearest-even mode.ReturnsReturns . DescriptionCompute the reciprocal of x in round-to-nearest-even mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __frcp_ru ( float x )
- 
                           Compute in round-up mode.ReturnsReturns . DescriptionCompute the reciprocal of x in round-up (to positive infinity) mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __frcp_rz ( float x )
- 
                           Compute in round-towards-zero mode.ReturnsReturns . DescriptionCompute the reciprocal of x in round-towards-zero mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __frsqrt_rn ( float x )
- 
                           Compute in round-to-nearest-even mode.ReturnsReturns . DescriptionCompute the reciprocal square root of x in round-to-nearest-even mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fsqrt_rd ( float x )
- 
                           Compute in round-down mode.ReturnsReturns . DescriptionCompute the square root of x in round-down (to negative infinity) mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fsqrt_rn ( float x )
- 
                           Compute in round-to-nearest-even mode.ReturnsReturns . DescriptionCompute the square root of x in round-to-nearest-even mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fsqrt_ru ( float x )
- 
                           Compute in round-up mode.ReturnsReturns . DescriptionCompute the square root of x in round-up (to positive infinity) mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fsqrt_rz ( float x )
- 
                           Compute in round-towards-zero mode.ReturnsReturns . DescriptionCompute the square root of x in round-towards-zero mode. Note:For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- __device__  float __fsub_rd ( float x, float y )
- 
                           Subtract two floating-point values in round-down mode.ReturnsReturns x - y. DescriptionCompute the difference of x and y in round-down (to negative infinity) mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fsub_rn ( float x, float y )
- 
                           Subtract two floating-point values in round-to-nearest-even mode.ReturnsReturns x - y. DescriptionCompute the difference of x and y in round-to-nearest-even rounding mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fsub_ru ( float x, float y )
- 
                           Subtract two floating-point values in round-up mode.ReturnsReturns x - y. DescriptionCompute the difference of x and y in round-up (to positive infinity) mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __fsub_rz ( float x, float y )
- 
                           Subtract two floating-point values in round-towards-zero mode.ReturnsReturns x - y. DescriptionCompute the difference of x and y in round-towards-zero mode. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.1, Table 6. 
- 
                                          This operation will never be merged into a single multiply-add instruction. 
 
- 
                                          
- __device__  float __log10f ( float x )
- 
                           Calculate the fast approximate base 10 logarithm of the input argument.ReturnsReturns an approximation to . DescriptionCalculate the fast approximate base 10 logarithm of the input argument x. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- 
                                          Most input and output values around denormal range are flushed to sign preserving 0.0. 
 
- 
                                          
- __device__  float __log2f ( float x )
- 
                           Calculate the fast approximate base 2 logarithm of the input argument.ReturnsReturns an approximation to . DescriptionCalculate the fast approximate base 2 logarithm of the input argument x. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- 
                                          Input and output in the denormal range is flushed to sign preserving 0.0. 
 
- 
                                          
- __device__  float __logf ( float x )
- 
                           Calculate the fast approximate base logarithm of the input argument.ReturnsReturns an approximation to . DescriptionCalculate the fast approximate base logarithm of the input argument x. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- 
                                          Most input and output values around denormal range are flushed to sign preserving 0.0. 
 
- 
                                          
- __device__  float __powf ( float x, float y )
- 
                           Calculate the fast approximate of .ReturnsReturns an approximation to . DescriptionCalculate the fast approximate of x, the first input argument, raised to the power of y, the second input argument, . Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- 
                                          Most input and output values around denormal range are flushed to sign preserving 0.0. 
 
- 
                                          
- __device__  float __saturatef ( float x )
- 
                           Clamp the input argument to [+0.0, 1.0].Returns- __saturatef(x) returns 0 if x < 0.
- __saturatef(x) returns 1 if x > 1.
- __saturatef(x) returns x if .
- __saturatef(NaN) returns 0.
 DescriptionClamp the input argument x to be within the interval [+0.0, 1.0]. 
- __device__  void __sincosf ( float x, float* sptr, float* cptr )
- 
                           Calculate the fast approximate of sine and cosine of the first input argument.Returns- none
 DescriptionCalculate the fast approximate of sine and cosine of the first input argument x (measured in radians). The results for sine and cosine are written into the second argument, sptr, and, respectively, third argument, cptr. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- 
                                          Denorm input/output is flushed to sign preserving 0.0. 
 
- __device__  float __sinf ( float x )
- 
                           Calculate the fast approximate sine of the input argument.ReturnsReturns the approximate sine of x. DescriptionCalculate the fast approximate sine of the input argument x, measured in radians. Note:- 
                                          For accuracy information for this function see the CUDA C++ Programming Guide, Appendix E.2, Table 9. 
- 
                                          Input and output in the denormal range is flushed to sign preserving 0.0. 
 
- 
                                          
- __device__  float __tanf ( float x )
- 
                           Calculate the fast approximate tangent of the input argument.ReturnsReturns the approximate tangent of x. DescriptionCalculate the fast approximate tangent of the input argument x, measured in radians.