Releases: NVIDIA-RTX/MathLib
Releases · NVIDIA-RTX/MathLib
v11
Fixed v10:
- fixed compilation on ARM
- fixed conversion bugs introduced in
v10 - removed
float2_to_float16_t2,float4_to_float16_t4,float16_t2_to_float2andfloat16_t4_to_float4, use "type cast" instead - allowed
ML_WINDOW_ORIGIN_OGLcustomization inml.hlsli - marked unused variables (PR #12)
- improved README
v10
Small float support for DL/ML (no breaking changes):
- fixed and significantly accelerated "to/from" scalar conversions
- implemented accelerated path for 4-components vectors (3x-4x faster than scalar, SSE4 required)
- implemented accelerated path for 8-components vectors (4x-6x faster than scalar, AVX2 required)
- added emulation for some missing AVX2 intrinsics
- added new types
float8_e4m3_t[2, 4, 8],float8_e5m2_t[2, 4, 8]andfloat16_t8 - added conversion operators to convert to/from a small float