I managed now to create a working microbenchmark of my vector-normalization program, which shows the differences between using SSE and non-SSE hardware sqrt implementations. While the FP-unit of x86 systems is not IEEE754 aware, the SSE-unit is. I have been using INFINITY for representing undefined values in my code.
Results of the benchmark (in MFLOPS, more is better):
- normal values, no SSE: 237
- normal values, SSE: 280
- INFINITY, no SSE: 12
- INFINITY, SSE: 1380
Ingen kommentarer:
Legg inn en kommentar