Consider deprecating SSE3 code #260
Labels
comp-Logic
Related to internal code logic
maintainability
Simplifies further code development (standardization, robustness)
performance
Simulation speed, memory consumption
pri-Medium
Worth assigning to a milestone
sparse
Sparse (non-FFT) mode
Milestone
Recent optimizations of imExp (#169) made SSE3 code only marginally faster than the standard c99 one (with compiler optimizations). It also showed that part of the SSE3 speedup is due to unsafe optimizations (can lead to precision loss).
Thus, better maintainability can be obtained if SSE3 code is removed altogether. However, it is worth studying where the remaining 10% speedup due to SSE3 comes from and optimizing the standard code accordingly.
/cc @jleinonen
The text was updated successfully, but these errors were encountered: