Investigate potential x86 varint optimization #279

danburkert · 2020-02-14T02:42:43Z

https://www.reddit.com/r/rust/comments/f36j05/comment/fhhwqp9

danburkert · 2020-02-14T17:55:04Z

see also https://github.com/gnzlbg/bitintr for safe and cross platform wrappers over the intrinsics

danburkert · 2020-11-23T20:56:22Z

https://news.ycombinator.com/item?id=25183811

danburkert · 2020-12-27T23:14:54Z

https://www.reddit.com/r/rust/comments/klck6a/i_published_my_first_crate_varintsimd/

as-com · 2021-01-02T04:52:12Z

So I did some quick and dirty prototyping with varint-simd v0.3.0, and here's what I found:

Microbenchmark varint performance is only improved for encoding and decoding larger numbers
Encoding performance generally fares better than decoding performance, depending on where I place the branch that uses varint-simd
Macrobenchmark performance is mostly a wash (tested on Coffee Lake), with some larger wins and some smaller losses

This is probably because the only encode/decode function is for single u64's, which is currently a weak point for varint-simd (it's not that much faster than other implementations when decoding/encoding tiny u64's).

I suspect there will need to be some larger-scale refactoring to take full advantage of varint-simd. For example, protobuf tags are up to 32 bits long, so a lot of cycles can be saved when encoding/decoding those.

My library also just added support for quickly decoding two, four, and eight adjacent varints in parallel (subject to size limitations), with some really good throughput figures - most of the time, protobufs will be a 32 bit tag followed by a 32 bit number or length, and decode requests can be shrunk based on how large the data field is in the .proto file. So there's likely a lot more gains to be had.

danburkert mentioned this issue Nov 9, 2020

Update bytes to 0.6 #381

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate potential x86 varint optimization #279

Investigate potential x86 varint optimization #279

danburkert commented Feb 14, 2020

danburkert commented Feb 14, 2020

danburkert commented Nov 23, 2020

danburkert commented Dec 27, 2020

as-com commented Jan 2, 2021

Investigate potential x86 varint optimization #279

Investigate potential x86 varint optimization #279

Comments

danburkert commented Feb 14, 2020

danburkert commented Feb 14, 2020

danburkert commented Nov 23, 2020

danburkert commented Dec 27, 2020

as-com commented Jan 2, 2021