A while back, I wrote a UTF-8 decoder in Common Lisp, targeting SBCL (it already...

whartung · on July 6, 2023

What's some examples of the code changes that you made? And did you just do repeated disassemblies of the functions to see that it was using the correct instructions, or did you do some benchmarking to show your changes were actual improvements?

aidenn0 · on July 6, 2023

Gosh, I'd have to see if I can dig it up this was a few years ago.

I did all of the above, plus profiling (sb-sprof combined with disassemble will show assembly level profiling).

moonchild · on July 6, 2023

Branches are prone to be faster than conditional moves if they are correctly predicted, because they do not increase the critical path length. And utf-8 decoders are commonly run on all-ascii input. What were you benchmarking on?

aidenn0 · on July 6, 2023

I ran separate benchmarks on all-ASCII, BMP-only, and ascii with non-BMP. ASCII was not slower on the low-branch version.