tl;dr: joiner is twice as slow than a foreach loop
A first start:
https://gist.github.com/wilzbach/ffd5d20639766a831fd4b1962572897a
While it's quite an improvement for LDC, it reduces the performance for DMD.
Moreover, the `alias this` trick used to introduce as little changes as possible introduces a overhead too.