I have to say that i've only read the phoronix article and not your blog post. I Should have read yours as well before claiming unfair comparison by mozilla (you).
I do now get why you compared against clang.
Good luck squeezing out the last part to be at native (clang) speeds. I do performance optimization quite often (in completely different areas) and know that the last bit towards a set goal are usually the most difficult parts to get.