I wondered if this image below is from the Anandtech review of the processor helps clear up some of the confusion with how the processor can be optimised. The issue seems to be getting 'turbo core' enabled and directing threads with shared data to the same core.
Is the situation the same under Linux?
Coupled with this it should interesting. Reassigning thread - core priorities?