Tuesday, 15 November 2016

multithreading - Java memory model - volatile and x86



I am trying to understand the intrinsics of java volatile and its semantics, and its transaltion to the underlying architecture and its instructions. If we consider the following blogs and resourses



fences generated for volatile, What gets generated for read/write of volatile and question on fences




here is what I gather:




  • volatile read inserts loadStore/LoadLoad barriers after it (LFENCE instruction on x86)

  • It prevents the reordering of loads with subsequent writes/loads

  • It is supposed to guarantee loading of the global state that was modified by other threads i.e. after LFENCE the state modifications done by other threads are visible to the current thread on its CPU.



WHat I am struggling to understand is this: Java does not emit LFENCE on x86
i.e. read of volatile does not cause LFENCE
.... I know that memory ordering of x86 prevent reording of loads with lods/stored, so second bullet point is taken care of. However, I would assume that in order for the state to be visible by this thread, LFENCE instruction should be issued to guarantee that all LOAD buffers are drained before the next instruction after the fence is executed (as per Intel manual). I understand there is cahce coherence protocol on x86, but volatile read should still drain any LOADs in the buffers, no?



Answer



On x86, the buffers are pinned to the cache line. If the cache line is lost, the value in the buffer isn't used. So there's no need to fence or drain the buffers; the value they contain must be current because another core can't modify the data without first invalidating the cache line.


No comments:

Post a Comment

c++ - Does curly brackets matter for empty constructor?

Those brackets declare an empty, inline constructor. In that case, with them, the constructor does exist, it merely does nothing more than t...