Deutsch   English   Français   Italiano  
<vkamr2$11jv0$3@dont-email.me>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: "Chris M. Thomasson" <chris.m.thomasson.1@gmail.com>
Newsgroups: comp.arch
Subject: Re: Strange asm generated by GCC...
Date: Sun, 22 Dec 2024 19:53:07 -0800
Organization: A noiseless patient Spider
Lines: 60
Message-ID: <vkamr2$11jv0$3@dont-email.me>
References: <vk2ek2$33ckv$3@dont-email.me> <vk4ahk$3i1bs$1@dont-email.me>
 <At6cnfsh1ZzpB_v6nZ2dnZfqn_adnZ2d@supernews.com>
 <vkamjt$11jv0$2@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Mon, 23 Dec 2024 04:53:07 +0100 (CET)
Injection-Info: dont-email.me; posting-host="f0e44193300486c78d7a7f6e740e8e90";
	logging-data="1101792"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX1+Liw4iSWbZKbDjs8lXPvINn7N97yZ0quQ="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:GjBE4qIsa5VAgp1RLATWThhbTFE=
In-Reply-To: <vkamjt$11jv0$2@dont-email.me>
Content-Language: en-US
Bytes: 3475

On 12/22/2024 7:49 PM, Chris M. Thomasson wrote:
> On 12/21/2024 2:37 AM, aph@littlepinkcloud.invalid wrote:
>> jseigh <jseigh_es00@xemaps.com> wrote:
>>> On 12/19/24 19:43, Chris M. Thomasson wrote:
>>>> Why in the world would GCC use an XCHG instruction for the following
>>>> code. The damn XCHG has an implied LOCK prefix! Yikes!
>>>>
>>>
>>> Speaking of strange code
>>>
>>> #include <atomic>
>>>
>>> bool test1(std::atomic<int> var, int addend)
>>> {
>>>      int expected = var.load(std::memory_order_relaxed);
>>>      int update = expected + addend;
>>>      return var.compare_exchange_weak(expected, update,
>>> std::memory_order_acq_rel, std::memory_order_seq_cst);
>>> }
>>>
>>> This is asm for armv8-a clang 9.0.0
>>>
>>> test1(std::atomic<int>, int):
>>>          ldr     w8, [x0]
>>>          ldaxr   w9, [x0]
>>>          cmp     w9, w8
>>>          b.ne    .LBB0_3
>>>          add     w8, w8, w1
>>>          stlxr   w9, w8, [x0]
>>>          cbz     w9, .LBB0_4
>>>          mov     w0, wzr
>>>          ret
>>> .LBB0_3:
>>>          clrex
>>>          mov     w0, wzr
>>>          ret
>>> .LBB0_4:
>>>          mov     w0, #1
>>>          ret
>>>
>>> I picked a version that just did ll/sc to avoid
>>> the question of whether a failed CASAL did a store or not.
>>>
>>> I don't see anything that forces a store memory barrier
>>> on all the fail paths.  I could be missing something.
>>
>> Why would there be one? If the store does not take place, there's no
>> need for a memory barrier because there's no store for anyone to
>> synchronize with. The only effect of a failed weak CAS is a load. If
>> you really need a store on failure because of its side effect you can
>> always add one.
> 
> Iirc, the membars for the success and failure can be "useful" for 
> popping from a lock-free stack. Wrt the C++ API the CAS can give you the 
> updated value on a failure. So, there is a load. Depending on what you 
> are doing, it might require an acquire.

Loading the head of the lock-free stack would be an acquire at the start 
of the CAS loop. The CAS can use relaxed for the success and an acquire 
for the failure.