Path: ...!2.eu.feeder.erje.net!feeder.erje.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: "Chris M. Thomasson" Newsgroups: comp.arch Subject: Re: Is Intel exceptionally unsuccessful as an architecture designer? Date: Sat, 21 Sep 2024 20:58:34 -0700 Organization: A noiseless patient Spider Lines: 40 Message-ID: References: <21028ed32d20f0eea9a754fafdb64e45@www.novabbs.org> <20240918190027.00003e4e@yahoo.com> <920c561c4e39e91d3730b6aab103459b@www.novabbs.org> <%dAHO.54667$S9Vb.39628@fx45.iad> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Date: Sun, 22 Sep 2024 05:58:35 +0200 (CEST) Injection-Info: dont-email.me; posting-host="0029950ff4e92ba21a7d99fa35b943c5"; logging-data="2144150"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18xPju2Q6RQMdOaypd9dbBrhodcGsMKhUs=" User-Agent: Mozilla Thunderbird Cancel-Lock: sha1:SCL7RNWqkIA7z6LoAesfVsbxRvA= In-Reply-To: Content-Language: en-US Bytes: 3419 On 9/21/2024 3:49 PM, jseigh wrote: > On 9/21/24 16:45, MitchAlsup1 wrote: >> On Sat, 21 Sep 2024 20:26:13 +0000, Chris M. Thomasson wrote: >> >>> On 9/21/2024 6:54 AM, Scott Lurndal wrote: >>>> mitchalsup@aol.com (MitchAlsup1) writes: >>>> https://www.marvell.com/products/cxl.html >>> >>> What about a weak coherency where a programmer has to use the correct >>> membars to get the coherency required for their specific needs? Along >>> the lines of UltraSPARC in RMO mode? >> >> In my case, I suffered through enough of these to implement a >> memory hierarchy free from the need of any MemBars yet provide >> the performance of relaxed memory order, except when >> certain kinds of addresses are touched {MMI/O, configuration >> space, ATOMIC accesses,...} In these cases, the core becomes >> {sequentially consistent, or strongly ordered} depending on the >> touched address. > > Well, we have asymmetric memory barriers now (membarrier() in linux) > so we can get rid of memory barriers in some cases.  For hazard > pointers which used to be a (load, store, mb, load) are now just > a (load, store, load).  Much faster,  from 8.02 nsecs to 0.79 nsecs. > So much so that other things which has heretofore been considered > to add negligible overhead are not so much by comparison.  Which can > be a little annoying because some like using those a lot. Getting rid of a #StoreLoad memory barrier is very nice, indeed. This actually helps x86. Well, "original" SMR (aka, hazard pointers) on x86 needs a membar. MFENCE or a locked rmw. WRT SPARC in RMO mode, well, iirc, it (SMR) needed a MEMBAR #StoreLoad | #LoadStore | #LoadLoad | #StoreStore not just a #StoreLoad | #LoadStore ?