Deutsch   English   Français   Italiano  
<2024Oct12.102737@mips.complang.tuwien.ac.at>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: anton@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Is Intel exceptionally unsuccessful as an architecture designer?
Date: Sat, 12 Oct 2024 08:27:37 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 62
Message-ID: <2024Oct12.102737@mips.complang.tuwien.ac.at>
References: <memo.20240913205156.19028s@jgd.cix.co.uk> <d3b9fc944f708546e4fbe5909c748ba3@www.novabbs.org> <%dAHO.54667$S9Vb.39628@fx45.iad> <vcna56$1nlod$2@dont-email.me> <a7708487530552a53732070fe08d9458@www.novabbs.org> <vcprkv$2asrd$1@dont-email.me> <e2c993172c11a221c4dcb9973f9cdb86@www.novabbs.org> <vcqe6f$2d8oa$1@dont-email.me> <4f84910a01d7db353eedadd7c471d7d3@www.novabbs.org> <20240923105336.0000119b@yahoo.com> <6577e60bd63883d1a7bd51c717531f38@www.novabbs.org> <20240924124944.00006d86@yahoo.com>
Injection-Date: Sat, 12 Oct 2024 10:44:24 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="29d5d06e203f002c66a9502ffb11d396";
	logging-data="109941"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX1/epDyKC7mqdryt2VZlXDTe"
Cancel-Lock: sha1:4CjUaWHT5FIu7FJimX8Z3FphnX0=
X-newsreader: xrn 10.11
Bytes: 4566

Michael S <already5chosen@yahoo.com> writes:
>Even if 99% is correct, there were still 6-7 figures worth of
>dual-processor x86 systems sold each year and starting from 1997 at
>least tens of thousands of quads. 
>Absence of ordering definitions should have been a problem for a lot of
>people. But somehow, it was not.

I remember Andy Glew posting here about the strong ordering that Intel
had at the time, and that it leads to superior performance compared to
weak ordering.

>mitchalsup@aol.com (MitchAlsup1) wrote:
>> Also note: this was just after the execution pipeline went
>> Great Big Our of Order, and thus made the lack of order
>> problems much more visible to applications. {Pentium Pro}

Nonsense.  Stores are published in architectural order, and loads have
to be architecturally ordered wrt local stores already in a
single-core system.  And once you have that, why should the ordering
wrt. remote stores be any worse than on an in-order machine?

Note that the weak ordering advocacy (such as [adve&gharachorloo95)
arose in companies with (at the time) only in-order CPUs.

Actually OoO technology offers a way to make the ordering strong
without having to pay for barriers and somesuch; we may not yet have
enough buffers for implementing sequential consistency efficiently,
though, but maybe if we ask for sequential consistency, hardware
designers will find a way to provide enough buffers for that.

@TechReport{adve&gharachorloo95,
  author =       {Sarita V. Adve and Kourosh Gharachorloo},
  title =        {Shared Memory Consistency Models: A Tutorial},
  institution =  {Digital Western Research Lab},
  year =         {1995},
  type =         {WRL Research Report},
  number =       {95/7},
  annote =       {Gives an overview of architectural features of
                  shared-memory computers such as independent memory
                  banks and per-CPU caches, and how they make the (for
                  programmers) most natural consistency model hard to
                  implement, giving examples of programs that can fail
                  with weaker consistency models.  It then discusses
                  several categories of weaker consistency models and
                  actual consistency models in these categories, and
                  which ``safety net'' (e.g., memory barrier
                  instructions) programmers need to use to work around
                  the deficiencies of these models.  While the authors
                  recognize that programmers find it difficult to use
                  these safety nets correctly and efficiently, it
                  still advocates weaker consistency models, claiming
                  that sequential consistency is too inefficient, by
                  outlining an inefficient implementation (which is of
                  course no proof that no efficient implementation
                  exists).  Still the paper is a good introduction to
                  the issues involved.}
}

- anton
-- 
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
  Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>