Article <2024May30.145038@mips.complang.tuwien.ac.at>

Deutsch English Français Italiano
<2024May30.145038@mips.complang.tuwien.ac.at>

View for Bookmarking (what is this?)
Look up another Usenet article
Path: ...!news.nobody.at!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: anton@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Byte Addressability And Beyond
Date: Thu, 30 May 2024 12:50:38 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 38
Message-ID: <2024May30.145038@mips.complang.tuwien.ac.at>
References: <v0s17o$2okf4$2@dont-email.me> <v31c4r$3u28v$1@dont-email.me> <v327n3$1use$1@gal.iecc.com> <BM25O.40665$HBac.4762@fx15.iad> <v32lpv$1u25$1@gal.iecc.com> <v33bqg$9cst$11@dont-email.me> <v34v62$ln01$1@dont-email.me> <v36bva$10k3v$2@dont-email.me> <2024May29.090435@mips.complang.tuwien.ac.at> <v38opv$1gsj2$3@dont-email.me>
Injection-Date: Thu, 30 May 2024 15:07:01 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="46db1f935b2b8b941d470a80861909b8";
	logging-data="1799719"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX19ysXwgZ8830KBPCw0f97XJ"
Cancel-Lock: sha1:FI1V70fzUYxhZ8e/fe6m28BgMn0=
X-newsreader: xrn 10.11
Bytes: 2797

Lawrence D'Oliveiro <ldo@nz.invalid> writes:
>On Wed, 29 May 2024 07:04:35 GMT, Anton Ertl wrote:
>
>> Lawrence D'Oliveiro <ldo@nz.invalid> writes:
>>
>>>Isn’t the point of RISC that these complex operations are
>>>more efficiently performed by a sequence of simpler instructions?
>> 
>> The IBM z series are not RISCs.
>
>Doesn’t matter. The principles of designing high-performance architectures 
>still apply: simpler instructions are better than more complex ones.

Is IBM z a high-performance architecture?

In the present case, the principles of designing high-performance
architectures will tell you that you don't need these instructions.

But if we forget about that for a minute, the block-copy-style
approach of IBM's CU14 instruction means that it could use a stream
buffer internally to avoid the performance snag that I mentioned in
another posting.

However, there is a big difference between what performance features
one can imagine and what is actually implemented.  I think that's the
marketing attraction of providing some feature as an instruction: it
lets the sales victim's imagination do the marketing/selling.

Concerning reality: When I looked at block copying a while ago
(Skylake/Zen1 days), I found that my code using a loop of AVX moves
outperformed REP MOVSB (where Intel and AMD's microcode should have
done at least as well) in many cases, and that despite Intel adding
"fast string moves" in IIRC Sandy Bridge.

- anton
-- 
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
  Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>