Deutsch English Français Italiano |
<2024May30.145038@mips.complang.tuwien.ac.at> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!news.nobody.at!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: anton@mips.complang.tuwien.ac.at (Anton Ertl) Newsgroups: comp.arch Subject: Re: Byte Addressability And Beyond Date: Thu, 30 May 2024 12:50:38 GMT Organization: Institut fuer Computersprachen, Technische Universitaet Wien Lines: 38 Message-ID: <2024May30.145038@mips.complang.tuwien.ac.at> References: <v0s17o$2okf4$2@dont-email.me> <v31c4r$3u28v$1@dont-email.me> <v327n3$1use$1@gal.iecc.com> <BM25O.40665$HBac.4762@fx15.iad> <v32lpv$1u25$1@gal.iecc.com> <v33bqg$9cst$11@dont-email.me> <v34v62$ln01$1@dont-email.me> <v36bva$10k3v$2@dont-email.me> <2024May29.090435@mips.complang.tuwien.ac.at> <v38opv$1gsj2$3@dont-email.me> Injection-Date: Thu, 30 May 2024 15:07:01 +0200 (CEST) Injection-Info: dont-email.me; posting-host="46db1f935b2b8b941d470a80861909b8"; logging-data="1799719"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19ysXwgZ8830KBPCw0f97XJ" Cancel-Lock: sha1:FI1V70fzUYxhZ8e/fe6m28BgMn0= X-newsreader: xrn 10.11 Bytes: 2797 Lawrence D'Oliveiro <ldo@nz.invalid> writes: >On Wed, 29 May 2024 07:04:35 GMT, Anton Ertl wrote: > >> Lawrence D'Oliveiro <ldo@nz.invalid> writes: >> >>>Isn’t the point of RISC that these complex operations are >>>more efficiently performed by a sequence of simpler instructions? >> >> The IBM z series are not RISCs. > >Doesn’t matter. The principles of designing high-performance architectures >still apply: simpler instructions are better than more complex ones. Is IBM z a high-performance architecture? In the present case, the principles of designing high-performance architectures will tell you that you don't need these instructions. But if we forget about that for a minute, the block-copy-style approach of IBM's CU14 instruction means that it could use a stream buffer internally to avoid the performance snag that I mentioned in another posting. However, there is a big difference between what performance features one can imagine and what is actually implemented. I think that's the marketing attraction of providing some feature as an instruction: it lets the sales victim's imagination do the marketing/selling. Concerning reality: When I looked at block copying a while ago (Skylake/Zen1 days), I found that my code using a loop of AVX moves outperformed REP MOVSB (where Intel and AMD's microcode should have done at least as well) in many cases, and that despite Intel adding "fast string moves" in IIRC Sandy Bridge. - anton -- 'Anyone trying for "industrial quality" ISA should avoid undefined behavior.' Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>