Deutsch   English   Français   Italiano  
<2024Aug12.131045@mips.complang.tuwien.ac.at>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!weretis.net!feeder8.news.weretis.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: anton@mips.complang.tuwien.ac.at (Anton Ertl)
Newsgroups: comp.arch
Subject: Re: Instruction Tracing
Date: Mon, 12 Aug 2024 11:10:45 GMT
Organization: Institut fuer Computersprachen, Technische Universitaet Wien
Lines: 36
Message-ID: <2024Aug12.131045@mips.complang.tuwien.ac.at>
References: <v970s3$flpo$1@dont-email.me> <2024Aug10.121802@mips.complang.tuwien.ac.at> <v995pm$1cni$2@gal.iecc.com> <2024Aug11.164438@mips.complang.tuwien.ac.at> <v9bg6n$2u0ud$2@dont-email.me> <v9c912$35the$1@dont-email.me>
Injection-Date: Mon, 12 Aug 2024 13:31:10 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="64d6ff755a5474c66a4e598c60741fd5";
	logging-data="3424105"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX18Q2AGCHHDAkExHObrdarfE"
Cancel-Lock: sha1:qDwBX89AmdGjrOtBsyxLmqCNYlI=
X-newsreader: xrn 10.11
Bytes: 2436

Terje Mathisen <terje.mathisen@tmsw.no> writes:
>Lawrence D'Oliveiro wrote:
>> On Sun, 11 Aug 2024 14:44:38 GMT, Anton Ertl wrote:
>>=20
>>> Power (IIRC) and Alpha don't have delayed branches.
>>=20
>> Not only does POWER not have delayed branches, but I recall the IBM fol=
>ks
>> claiming in the initial publicity that branches could often execute in
>> zero clock cycles--that is, fully overlapped with surrounding
>> instructions.
>
>Afair, the original POWER had 3 chips, with branches in a separate unit=20
>from integer/logic ops, right?

Looking at <https://en.wikipedia.org/wiki/POWER1>, the RIOS-1
configuration has 9 chips: ICU, FPU, FXU (integer unit), SCU (storage
control), 4xDCU (data cache), I/O Unit.  The RIOS-9 configuration has
only 2 DCUs (7 chips total).

>It also had multiple (8?) sets of compare result flags in order to avoid =
>making them a speed limiter.

I wonder how much that is used.  There is only one carry bit.

>Yeah, the R part was intended to make latency a single cycle for _most_=20
>instructions.

It was mainly meant to increase the *throughput* to one instruction
per cycle; that includes instructions like loads that have a
latency > 1 cycle.

- anton
-- 
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
  Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>