Deutsch English Français Italiano |
<2024Aug12.131045@mips.complang.tuwien.ac.at> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!weretis.net!feeder8.news.weretis.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: anton@mips.complang.tuwien.ac.at (Anton Ertl) Newsgroups: comp.arch Subject: Re: Instruction Tracing Date: Mon, 12 Aug 2024 11:10:45 GMT Organization: Institut fuer Computersprachen, Technische Universitaet Wien Lines: 36 Message-ID: <2024Aug12.131045@mips.complang.tuwien.ac.at> References: <v970s3$flpo$1@dont-email.me> <2024Aug10.121802@mips.complang.tuwien.ac.at> <v995pm$1cni$2@gal.iecc.com> <2024Aug11.164438@mips.complang.tuwien.ac.at> <v9bg6n$2u0ud$2@dont-email.me> <v9c912$35the$1@dont-email.me> Injection-Date: Mon, 12 Aug 2024 13:31:10 +0200 (CEST) Injection-Info: dont-email.me; posting-host="64d6ff755a5474c66a4e598c60741fd5"; logging-data="3424105"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18Q2AGCHHDAkExHObrdarfE" Cancel-Lock: sha1:qDwBX89AmdGjrOtBsyxLmqCNYlI= X-newsreader: xrn 10.11 Bytes: 2436 Terje Mathisen <terje.mathisen@tmsw.no> writes: >Lawrence D'Oliveiro wrote: >> On Sun, 11 Aug 2024 14:44:38 GMT, Anton Ertl wrote: >>=20 >>> Power (IIRC) and Alpha don't have delayed branches. >>=20 >> Not only does POWER not have delayed branches, but I recall the IBM fol= >ks >> claiming in the initial publicity that branches could often execute in >> zero clock cycles--that is, fully overlapped with surrounding >> instructions. > >Afair, the original POWER had 3 chips, with branches in a separate unit=20 >from integer/logic ops, right? Looking at <https://en.wikipedia.org/wiki/POWER1>, the RIOS-1 configuration has 9 chips: ICU, FPU, FXU (integer unit), SCU (storage control), 4xDCU (data cache), I/O Unit. The RIOS-9 configuration has only 2 DCUs (7 chips total). >It also had multiple (8?) sets of compare result flags in order to avoid = >making them a speed limiter. I wonder how much that is used. There is only one carry bit. >Yeah, the R part was intended to make latency a single cycle for _most_=20 >instructions. It was mainly meant to increase the *throughput* to one instruction per cycle; that includes instructions like loads that have a latency > 1 cycle. - anton -- 'Anyone trying for "industrial quality" ISA should avoid undefined behavior.' Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>