Deutsch English Français Italiano |
<20240728130819.00006111@yahoo.com> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Michael S <already5chosen@yahoo.com> Newsgroups: comp.arch Subject: Re: Non-pipelined FDIV/SQRT Date: Sun, 28 Jul 2024 13:08:19 +0300 Organization: A noiseless patient Spider Lines: 36 Message-ID: <20240728130819.00006111@yahoo.com> References: <v6tbki$3g9rg$1@dont-email.me> <47689j5gbdg2runh3t7oq2thodmfkalno6@4ax.com> <v71vqu$gomv$9@dont-email.me> <116d9j5651mtjmq4bkjaheuf0pgpu6p0m8@4ax.com> <f8c6c5b5863ecfc1ad45bb415f0d2b49@www.novabbs.org> <7u7e9j5dthm94vb2vdsugngjf1cafhu2i4@4ax.com> <0f7b4deb1761f4c485d1dc3b21eb7cb3@www.novabbs.org> <v78soj$1tn73$1@dont-email.me> <4bbc6af7baab612635eef0de4847ba5b@www.novabbs.org> <v792kn$1v70t$1@dont-email.me> <ef12aa647464a3ebe3bd208c13a3c40c@www.novabbs.org> <tD7mO.11270$Z2s2.1953@fx05.iad> <jwvjzhibrcf.fsf-monnier+comp.arch@gnu.org> <HPbmO.59320$q2da.48454@fx18.iad> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Injection-Date: Sun, 28 Jul 2024 12:07:48 +0200 (CEST) Injection-Info: dont-email.me; posting-host="bd9af8c4f59d3d3264d37e8e2341df28"; logging-data="4064499"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+XmdAhx44ROlffPlhJY2qS4iYQ+KffeSg=" Cancel-Lock: sha1:qXZJKWwQHbNwEYS+Uq5x2HZmuIA= X-Newsreader: Claws Mail 3.19.1 (GTK+ 2.24.33; x86_64-w64-mingw32) Bytes: 2932 On Thu, 18 Jul 2024 12:56:46 -0400 EricP <ThatWouldBeTelling@thevillage.com> wrote: > Stefan Monnier wrote: > >> If the FP multiplier is a 4-stage pipeline, and FDIV is iterating > >> using the multiplier, can the pipeline get a mix of multiple > >> operations going at once? FDIV for both Newton=E2=80=93Raphson and > >> Goldschmidt iterates serially so each can only use one of the 4 > >> pipeline slots. =20 > >=20 > > Something I've been wondering for a while, indeed. > > IOW, is there enough parallelism inside the FDIV/SQRT "microcode" > > to keep the FMAC fully busy (my naive understanding is that there > > isn't)? If not, do current CPU make the FMAC available for other > > operations while an FDIV/SQRT is in progress? If not, how hard > > would it be? > >=20 > >=20 > > Stefan =20 >=20 > And if they can't mix then to what extent can the end of one op, > as it drains from the pipeline, overlap with the start of the next? > Obviously FMUL can pipeline with FMUL but can the next FMUL overlap > with the end of a prior FDIV? An EXP? >=20 > I was thinking about reservation station schedulers and wondering > what they might have to optimize. >=20 >=20 On most (all ?) modern GBOoO cores FDIV can overlap with the next independent FDIV and sometimes with more than one. In that light it would be hard to believe that FDIV can not overlap with FMUL or FMAC.