Deutsch English Français Italiano |
<v792kn$1v70t$1@dont-email.me> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: "Stephen Fuld" <SFuld@alumni.cmu.edu.invalid> Newsgroups: comp.arch Subject: Re: Continuations Date: Wed, 17 Jul 2024 18:30:47 -0000 (UTC) Organization: A noiseless patient Spider Lines: 48 Message-ID: <v792kn$1v70t$1@dont-email.me> References: <v6tbki$3g9rg$1@dont-email.me> <47689j5gbdg2runh3t7oq2thodmfkalno6@4ax.com> <v71vqu$gomv$9@dont-email.me> <116d9j5651mtjmq4bkjaheuf0pgpu6p0m8@4ax.com> <f8c6c5b5863ecfc1ad45bb415f0d2b49@www.novabbs.org> <7u7e9j5dthm94vb2vdsugngjf1cafhu2i4@4ax.com> <0f7b4deb1761f4c485d1dc3b21eb7cb3@www.novabbs.org> <v78soj$1tn73$1@dont-email.me> <4bbc6af7baab612635eef0de4847ba5b@www.novabbs.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Date: Wed, 17 Jul 2024 20:30:47 +0200 (CEST) Injection-Info: dont-email.me; posting-host="d02dd0d7b737f1e0e4f7b53102fc2b6f"; logging-data="2071581"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/+F+Hhu3NiQ1mzgZnn8PJDmtPbo1rytaY=" User-Agent: XanaNews/1.21-f3fb89f (x86; Portable ISpell) Cancel-Lock: sha1:CPwq8kD6kcYSzLO+Aszfc3hwMy0= Bytes: 2996 MitchAlsup1 wrote: > On Wed, 17 Jul 2024 16:50:27 +0000, Thomas Koenig wrote: > > >MitchAlsup1 <mitchalsup@aol.com> schrieb: > > > > > What I am talking about is to improve their performance until a > > > sin() takes about the same number of cycles of FDIV, not 10× more. > > > > Maybe time for a little story. > > > > Some unspecified time ago, a colleague did CFD calculations which > > included fluid flow (including turbulence modelling and diffusion) > > and quite a few chemical reactions together. So, he evaluated a > > huge number of Arrhenius equations, > > > > k = A * exp(-E_a/(R*T)) > > > > and because some of the reactions he looked at were highly > > exothermic or endothermic, he needed tiny relaxation factors (aka > > small steps). His calculaiton spent most of the time evaluating > > the Arrhenius equation above many, many, many, many times. > > > > A single calculation took months, and he didn't use weak hardware. > > > > A fully pipelined evaluation of, let's say, four parallel exp and > > four parallel fdiv instructions would have reduced his calculation > > time by orders of magnitude, and allowed him to explore the design > > space instead of just scratching the surface. > > > > (By the way, if I had found a reasonable way to incorporate the > > Arrhenius equation into your ISA, I would have done so already :-) > > FMUL Rt,RR,RT > FDIV Rt,-RE,Rt > EXP Rt,Rt > FMUL Rk,RA,Rt > > Does not look "all that bad" to me. So for your GbOoO CPU, how many of the various FP operations, and the EXP instruction can be done in parallel? -- - Stephen Fuld (e-mail address disguised to prevent spam)