Deutsch English Français Italiano |
<277c774f1eb48be79cd148dfc25c4367@www.novabbs.org> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!weretis.net!feeder9.news.weretis.net!news.nk.ca!rocksolid2!i2pn2.org!.POSTED!not-for-mail From: mitchalsup@aol.com (MitchAlsup1) Newsgroups: comp.arch Subject: Re: Continuations Date: Fri, 19 Jul 2024 16:32:43 +0000 Organization: Rocksolid Light Message-ID: <277c774f1eb48be79cd148dfc25c4367@www.novabbs.org> References: <v6tbki$3g9rg$1@dont-email.me> <47689j5gbdg2runh3t7oq2thodmfkalno6@4ax.com> <v71vqu$gomv$9@dont-email.me> <116d9j5651mtjmq4bkjaheuf0pgpu6p0m8@4ax.com> <f8c6c5b5863ecfc1ad45bb415f0d2b49@www.novabbs.org> <7u7e9j5dthm94vb2vdsugngjf1cafhu2i4@4ax.com> <0f7b4deb1761f4c485d1dc3b21eb7cb3@www.novabbs.org> <v78soj$1tn73$1@dont-email.me> <v7dsf2$3139m$1@dont-email.me> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Info: i2pn2.org; logging-data="3844975"; mail-complaints-to="usenet@i2pn2.org"; posting-account="65wTazMNTleAJDh/pRqmKE7ADni/0wesT78+pyiDW8A"; User-Agent: Rocksolid Light X-Rslight-Site: $2y$10$g2wDRA17Eje.mh3ydDMM5eX4S80AEs97jc42RGLqXTKwHoDwViE4O X-Spam-Checker-Version: SpamAssassin 4.0.0 X-Rslight-Posting-User: ac58ceb75ea22753186dae54d967fed894c3dce8 Bytes: 3394 Lines: 49 On Fri, 19 Jul 2024 14:16:01 +0000, Terje Mathisen wrote: > Thomas Koenig wrote: >> MitchAlsup1 <mitchalsup@aol.com> schrieb: >> >>> What I am talking about is to improve their performance until a >>> sin() takes about the same number of cycles of FDIV, not 10× more. >> >> Maybe time for a little story. >> >> Some unspecified time ago, a colleague did CFD calculations which >> included fluid flow (including turbulence modelling and diffusion) >> and quite a few chemical reactions together. So, he evaluated a >> huge number of Arrhenius equations, >> >> k = A * exp(-E_a/(R*T)) >> >> and because some of the reactions he looked at were highly >> exothermic or endothermic, he needed tiny relaxation factors (aka >> small steps). His calculaiton spent most of the time evaluating >> the Arrhenius equation above many, many, many, many times. >> >> A single calculation took _months_, and he didn't use weak hardware. >> >> A fully pipelined evaluation of, let's say, four parallel exp and >> four parallel fdiv instructions would have reduced his calculation >> time by orders of magnitude, and allowed him to explore the design >> space instead of just scratching the surface. > > Back when I first looked at Invsqrt(), I did so because an Computation > Fluid Chemistry researcher from Sweden asked for help speeding up his > reciprocal calculations (sqrt(1/(dx^2+dy^2+dz^2))), I found that by > combining the 1/x and the sqrt and doing three of them pipelind together > (all the water molecules having three atoms), his weeklong simulation > runs ran in half the time, on both PentiumPro and Alpha hardware. I, personally, have found many Newton-Raphson iterators that converge faster using 1/SQRT(x) than using the SQRT(x) equivalent. > >> >> (By the way, if I had found a reasonable way to incorporate the >> Arrhenius equation into your ISA, I would have done so already :-) >> > > I'm guessing that you would do the exp(-E_a/(R*T)) as > exp(-E-a)-exp(R*T), since that should give the same result and now you > could interleave the two exp) calculations, and/or hoist the exp(-E_a) > term? > > Terje