Article <v2etr0$3s9r0$1@dont-email.me>

Warning: mysqli::__construct(): (HY000/1203): User howardkn already has more than 'max_user_connections' active connections in D:\Inetpub\vhosts\howardknight.net\al.howardknight.net\includes\artfuncs.php on line 21
Failed to connect to MySQL: (1203) User howardkn already has more than 'max_user_connections' active connections
Warning: mysqli::query(): Couldn't fetch mysqli in D:\Inetpub\vhosts\howardknight.net\al.howardknight.net\index.php on line 66
Article <v2etr0$3s9r0$1@dont-email.me>

Deutsch English Français Italiano

<v2etr0$3s9r0$1@dont-email.me>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!news.mixmin.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Terje Mathisen <terje.mathisen@tmsw.no>
Newsgroups: comp.arch
Subject: Re: Making Lemonade (Floating-point format changes)
Date: Mon, 20 May 2024 09:24:16 +0200
Organization: A noiseless patient Spider
Lines: 39
Message-ID: <v2etr0$3s9r0$1@dont-email.me>
References: <abe04jhkngt2uun1e7ict8vmf1fq8p7rnm@4ax.com>
 <memo.20240512203459.16164W@jgd.cix.co.uk> <v1rab7$2vt3u$1@dont-email.me>
 <20240513151647.0000403f@yahoo.com> <v1to2h$3km86$1@dont-email.me>
 <20240514221659.00001094@yahoo.com> <v234nr$12p27$1@dont-email.me>
 <20240516001628.00001031@yahoo.com> <v2cn4l$3bpov$1@dont-email.me>
 <v2d9sv$3fda0$1@dont-email.me> <20240519203403.00003e9b@yahoo.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Mon, 20 May 2024 09:24:16 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="2442f757fe0e90d7c629db09088092de";
	logging-data="4073312"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX18XWAG7943DL1lUa4rrN695ac3QvZVlfEi3vtIEg2+ICA=="
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101
 Firefox/91.0 SeaMonkey/2.53.18.2
Cancel-Lock: sha1:v3ZFOiHEBDPL0fGWBKCGvxPL3Bo=
In-Reply-To: <20240519203403.00003e9b@yahoo.com>
Bytes: 2954

Michael S wrote:
> On Sun, 19 May 2024 18:37:51 +0200
> Terje Mathisen <terje.mathisen@tmsw.no> wrote:
> 
>> Thomas Koenig wrote:
>>> So, I did some more measurements on the POWER9 machine, and it came
>>> to around 18 cycles per FMA.  Compared to the 13 cycles for the
>>> FMA instruction, this actually sounds reasonable.
>>>
>>> The big problem appears to be that, in this particular
>>> implementation, multiplication is not pipelined, but done by
>>> piecewise by addition.  This can be explained by the fact that
>>> this is mostly a decimal unit, with the 128-bit QP just added as
>>> an afterthought, and decimal multiplication does not happen all
>>> that often.
>>>
>>> A fully pipelined FMA unit capable of 128-bit arithmetic would be
>>> an entirely different beast, I would expect a throughput of 1 per
>>> cycle and a latency of (maybe) one cycle more than 64-bit FMA.
>>>    
>> The FMA normalizer has to handle a maximally bad cancellation, so it
>> needs to be around 350 bits wide. Mitch knows of course but I'm
>> guessing that this could at least be close to needing an extra cycle
>> on its own and/or heroic hardware?
>>
>> Terje
>>
> 
> Why so wide?
> Assuming that subnormal multiplier inputs are normalized before

They are not, this is part of what you do to make subnormal numbers 
exactly the same speed as normal inputs.

Terje

-- 
- <Terje.Mathisen at tmsw.no>
"almost all programming can be viewed as an exercise in caching"