Article <v1u6oi$3o53t$1@dont-email.me>

Warning: mysqli::__construct(): (HY000/1203): User howardkn already has more than 'max_user_connections' active connections in D:\Inetpub\vhosts\howardknight.net\al.howardknight.net\includes\artfuncs.php on line 21
Failed to connect to MySQL: (1203) User howardkn already has more than 'max_user_connections' active connections
Warning: mysqli::query(): Couldn't fetch mysqli in D:\Inetpub\vhosts\howardknight.net\al.howardknight.net\index.php on line 66
Article <v1u6oi$3o53t$1@dont-email.me>

Deutsch English Français Italiano

<v1u6oi$3o53t$1@dont-email.me>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!weretis.net!feeder9.news.weretis.net!feeder8.news.weretis.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: BGB <cr88192@gmail.com>
Newsgroups: comp.arch
Subject: Re: Making Lemonade (Floating-point format changes)
Date: Mon, 13 May 2024 18:12:12 -0500
Organization: A noiseless patient Spider
Lines: 84
Message-ID: <v1u6oi$3o53t$1@dont-email.me>
References: <abe04jhkngt2uun1e7ict8vmf1fq8p7rnm@4ax.com>
 <memo.20240512203459.16164W@jgd.cix.co.uk> <v1rab7$2vt3u$1@dont-email.me>
 <20240513151647.0000403f@yahoo.com> <v1tre1$3leqn$1@dont-email.me>
 <9c79fb24a0cf92c5fac633a409712691@www.novabbs.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Tue, 14 May 2024 01:12:19 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="f7bb63f21ae876478e2f78c3252a6146";
	logging-data="3937405"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX19gGZ6YPBw+MUlvwBtTEnsJrLMYBweuQCc="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:6PF3OeD2xXHf1cl2tJPCjFu5GXE=
Content-Language: en-US
In-Reply-To: <9c79fb24a0cf92c5fac633a409712691@www.novabbs.org>
Bytes: 4419

On 5/13/2024 4:16 PM, MitchAlsup1 wrote:
> BGB wrote:
> 
>>
>> Emulation via traps is very slow, but typical for many ISA's is to 
>> just quietly turn the soft-float operations into runtime calls.
> 
> I recall that MIPS could emulate a TLB table walk in something like
> 19 cycles. That is:: a few cycles to get there, a hash table access,
> a check, a TLB install, and a few cycles to get back.
> 
> On an x86 this would be at least 200 cycles just getting there and back.
> 

I guess there are different possibilities here...

Trap cost can be reduced, say, by having banked registers.
But, not so good with explicit save/restore and a large register file.

For example, I can note that a MSP430 at 16MHz can service a 32kHz 
timer... (with a budget of 488 cycles per interrupt).

But, my BJX2 core (at 50MHz) would have a harder time here, with around 
a 1.5k cycle budget...

Then again, it is possible the per-interrupt overhead would go down 
slightly, since most likely the ISR stack will still be in the L1 cache 
between interrupts (and save/restore overhead should drop to ~ 100 
cycles in the absence of L1 misses).

MSP430 had a slight advantage here (besides fewer registers) in that L1 
misses are not a thing (so, memory access has constant latency).

> So, to revisit your statement::
> 
> Emulation is slow when trap overhead is large and not-slow when trap 
> overhead
> is small.

Possible, but I would not expect trap overhead to be lower than runtime 
call overhead...

Also (in my case):
Debugging is rather annoying in cases where dealing with bugs 
appear/disappear/move around at random or with the slightest perturbation...

But, given for the most part behavior is consistently buggy (and 
manifesting in seemingly the same ways) between both the emulator and 
Verilog implementation, this implies the causal factors are in software.

I guess in this case, either I figure it out, or will need to again go 
back to cooperative scheduling. Seemingly, using preemptive scheduling 
and virtual memory at the same time is particularly unstable (programs 
tend to crash on startup or soon after).

Also I may need to rework how page-in/page-out is handled (and or how IO 
is handled in general) since if a page swap needs to happen while IO is 
already in progress (such as a page-miss in the system-call process), at 
present, the OS is dead in the water (one can't access the SDcard in the 
middle of a different access to the SDcard).

Though, this issue is reduced if all memory used for kernel-level 
operation is either physical or direct-mapped (well, and/or doing "TLB 
pokes" for each cluster before reading/writing it).

But, needing to use memory probes in some cases (before certain 
operations, such as inter-ISA branches, etc; because of bad things that 
can happen if a TLB miss or page-miss happens at that moment), is kinda 
stupid...

Doing a test:
Seems the bugs with preemption + virtual memory are independent of the 
usage of traditional nested page tables or hybrid B-Tree based page 
tables, ...