Deutsch   English   Français   Italiano  
<vrh1s2$37rio$1@dont-email.me>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!eternal-september.org!.POSTED!not-for-mail
From: Mikko <mikko.levanto@iki.fi>
Newsgroups: comp.theory
Subject: Re: neos Universal Compiler
Date: Thu, 20 Mar 2025 14:32:02 +0200
Organization: -
Lines: 47
Message-ID: <vrh1s2$37rio$1@dont-email.me>
References: <UX6BP.512735$Kb9a.408485@fx16.ams4> <vr3ir8$39jtg$1@dont-email.me> <3KgBP.513160$Kb9a.94584@fx16.ams4> <vr6948$1jnlo$1@dont-email.me> <KWEBP.424367$NN2a.82900@fx15.ams4> <vr8pi1$3r8lv$1@dont-email.me> <NrYBP.191725$C8m7.108955@fx11.ams4> <vrbu8h$2km30$1@dont-email.me> <S7fCP.743699$nb1.627959@fx01.ams4> <vre64b$lcu3$1@dont-email.me> <vre88p$7kb$1@news.muc.de> <vreh3f$v5ih$1@dont-email.me> <vrgtej$a85$1@news.muc.de>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Thu, 20 Mar 2025 13:32:03 +0100 (CET)
Injection-Info: dont-email.me; posting-host="60f4bef9f01dfedd0a7e4274e98fdf68";
	logging-data="3403352"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX18+9el1UJqco00Kx731NEEX"
User-Agent: Unison/2.2
Cancel-Lock: sha1:qgqYeOxDlcRX5JR1jLceg4Qu08E=
Bytes: 2892

On 2025-03-20 11:16:35 +0000, Alan Mackenzie said:

> Mikko <mikko.levanto@iki.fi> wrote:
>> On 2025-03-19 11:02:49 +0000, Alan Mackenzie said:
> 
>>> Mikko <mikko.levanto@iki.fi> wrote:
>>>> On 2025-03-18 14:08:50 +0000, Mr Flibble said:
> 
>>>>> On Tue, 18 Mar 2025 15:59:45 +0200, Mikko wrote:
> 
>>> [ .... ]
> 
>>>>>> Is there a neosBNF schema that describes the tokens of FORtRAN 66 or
>>>>>> Algol 60?
> 
>>>>> Not yet.
> 
>>>> The definition of string literal of Algol 60 would be a good example
>>>> of something that cannot be defined with a regular expression and is
>>>> therefore impossible or at least complicated with an ordinary tokenizer.
> 
>>> Would you please be more specific about just what in an Algol 60 string
>>> literal prevents a regexp from parsing it.  Not for any special reason,
>>> just that I'm curious.  Maybe an example of such a string would be
>>> interesting.  Thanks!
> 
>> Algol 60 has different characters for opening and closing quotes (something
>> like 2018 and 2019 of Unicode) ....
> 
> Most current languages, including C, have different openers and closers
> for comments, which is surely analogous.
> 
>> .... and allows any number of nested quotes.
> 
> Ah OK.  Regular expressions can't parse arbitrarily nested structures.
> But Backus-Nauer Form can express them, and a push-down automaton can
> process them.
> 
> Are you sure about ordinary tokenizers not being able to handle such
> arbitrarily nested things in a non-complicated way?

Yes if "ordinary" is defined as one that can only parse a regular
language.

-- 
Mikko