Deutsch English Français Italiano |
<vrh1s2$37rio$1@dont-email.me> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!eternal-september.org!.POSTED!not-for-mail From: Mikko <mikko.levanto@iki.fi> Newsgroups: comp.theory Subject: Re: neos Universal Compiler Date: Thu, 20 Mar 2025 14:32:02 +0200 Organization: - Lines: 47 Message-ID: <vrh1s2$37rio$1@dont-email.me> References: <UX6BP.512735$Kb9a.408485@fx16.ams4> <vr3ir8$39jtg$1@dont-email.me> <3KgBP.513160$Kb9a.94584@fx16.ams4> <vr6948$1jnlo$1@dont-email.me> <KWEBP.424367$NN2a.82900@fx15.ams4> <vr8pi1$3r8lv$1@dont-email.me> <NrYBP.191725$C8m7.108955@fx11.ams4> <vrbu8h$2km30$1@dont-email.me> <S7fCP.743699$nb1.627959@fx01.ams4> <vre64b$lcu3$1@dont-email.me> <vre88p$7kb$1@news.muc.de> <vreh3f$v5ih$1@dont-email.me> <vrgtej$a85$1@news.muc.de> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Date: Thu, 20 Mar 2025 13:32:03 +0100 (CET) Injection-Info: dont-email.me; posting-host="60f4bef9f01dfedd0a7e4274e98fdf68"; logging-data="3403352"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18+9el1UJqco00Kx731NEEX" User-Agent: Unison/2.2 Cancel-Lock: sha1:qgqYeOxDlcRX5JR1jLceg4Qu08E= Bytes: 2892 On 2025-03-20 11:16:35 +0000, Alan Mackenzie said: > Mikko <mikko.levanto@iki.fi> wrote: >> On 2025-03-19 11:02:49 +0000, Alan Mackenzie said: > >>> Mikko <mikko.levanto@iki.fi> wrote: >>>> On 2025-03-18 14:08:50 +0000, Mr Flibble said: > >>>>> On Tue, 18 Mar 2025 15:59:45 +0200, Mikko wrote: > >>> [ .... ] > >>>>>> Is there a neosBNF schema that describes the tokens of FORtRAN 66 or >>>>>> Algol 60? > >>>>> Not yet. > >>>> The definition of string literal of Algol 60 would be a good example >>>> of something that cannot be defined with a regular expression and is >>>> therefore impossible or at least complicated with an ordinary tokenizer. > >>> Would you please be more specific about just what in an Algol 60 string >>> literal prevents a regexp from parsing it. Not for any special reason, >>> just that I'm curious. Maybe an example of such a string would be >>> interesting. Thanks! > >> Algol 60 has different characters for opening and closing quotes (something >> like 2018 and 2019 of Unicode) .... > > Most current languages, including C, have different openers and closers > for comments, which is surely analogous. > >> .... and allows any number of nested quotes. > > Ah OK. Regular expressions can't parse arbitrarily nested structures. > But Backus-Nauer Form can express them, and a push-down automaton can > process them. > > Are you sure about ordinary tokenizers not being able to handle such > arbitrarily nested things in a non-complicated way? Yes if "ordinary" is defined as one that can only parse a regular language. -- Mikko