Path: ...!news.mixmin.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: anton@mips.complang.tuwien.ac.at (Anton Ertl) Newsgroups: comp.arch Subject: Re: text in programming languages, Unicode in strings Date: Mon, 20 May 2024 17:24:03 GMT Organization: Institut fuer Computersprachen, Technische Universitaet Wien Lines: 30 Message-ID: <2024May20.192403@mips.complang.tuwien.ac.at> References: <2024May20.145316@mips.complang.tuwien.ac.at> Injection-Date: Mon, 20 May 2024 19:30:55 +0200 (CEST) Injection-Info: dont-email.me; posting-host="481c197852760d18650e1e4ce71128a0"; logging-data="120268"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19s4AsFDVCtGHAI2oeyftzB" Cancel-Lock: sha1:6Uh7g2+AyNExSEY+ozK+AGB7iIA= X-newsreader: xrn 10.11 Bytes: 2171 jgd@cix.co.uk (John Dallman) writes: >In article <2024May20.145316@mips.complang.tuwien.ac.at>, >anton@mips.complang.tuwien.ac.at (Anton Ertl) wrote: > >> I am not convinced that the locale-specific input is a good idea, >> though. > >You look pretty silly if your input function can't read the products of >your output function, and figuring out what separators have been used >automatically is not foolproof. Yes and yes. Especially given the "," vs. "." roles in various locales. But OTOH, not being able to read or, worse, misinterpreting the output produced by someone else just because that output was produced under a different locale is pretty silly, too. For reserved words and builtin names of programming languages, the solution has been to make them independent of the locale and ignore Algol 60 and Algol 68 for programming, which suggested something else. We already do the same for the decimal separator in the usual output functions (it uses "."), we should introduce thousands separators that are also locale-independent. - anton -- 'Anyone trying for "industrial quality" ISA should avoid undefined behavior.' Mitch Alsup,