Path: ...!weretis.net!feeder9.news.weretis.net!news.quux.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!eternal-september.org!.POSTED!not-for-mail From: Lawrence D'Oliveiro Newsgroups: comp.lang.c Subject: Re: Simple string conversion from UCS2 to ISO8859-1 Date: Sat, 22 Feb 2025 21:23:42 -0000 (UTC) Organization: A noiseless patient Spider Lines: 11 Message-ID: References: <7bf2c66d1f1ef9e92c00f44320bb998f3cea2183@i2pn2.org> <87frk7m6h5.fsf@nosuchdomain.example.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Injection-Date: Sat, 22 Feb 2025 22:23:43 +0100 (CET) Injection-Info: dont-email.me; posting-host="66c510652e0b77746a9402731df3f37e"; logging-data="156229"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19ga1DqXKGVDb0/hQXUpkYU" User-Agent: Pan/0.162 (Pokrosvk) Cancel-Lock: sha1:DXAShxDiIBrMexRC7aMC7BIY4wA= Bytes: 1722 On Sat, 22 Feb 2025 13:11:34 +0100, Janis Papanagnou wrote: > UTF-8 is an _encoding_ (as I wrote), as opposed to a direct > representation of a fixed width character (either 8 bit width ISO 8859-X > or 16 bit with UCS-2). Conversions to/from UTF-8 are not as > straightforward as fixed width character representations are. Unicode is not, and never has been, a fixed-width character set. UCS-2 was a fixed-width set of code points. Even that idea has been abandoned.