| Deutsch English Français Italiano |
|
<vpdf8u$4oi5$2@dont-email.me> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!weretis.net!feeder9.news.weretis.net!news.quux.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!eternal-september.org!.POSTED!not-for-mail From: Lawrence D'Oliveiro <ldo@nz.invalid> Newsgroups: comp.lang.c Subject: Re: Simple string conversion from UCS2 to ISO8859-1 Date: Sat, 22 Feb 2025 21:23:42 -0000 (UTC) Organization: A noiseless patient Spider Lines: 11 Message-ID: <vpdf8u$4oi5$2@dont-email.me> References: <vp9oml$3a0k5$1@dont-email.me> <7bf2c66d1f1ef9e92c00f44320bb998f3cea2183@i2pn2.org> <vp9sb4$3a0k4$5@dont-email.me> <vp9tnr$3dca2$1@dont-email.me> <87frk7m6h5.fsf@nosuchdomain.example.com> <vpav4f$3jdl6$1@dont-email.me> <vpccfn$3to51$1@dont-email.me> <vpceto$3uccs$1@dont-email.me> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Injection-Date: Sat, 22 Feb 2025 22:23:43 +0100 (CET) Injection-Info: dont-email.me; posting-host="66c510652e0b77746a9402731df3f37e"; logging-data="156229"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19ga1DqXKGVDb0/hQXUpkYU" User-Agent: Pan/0.162 (Pokrosvk) Cancel-Lock: sha1:DXAShxDiIBrMexRC7aMC7BIY4wA= Bytes: 1722 On Sat, 22 Feb 2025 13:11:34 +0100, Janis Papanagnou wrote: > UTF-8 is an _encoding_ (as I wrote), as opposed to a direct > representation of a fixed width character (either 8 bit width ISO 8859-X > or 16 bit with UCS-2). Conversions to/from UTF-8 are not as > straightforward as fixed width character representations are. Unicode is not, and never has been, a fixed-width character set. UCS-2 was a fixed-width set of code points. Even that idea has been abandoned.