Deutsch   English   Français   Italiano  
<vpdf8u$4oi5$2@dont-email.me>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!weretis.net!feeder9.news.weretis.net!news.quux.org!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!eternal-september.org!.POSTED!not-for-mail
From: Lawrence D'Oliveiro <ldo@nz.invalid>
Newsgroups: comp.lang.c
Subject: Re: Simple string conversion from UCS2 to ISO8859-1
Date: Sat, 22 Feb 2025 21:23:42 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 11
Message-ID: <vpdf8u$4oi5$2@dont-email.me>
References: <vp9oml$3a0k5$1@dont-email.me>
	<7bf2c66d1f1ef9e92c00f44320bb998f3cea2183@i2pn2.org>
	<vp9sb4$3a0k4$5@dont-email.me> <vp9tnr$3dca2$1@dont-email.me>
	<87frk7m6h5.fsf@nosuchdomain.example.com> <vpav4f$3jdl6$1@dont-email.me>
	<vpccfn$3to51$1@dont-email.me> <vpceto$3uccs$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Sat, 22 Feb 2025 22:23:43 +0100 (CET)
Injection-Info: dont-email.me; posting-host="66c510652e0b77746a9402731df3f37e";
	logging-data="156229"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX19ga1DqXKGVDb0/hQXUpkYU"
User-Agent: Pan/0.162 (Pokrosvk)
Cancel-Lock: sha1:DXAShxDiIBrMexRC7aMC7BIY4wA=
Bytes: 1722

On Sat, 22 Feb 2025 13:11:34 +0100, Janis Papanagnou wrote:

> UTF-8 is an _encoding_ (as I wrote), as opposed to a direct
> representation of a fixed width character (either 8 bit width ISO 8859-X
> or 16 bit with UCS-2). Conversions to/from UTF-8 are not as
> straightforward as fixed width character representations are.

Unicode is not, and never has been, a fixed-width character set.

UCS-2 was a fixed-width set of code points. Even that idea has been 
abandoned.