Deutsch English Français Italiano |
<v2acqr$2qj9l$1@dont-email.me> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!news.mixmin.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Thomas Koenig <tkoenig@netcologne.de> Newsgroups: comp.arch Subject: Re: Unicode in strings Date: Sat, 18 May 2024 14:09:31 -0000 (UTC) Organization: A noiseless patient Spider Lines: 37 Message-ID: <v2acqr$2qj9l$1@dont-email.me> References: <v0s17o$2okf4$2@dont-email.me> <4e0557bec2acda4df76f1ed01ebcbdf6@www.novabbs.org> <v1ep4i$1ptf$1@gal.iecc.com> <v1eruj$3o1r8$1@dont-email.me> <v1h8l6$1ttd$1@gal.iecc.com> <v1kifk$17qh0$1@dont-email.me> <2024May10.182047@mips.complang.tuwien.ac.at> <v1ns43$2260p$1@dont-email.me> <2024May11.173149@mips.complang.tuwien.ac.at> <v1preb$2jn47$1@dont-email.me> <2024May12.110053@mips.complang.tuwien.ac.at> <6124140226e28fd4afec0b435bdbeca1@www.novabbs.org> <2024May18.104040@mips.complang.tuwien.ac.at> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Injection-Date: Sat, 18 May 2024 16:09:31 +0200 (CEST) Injection-Info: dont-email.me; posting-host="fa291d9ac1b2dd6b1af2e22bd1896fd5"; logging-data="2968885"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+j2Zd/hkEKCoW/yI/aUwESRyfh259I2O8=" User-Agent: slrn/1.0.3 (Linux) Cancel-Lock: sha1:ZlFniGOoBHzNp7lt6xycfEAv+jw= Bytes: 3033 Anton Ertl <anton@mips.complang.tuwien.ac.at> schrieb: > mitchalsup@aol.com (MitchAlsup1) writes: >>It seems to me (in my vast ignorance) that names for things should be >>written in the most appropriate set of characters in the language of >>the person/thing being named. >> >>Then when such a name is "sent out to be displayed" that it is a property >>of the display what character set(s) it can properly emit, and thereby >>alter the string of characters as appropriate to its capabilities. >> >>For example:: Take > "K\u0316\u0308nig" cr type ==> K̖̈nig >>When displayed on a ASCII only line printer it would be written Koenig >>When displayed on a enhanced ASCII printer it would be written König >>When displayed on a full functional printer it would be written K̖̈nig > > Why do you think that K̖̈nig should be written as Koenig or König? On my display, this read K, n with a diacritic and something close to a cedille under the n. > > However, for König Again, the diaresis is over the n, not the o. > Unicode specifies that the precomposed form is > König. And if you want a transcription into ASCII with the knowledge > that it's German, the result would be Koenig. This is actually sometimes a (fairly minor) problem because the name on my passport actually reads "König" (o-diacritic), but people without knowledge of German tend to translscribe this as "Konig", whereas I transcribe it as "Koenig" on offical forms such as the one I need to fill out prior to entering the US. This is why modern EU passports have a canonical form of the name, which then is "KOENIG".