Deutsch English Français Italiano |
<uuagm1$1f2kh$1@dont-email.me> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!news.mixmin.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Rich <rich@example.invalid> Newsgroups: misc.news.internet.discuss,comp.misc Subject: Re: China: Government Starts Phasing Out American Processors, Operating Systems on Government Computers Date: Sun, 31 Mar 2024 02:10:09 -0000 (UTC) Organization: A noiseless patient Spider Lines: 36 Message-ID: <uuagm1$1f2kh$1@dont-email.me> References: <utvu0p$8nhi$1@news.mixmin.net> <uu2skc$3cc0r$2@dont-email.me> <t0va0jtgjrrlfbtdmsv61fssb98oh376bm@4ax.com> <fk2b0jdv0c18pvhuihlco0ors44idpo5a5@4ax.com> <uu95bi$ak6$1@panix2.panix.com> <87a5mfo100.fsf@enoch.nodomain.nowhere> Injection-Date: Sun, 31 Mar 2024 02:10:14 +0200 (CEST) Injection-Info: dont-email.me; posting-host="78fc1234267bd7aa8f7e558201269377"; logging-data="1542801"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX192j3lzAN8EAvM0BWnVEsna" User-Agent: tin/2.6.1-20211226 ("Convalmore") (Linux/5.15.139 (x86_64)) Cancel-Lock: sha1:1eiXG/sBn8Q+cqggbXfxPmtZE2k= Bytes: 2905 In comp.misc Mike Spencer <mds@bogus.nodomain.nowhere> wrote: > > kludge@panix.com (Scott Dorsey) writes: > >> You also would see a lot of weird ligatures back then which are seldom >> seen today. Some of that stuff didn't actually disappear until automated >> typesetting came along, as there are only so many keys on the linotype. > > ... > > The PDF author had used used the ff ligature from whatever > $CURRENTLY-KEWL-CHARSET which was rendered readably. But the xpdf > author wasn't clueful enough to realize that no user ever enters a > ligature character code from the keyboard as a search target and write > compensating translations into the source code. It may not be xpdf's author's fault. If the pdf creator did not provide a proper reverse map table from the code point used for the ff ligature to its actual character (or characters) then there's nothing a pdf reader can do to fix the problem. The problem is that the PDF specification allows for the PDF creator to create arbitrary mappings from byte values used in the PDF file to any given glyph in a font file. But it makes optional the reverse mapping table which would define to a PDF reader program that "byte value 0x32 in this portion of this PDF [1] represents the 'ff' litgature". Without that reverse table, PDF is effectively a "write only medium". It will print a perfect document, but you can't search, nor copy out, anything from it. [1] 0x32 can be made to represent any number of different glyphs within a single given PDF. In fact, if one were so devious as to do so, every byte in the pdf representing a text character could be 0x32, and each one could "print" to the electronic sheet of paper a different font glyph.