Deutsch English Français Italiano |
<v3t9hf$1m1oh$1@dont-email.me> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!news.mixmin.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Malcolm McLean <malcolm.arthur.mclean@gmail.com> Newsgroups: comp.lang.c Subject: Re: ASCII to ASCII compression. Date: Thu, 6 Jun 2024 22:26:07 +0100 Organization: A noiseless patient Spider Lines: 39 Message-ID: <v3t9hf$1m1oh$1@dont-email.me> References: <v3snu1$1io29$2@dont-email.me> <v3t2bn$1ksfn$1@dont-email.me> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Date: Thu, 06 Jun 2024 23:26:08 +0200 (CEST) Injection-Info: dont-email.me; posting-host="fd1a4517109bed24a981b559f1527ee3"; logging-data="1771281"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+viVfz2LBGBDMmB2UCV/ZVC7gOtjCMBDo=" User-Agent: Mozilla Thunderbird Cancel-Lock: sha1:v/Wwp87qVG+CZwYyRZ8ZTnfHvE4= Content-Language: en-GB In-Reply-To: <v3t2bn$1ksfn$1@dont-email.me> Bytes: 2691 On 06/06/2024 20:23, Paul wrote: > On 6/6/2024 12:25 PM, Malcolm McLean wrote: >> >> Not strictly a C programming question, but smart people will see the relavance to the topicality, which is portability. >> >> Is there a compresiion algorthim which converts human language ASCII text to compressed ASCII, preferably only "isgraph" characters? >> >> So "Mary had a little lamb, its fleece was white as snow". >> >> Would become >> >> QWE£$543GtT£$"||x|VVBB? >> > > The purpose of doing this, is to satisfy transmission through a 7 bit channel. > In the history of networking, not all channels were eight-bit transparent. > (On the equipment in question, this was called "robbed-bit signaling.) > For example, BASE64 is valued for its 7 bit channel properties, the ability > to pass through a pipe which is not 8 bit transparent. Even to this day, > your email attachments may traverse the network in BASE64 format. > > That is one reason, that email or USENET clients to this day, have > both 7 bit and 8 bit content encoding methods. It's to handle the > unlikely possibility that 7 bit transmission channels still exist. > They likely do exist. > Yes. If yiu stire data as 8 but binaries then it's inherently risky. There's usually no recovery froma single bit gett corrupted. Whilst if you store as ASCII, the data can usually be recovered very easly if something goes wrong wit the phsyical storage. A "And God said" becomes "And G$d said", an even with this tiny text, you can still read it perfectly well. -- Check out Basic Algorithms and my other books: https://www.lulu.com/spotlight/bgy1mm