Deutsch English Français Italiano |
<v45h0l$3t9dr$1@dont-email.me> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!news.mixmin.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Malcolm McLean <malcolm.arthur.mclean@gmail.com> Newsgroups: comp.lang.c Subject: Re: ASCII to ASCII compression. Date: Mon, 10 Jun 2024 01:22:45 +0100 Organization: A noiseless patient Spider Lines: 49 Message-ID: <v45h0l$3t9dr$1@dont-email.me> References: <v3snu1$1io29$2@dont-email.me> <v3t2bn$1ksfn$1@dont-email.me> <v3t9hf$1m1oh$1@dont-email.me> <44h6e14sb3.fsf@be-well.ilk.org> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Date: Mon, 10 Jun 2024 02:22:46 +0200 (CEST) Injection-Info: dont-email.me; posting-host="6dd644d904799cc70f95f2de10652995"; logging-data="4105659"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+qrygxFbGTJGf2bOcUalDXKP/16XkruCk=" User-Agent: Mozilla Thunderbird Cancel-Lock: sha1:cbAvjqSHpiAIuTkvL9m7NpDgmdA= In-Reply-To: <44h6e14sb3.fsf@be-well.ilk.org> Content-Language: en-GB Bytes: 3284 On 10/06/2024 00:20, Lowell Gilbert wrote: > Malcolm McLean <malcolm.arthur.mclean@gmail.com> writes: > >> On 06/06/2024 20:23, Paul wrote: >>> On 6/6/2024 12:25 PM, Malcolm McLean wrote: >>>> >>>> Not strictly a C programming question, but smart people will see the relavance to the topicality, which is portability. >>>> >>>> Is there a compresiion algorthim which converts human language ASCII text to compressed ASCII, preferably only "isgraph" characters? >>>> >>>> So "Mary had a little lamb, its fleece was white as snow". >>>> >>>> Would become >>>> >>>> QWE£$543GtT£$"||x|VVBB? >>>> >>> The purpose of doing this, is to satisfy transmission through a 7 >>> bit channel. >>> In the history of networking, not all channels were eight-bit transparent. >>> (On the equipment in question, this was called "robbed-bit signaling.) >>> For example, BASE64 is valued for its 7 bit channel properties, the ability >>> to pass through a pipe which is not 8 bit transparent. Even to this day, >>> your email attachments may traverse the network in BASE64 format. >>> That is one reason, that email or USENET clients to this day, have >>> both 7 bit and 8 bit content encoding methods. It's to handle the >>> unlikely possibility that 7 bit transmission channels still exist. >>> They likely do exist. >>> >> Yes. If yiu stire data as 8 but binaries then it's inherently >> risky. There's usually no recovery froma single bit gett corrupted. >> >> Whilst if you store as ASCII, the data can usually be recovered very >> easly if something goes wrong wit the phsyical storage. A "And God >> said" >> becomes "And G$d said", an even with this tiny text, you can still read >> it perfectly well. > > That example only works because it doesn't include compression. > > Yes, so the ASCII to ASCCI compression scheme needs to be almost as robust. Any corruption will corrupt only a single line. And you can examine near-nonsense ASCII in a way you can't examine binary. Yiu can lod it into a text editor and look for the difference between two versions. -- Check out my hobby project. http://malcolmmclean.github.io/babyxrc