Deutsch English Français Italiano |
<v3uidi$20jte$2@dont-email.me> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!news.mixmin.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Malcolm McLean <malcolm.arthur.mclean@gmail.com> Newsgroups: comp.lang.c Subject: Re: ASCII to ASCII compression. Date: Fri, 7 Jun 2024 10:03:46 +0100 Organization: A noiseless patient Spider Lines: 33 Message-ID: <v3uidi$20jte$2@dont-email.me> References: <v3snu1$1io29$2@dont-email.me> <v3u3c4$1ubqm$1@dont-email.me> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Date: Fri, 07 Jun 2024 11:03:46 +0200 (CEST) Injection-Info: dont-email.me; posting-host="bbc6ce194fadee025562548db60d9fc6"; logging-data="2117550"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18YIhlN/fmdtX25F8Ij9n/KQseuBN4MW5c=" User-Agent: Mozilla Thunderbird Cancel-Lock: sha1:EsiCvULE+XeVG/iZq8jqNOY3i8U= In-Reply-To: <v3u3c4$1ubqm$1@dont-email.me> Content-Language: en-GB Bytes: 2431 On 07/06/2024 05:47, Mikko wrote: > On 2024-06-06 16:25:37 +0000, Malcolm McLean said: > >> Not strictly a C programming question, but smart people will see the >> relavance to the topicality, which is portability. >> >> Is there a compresiion algorthim which converts human language ASCII >> text to compressed ASCII, preferably only "isgraph" characters? >> >> So "Mary had a little lamb, its fleece was white as snow". >> >> Would become >> >> QWE£$543GtT£$"||x|VVBB? > > There are compression algorithms that can be adapted to any possible > size of input and output character sets, including that both are > ASCII and that the output character set is a subset of the input set. > > Restricting the input set to ASCII may be too strong. Files that should > be ASCII files sometimes contain non-ascii bytes. The output should be > restricted to the 94 visible characters but the decompressor should > accept at least full ASCII and skip the invalid characters as > insignificant. > That permits addition of line brakes and perhaps other spaces that could > be useful for example when the file is printed for debugging. > That's exactly the idea. The system is robust to white space. You can add spaces to your heart's content, and they arec just skipped. -- Check out Basic Algorithms and my other books: https://www.lulu.com/spotlight/bgy1mm