Warning: mysqli::__construct(): (HY000/1203): User howardkn already has more than 'max_user_connections' active connections in D:\Inetpub\vhosts\howardknight.net\al.howardknight.net\includes\artfuncs.php on line 21
Failed to connect to MySQL: (1203) User howardkn already has more than 'max_user_connections' active connectionsPath: ...!feeds.phibee-telecom.net!news.mixmin.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Richard Harnden Newsgroups: comp.lang.c Subject: Re: ASCII to ASCII compression. Date: Fri, 7 Jun 2024 20:06:09 +0100 Organization: A noiseless patient Spider Lines: 37 Message-ID: References: <874ja657s9.fsf@bsb.me.uk> Reply-To: nospam.harnden@invalid.com MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Date: Fri, 07 Jun 2024 21:06:11 +0200 (CEST) Injection-Info: dont-email.me; posting-host="459b11261669d4f8bcbaef599a03cd81"; logging-data="2304710"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19/Puyn558DKqxizXpu63K8ntSfePMEi/4=" User-Agent: Mozilla Thunderbird Cancel-Lock: sha1:/2cSneGHuyg5MXjf1g1hYB+q3HQ= X-Antivirus-Status: Clean In-Reply-To: Content-Language: en-US X-Antivirus: AVG (VPS 240607-6, 6/7/2024), Outbound message Bytes: 3082 On 07/06/2024 19:49, Malcolm McLean wrote: > On 07/06/2024 13:52, Mikko wrote: >> On 2024-06-07 09:00:57 +0000, Malcolm McLean said: >> >>> Yes, but Huffman is easy to decode. It's the sort of project you give >>> to people who have just got past the beginner stage but aren't very >>> experienced programmers yet, whilst implementing Lempel-Ziv is a job >>> for someone who knows what he is doing. >>> >>> Because the lines will often be very short, adaptive Huffman coding >>> is no good. I need a fixed Huffman table with 128 entries for each 7 >>> bit value plus one for "stop". I wonder if any such standard table >>> exists. >> >> You don't need a standard table. You need statistics. Once you have the >> statistics the table is easy to costruct with Huffman's algorithm. >> > No you do. The text might be very short, like "Mary had a little lamb", > and you will compress it because you know that you are being fed > meaningful ASCII. For example even this tiny fragment contains the > letter "e", which would have a short Huffman code. And four a's and two > t's, which are the third and the second most commn letters. So it should > compress. > > And we're compressing each line independently, and choosing a visually > distinctive ASCII character as the line break. So anyone seeing the > compressed data will immediately be able to home in on the line breaks, > and will be able to fix any corruption without special tools. > > And you have a standard table which never changes. And so that makes the > decompressor much easier to write. Will your babyx be able to handle, say, utf-8? -- This email has been checked for viruses by AVG antivirus software. www.avg.com