Deutsch English Français Italiano |
<874ja657s9.fsf@bsb.me.uk> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!2.eu.feeder.erje.net!feeder.erje.net!news.swapon.de!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Ben Bacarisse <ben@bsb.me.uk> Newsgroups: comp.lang.c Subject: Re: ASCII to ASCII compression. Date: Thu, 06 Jun 2024 17:56:54 +0100 Organization: A noiseless patient Spider Lines: 31 Message-ID: <874ja657s9.fsf@bsb.me.uk> References: <v3snu1$1io29$2@dont-email.me> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Injection-Date: Thu, 06 Jun 2024 18:56:58 +0200 (CEST) Injection-Info: dont-email.me; posting-host="44b6ff77d00ee25ed348c35c6fa65b42"; logging-data="1668567"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+9Nd5tisMvol9YsWVTepVfcarnPEeYHvU=" User-Agent: Gnus/5.13 (Gnus v5.13) Cancel-Lock: sha1:mvx0KpmK183zrdl4fDzF27QGvkA= sha1:XJTluA4ZZWs2UuKtiTR/5dmFMwE= X-BSB-Auth: 1.5f65eca2766262ec03e8.20240606175654BST.874ja657s9.fsf@bsb.me.uk Bytes: 2246 Malcolm McLean <malcolm.arthur.mclean@gmail.com> writes: > Not strictly a C programming question, but smart people will see the > relavance to the topicality, which is portability. I must not be smart as I can't see any connection to the topic of this group! > Is there a compresiion algorthim which converts human language ASCII text > to compressed ASCII, preferably only "isgraph" characters? > > So "Mary had a little lamb, its fleece was white as snow". > > Would become > > QWE£$543GtT£$"||x|VVBB? Obviously such algorithms exist. One that is used a lot is just base64 encoding of binary compressed text, but that won't beat something specifically crafted for the task which is presumably what you are asking for. I don't know of anything aimed at that task specifically. One thing you should specify is whether you need it to work on small texts, or, even better, at what sort of size you want the pay-off to start to kick in. For example, the xz+base64 encoding of the complete works of Shakespeare is still less than 40% of the size of the original but your single line will end up much larger using that off-the-shelf scheme. -- Ben.