Deutsch   English   Français   Italiano  
<m3zfrv5qll.fsf@leonis4.robolove.meer.net>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!weretis.net!feeder8.news.weretis.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Madhu <enometh@meer.net>
Newsgroups: comp.lang.lisp
Subject: Re: From JoyceUlysses.txt -- words occurring exactly once
Date: Sat, 08 Jun 2024 22:17:18 +0530
Organization: Motzarella
Lines: 31
Message-ID: <m3zfrv5qll.fsf@leonis4.robolove.meer.net>
References: <v3ame4$1qf6m$5@dont-email.me>
	<jwvzfs6ncq0.fsf-monnier+comp.lang.lisp@gnu.org>
	<20240530161942.627@kylheku.com>
MIME-Version: 1.0
Content-Type: text/plain
Injection-Date: Sat, 08 Jun 2024 18:47:18 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="148b5a38ade18e885d5bb98692d69b25";
	logging-data="2828901"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX18T6NI7i3BvD1MLzA1pfolnvd3s++QFWTo="
Cancel-Lock: sha1:fGb6ERQDjRLnfE3JcYbKmNMXu9c=
	sha1:rpjB4GpuHwHVoZ0omUNrcntl/eU=
Bytes: 2261

* Kaz Kylheku <20240530161942.627@kylheku.com> :
Wrote on Thu, 30 May 2024 23:20:08 -0000 (UTC):

> On 2024-05-30, Stefan Monnier <monnier@iro.umontreal.ca> wrote:
>>> Given a text file of a novel (JoyceUlysses.txt) ...
>>> could someone give me a pretty fast (and simple) program that'd give me
>>> a list of all words occurring exactly once?
>>
>>     tr ' .;:,?!' '\n' | sort | uniq -u
>
> Yep, that's pretty much how Doug McIlroy famously shut down Knuth.

https://www.cs.tufts.edu/~nr/cs257/archive/don-knuth/pearls-2.pdf

(how do you cite this?)

Knuth didn't invent the "hash trie" data structure for this the article,
it was already there in TeX, in this article knuth credits Frank Liang's
phd thesis for the data structure.

This was one of the first things things I coded up at the time of the
article.  The fun was in designing how to best modify the structure
without sacrificing space

Phil Bagwell's paper "Ideal Hash Trees" described its invention
correctly as Hash Array Mapped Tries. However at some point, (probably
after the coming from clojure developers with "functional" pretensions?)
the "hash trie" was appropriated meaning something else,
something"immutable" and all that.

At least there isn't a wiki page for it.