Path: ...!3.eu.feeder.erje.net!feeder.erje.net!news.in-chemnitz.de!news.swapon.de!weretis.net!feeder8.news.weretis.net!reader5.news.weretis.net!news.solani.org!.POSTED!not-for-mail From: Mild Shock Newsgroups: comp.lang.prolog Subject: ZebralLogic for evaluating LLMs (Was: Alan Kay's Dynabook fueled by Prolog?) Date: Thu, 1 Aug 2024 18:24:25 +0200 Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Date: Thu, 1 Aug 2024 16:24:24 -0000 (UTC) Injection-Info: solani.org; logging-data="692736"; mail-complaints-to="abuse@news.solani.org" User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Firefox/91.0 SeaMonkey/2.53.18.2 Cancel-Lock: sha1:1n1tguSUpktBxzj/8/1luzoaZG0= In-Reply-To: X-User-ID: eJwFwQkBACAIA8BK8oxpHBDpH8E7WEhceiAcgzmFHtXryx6tTzxm5FOkVK4tzLnU4MEaaLfV3t5XiJJS8gNYUBVj Bytes: 3071 Lines: 67 Hi, A Prolog appendix could give a nice boost to your Edge Device Artificial Intelligence, possibly not having much problems with Zebra puzzles: A week ago, I posted that I was cooking a logical reasoning benchmark as a side project. Now it's finally ready! Introducing πŸ¦“ π™•π™šπ™—π™§π™–π™‡π™€π™œπ™žπ™˜, designed for evaluating LLMs with Logic Puzzles. https://x.com/billyuchenlin/status/1814254565128335705 LoL Bye Mild Shock schrieb: > Hi, > > The paper mentions: "Huge computing power > of our modern laptops". If I look at my > new iPad Pro M4 2024, I would say > > "Huge computing power of tablets", measurement > have shown it is almost twice as fast as my > laptops form ca. 2020. So lets do the following: > > Bring LPTP to Dogelog Player? > > Bye > > P.S.: This would give a new spin of Alan > Key's vision of Dynabook. Can we run the > Dynabook idea on Prolog? > > Joe Armstrong interviews Alan Kay > https://www.youtube.com/watch?v=fhOHn9TClXY > > Mild Shock schrieb: >> Hi, >> >> I remember Robert StΓ€rk's disappearing from >> academic life at ETH Zurich all of a sudden. >> Did Ulrich Neumerkel now also disappeared not >> >> because the Scryer Prolog disaster, but after >> he figured out that failure slices are not hip >> enought? What could be more hip, are the modalities >> >> of Robert StΓ€rk's logic more hip now and even useful? >> >> Automated Theorem Proving for Prolog Verification >> Fred Mesnard etc.. May 2024 >> https://lim.univ-reunion.fr/staff/fred/Publications/24-MesnardMP-slides.pdf >> >> >> Disclaimer: I am not deep into this theory, >> it has some ingredients that were floating around >> the 80's / 80's, not only in the millieau of ETH Zurich, >> >> but also in the vincinity of Gehard Jaeger, Bern. >> There are many alternative formalizations that >> can express termination etc.. But maybe LPTP is >> >> especially suited for Prolog? >