Path: news.eternal-september.org!eternal-september.org!.POSTED!not-for-mail From: Janis Papanagnou Newsgroups: comp.lang.awk Subject: Re: The new matcher (Was: Experiences with match() subexpressions?) Date: Fri, 11 Apr 2025 15:50:11 +0200 Organization: A noiseless patient Spider Lines: 37 Message-ID: References: <67f8b7af$0$705$14726298@news.sunsite.dk> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Injection-Date: Fri, 11 Apr 2025 15:50:14 +0200 (CEST) Injection-Info: dont-email.me; posting-host="a8e511e26582239175fc7649ace8aa93"; logging-data="1844782"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/SavbuYJyWd+3eqzyuCqn1" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0 Cancel-Lock: sha1:RkMYa2FS9Z1Ovx4BweIb3k6GMkI= In-Reply-To: X-Enigmail-Draft-Status: N1110 On 11.04.2025 10:57, Kenny McCormack wrote: > In article <67f8b7af$0$705$14726298@news.sunsite.dk>, > Aharon Robbins wrote: > ... >> Mike Haertel is writing a new regexp matcher for gawk; it was announced >> here some time agao: https://github.com/mikehaertel/minrx. The code is >> in the feature/minrx branch of the gawk Git repository. > > Just out of curiosity, does the new matcher address the issue raised by > Janis? I read his post as if he put it under discussion ("I just opened an issue, [...] about this question. We shall see what develops.") and the provided link shows this as well.[*] (I don't see the answers, though, since my browser obviously doesn't support the web-page's (dynamic?) format. - So I cannot tell what the state of that discussion is.) > It sounds like you are implying that it does, but do not say so explicitly. > > [...] Janis [*] From https://github.com/mikehaertel/minrx/issues/43: So there are two questions. Is it theoretically possible to capture all the instances of subexpressions matched by the interval expression? Can this be brought out into the code? I understand it would take an extended API with a richer data structure in order to do this. gawk's extended version of the match() function could then be (somehow) extended to take advantage of this feature.