Path: ...!weretis.net!feeder8.news.weretis.net!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Richard Owlett Newsgroups: comp.editors Subject: Re: Automating an atypical search & replace Date: Sun, 14 Jul 2024 02:47:02 -0500 Organization: A noiseless patient Spider Lines: 20 Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Date: Sun, 14 Jul 2024 09:47:04 +0200 (CEST) Injection-Info: dont-email.me; posting-host="46abaa0d5260ada45c5313798b70835f"; logging-data="71231"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+qQetRbfU7/Dj6L4bEiMkI7+HbXMOWi/g=" User-Agent: Mozilla/5.0 (X11; Linux i686; rv:52.0) Gecko/20100101 Firefox/52.0 SeaMonkey/2.49.4 Cancel-Lock: sha1:zzrepGLJ4voT38m6Fekz5vzlShY= In-Reply-To: Bytes: 1845 On 07/13/2024 06:39 PM, Lawrence D'Oliveiro wrote: > On Sat, 13 Jul 2024 11:08:48 -0500, Richard Owlett wrote: > >> These occurrences are consistently of the form >> arbitrary_text >> >> I wish to delete "" and *ASSOCIATED* "". > > This is beyond the abilities of regular expressions. This is the point > where you need to use an actual HTML/XML-parsing library. > > See also > . > Thank you for the reference. Also I've begun perusing https://docs.kde.org/stable5/en/kate/katepart/regular-expressions.html . One of my motivations for this project is education.