Path: ...!eternal-september.org!feeder2.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: saito Newsgroups: comp.lang.tcl Subject: Re: unicode text Date: Sat, 9 Nov 2024 12:57:27 -0500 Organization: A noiseless patient Spider Lines: 11 Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Date: Sat, 09 Nov 2024 18:57:28 +0100 (CET) Injection-Info: dont-email.me; posting-host="ab0d08e8ac772a511d4eb961b3e804be"; logging-data="4102405"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+EwbHmIFv16LN2Fnm4LKXc" User-Agent: Mozilla Thunderbird Cancel-Lock: sha1:7ndJe05lP3ygCC8XTaEbbVTZYyY= Content-Language: en-US In-Reply-To: Bytes: 1433 On 11/8/2024 10:15 PM, Michael Soyka wrote: > On 11/08/2024 9:28 PM, saito wrote: >> Is there a way to remove emojis, non-printable and other graphic >> characters from a string? I can use a regexp with a-zA-Z and such but >> this doesn't account for valid characters from non-ascii/non-Western >> languages, right? >> > I've found that this regular expression works for emojis: >    [^[:print:][:cntrl:]] Thanks! That is a good start.