Deutsch   English   Français   Italiano  
<v5fvp7$1ut7v$1@dont-email.me>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Lawrence D'Oliveiro <ldo@nz.invalid>
Newsgroups: comp.os.linux.advocacy
Subject: Re: Need Assistance -- Network Programming
Date: Wed, 26 Jun 2024 02:52:23 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 10
Message-ID: <v5fvp7$1ut7v$1@dont-email.me>
References: <17da6bead1f52684$159717$3694546$802601b3@news.usenetexpress.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Wed, 26 Jun 2024 04:52:23 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="d3e70b79bcb23d33c66caba9f2139d60";
	logging-data="2061567"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX1/cQyUbZjMafEAtYY2rk9yD"
User-Agent: Pan/0.158 (Avdiivka; )
Cancel-Lock: sha1:TzyTxE0d3P5aJAr80NK7MUxTwwU=
Bytes: 1345

On Wed, 19 Jun 2024 13:47:44 +0000, Lester Thorpe wrote:

> I need to read through an HTML file, find all external HTTP(S) links,
> and then determine if those external links are still viable, i.e.
> if the pages to which they link still exist.
> 
> Perl is the language of choice.

Does Perl have anything like BeautifulSoup
<https://www.crummy.com/software/BeautifulSoup/bs4/doc/>?