Deutsch English Français Italiano |
<v5fvp7$1ut7v$1@dont-email.me> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Lawrence D'Oliveiro <ldo@nz.invalid> Newsgroups: comp.os.linux.advocacy Subject: Re: Need Assistance -- Network Programming Date: Wed, 26 Jun 2024 02:52:23 -0000 (UTC) Organization: A noiseless patient Spider Lines: 10 Message-ID: <v5fvp7$1ut7v$1@dont-email.me> References: <17da6bead1f52684$159717$3694546$802601b3@news.usenetexpress.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Injection-Date: Wed, 26 Jun 2024 04:52:23 +0200 (CEST) Injection-Info: dont-email.me; posting-host="d3e70b79bcb23d33c66caba9f2139d60"; logging-data="2061567"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/cQyUbZjMafEAtYY2rk9yD" User-Agent: Pan/0.158 (Avdiivka; ) Cancel-Lock: sha1:TzyTxE0d3P5aJAr80NK7MUxTwwU= Bytes: 1345 On Wed, 19 Jun 2024 13:47:44 +0000, Lester Thorpe wrote: > I need to read through an HTML file, find all external HTTP(S) links, > and then determine if those external links are still viable, i.e. > if the pages to which they link still exist. > > Perl is the language of choice. Does Perl have anything like BeautifulSoup <https://www.crummy.com/software/BeautifulSoup/bs4/doc/>?