minimal_web_scraper.main#

Overview#

Function#
`download`(target_url, timeout)	Download the HTML content of the URL, using requests library.
`scrape`(url)	Orchestrate the download and parse of the resource at the URL.

Attributes#
`headers`	-

minimal_web_scraper.main.download(target_url: str, timeout: int = 1) → tuple[bytes, str | None]#

Download the HTML content of the URL, using requests library.

Use a custom header.

minimal_web_scraper.main.scrape(url: str) → Any#

Orchestrate the download and parse of the resource at the URL.

Parameters:: url – URL to parse
Returns:: extracted informations by a implemented parsers.BaseParser.parse()
Raise:: parsers.exceptions.ParserNotFound()