html2skim
html2skim OPTIONS SKIM_DB_FILE URL [CSS_SELECTOR]
html2skim provides a way to treat a webpage containing links as a
feed storing the results in a skimmer database. A SKIM_DB_FILE will be
created if it does not already exist. Feed channel information can be
specified via html2skim options. The URL should be for an HTML page that
html2skim will scrape for links. An optional CSS_SELECTOR can be
included to filter a specific section of the HTML document for links. If
none is provided then the selector a[href]
will be
used.
This is an example of scaping a web page identified as “https://example.edu/” filter the links found in the “.college-news” element.
html2skim \
myfeeds.skim \
https://example.edu/ \
'.college-news > a[href]'