Blog Archive for November 23, 2012

PhantomJS Site Scrape

November 23, 2012

PhantomJS is a standalone headless webkit based browser that can run from the command line. It runs scripts written in JavaScript, which can also run in the context of a remote web page.

This script goes to the Rothwell Temperance Band website at http://rtb.org.uk/ and finds all the H3 tags …