Blog Archive for January 5, 2014

Fetching HTML via Beautiful Soup

January 5, 2014

I had the need to pull down news stories from a web site that didn't have an RSS feed. I therefore resorted to using Beautiful Soup from python to parse the HTML and find the appropriate section of the page.

In the following code, lNewsIndexUrl contains the full url to …