news-extractor
news-please - an integrated web crawler and information extractor for news that just works
A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one package