blogmarks.net Get Firefox!

Adrian Holovaty releases templatemaker, a Python library for smart screen scraping

18 month ago

Andy Baio : Adrian Holovaty releases templatemaker, a Python library for smart screen scraping - given a large set of HTML documents, intelligently extracts the strings that change between them

Matthew M. Boedicker : templatemaker, Python screenscraping library - (via waxy) [via]

joshua : Introducing templatemaker - back out templates from similar documents

Rod Begbie : Introducing templatemaker - Python library that analyses a corpus of web pages, works out where the dynamic values are in the template, then allows you to scrape out the juicy details. I can think of oh, so many uses for this. [via#

philgyford : Introducing templatemaker | Holovaty.com - Python thing. Point it at some HTML files and it will make a template with holes for the unique strings in the pages. (via Daring Fireball)

Tags : dev python web adrianholovaty screenscraping html scraping templates templating top via:daringfireball webdevelopment

  copy
xml
Upian.