Skip to content
vuamitom edited this page Aug 3, 2012 · 1 revision
  • Beautiful Soup
  • lxml
  • regex

pyGoose uses lxml to parse HTML text and falls back to Beautiful Soup for text that lxml can't parse.

Clone this wiki locally