Skip to content

a jquery liked pythonic web crawler library ,it's based on BeautifulSoup and wget

Notifications You must be signed in to change notification settings

NStal/pywebquery

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pywebquery is a web crawler lib inspired by jQuery.It's based on famous XML/HTML parse lib BeautifulSoup.The predefined download method inside the lib is based on wget.

You can use this lib in a jQuery way or a pythonic way such:

for item in Page("http://www.stackoverflow.com").find(".question-hyperlink:title[android]").attr("href"):
    print item

Than you get all the question-link about android in the home page of StackOverFlow.

What about the content?

for item in Page("http://www.stackoverflow.com").find(".question-hyperlink:title[android]").follow().find(".question.post-text").text():
    print item 

Wow,you get all the question !


About

a jquery liked pythonic web crawler library ,it's based on BeautifulSoup and wget

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages