Skip to content

Latest commit

 

History

History
39 lines (23 loc) · 1.49 KB

README.md

File metadata and controls

39 lines (23 loc) · 1.49 KB

naver-software-crawler

naver-software-crawler downloads every single data from one category that you select from Naver Software.

Prerequisites

Usage

Select one category and copy the main address like below.

Clone this repo and enter

$ python git clone https://github.com/sjang42/naver-software-crawler.git 
$ cd naver-software-crawler

Execute crawler.py with needed options

$ python crawler.py --help
$ python crawler.py --address=[your_wanted_address] --id=[your_naver_id] --password=[your_naver_password] --driver=[path_to_chrome_driver]

Example & Screenshot

This code will crawl all the fonts on Naver Software.

$ python crawler.py --address=http://software.naver.com/software/fontList.nhn?categoryId=I0100000 --id=your_naver_id --password=your_naver_password --driver=path_to_chrome_driver

Caution

Naver Software sometimes stops your action and let you go dump url. when this is happend, crawler will stop and will display the address that need to be preceed from. When you get this address, you execute crawler.py with --goingon=True. See crawler.py --help for more information. Auto re-starts feature will be added. Pull request is always welcome.