Skip to content

This is a crawler for Tecent news,Netease News and some news website

Notifications You must be signed in to change notification settings

xvlvzhu/Search_Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Search_Engine

#版本:
version 0.1
#介绍:
This is a crawler for Tecent news,Netease News and some news website
#预备工作:
1、在工程中导入searchengine.jar
2、SearchEngine_zxl.jar -> add to build path
3、添加https://github.com/zxl1994/Search_Engine 目录中的Jar依赖包
4、c3p0-config.xml中填入数据库信息
#使用方法:
News_Crawler_2 news_Crawler = News_Crawler_2.getInstance(); //创建实例
news_Crawler.start_crawler(); //启动爬虫
news_Crawler.user_defined_url(); //自定义爬虫
news_Crawler.stop_crawler(); //停止爬虫

ReverseIndex.reverseIndex("XXX","XXX",false);
参数1:创建/存放索引的目录
参数2:检索关键字
参数3:true:更新/创建索引 false:使用已建立的索引

About

This is a crawler for Tecent news,Netease News and some news website

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages