GitHub

#python selenium为自动化测试框架，其余为python scrapy爬虫

影评分析：主要做了三件事：抓取网页数据清理数据用词云进行展示 python2.7 1、第一步要对网页进行访问，python中使用的是urllib2库 2、第二步，需要对得到的html代码进行解析，得到里面提取我们需要的数据，使用BeautifulSoup。 3、用jieba分词统计词频(numpy) 4、用云词显示wordcloud

https://pan.baidu.com/s/17JimPl3bA2ShVHPZK1z9ow

cacha为缓存爬虫

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.idea		.idea
CS		CS
Linux		Linux
cacha		cacha
movietop250		movietop250
photo		photo
practice		practice
selenium		selenium
todayfirst		todayfirst
weather		weather
weibo		weibo
zhidao		zhidao
影评分析		影评分析
智联招聘网站爬取		智联招聘网站爬取
股票数据定向爬取		股票数据定向爬取
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

chaoyifei/python

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages