The program was created for Team project - development of data analysis system course run by @pbiecek at Warsaw University of Technology.
The aim of the created program is to download tweets from twitter by defined user or keyword. Downloading can be done in two modes:
- stream
- history
Full specification and more detailed description of summarization features (in Polish) can be found in this file.
- Download binary file from latest release
- Prepare
config.yaml
- Run binaries
twitter-saver
andweb
The programs can be run from command line with following arguments:
--config
- path to config file
Config file is stored in YAML format. It contains following information:
db
:host
port
user
password
dbName
- database name in which data will be storedsslMode
-enable
ordisable
web
- web interface parameters:port
- port on which server will be workingsecret
- private key used to sign session identifiers. Should be random and renewed periodically. Keys shorter than 256 bits are not recommended.
users
- sequence of user accounts. Each account consists of:username
password
twitter
- twitter API keys:consumerKey
consumerSecret
token
tokenSecret
json
- defines additional fields from tweet saved in databaseall
- saves whole tweet contentfields
- sequence of field names to be stored
autoDeleteDays
- number of days after which data will be automatically removed
- Piotr Krzeszewski
- Łukasz Ławniczak
- Artur Minorczyk