git, JDK 11 (or later) and Apache Maven are required.
Execute:
$ cd ~/foo/bar/
$ git clone https://github.com/edamontology/pubfetcher.git
$ cd pubfetcher/
$ git checkout develop
$ mvn clean install
PubFetcher can now be run with:
$ java -jar ~/foo/bar/pubfetcher/target/pubfetcher-cli-<version>.jar -h
A packaged version of PubFetcher can be found as ~/foo/bar/pubfetcher/dist/target/pubfetcher-<version>.zip
.
Same as previous section, except git checkout develop
must be replaced with git checkout master
.
Pre-built releases can be found from https://github.com/edamontology/pubfetcher/releases. A downloaded release package can be unzipped in the desired location, where pubfetcher-cli-<version>.jar
can again be run with java -jar
.
By default, Selenium WebDriver is used to get the content of web pages that require JavaScript. Currently, only the Firefox driver is supported, meaning that Firefox needs to be installed (or the firefox executable available somewhere and pointed to with --seleniumFirefox /path/to/firefox
), otherwise PubFetcher will fail to start. If this not desirable, then usage of Selenium can be disabled with --selenium false
, in which case the included HtmlUnit library will be used instead (however, this will be slower and with worse quality results).