This project is about easily finding out how much a rental property's price had declined since its peak price prior to pandemic lockdowns.
To run this project, open crawler.ipynb
as a Jupyter Notebook, and run reach cell.
A cell near the top of crawler.ipynb
contains the following parameters:
__from_cache_file
- ifTrue
, the crawler will takehtml
from the specified cache file rather than request html from the internet.__targets
- the source of target addresses__cache_dir
- the location of cached html__data_output_file
- name of the output file__output_dir
= directory for any and all output that is written to file
crawler.ipynb
does all the work- the file specified by
__targets
should contain a line for each of the properties you are interested in checking. Data is the format:
<unit #>/<street number> <street name>,
<suburb>,
<state(2-3 letter abbreviation)> <postcode>;
<advertised price($)>;
<number of bedrooms>;
<is this an NRAS property? (y/n)>
-
- Note: Each entry should be on a single line. The above is broken over several lines for readability
- Example:
301/4 Pretend Place, Newtown, NSW 2067; 290; 1; y