Charity seeding experiment code and process

FORK of Toby's original code to interact with the JustGiving API. We (@daaronr and @oskasf) have adjusted it slightly (see commits) and are using this to do data pulls and capture data used elsewhere.Adapted by Oska to automate the process of data pulling and munging.

Charity seeding experiment code and process

Pulls

Files

bash Folder:

Contains scripts to pull new pages from JustGiving. Each file is fairly self explanatory; each contains explicit command line arguments for the list of charities and output folders. See the below explanation of justgiving_data_pull.R for more info. These scripts should be run from the repo root directory.

data_munge.sh combines all data using R scripts (script from R/process_data/main.R, calling a bunch of other scripts); saves 3 RDS files
move_final_dfs.sh: A script to move dataframes which are exported from running combine_available_data.R into the fundraising_data_pull repo. (This code presumes that both repos are stored in the same folder on your system. Note this may be redundant to code on the other side doing this.)
pull_effective.sh pulls data on effective charities (using R/justgiving_data_pull.R and parameters), and pushes to Github
pull_new_effective.sh pulls data on new effective charities and pushes to Github
pull_top_50_guardian_nonrelig_noncollege.sh
pull_new_top_50_guardian_nonrelig_noncollege.sh

R Folder:

just_giving_data_pull.R pulls data on charities which can be specified using command line arguments. Running rscript just_giving_data_pull.R --help gives further detail on all available arguments. (See optparse tool)
functions.R defines functions for data pulls.
functions.R defines functions for data pulls
get_current_state_and_randomise.R defines the randomisation process, outputs a file listing all new treatment groups, and saves the current state

R/process_data:

clean_data.R performs some light column type adjustments and deals with some missing data
combine_available_data.R combines all the donation and fundraising data into 3 key dataframes which have summary statistics
folder_funcs.R defines functions which are used in monthly_sum.R
main.R ties together all R files in this folder
monthly_sum.R is a file to speed up the process of combining. Data on donations and fundaisers are aggregated by the month in which they were pulled and written to a total dataframe for the month. This removes the need to re-read and re-combine all past data when running combine_available_data.R. Instead only the monthly dataframes are combined, of which there are much fewer (around 20-30 instead of 300). Note that monthly_sum.R will ignore previous months data but combine the current months.

Lists of charities

data/guardian_top_50.csv: data/guardian_top_50_nonrelig_noncollege.csv: removes 5 charities from the above, one college and four largely funding religious activities

data/effective_charities.csv: all recommended by one or more organisations associated with effective altruism (may need updating)

extend_charities/top_1000_list.csv

These orgs are: GiveWell https://www.thelifeyoucansave.org/ https://ea-foundation.org/ https://animalcharityevaluators.org

How do I make the code run? [NEEDS UPDATING -- partially out of date]

First you need to register and create an app id on JustGiving, and save this as in a file you call a file "my_app_id.R", containing a single line of text

my_app_id <- "/[ID]"

replacing "[ID]" with your ID, without the brackets.

Next... /

What are the files created?

2 files are created (?and 4 are updated) each time data is drawn from the API, and these are stored to the folders mentioned below.

Note that we must run this regularly to retain data from expired pages (which can't be accessed through the api).

DR: The above needs clarification. The 'all_' files are no longer being created I think

Created:

A table of currently live (not yet expired) JG fundraising pages is created in

{data\just_giving_data_snapshots\fundraisers}

...with the current date appended. This contains only those "effective" charities that have a just giving id in the effective_charities.csv file (the effective_charities.csv is in the data folder of the project).

A table of donations to currently live pages is created in

{data\just_giving_data_snapshots\donations}

..with the current date appended.

These files are created as a record of the state of the full sample of pages. This is done:

in case we find issues with the code or our data collection methodology during the experiment, and
for transparency - this data can be published as a way of allowing our entire process to be visible.

Updated:

data_pulls.csv is updated after every pull with the date and the file paths of the two files created (fundraisers and donations). The most recent files referenced in this table are used to update the other files.

Note: additional details for our project moved to private page

my_app_id <- {/your app id}

You may need to add quotes around this, i.e.: my_app_id <- "/id_number"

12th September

Data pulls were previously only pulling the effective charities, this has been changed now.

Name		Name	Last commit message	Last commit date
Latest commit History 536 Commits
R		R
bash		bash
data		data
extend_charities		extend_charities
rds		rds
renv		renv
.Rprofile		.Rprofile
.gitignore		.gitignore
README.md		README.md
fundraising_data_pull.Rproj		fundraising_data_pull.Rproj
my_app_id.R		my_app_id.R
packages_on_linode.txt		packages_on_linode.txt
renv.lock		renv.lock
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Charity seeding experiment code and process

Pulls

Files

Lists of charities

How do I make the code run? [NEEDS UPDATING -- partially out of date]

What are the files created?

Created:

Updated:

12th September

About

Releases

Packages

Languages

daaronr/fundraising_data_pull

Folders and files

Latest commit

History

Repository files navigation

Charity seeding experiment code and process

Pulls

Files

Lists of charities

How do I make the code run? [NEEDS UPDATING -- partially out of date]

What are the files created?

Created:

Updated:

12th September

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages