Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor Muckrock Scraper #111

Open
wants to merge 15 commits into
base: main
Choose a base branch
from

Conversation

maxachis
Copy link
Collaborator

@maxachis maxachis commented Dec 16, 2024

Supports #105

Refactors Muckrock Scraper, getting rid of redundant code, modularizing, and otherwise doing quality of life improvements.

* Create explicit "main" method
* Extract logic to functions
* Add detailed docstrings
* Add some comments
* Create explicit "main" method
* Extract logic to functions
* Add detailed docstrings
* Add some comments and TODOs
* Add detailed docstrings
* Add some comments and TODOs
* Create explicit main function and `__main__` section
* Add detailed docstrings
* Add some comments and TODOs
* Extract logic from `muck_get.py` and `download_muckrock_foia.py`
* Create constants for base muckrock api url and foia extension of base url
* Extract logic for loading from and saving to json files to separate functions
* Add TODOs
* Extract `muck_get.py` logic to FOIA searcher
* Remove deprecated `download_muckrock_foia.py`
* Create MuckrockFetcher base class
* Implement in FOIAFetcher
* Create JurisdictionFetcher and AgencyFetcher
* Replace relevant logic in `generate_detailed_muckrock_csv.py`
* Create Enum Class
* Simplify Agency Info data creation
* Extract logic to separate functions
* Create Enum Class
* Simplify Agency Info data creation
* Extract logic to separate functions
* Create LoopFetcher classes
* Implement in `get_allegheny_foias`
* Create SQLClient classes
* Add custom exception handling to Muckrock Fetcher.
* Clean up comments
* Extract some logic to separate functions.
* Create FOIA DB Searcher class, incorporate into module
* Extract logic to functions
* Move all class files into `/classes` module
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant