forked from HHS/TANF-app
-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'develop' into sprint-100-summary
- Loading branch information
Showing
40 changed files
with
2,867 additions
and
37,482 deletions.
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
# High Level Parsing Flow | ||
|
||
Parsing begins after a user submits a datafile or datafiles via the frontend. The submission generates a new Celery task | ||
or tasks which are enqueued to Redis. As work becomes available the Celery workers dequeue a task from Redis and begin | ||
working them. The parsing task gets the Datafile Django model and begins iterating over each line in the file. For each | ||
line in the file this task: parses the line into a new record, performs category 1 - 3 validation on the record, | ||
performs exact duplicate and partial duplicate detection, performs category 4 validation, and stores the record in a | ||
cache to be bulk created/serialized to the database and ElasticSearch. The image below provides a high level flow of the | ||
aforementioned steps. | ||
|
||
![Parsing Flow](./diagrams/parsing.png) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.