Skip to content

Commit

Permalink
Update README.MD
Browse files Browse the repository at this point in the history
  • Loading branch information
matt-farmer authored Apr 19, 2018
1 parent 5050e64 commit a5f4f2e
Showing 1 changed file with 10 additions and 12 deletions.
22 changes: 10 additions & 12 deletions nap-writing-print/README.MD
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ The full process to go from receiveing a NAPLAN results data file is:
1. invoke nap-writing-print with the following command
> ./nap-writing-print
The nap-writing-print tool will then run and create individual pdf files for each writing response that it finds
The nap-writing-print tool will then run and create individual html files for each writing response that it finds
in the provided writing_extract.csv file.

When the tool runs it creates two folder strucures in its /out directory
Expand All @@ -39,10 +39,13 @@ in each of the folders you will find the following sub-folder strucure

/script

The files are named with a prefix of the student participation code, followed by the anonymous id given
The files are named with a prefix of the state identifier and the student participation code, followed by the anonymous id given
to that student's script. The participation prefix is to allow users to ignore
scripts or non-scripts from students who did not complete the writing test.

writing script files end with the suffix script
audit files end with the suffix audit

The files have the same name, for cross validation, in each folder

Files in the /script directory contain
Expand All @@ -53,24 +56,19 @@ the meta-data associated with the student's script; such as PSI, Local identifie

This means scripts can be distributed independently, but manually reconciled if needed.

As the pdf outout is constructed entirely from the contents of the input file, for safety at the end of a run a timestamped
folder is created in the /backup directory of the nap-writing-print folder so that the same pdfs can be
As the html outout is constructed entirely from the contents of the input file, for safety at the end of a run a timestamped
folder is created in the /backup directory of the nap-writing-print folder so that the same html files can be
generated at any time in the future even if the working .csv file has been over-written.

The pdf files are all sized as A4, and are currently only created in black and white, the printing tool
does not maintain any text colouring added by the user.

The output file does maintain the paragraphing of the original input from the user, but does not maintain formatting
such as bold, italics or sub/superscripting.

All input will be rendered as text icluding the best textual approximation of extended chars such as emojis.
The output file maintains the paragraphing of the original input from the user, bold text, underlined text, italic text, and ordered and unordered lists as available in the online naplan editor component.

It goes without saying that this tool will generate a lot of files, so is best run on 64-bit environments where constraints
on number of files in a directory or folder are not an issue if large input files are being processed.

It is advised for best performance to run on a machine with an SSD hard disk.

Current performance on a i5 macbook creates 200k pdfs in around 2 minutes.
Current performance on a i5 macbook creates 200,000 html files (4 files for each of 50k students) in around 1 minute.




Expand Down

0 comments on commit a5f4f2e

Please sign in to comment.