CRUK Trailblazer image processor project

Basic idea

The Python scripts here for the basis of the pre-upload image processing necessary to get pathology images into PyBossa.

The main functions carried out are

rename the files to ensure they are anonymised
copies output file to a different target directory
compress them
convert to jpg if necessary
create a Pybossa image upload csv that contains image metadata

How to use

There are a set of constants defined at the top of the script - update these to match image locations and formats
A second set of constants refer to common metadata for this image set - update as required
Set 'processing_annotated_images' to be true or false according to image type being processed
Update 'get_core_id_from_filename' according to how core_id is embedded in filename

The CSV metadata file produced as an output serves as a google spreadsheet impot for PyBossa

Note that for annotated images are not renamed but they are compressed and converted to jpg format. The CSV file is created for annotated images but is not needed or used.

To get images on to server use git (please chaneg comment!)... git add --all && git commit -m 'adding images for xxxxx' && git push origin master

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
image-processor.py		image-processor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CRUK Trailblazer image processor project

Basic idea

How to use

About

Releases

Packages

Languages

andy-cruk/pathology-image-processor

Folders and files

Latest commit

History

Repository files navigation

CRUK Trailblazer image processor project

Basic idea

How to use

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages