Skip to content

Contains an experiment which checks how well a GP classifier fits from a handful of labelled high-dimensional samples.

Notifications You must be signed in to change notification settings

boykovdn/experiment_active_learning_performance

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Aim

Experiment aiming to compare performance of active learning (AL) querying VS random subset selection. The expected result is that AL performance increases more rapidly. The experiment is carried out on generated Gaussian clusters and on images using Cellpose as feature extractor.

Current state

The diagram below summarizes the input/output structure of the experiment, where rectangular nodes are function, and elliptical ones are inputs or outputs.

Diagram of the experiment structure.

The gp_al (stands for Gaussian Process Active Learning) algorithm is at the centre of the experiment. It handles the querying process which simulates the input received from the oracle by revealing the information to the classifier in stages. For the active learning, selection is done based on the entropy of predictions. Comparing to non-informative querying is done by passing a querying function which simply selects a random subset of the data.

Handwritten draft of the gp_al algorithm.

Running

Change run_container.sh to use your directories. Use docker and the run_container.sh script. Uncomment the jupyter bit if you want to run a notebook on localhost:8008.

For using the fluorescent cells dataset, download and unpack https://zenodo.org/record/6645803 into /data and use the run_container.sh script to mount this directory.

About

Contains an experiment which checks how well a GP classifier fits from a handful of labelled high-dimensional samples.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published