Automated subset selection Python code for Masters Thesis on proteomics search engine parameter optimization. Code includes automated selection, controlled and randomized, for creating subset mgf files from a full mgf file. Requires the Pyteomics toolkit.