How to handle missing `MSMS` data #38

hechth · 2023-02-23T09:50:07Z

Currently, the MS1 data is copied into the slot for MS2 data if it is not present in the version that reads data from a csv, while it is kept empty when reading it from xcms - should this be made the general case?

The text was updated successfully, but these errors were encountered:

cbroeckl · 2023-02-23T15:04:04Z

if i recall, the clustering algorithm is written to expect data in the MS2 slot as well. This is sloppy coding, frankly, as the way it is written was just a shortcut to keep from having to change the similarity scoring. If there is only MS1 data, in theory there is no reason to be calculating MS2 similarity, or MS1vs MS2 correlational similarity.

To move away from this we would need to ensure that the calculate.similarity function behaviour is different when no MS2 data is available - currently there is no condition written to deal with this situation:
max_value <- pmax( cor( data1[, start_row:stop_row], data1[, start_col:stop_col], method = cor.method, use = "everything"), cor( data1[, start_row:stop_row], data2[, start_col:stop_col], method = cor.method, use = "everything"), cor( data2[, start_row:stop_row], data2[, start_col:stop_col], method = cor.method, use = "everything") #, na.rm = TRUE ) ) # correlational similarity corr_sim <- round(exp(-((1 - max_value) ^ 2) / (2 * (sr ^ 2))), digits = 20) }

i think it is better to remedy this situation than leave it as it was written. fewer calculations to do.

hechth · 2023-02-24T08:28:29Z

@cbroeckl I agree - then let's keep an eye on this. let's make a list of places on the code where this behaviour will need to be adapted and resolve them step by step.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to handle missing `MSMS` data #38

How to handle missing `MSMS` data #38

hechth commented Feb 23, 2023

cbroeckl commented Feb 23, 2023

hechth commented Feb 24, 2023

How to handle missing MSMS data #38

How to handle missing MSMS data #38

Comments

hechth commented Feb 23, 2023

cbroeckl commented Feb 23, 2023

hechth commented Feb 24, 2023

How to handle missing `MSMS` data #38

How to handle missing `MSMS` data #38