use `xtal2png` with `imagen-pytorch` and `matbench-genmetrics` #204

sgbaird · 2022-08-20T07:21:53Z

matbench-genmetrics is in a usable state now #12 (comment)

I think imagen-pytorch can be used with TPU, but I'm not sure how much custom configuration is required https://github.com/sparks-baird/xtal2png/blob/main/notebooks/3.1-imagen-pytorch.ipynb

I might just need to try it on Colab, switch to TPU, and see what happens. I think the latest versions uses 🤗 Accelerate library will make it easier to switch over. I'm unsure if I should focus more on hyperparameter tuning or just pick some reasonable defaults and train it for as long as seems reasonable (a week or two, for example). If going with my university HPC instead of TPU time, I can still do checkpointing in either case.

The text was updated successfully, but these errors were encountered:

sgbaird · 2022-10-22T04:54:03Z

@ ~2000 epochs (4x4 tile)

kjappelbaum · 2022-10-22T16:40:39Z

@ ~2000 epochs (4x4 tile)

do they decode to some reasonable materials? :D

sgbaird · 2022-10-28T18:53:38Z

do they decode to some reasonable materials? :D

I'm going to go with a pretty confident "no" 😬

I think I'm also going to say the metrics need some work (note this is for 1000 generated structures):

{0: {'validity': 0.4092998941577499, 'coverage': 0.0, 'novelty': 1.0, 'uniqueness': 1.0}}

sgbaird · 2022-10-28T18:54:42Z

While I'm sure there's a lot to be done with the hyperparameters, I think I'll take another shot at running CDVAE for comparison.

HarshaSatyavardhan · 2023-10-09T07:15:06Z

I am seeing coverage as 0 is it not concerning?. I my self have tried and got coverage as 0 with different variations of ddpm.

sgbaird · 2023-10-13T01:51:39Z

@HarshaSatyavardhan thanks for the great question. Concerning - yes, though the idea of rediscovery is quite difficult. To make the point, see what the authors of PGCGM needed to do before "moving the needle" past 0 in https://www.nature.com/articles/s41524-023-01059-8:

Notice how the first bar along the horizontal axis starts at 50*10000 = 500,000. Also, the coverage benchmark from matbench-genmetrics is even more difficult to succeed at than what PGCGM did because it uses time-based splits (i.e., not just can we discover something that was held out, but can we discover something "in the future" based only on training data before some calendar year).

I my self have tried and got coverage as 0 with different variations of ddpm.

Thank you for sharing this.

Open to thoughts or suggestions you have. I think both xtal2png and the benchmarks themselves could be improved. Aside: matbench-genmetrics is under review at openjournals/joss-reviews#5618.

cc @hasan-sayeed @sp8rks @michaeldalverson

sgbaird mentioned this issue Oct 22, 2022

Some elements dropped while encoding to mod_pettifor representation kjappelbaum/element-coder#21

Open

sgbaird mentioned this issue Oct 28, 2022

Run matbench-genmetrics on the latest imagen-pytorch run (fixup mod-petti featurizer) #207

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use `xtal2png` with `imagen-pytorch` and `matbench-genmetrics` #204

use `xtal2png` with `imagen-pytorch` and `matbench-genmetrics` #204

sgbaird commented Aug 20, 2022

sgbaird commented Oct 22, 2022

kjappelbaum commented Oct 22, 2022

sgbaird commented Oct 28, 2022

sgbaird commented Oct 28, 2022

HarshaSatyavardhan commented Oct 9, 2023

sgbaird commented Oct 13, 2023 •

edited

Loading

use xtal2png with imagen-pytorch and matbench-genmetrics #204

use xtal2png with imagen-pytorch and matbench-genmetrics #204

Comments

sgbaird commented Aug 20, 2022

sgbaird commented Oct 22, 2022

kjappelbaum commented Oct 22, 2022

sgbaird commented Oct 28, 2022

sgbaird commented Oct 28, 2022

HarshaSatyavardhan commented Oct 9, 2023

sgbaird commented Oct 13, 2023 • edited Loading

use `xtal2png` with `imagen-pytorch` and `matbench-genmetrics` #204

use `xtal2png` with `imagen-pytorch` and `matbench-genmetrics` #204

sgbaird commented Oct 13, 2023 •

edited

Loading