[WIP] Adding Docker version of Prism #68

danieldeutsch · 2021-10-28T14:48:27Z

Here is what it would look like to add a Dockerized version of a metric.

danieldeutsch · 2021-10-28T14:49:16Z

gem_metrics/prism.py

+
+class Prism(ReferencedMetric):
+    def __init__(self, device: int, language: str = "en"):
+        self.prism = _Prism(device=device, language=language)


The implementation in the GEM metrics library would wrap the one in Repro (here)

danieldeutsch · 2021-10-28T14:49:52Z

gem_metrics/prism.py

+
+        # Example `micro` output for 3 inputs
+        # [{'prism': -1.1578280925750732}, {'prism': -1.3325390815734863}, {'prism': -2.730839729309082}]
+        _, micro = self.prism.predict_batch(inputs)


The actual implementation would reformat the input data to the format required by the metric, running predict_batch, and then reformatting the outputs to fit the GEM framework

danieldeutsch · 2021-10-28T14:52:05Z

If you have a machine with Docker installed, you should be able to directly run test_prism.py. The first time it runs, this Docker image will automatically be downloaded to your machine, which may take 1-2 minutes. After that, you can directly run the metric without downloading the Docker image.

danieldeutsch · 2021-10-28T14:53:24Z

@sebastianGehrmann @tuetschek @ndaheim Please take a look and let me know what you think.

sebastianGehrmann · 2021-11-15T14:12:06Z

Hey, given the discussions in the chat, should we merge this so we can proceed with adding the other implementations?

danieldeutsch · 2021-11-15T16:32:08Z

I think it should be ok. My only concern is how to deal with the GPU device. It seems like the other GPU-based metrics don't manually control the device, but this is necessary for the Dockerized metrics or else they will just use GPU 0.

When you run something in Docker, the code run in Docker runs in its own process, so the CUDA_VISIBLE_DEVICES environment variable you might set from the command line is ignored. My library sets that variable specifically for the Docker processes.

I was actually thinking about this last night. Since the classes are instantiated here without any arguments to the constructor, there isn't an obvious way right now to pass the device ID to the Dockerized metrics.

GEM-metrics/gem_metrics/__init__.py

Lines 124 to 126 in 9435858

    
           logger.info(f"Computing {metric_class.__name__} for {outs.filename}...") 
        
           metric = metric_class() 
        
           result = metric.compute_cached(cache, outs)

Any thoughts?

Adding Docker version of Prism

97925db

danieldeutsch commented Oct 28, 2021

View reviewed changes

This was linked to issues Dec 16, 2021

Add PRISM #49

Closed

Add support for docker and add all the metrics from repro #74

Open

danieldeutsch closed this Jan 29, 2022

danieldeutsch deleted the docker branch January 29, 2022 19:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Adding Docker version of Prism #68

[WIP] Adding Docker version of Prism #68

danieldeutsch commented Oct 28, 2021

danieldeutsch Oct 28, 2021

danieldeutsch Oct 28, 2021

danieldeutsch commented Oct 28, 2021

danieldeutsch commented Oct 28, 2021

sebastianGehrmann commented Nov 15, 2021

danieldeutsch commented Nov 15, 2021

[WIP] Adding Docker version of Prism #68

[WIP] Adding Docker version of Prism #68

Conversation

danieldeutsch commented Oct 28, 2021

danieldeutsch Oct 28, 2021

Choose a reason for hiding this comment

danieldeutsch Oct 28, 2021

Choose a reason for hiding this comment

danieldeutsch commented Oct 28, 2021

danieldeutsch commented Oct 28, 2021

sebastianGehrmann commented Nov 15, 2021

danieldeutsch commented Nov 15, 2021