Add GDN algorithm #16

SebastianSchmidl · 2023-06-30T13:52:29Z

Adds the GDN algorithm from https://github.com/d-ailin/GDN (Paper: https://doi.org/10.1609/aaai.v35i5.16523)

Fixes #15

SebastianSchmidl · 2023-06-30T13:56:44Z

@2er0 can you take it from here?

add source code within the given template (transform data and set the parameters)
check parameter definitions in algorithm.py (name, type, default value, and their completeness)
check parameter definitions in manifest.json (consistent with algorithm.py and descriptions)
adapt implementation to correctly save the model
check if the algorithm successfully trains and executes (I can help with that)
does the algorithm need reverse windowing? add a note in the README

2er0 · 2023-09-21T09:53:50Z

@CodeLionX please test - I found some time to make the first full working draft.

docker build -t registry.gitlab.hpi.de/akita/i/python3-base:0.2.6 ./0-base-images/python3-base 
docker build -t registry.gitlab.hpi.de/akita/i/python3-torch:0.2.6 ./0-base-images/python3-torch
docker build -t registry.gitlab.hpi.de/akita/i/gdn:0.2.6 ./gdn

docker run --rm \                                                                               
    -v $(pwd)/1-data:/data:ro \
    -v $(pwd)/2-results:/results:rw \
  registry.gitlab.hpi.de/akita/i/gdn:0.2.6 execute-algorithm '{
    "executionType": "train",
    "dataInput": "/data/multi-dataset.csv",
    "dataOutput": "/results/anomaly_scores.ts",
    "modelInput": "/results/model.pkl",
    "modelOutput": "/results/model.pkl",
    "customParameters": {}
  }'

docker run --rm \                                                                               
    -v $(pwd)/1-data:/data:ro \
    -v $(pwd)/2-results:/results:rw \
  registry.gitlab.hpi.de/akita/i/gdn:0.2.6 execute-algorithm '{
    "executionType": "execute",
    "dataInput": "/data/multi-dataset.csv",
    "dataOutput": "/results/test_anomaly_scores.ts",
    "modelInput": "/results/model.pkl",
    "modelOutput": "/results/model.pkl",
    "customParameters": {}
  }'

SebastianSchmidl

I took a brief look and can execute the algorithm locally. 👍🏼
However, I found some issues in the current draft:

See inline comments (changes to base image)
The input handling is too tight: You assume that the feature columns are labeled value*, but many datasets use other feature names.
All other algorithms store a single model-file. TimeEval currently does not support multiple files. I would suggest that you create an archive when storing the model. You can take a look at Torsk as an example for that.
Do you perform train-test-splitting within the code? The anomaly score files are very short … TimeEval requires that the scoring has the same length as the input TS. You could use a postprocessing step to perform this transformation (allows you to use Code from TimeEval).

SebastianSchmidl · 2023-09-22T10:08:23Z

0-base-images/python3-base/requirements.txt

+numpy>=1.20.0
+pandas>=1.2.1
+matplotlib>=3.3.4
+scipy>=1.6.0
+scikit-learn>=0.24.1


We deliberately pinned the dependencies to ensure reproducibility!

If the new algorithm actually needs a new Python version and different dependencies, we should create a new base image. Otherwise, we have to check if all the other algorithms still work with the new base image.

SebastianSchmidl · 2023-09-22T10:09:23Z

README.md

-   #    -e LOCAL_UID=<current user id> \
-   #    -e LOCAL_GID=<current groupid> \
-     registry.gitlab.hpi.de/akita/i/<your_algorithm>:latest execute-algorithm '{
+     registry.gitlab.hpi.de/akita/i/gdn:0.2.6 execute-algorithm '{
       "executionType": "train",
-       "dataInput": "/data/dataset.csv",
+       "dataInput": "/data/multi-dataset.csv",


Please revert those changes to the README. They only apply to your algorithm and not to the others.

SebastianSchmidl · 2023-09-22T10:10:45Z

gdn/Dockerfile

@@ -0,0 +1,18 @@
+FROM registry.gitlab.hpi.de/akita/i/python3-torch:0.2.6


There is no version 0.2.6 of the base images yet.

SebastianSchmidl · 2023-09-22T10:11:28Z

gdn/Dockerfile

+# fixing six.py dataloader issue
+COPY GDN/dataloader_fix.py /usr/local/lib/python3.10/site-packages/torch_geometric/data/dataloader.py


Do you have a link to the bug report/issue? This looks like a dirty hack.

SebastianSchmidl added 2 commits June 30, 2023 15:46

feat: prepare skeleton for GDN algorithm

f1508be

doc(gdn): add GDN to global README and describe the basic in its README

3dc4006

SebastianSchmidl added 🏅 medium MoSCoW: Should-have comp: algorithms labels Jun 30, 2023

SebastianSchmidl assigned SebastianSchmidl and unassigned SebastianSchmidl Jun 30, 2023

WIP not tested draft of GDN integration

91c0fb1

SebastianSchmidl assigned 2er0 Jul 5, 2023

2er0 added 3 commits July 11, 2023 23:01

WIP major, minor fixes and adaptions

daf8e81

align model and asset storing to TimeEval structure

652683b

first full draft of GDN with python-base update to python version 3.10

d704a3d

SebastianSchmidl commented Sep 22, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GDN algorithm #16

Add GDN algorithm #16

SebastianSchmidl commented Jun 30, 2023

SebastianSchmidl commented Jun 30, 2023 •

edited by 2er0

Loading

2er0 commented Sep 21, 2023

SebastianSchmidl left a comment •

edited

Loading

SebastianSchmidl Sep 22, 2023

SebastianSchmidl Sep 22, 2023

SebastianSchmidl Sep 22, 2023

SebastianSchmidl Sep 22, 2023

		@@ -0,0 +1,18 @@
		FROM registry.gitlab.hpi.de/akita/i/python3-torch:0.2.6

		# fixing six.py dataloader issue
		COPY GDN/dataloader_fix.py /usr/local/lib/python3.10/site-packages/torch_geometric/data/dataloader.py

Add GDN algorithm #16

Are you sure you want to change the base?

Add GDN algorithm #16

Conversation

SebastianSchmidl commented Jun 30, 2023

SebastianSchmidl commented Jun 30, 2023 • edited by 2er0 Loading

2er0 commented Sep 21, 2023

SebastianSchmidl left a comment • edited Loading

Choose a reason for hiding this comment

SebastianSchmidl Sep 22, 2023

Choose a reason for hiding this comment

SebastianSchmidl Sep 22, 2023

Choose a reason for hiding this comment

SebastianSchmidl Sep 22, 2023

Choose a reason for hiding this comment

SebastianSchmidl Sep 22, 2023

Choose a reason for hiding this comment

SebastianSchmidl commented Jun 30, 2023 •

edited by 2er0

Loading

SebastianSchmidl left a comment •

edited

Loading