DataRobot User Models

What is this repository?

This repository contains tools, templates, and information for assembling, debugging, testing, and running your custom inference models and custom tasks with DataRobot.

For further documentation on this and all other features please visit our comprehensive documentation at: https://docs.datarobot.com/

Terminology

DataRobot has 2 mechanisms for bringing custom ML code:

Custom task: an ML algorithm, for example, XGBoost or One-hot encoding, that can be used as a step in an ML pipeline (blueprint) inside DataRobot.
Custom inference model: a pre-trained model or user code prepared for inference. An inference model can have a predefined input/output schema or be unstructured. Learn more here

Custom Tasks Reference

Materials for getting started:

Demo Video
Code examples:
- Custom task templates
- Environment Templates
- Building blueprints programmatically from tasks like lego blocks
Quick walk-through
Detailed documentation

Other resources:

There is a chance that the task you are looking for has already been implemented. Check custom tasks community Github to see some off-the-shelf examples
- Note: The community repo above is NOT the place to start learning the basic concepts. The examples tend to have more complex logic and are meant to be used as-is rather than as a reference.
- This repo is the appropriate place to start with tutorial examples.

Custom Inference Models Reference

Materials for getting started:

Custom models walk-through
Code examples:
- Custom inference models templates
- Environment Templates
References for defining a custom inference model:

Other sources:

There is a chance that the model you are looking for has already been implemented. Check custom inference models community Github to see some off-the-shelf examples

Contribution & development

Prerequisites for development

Note: Only reference this section if you plan to work with DRUM.

To build it, the following packages are required: make, Java 11, maven, docker, R E.g. for Ubuntu 18.04
apt-get install build-essential openjdk-11-jdk openjdk-11-jre maven python3-dev docker apt-utils curl gpg-agent software-properties-common dirmngr libssl-dev ca-certificates locales libcurl4-openssl-dev libxml2-dev libgomp1 gcc libc6-dev pandoc

R

Ubuntu 18.04
apt-key adv --keyserver keyserver.ubuntu.com --recv-keys E298A3A825C0D65DFD57CBB651716619E084DAB9
add-apt-repository 'deb https://cloud.r-project.org/bin/linux/ubuntu bionic-cran35/'
apt-get install r-cran-littler r-base r-base-dev

R packages

Rscript -e "install.packages(c('devtools', 'tidyverse', 'caret', 'recipes', 'glmnet', 'plumber', 'Rook', 'rjson', 'e1071'), Ncpus=4)"
Rscript -e 'library(caret); install.packages(unique(modelLookup()[modelLookup()$forReg, c(1)]), Ncpus=4)'
Rscript -e 'library(caret); install.packages(unique(modelLookup()[modelLookup()$forClass, c(1)]), Ncpus=4)'

DRUM developers

Setting Up Local Env For Testing

create Py 3.7 or 3.8 venv
pip install -r requirements_dev.txt
pip install -e custom_model_runner/
pytest to your heart's content

DataRobot Confluence

To get more information, search for custom models and datarobot user models in DataRobot Confluence.

Committing into the repo

Ask repository admin for write access.
Develop your contribution in a separate branch run tests and push to the repository.
Create a pull request.

Testing changes to drum in DR app

There is a script called create-drum-dev-image.sh which will build and save an image with your latest local changes to the DRUM codebase. You can test new changes to drum in the DR app by running this script with an argument for which dropin env to modify, and uploading the image which gets built as an execution environment.

Non-DataRobot developers

To contribute to the project, use a regular GitHub process: fork the repo and create a pull request to the original repository.

Tests

Test artifacts

Artifacts used in tests are located here: ./tests/fixtures/drop_in_model_artifacts.
There is also the code in (*.ipynb, Pytorch.py, Rmodel.R, etc files) to generate those artifacts.
Check for generate* scripts in ./tests/fixtures/drop_in_model_artifacts and ./tests/fixtures/artifacts.py

Model examples in ./model_templates are also used in functional testing. In the most cases, artifacts for those models are the same as in the ./tests/fixtures/drop_in_model_artifacts and can be simply copied accordingly. If artifact for model template is not in the ./tests/fixtures/drop_in_model_artifacts, check template's README for more instructions.

Communication

Some places to ask for help are:

open an issue through the GitHub board.

Name		Name	Last commit message	Last commit date
Latest commit History 386 Commits
.github		.github
custom_model_runner		custom_model_runner
docker		docker
jenkins		jenkins
model_templates		model_templates
public_dropin_environments		public_dropin_environments
task_templates		task_templates
tests		tests
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
.ignore		.ignore
BRANCHING.yaml		BRANCHING.yaml
DEFINE-INFERENCE-MODEL.md		DEFINE-INFERENCE-MODEL.md
DRCODEOWNERS		DRCODEOWNERS
LICENSE		LICENSE
MODEL-METADATA.md		MODEL-METADATA.md
README.md		README.md
VALIDATION-SCHEMA.md		VALIDATION-SCHEMA.md
custom_model_tutorial.ipynb		custom_model_tutorial.ipynb
dependbot.yml		dependbot.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt
requirements_lint.txt		requirements_lint.txt
requirements_test.txt		requirements_test.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DataRobot User Models

What is this repository?

Terminology

Content

Custom Tasks Reference

Custom Inference Models Reference

Contribution & development

Prerequisites for development

R

R packages

DRUM developers

Setting Up Local Env For Testing

DataRobot Confluence

Committing into the repo

Testing changes to drum in DR app

Non-DataRobot developers

Tests

Test artifacts

Communication

About

Releases

Packages

Languages

License

popovaana123/datarobot-user-models

Folders and files

Latest commit

History

Repository files navigation

DataRobot User Models

What is this repository?

Terminology

Content

Custom Tasks Reference

Custom Inference Models Reference

Contribution & development

Prerequisites for development

R

R packages

DRUM developers

Setting Up Local Env For Testing

DataRobot Confluence

Committing into the repo

Testing changes to drum in DR app

Non-DataRobot developers

Tests

Test artifacts

Communication

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages