Walkthrough

Repo contains code presented during Pytech Summit 2021. It's an example of using feast with azure provider.

Walkthrough

Creating and preparing virtual environment

python -m venv .venv
source .venv/bin/activate
pip install pandas feast-azure-provider azureml-core

In case of lack of odbc please follow this tutorial.

Generating data

It creates artificial time series which were used during the live demo.

cd data_utils
python generator.py

Creating cloud environment

Cloud enviroment can be created based on template.json. It can be done:

One of required PrincipleID which can be found in Cloud Shell using command:

az ad signed-in-user show --query objectId -o tsv

For more information please go to feast azure provider repo.

Creating cloud environment lasts about 20-30 minutes.

Copying generated data to data lake

Generated data has to be copied to data laked linked to synapse resource.

Preparing azure ml config

Firstly it's needed to download config.json from azure machine learning resource. Next replace CONFIG_PATH variable value in data_utils/load_data.py and utils/prepare_feast.py (next time it's be done through some config file :D) with this path.

Moving data from data lake to synapse

It can be done by script:

cd data_utils
python generator.py

Or explicitly on synapse with queries.

Creating payments table:

CREATE TABLE dbo.payments (
    event_id INT,
    player_id NVARCHAR(5),
    ts DATETIME2,
    amount FLOAT,
    transactions INT
)

Creating stats table:

CREATE TABLE dbo.stats (
    event_id INT,
    player_id NVARCHAR(5),
    ts DATETIME2,
    win_loss_ratio FLOAT,
    games_played INT,
    time_in_game FLOAT
)

Moving payments data ({URL} should be replaced by data lake URL linked to synapse):

COPY INTO dbo.payments
FROM '{URL}.csv'
WITH
(
    FILE_TYPE = 'CSV'
    ,MAXERRORS = 0
    ,FIRSTROW = 2
)

Moving stats data ({URL} should be replaced by data lake URL linked to synapse):

COPY INTO dbo.stats
FROM '{URL}.csv'
WITH
(
    FILE_TYPE = 'CSV'
    ,MAXERRORS = 0
    ,FIRSTROW = 2
)

Apply features

python apply_feast.py

Materialize store

python materialize_stores.py

Show stores

python show_stores.py

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
data_utils		data_utils
feature_repo		feature_repo
infra		infra
presentation		presentation
utils		utils
.gitignore		.gitignore
apply_feast.py		apply_feast.py
materialize_stores.py		materialize_stores.py
readme.md		readme.md
show_stores.py		show_stores.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Walkthrough

Creating and preparing virtual environment

Generating data

Creating cloud environment

Copying generated data to data lake

Preparing azure ml config

Moving data from data lake to synapse

Apply features

Materialize store

Show stores

About

Releases

Packages

Languages

TSienki/feast-demo-pytech

Folders and files

Latest commit

History

Repository files navigation

Walkthrough

Creating and preparing virtual environment

Generating data

Creating cloud environment

Copying generated data to data lake

Preparing azure ml config

Moving data from data lake to synapse

Apply features

Materialize store

Show stores

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages