[Bug] state:modified not working for windows machine #10574

abrown-calix · 2024-08-15T18:28:55Z

Is this a new bug in dbt-core?

I believe this is a new bug in dbt-core
I have searched the existing issues, and I could not find an existing issue for this bug

Current Behavior

We have set environment variables DBT_DEFER=true and DBT_STATE= path to prod manifest file. When we run dbt run -m <model_name> and look at the manifest.json the deferred is true (deferred: true). However when we run command: dbt run -m state:modified or dbt run --select state:modified the deferred is false (deferred: false). This causes dbt to run ALL models as the database listed in the manifest.json file is not correct.

This seems to only happen for those with windows machines. We have a few folks on our team that have macs and everything works fine.

Expected Behavior

When we run: dbt run -m state:modified, it should ONLY run the modified model not all models. The deferred flag in the manifest.json should be deferred: true.

Steps To Reproduce

Set up environment variables in the system config (System->Advanced system settings-> Advanced -> Environment Variables. Add DBT_DEFER true, DBT_STATE /path/to/production/manifest
Close terminal
Open terminal
Pull up master branch
run command: dbt ls -m state:modified. It should state: Got a state selector method, but no comparison manifest. Instead it lists ALL our models

Relevant log output

Incorrect manifest:

"nodes": {
        "model.analytics_edw.dim_product_oracle": {
            "database": "dbt_dev_agoncalv",
            "schema": "audit",
            "name": "dim_product_oracle",
            "resource_type": "model",
            "package_name": "analytics_edw",
            "path": "audit\\dim_product_oracle.sql",
            "original_file_path": "models\\audit\\dim_product_oracle.sql",
            "unique_id": "model.analytics_edw.dim_product_oracle",
            "fqn": [
                "analytics_edw",
                "audit",
                "dim_product_oracle"
            ],
            "alias": "dim_product_oracle",
            "checksum": {
                "name": "sha256",
                "checksum": "47a67bbdf5791a7dc7767c39dfbd0cfc151a6f7d406bee06c23665fdb48d5032"
            },
            "config": {
                "enabled": true,
                "alias": null,
                "schema": "audit",
                "database": null,
                "tags": [
                    "audit",
                    "prod_ops_multi_source"
                ],
                "meta": {},
                "group": null,
                "materialized": "table",
                "incremental_strategy": null,
                "persist_docs": {},
                "post-hook": [],
                "pre-hook": [],
                "quoting": {},
                "column_types": {},
                "full_refresh": null,
                "unique_key": "item_master_id",
                "on_schema_change": "ignore",
                "on_configuration_change": "apply",
                "grants": {},
                "packages": [],
                "docs": {
                    "show": true,
                    "node_color": null
                },
                "contract": {
                    "enforced": false,
                    "alias_types": true
                },
                "access": "protected"
            },
            "tags": [
                "audit",
                "prod_ops_multi_source"
            ],
            "description": "The data from this table is derived from PC_FIVETRAN_DB.PROD_OPS.ITEM_MASTER_110.",
            "columns": {
                "item_master_id": {
                    "name": "item_master_id",
                    "description": "",
                    "meta": {},
                    "data_type": null,
                    "constraints": [],
                    "quote": null,
                    "tags": []
                }
            },
            "meta": {},
            "group": null,
            "docs": {
                "show": true,
                "node_color": null
            },
            "patch_path": "analytics_edw://models\\edw\\product_ops\\_schema.yml",
            "build_path": null,
            "deferred": false,

Correct manifest:

"nodes": {
        "model.analytics_edw.dim_product_oracle": {
            "database": "dbt_dev_agoncalv",
            "schema": "audit",
            "name": "dim_product_oracle",
            "resource_type": "model",
            "package_name": "analytics_edw",
            "path": "audit\\dim_product_oracle.sql",
            "original_file_path": "models\\audit\\dim_product_oracle.sql",
            "unique_id": "model.analytics_edw.dim_product_oracle",
            "fqn": [
                "analytics_edw",
                "audit",
                "dim_product_oracle"
            ],
            "alias": "dim_product_oracle",
            "checksum": {
                "name": "sha256",
                "checksum": "47a67bbdf5791a7dc7767c39dfbd0cfc151a6f7d406bee06c23665fdb48d5032"
            },
            "config": {
                "enabled": true,
                "alias": null,
                "schema": "audit",
                "database": null,
                "tags": [
                    "audit",
                    "prod_ops_multi_source"
                ],
                "meta": {},
                "group": null,
                "materialized": "table",
                "incremental_strategy": null,
                "persist_docs": {},
                "post-hook": [],
                "pre-hook": [],
                "quoting": {},
                "column_types": {},
                "full_refresh": null,
                "unique_key": "item_master_id",
                "on_schema_change": "ignore",
                "on_configuration_change": "apply",
                "grants": {},
                "packages": [],
                "docs": {
                    "show": true,
                    "node_color": null
                },
                "contract": {
                    "enforced": false,
                    "alias_types": true
                },
                "access": "protected"
            },
            "tags": [
                "audit",
                "prod_ops_multi_source"
            ],
            "description": "The data from this table is derived from PC_FIVETRAN_DB.PROD_OPS.ITEM_MASTER_110.",
            "columns": {
                "item_master_id": {
                    "name": "item_master_id",
                    "description": "",
                    "meta": {},
                    "data_type": null,
                    "constraints": [],
                    "quote": null,
                    "tags": []
                }
            },
            "meta": {},
            "group": null,
            "docs": {
                "show": true,
                "node_color": null
            },
            "patch_path": "analytics_edw://models\\edw\\product_ops\\_schema.yml",
            "build_path": null,
            "deferred": true,

Environment

- OS:Windows
- Python:3.12
- dbt:1.7.9
Plugins
    - snowflake: 1.7.2

Which database adapter are you using with dbt?

snowflake

Additional Context

dbeatty10 · 2024-08-20T00:32:52Z

Thanks for reaching out @abrown-calix !

Could you double-check that the DBT_DEFER and DBT_STATE environment variables are available in their Windows terminal?

The precise instructions to check those environment variables will vary depending on the Windows terminal they are using. But I'll try to include a few of the most common ones.

Command Prompt (CMD)

echo %DBT_DEFER%
echo %DBT_STATE%

PowerShell

$env:DBT_DEFER
$env:DBT_STATE

Windows Subsystem for Linux (WSL) or another Unix-like shells on Windows

echo $DBT_DEFER
echo $DBT_STATE

abrown-calix · 2024-08-21T19:47:21Z

in each they are reflecting accurately.

dbeatty10 · 2024-08-21T21:16:00Z

Could you try a few other things and see if any of them work for the Windows users?

DBT_DEFER=1 instead of DBT_DEFER=true
Python 3.10 or Python 3.11 instead of Python 3.12
dbt 1.8 instead of dbt 1.7

abrown-calix · 2024-08-26T21:38:16Z

We have already tried to upgrade to dbt 1.8 and it always is running all the models unfortunately.

abrown-calix · 2024-08-26T21:46:42Z

We have tried with quotes
. One of our windows users has python 3.11.8 and it is behaving the same.

ybryzTE · 2024-10-24T20:50:18Z

Ran into the same issue. Running dbt build --select state:modified+ --defer --state prod_artifacts/ built all models on my Windows machine. GitHub CI running on Linux was working as expected - only modified models were built. Check your line breaks on Windows. Mine were set to 'CRLF', whereas the files in my repo had 'LF' instead. Converting 'CRLF' to 'LF' in my editor solved the issue.

dbeatty10 · 2024-12-15T00:11:22Z

@ybryzTE Could you share more details about the solution you used? e.g., which editor you are using, how to changed the line breaks, which files it affected, etc? Were you using the DBT_DEFER and/or
DBT_STATE environment variables at all or just the --defer and --state CLI flags?

@abrown-calix did you try specifying --defer --state on the command line to confirm that the issue was related to the DBT_DEFER and/or DBT_STATE environment variables? I'd be curious if you could try to isolate the issue to either one environment variable or the other.

If it is isolated to the DBT_DEFER variable, then we can try to focus on troubleshooting that particular piece.

If if is isolated to the DBT_STATE variable, then maybe the path is specified in a format that doesn't work for dbt on Windows? e.g., instead of .\prod_manifest\, maybe it needs to be something like just prod_manifest instead?

You could try troubleshooting by doing commands like this and see if they work the way you expect or not (both in relation to the contents in manifest.json as well as the nodes that are selected):

dbt run --select state:modified --defer --state prod_manifest

If that works, then you could do each of these and see if they work or not (making sure that your desired environment variables are set in both cases):

dbt run --select state:modified --state prod_manifest
dbt run --select state:modified --defer

abrown-calix added bug Something isn't working triage labels Aug 15, 2024

dbeatty10 added windows Everyone's favorite OS that's sometimes a little weird state: modified labels Aug 15, 2024

dbeatty10 added awaiting_response and removed triage labels Aug 20, 2024

github-actions bot added triage and removed awaiting_response labels Aug 21, 2024

dbeatty10 added awaiting_response and removed triage labels Aug 21, 2024

github-actions bot added triage and removed awaiting_response labels Aug 26, 2024

dbeatty10 added awaiting_response and removed triage labels Dec 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] state:modified not working for windows machine #10574

[Bug] state:modified not working for windows machine #10574

abrown-calix commented Aug 15, 2024 •

edited by dbeatty10

Loading

dbeatty10 commented Aug 20, 2024

abrown-calix commented Aug 21, 2024

dbeatty10 commented Aug 21, 2024

abrown-calix commented Aug 26, 2024

abrown-calix commented Aug 26, 2024

ybryzTE commented Oct 24, 2024 •

edited

Loading

dbeatty10 commented Dec 15, 2024

[Bug] state:modified not working for windows machine #10574

[Bug] state:modified not working for windows machine #10574

Comments

abrown-calix commented Aug 15, 2024 • edited by dbeatty10 Loading

Is this a new bug in dbt-core?

Current Behavior

Expected Behavior

Steps To Reproduce

Relevant log output

Environment

Which database adapter are you using with dbt?

Additional Context

dbeatty10 commented Aug 20, 2024

Command Prompt (CMD)

PowerShell

Windows Subsystem for Linux (WSL) or another Unix-like shells on Windows

abrown-calix commented Aug 21, 2024

dbeatty10 commented Aug 21, 2024

abrown-calix commented Aug 26, 2024

abrown-calix commented Aug 26, 2024

ybryzTE commented Oct 24, 2024 • edited Loading

dbeatty10 commented Dec 15, 2024

abrown-calix commented Aug 15, 2024 •

edited by dbeatty10

Loading

ybryzTE commented Oct 24, 2024 •

edited

Loading