Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Runtime Error 'Lexer' object has no attribute '_SQL_REGEX' #710

Closed
2 tasks done
Tonayya opened this issue Feb 5, 2024 · 15 comments · Fixed by #711
Closed
2 tasks done

[Bug] Runtime Error 'Lexer' object has no attribute '_SQL_REGEX' #710

Tonayya opened this issue Feb 5, 2024 · 15 comments · Fixed by #711
Labels
bug Something isn't working

Comments

@Tonayya
Copy link

Tonayya commented Feb 5, 2024

Is this a new bug in dbt-redshift?

  • I believe this is a new bug in dbt-redshift
  • I have searched the existing issues, and I could not find an existing issue for this bug

Current Behavior

dbt Cloud users experiencing an intermittent runtime error in random job runs which stems from the sqlparse library that dbt uses(link). But this appears to only be affecting Redshift connected projects. Issue was also reported on Discourse here.

Some notes from the internal correspondence with Core team:

  • sqlparse is a library that we use to combine the SQL from ephemeral models with the non-ephemeral model SQL. It looks like this error would be an even more edge-casey failure than the one that we addressed in the referenced ticket.
  • This is likely a variation on the original issue, which is that the sqlparse lexer isn't initialized when we use it to parse the SQL. They set the "default_instance" before it's initialized so another thread checking if the default instance is set in that tiny tiny window will see that it is, but it hasn't been initialized. In order to fix this we'd probably need to write an internal version of their initialization code and pin the code. Or vendor it.
  • Looks like dbt-redshift directly uses sqlparse but just for splitting. It's the only adapter that does it though and it looks like it went in as of 1.5 so lines up. It also (on main) no longer does probably with dbt-adapter being pulled out (link).

Expected Behavior

Job runs not failing with the aforementioned error.

Steps To Reproduce

There's no specific step to reproduce this as it happens on random intervals and so far the only consistent thing found in the behaviour has been the dbt-redshift adapter. Error has occurred on dbt version 1.5, 1.6 and 1.7.

Relevant log output

an error:
Runtime Error
  'Lexer' object has no attribute '_SQL_REGEX'

Environment

- OS:
- Python:
- core: Running with dbt=1.6.9
-  adapter: redshift=1.6.7

Additional Context

No response

@mikealfare
Copy link
Contributor

Re-opening until this is deployed.

@mikealfare mikealfare reopened this Feb 6, 2024
@mikealfare
Copy link
Contributor

User confirmed this is resolved.

@Tonayya
Copy link
Author

Tonayya commented Feb 13, 2024

Hiya @mikealfare , user just got back to us saying this has occurred again today and asked if this was backported to 1.6 (which I can see in the PR that it was). Should this still be occurring?

@mikealfare
Copy link
Contributor

Hiya @mikealfare , user just got back to us saying this has occurred again today and asked if this was backported to 1.6 (which I can see in the PR that it was). Should this still be occurring?

If it's still occurring in the latest version, than this is a different issue than what we saw in dbt-core. Can you confirm that the user is using dbt-redshift==1.6.7?

@mikealfare mikealfare reopened this Feb 14, 2024
@Tonayya
Copy link
Author

Tonayya commented Feb 14, 2024

Hi @mikealfare confirming that they are running with the following:

02:06:34  Running with dbt=1.6.9
02:06:34  Registered adapter: redshift=1.6.7

It's the same error:

02:06:39  Encountered an error:
Runtime Error
  'Lexer' object has no attribute '_SQL_REGEX'

@mikealfare
Copy link
Contributor

Thanks for confirming. We'll need to look into this more then as it appears to be an unknown issue.

@mikealfare mikealfare removed their assignment Feb 14, 2024
@Tonayya
Copy link
Author

Tonayya commented Feb 19, 2024

Hiya @mikealfare hope you're well. Just wanted to check in on this one and see whether we have any idea what's happening? :/ user just reached out to follow-up on an ETA. I have asked them whether this has occurred again since they last reported this on 13th Feb. Thank you! :)

@martynydbt
Copy link
Contributor

@Tonayya is there currently a work around? Sorry, just joined in here and getting up to speed.

@Tonayya
Copy link
Author

Tonayya commented Feb 21, 2024

@martynydbt thanks for getting back to me. So basically re-running the jobs usually does the trick but it is causing an issue intermittently and disrupting their flow. It has been impacting a few of our customers for quite some time now.

@Tonayya
Copy link
Author

Tonayya commented Feb 26, 2024

Hi @martynydbt hope you're well. Just checking in to see if we know anything further about this? Was there any info needed from this side at all?

@Tonayya
Copy link
Author

Tonayya commented Mar 15, 2024

Hi all, just wondering if any updates on this? :/

@stisti
Copy link

stisti commented Mar 15, 2024

We also see this sometimes. Here's the log from today morning run in Jenkins:

[2024-03-15T07:04:03.810Z] + docker run --rm -u 1000 -v /tmp/workspace/_lake-etls-dbt-fast_cycle_master:/etl -w /etl -v /tmp/tmp.Si51YcFO73:/profiles -e DBT_PROFILES_DIR=/profiles -e DBT_TARGET_PATH -e DBT_LOG_PATH -e USER -e AWS_DEFAULT_REGION -e AWS_METADATA_SERVICE_NUM_ATTEMPTS xxxxx.dkr.ecr.eu-west-1.amazonaws.com/ds/dbt:1.7.8 source freshness --target prd --select +tag:fast_cycle
[2024-03-15T07:04:06.447Z] 07:04:06  Running with dbt=1.7.8
[2024-03-15T07:04:06.721Z] 07:04:06  Registered adapter: redshift=1.7.3
[2024-03-15T07:04:06.721Z] 07:04:06  Unable to do partial parsing because saved manifest not found. Starting full parse.
[2024-03-15T07:04:11.035Z] 07:04:10  Found 64 models, 11 seeds, 111 tests, 22 sources, 0 exposures, 0 metrics, 836 macros, 0 groups, 0 semantic models
[2024-03-15T07:04:11.035Z] 07:04:10  
[2024-03-15T07:04:14.441Z] 07:04:13  Encountered an error:
[2024-03-15T07:04:14.441Z] Runtime Error
[2024-03-15T07:04:14.441Z]   'Lexer' object has no attribute '_SQL_REGEX'
script returned exit code 2

@jliu0812
Copy link

Hello.

I just faced this issue.

[2024-04-12, 14:41:12 CDT] {{subprocess.py:93}} INFO - �[0m19:41:10.666850 [error] [MainThread]: Encountered an error:
[2024-04-12, 14:41:12 CDT] {{subprocess.py:93}} INFO - Runtime Error
[2024-04-12, 14:41:12 CDT] {{subprocess.py:93}} INFO -   'Lexer' object has no attribute '_SQL_REGEX'

Version info:

[2024-04-12, 14:44:07 CDT] {{subprocess.py:93}} INFO - 19:44:07  Running with dbt=1.7.11
[2024-04-12, 14:44:08 CDT] {{subprocess.py:93}} INFO - 19:44:08  Registered adapter: redshift=1.7.4
[2024-04-12, 14:44:08 CDT] {{subprocess.py:93}} INFO - 19:44:08  Unable to do partial parsing because saved manifest not found. Starting full parse.
...

@jliu0812
Copy link

I reran the same process again, it works fine after that. Seems to be an intermittent issue.

@mikealfare
Copy link
Contributor

This is either done or no longer an issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

6 participants