-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New Source Connector: DuckDB 🦆 #31
Comments
Assigning myself in order to reserve/hold for @ombhardwajj, who has the related #30. |
Hey @aaronsteers ,you can assign it to me now! |
@ombhardwajj what is the status of this issue? |
@marcosmarxm Its already been a week since I am working on the Hugging Face Datasets connector. Given the time constraint of this hackathon, I don't think i'll be able to build this DuckDB connector. Hence I am un-assigning myself. |
@marcosmarxm @aaronsteers can you please let me know which connector development method i should follow - python cdk or lowcode cdk |
@bala-ceg Probably you'll need to use Python CDK as the stream are going to be dynamically created |
@marcosmarxm is there any DB based python CDK that is written previously? I would like to see that as reference |
Overview
We do not yet have a DuckDB source connector. Normally, DuckDB database are local files and not very useful as sources, but now they can also be remote (e.g. MotherDuck) and they can be a pass-through for other datasource (e.g. #30 and the Hugging Face Datasets).
Technical spec
You would write a new source connector which can connect to a (remote) DuckDB dataset or database, and emit records from DuckDB, allowing Airbyte users to send these to any Airbyte destination.
Notes:
Cache
andSQLProcessor
.Definition of Done
The text was updated successfully, but these errors were encountered: