Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
Notebook | Description | Lang |
---|---|---|
File Operations Sample | Various file operations sample such as Azure Blob Storage mount & umount, ls/rm/cp/mv, read CSV file, etc | Python |
ELT Sample: Azure Blob Stroage - Databricks - CosmosDB | In this notebook, you extract data from Azure Blob Storage into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Azure Cosmos DB | Python |
ELT Sample: Azure Blob Stroage - Databricks - SQLDW | In this notebook, you extract data from Azure Blob Storage into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Azure SQL Data Warehouse | Python |
Streaming Sample: Cosmos DB ChangeFeed - Databricks | In this notebook, you read a live stream of tweets that stored in Cosmos DB by leveraging Apache Spart to read the Cosmos DB's Change Feed, and run transformations on the data in Databricks cluster | Python |
Streaming Sample: Azure Event Hub - Databricks | In this notebook, you connect Azure Event Hub (hyper-scale data ingestion service) with Azure Databricks to stream data into an Apache Spark cluster in near real-time | Python |
Bug reports and pull requests are welcome on GitHub at https://github.com/yokawasa/databricks-notebook