Skip to content
Change the repository type filter

All

    Repositories list

    • valentine

      Public
      A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.
      Python
      Apache License 2.0
      238443Updated Nov 28, 2024Nov 28, 2024
    • autofeat

      Public
      Source code for augmenting relational datasets through join paths
      Jupyter Notebook
      Apache License 2.0
      2401Updated Nov 19, 2024Nov 19, 2024
    • Human-in-the-loop Feature Discovery with AutoFeat
      Jupyter Notebook
      1240Updated Oct 22, 2024Oct 22, 2024
    • styx

      Public
      Styx: Transactional Stateful Functions on Streaming Dataflows
      Python
      GNU General Public License v3.0
      21600Updated Oct 2, 2024Oct 2, 2024
    • Benchmarking suite for the Web-Scale Data Management course using Locust
      Python
      151201Updated Aug 9, 2024Aug 9, 2024
    • Key-value store with a choice of 3 backend engines all built from scratch, specifically designed for dataflow systems.
      Python
      0000Updated Aug 1, 2024Aug 1, 2024
    • Code repository for Adaptive Distributed Streaming Similarity Joins published in DEBS 2023.
      Java
      0100Updated Jun 14, 2024Jun 14, 2024
    • Python
      Apache License 2.0
      1200Updated Jun 14, 2024Jun 14, 2024
    • checkmate

      Public
      CheckMate: Evaluating Checkpointing Protocols for Streaming Dataflows
      Python
      2600Updated Jun 14, 2024Jun 14, 2024
    • wdm-project-template

      Public template
      Template project for TU Delft's Web-scale Data Management course
      Python
      12400Updated Jun 2, 2024Jun 2, 2024
    • Java
      3010Updated Feb 20, 2024Feb 20, 2024
    • Code base for BSc Research Project Q4/2023 - Group 19
      Python
      Apache License 2.0
      0200Updated Aug 9, 2023Aug 9, 2023
    • Github pages repository for the Delft FinTech hackathon landing page
      HTML
      0000Updated Jul 3, 2023Jul 3, 2023
    • SiMa

      Public
      Jupyter Notebook
      1000Updated Jun 2, 2023Jun 2, 2023
    • Modified code and experiments from the "Feature augmentation with reinforcement learning" paper
      Python
      Apache License 2.0
      1400Updated May 16, 2023May 16, 2023
    • 0000Updated Mar 14, 2023Mar 14, 2023
    • Site repo for modelsearch
      HTML
      0000Updated Feb 27, 2023Feb 27, 2023
    • Transactions for Stateful Functions as a Service. This repository implements and API and associated underpinnings for two-phase Commit and SAGAs on Apache Flink's Statefun.
      Java
      Apache License 2.0
      22500Updated Dec 15, 2022Dec 15, 2022
    • Beldi

      Public
      Go
      MIT License
      11000Updated Dec 15, 2022Dec 15, 2022
    • repro-di

      Public
      Jupyter Notebook
      0009Updated Nov 22, 2022Nov 22, 2022
    • FERDiS

      Public
      C#
      1521Updated Oct 21, 2022Oct 21, 2022
    • Valentine scalable deployment for VLDB demo
      Python
      Apache License 2.0
      1810Updated Sep 26, 2022Sep 26, 2022
    • stateflow

      Public
      Prototype which extracts stateful dataflows by analysing Python code.
      Python
      42020Updated Sep 8, 2022Sep 8, 2022
    • Python
      0000Updated Sep 8, 2022Sep 8, 2022
    • The output produced by the Valentine Experiment Suite included in the paper "The Valentine Experiment Suite for Schema Matching"
      Jupyter Notebook
      Apache License 2.0
      1208Updated Aug 23, 2022Aug 23, 2022
    • Source for the Modelsearch project, a search engine for finding models with specific properties across hubs that host them.
      Vue
      0000Updated Jul 18, 2022Jul 18, 2022
    • This repository provides data and scripts to use Sherlock, a neural-network based model to detect semantic data types. https://sherlock.media.mit.edu
      Jupyter Notebook
      70000Updated Feb 5, 2022Feb 5, 2022
    • Website for DBML workshop in conjunction with ICDE 2022
      HTML
      Apache License 2.0
      3000Updated Dec 15, 2021Dec 15, 2021
    • clonos

      Public
      Clonos is a novel approach on fault-recovery & high availability for stream processing, based on causal logging.
      Java
      Apache License 2.0
      13k600Updated Dec 10, 2021Dec 10, 2021
    • beam

      Public
      Apache Beam is a unified programming model for Batch and Streaming
      Java
      Apache License 2.0
      4.3k100Updated Dec 3, 2021Dec 3, 2021