Change Data-ID to chunking algorithm compatible with other implementations. #88
Labels
Affects: Code
Affects: Compat
Breaks compatibility with last version.
Affects: Spec
Priority: High
Scope: Medium
Milestone
Recently some implementations of FastCDC in different programming languages have shown up that are compatible to each other:
https://github.com/ronomon/deduplication
https://github.com/nlfiedler/fastcdc-rs
https://github.com/titusz/fastcdc-py
We should join this development and update the Data-ID chunking algorithm to be in line with those implementations. We also should implement an optional accelerated version as the 5 MB/s of a pure python version is a major performance bottleneck.
The text was updated successfully, but these errors were encountered: