Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(pipeline): first trials at duplicates identification #295

Closed
wants to merge 1 commit into from

Conversation

YannickPassa
Copy link
Contributor

Notebook avec les premières explorations autour de la déduplication.
Dans le premier commit j'essaie de créer un jeu de données test à partir des données FT.

Dans un 2nd temps je vais ajouter des structures de l'IAE afin d'avoir un fichier de test + générique.

@vmttn
Copy link
Contributor

vmttn commented Sep 18, 2024

(j'ai converti en draft, car les PRs en ready for review sont déployées en staging)

@YannickPassa YannickPassa changed the title Deduplication feat(pipeline): first trials at duplicates identification Sep 25, 2024
@vperron
Copy link
Contributor

vperron commented Oct 8, 2024

Je clôture cette PR pour l'instant, je pense qu'on la rouvrira si l'on souhaite récupérer et retravailler le Notebook correspondant.

Trop de PR ouvertes ^^

@vperron vperron closed this Oct 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants