Skip to content

Pull requests: modelscope/data-juicer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[WIP] docs for distributed processing dj:dist issues/PRs about distributed data processing documentation Improvements or additions to documentation
#523 opened Dec 26, 2024 by HYLcool Draft
Dev/manage meta
#518 opened Dec 24, 2024 by BeachWang Draft
Add minhash deduplicator based on RAY. dj:dist issues/PRs about distributed data processing dj:efficiency regarding to efficiency issues and enhancements dj:op issues/PRs about some specific OPs
#502 opened Nov 28, 2024 by chenyushuo Loading…
Add minhash deduplicator based on RAY and Redis dj:dist issues/PRs about distributed data processing dj:efficiency regarding to efficiency issues and enhancements dj:op issues/PRs about some specific OPs
#489 opened Nov 15, 2024 by pan-x-c Loading…
Automatically split input dataset in ray mode
#415 opened Sep 4, 2024 by pan-x-c Loading…
[WIP]Add text tagging by prompt mapper op dj:op issues/PRs about some specific OPs
#408 opened Aug 30, 2024 by garyzhang99 Loading…
1 task
Add text_pair_similarity_filter dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs enhancement New feature or request
#405 opened Aug 28, 2024 by Qirui-jiao Draft
Add sentence_augmentation_mapper dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs enhancement New feature or request
#401 opened Aug 22, 2024 by Qirui-jiao Draft
Add mllm_mapper dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs enhancement New feature or request
#400 opened Aug 22, 2024 by Qirui-jiao Draft
Add sdxl_prompt2prompt_mapper dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs enhancement New feature or request
#395 opened Aug 21, 2024 by Qirui-jiao Draft
[Ready] Add image_segment_mapper dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs enhancement New feature or request
#394 opened Aug 21, 2024 by Qirui-jiao Loading…
Add GPT-4V as evaluator dj:multimodal issues/PRs about multimodal data processing enhancement New feature or request stale-pr
#276 opened Mar 22, 2024 by drcege Draft DJ-SORA
ProTip! What’s not been updated in a month: updated:<2024-11-26.