[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
-
Updated
Nov 2, 2023 - Python
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
AnnotationTool to create multi-view annotations for the JARVIS 3D Markerless Pose Estimation Toolbox
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
A Python library designed for scraping data from the SCP wiki.
Generates training data for training ML models. This may seem useless but it can sometimes be helpful if you don't feel like finding a ton of data yourself. Plus, because you can customize how data is generated, you can create some nasty datasets to test your algorithms.
Some simple python scripts for generating data for remote sensing purpose.
Codes for machine learning exercises on DL models and training data generation pipelines
UI to prepare training data for SmartTool inside Supervisely platform
This repository has research paper implementation which reconstructs training data.
This is a MATLAB source code of the enhanced equidistribution, which guarantees that the generated random sequence follows the theoretical uniform distribution.
Add a description, image, and links to the training-data-generation topic page so that developers can more easily learn about it.
To associate your repository with the training-data-generation topic, visit your repo's landing page and select "manage topics."