Skip to content

An Audio Dataset Manager for Audio Machine Learning Tasks

License

Notifications You must be signed in to change notification settings

THasthika/AudioMosaic

Repository files navigation

AudioMosaic

codecov

AudioMosaic is an Audio Dataset Manager designed specifically for Machine Learning projects. It provides a simple and intuitive interface to organize, manage, and preprocess audio datasets, making it easier to work with audio data in your machine learning pipeline.

Features (WIP)

  • Dataset Organization: AudioMosaic allows you to organize your audio datasets into projects, making it easier to manage multiple datasets for different machine learning tasks.
  • Data Import: Import audio files from various sources, such as local directories, remote URLs, or cloud storage.
  • Metadata Management: Annotate and manage metadata associated with your audio files, including labels, tags, and any other relevant information.
  • Preprocessing: Apply various preprocessing techniques to your audio data, such as audio augmentation, feature extraction, and signal processing.
  • Audio Augmentation: Perform data augmentation on your audio files to increase the diversity and robustness of your dataset. Apply techniques like time stretching, pitch shifting, background noise addition, and more.
  • Visualization: Visualize audio waveforms, spectrograms, and other audio representations to gain insights into your dataset.
  • Data Splitting: Split your dataset into training, validation, and testing subsets, ensuring proper evaluation of your machine learning models.
  • Export: Export your processed dataset in a format suitable for your machine learning framework or library.

Installation (WIP)

  1. Clone the AudioMosaic repository
# 1. Clone the repo
git clone https://github.com/THasthika/AudioMosaic.git

# 2. Navigate to the directory
cd AudioMosaic

# 3. Install the deps (Poetry)
poetry install

# ... (still pending)

Contributing

Contributions are welcome! If you have any ideas, suggestions, or bug reports, please open an issue in the AudioMosaic repository. See our Contributing Guidelines for more information.

License

AudioMosaic is licensed under the MIT License.

Acknowledgements

We would like to thank the open-source community for their valuable contributions and the developers of the libraries and tools that AudioMosaic depends on.

Contact

If you have any questions, comments, or feedback, please feel free to reach out to us by opening an issue in the AudioMosaic repository.

Happy audio dataset management!

About

An Audio Dataset Manager for Audio Machine Learning Tasks

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published