Skip to content

v0.7.1 - Code fixes, splits specification, and metadata validation and handling

Compare
Choose a tag to compare
@ascourtas ascourtas released this 13 Nov 20:17
· 857 commits to main since this release
199395a

This release addresses some previous bugs with loading datasets on init(); that functionality has been removed for the time being, in favor of a more robust refactor in future releases.

In addition to code fixes, we've added the following functionality and improvements:

Users can now specify to download specific splits when they load_data(); this reduces the time and RAM required for people to use datasets when they may only need part of it.

Ex: tr = f.load_data(split="train")

Also, dataset metadata are now validated with appropriate error handling, so a user publishing a new dataset is instantly notified if any part of their specification is incompatible with the metadata schema. This will improve the user experience for both dataset publishers and consumers.

Additionally, this release includes code cleanup, docs improvements, and new applied AI examples.

What's Changed

Full Changelog: v0.7.0...v0.7.1