Fundamentals
Overview
The Platform includes a Dataset module designed for comprehensive management and organization of machine learning datasets.
Interaction with this module is facilitated by the Dataset Client, a dynamic API client.
You can also work with Dataframes within the Datasets module. Dataframe is always one CSV file and Dataset can be a set of files.
Key Benefits
- Accessibility: Provides easy access to data from any machine or location, enhancing data availability.
- Storage Optimization: Implements a hierarchical structure to inherit files from parent datasets, reducing redundancy and optimizing storage use.
- Versioning: Allows for efficient creation and tracking of multiple dataset versions, facilitating updates, additions, or removals of files.