Skip to content

Fundamentals

Overview

The Platform includes a Dataset module designed for comprehensive management and organization of machine learning datasets.

Interaction with this module is facilitated by the Dataset Client, a dynamic API client.

You can also work with Dataframes within the Datasets module. Dataframe is always one CSV file and Dataset can be a set of files.

Key Benefits

  • Accessibility: Provides easy access to data from any machine or location, enhancing data availability.
  • Storage Optimization: Implements a hierarchical structure to inherit files from parent datasets, reducing redundancy and optimizing storage use.
  • Versioning: Allows for efficient creation and tracking of multiple dataset versions, facilitating updates, additions, or removals of files.