Skip to main content

Datasets

This article explains how to create and manage datasets in Hyperstack AI Studio using the API or UI. Datasets help you organize logs for fine-tuning, training analysis, and efficient model iteration.

In this article


Create Datasets Using the UI

You can create datasets directly through the AI Studio UI using the following steps:

  1. Open the Logs & Datasets Page

    Navigate to the Logs & Datasets page.

  2. Filter Logs by Tags or Models

    Use the filtering options to select logs associated with specific tags or models.

    Filter requirements

    At least one tag or model must be selected to create a dataset.

  3. Review and Select Logs

    Once filters are applied, review the matching logs displayed in the list.

  4. Create Dataset

    Click the Create Dataset button. Provide a unique name for your dataset and optionally add a description.

  5. Save Dataset

    Click Save to finalize dataset creation. The new dataset will appear under the Datasets tab.

Once created, you can:

  • View all logs associated with the dataset.
  • Add or remove metadata tags.
  • Use the dataset for fine-tuning or evaluation tasks within Hyperstack AI Studio.

Delete Datasets Using the UI

Datasets can then be viewed or deleted from the Dataset tab within the Data & Datasets section of the UI.

Dataset vs Data Deletion

When deleting a dataset the data still persist, only the grouping is deleted. To delete the data follow the steps above in the "Delete Data" Section


Dataset APIs

Create Datasets API

https://api.ai.hyperstack.cloud/api/v1/datasets

To create a dataset via API, use the command below and replace the following variables:

  • API_KEY: Your API key.
  • name: A unique name to identify your dataset.
  • At least one of tags or models is required to define the log selection criteria.
  • Optional fields such as description, tags, models, from_date, and to_date can be included to filter logs or add metadata. See the full list in the Optional Parameters section.
curl -X POST "https://api.ai.hyperstack.cloud/api/v1/datasets" \
-H "X-API-KEY: API_KEY" \
-H "Content-Type: application/json" \
-d '{
"name": "YOUR_DATASET_NAME",
"tags": ["tag1", "tag2"],
"models": ["model1", "model2"],
"description": "A brief description of your dataset",
"from_date": "2024-01-01T00:00:00Z",
"to_date": "2024-12-31T23:59:59Z"
}'

Response

If successful, the API will return the following confirmation response:

201 Created
{
"status": "success",
"message": "Successfully created dataset {name}"
}

Error Responses

  • 422 Unprocessable Entity: Request is missing required parameters. The name field is mandatory, and you must provide at least one of tags or models to filter the data to include in the dataset.

  • 400 Bad Request: A dataset with the specified name already exists. Provide a unique name for your dataset.


List Datasets API

GET https://api.ai.hyperstack.cloud/api/v1/datasets

To retrieve a list of your datasets via API, provide your API_KEY and execute the command below. The dataset ID returned by this API can be used in the path of the Delete Datasets API to delete the specified dataset.

curl -X GET "https://api.ai.hyperstack.cloud/api/v1/datasets" \
-H "X-API-KEY: API_KEY" \

Response

If successful, the API will return the following:

200 OK
{
"message": [
{
"created_at": "Thu, 26 Jun 2025 20:28:39 GMT",
"description": "",
"filters": {
"from_date": null,
"models": [
"mistralai/mistral-7b-instruct-v0.3"
],
"tags": null,
"to_date": null
},
"id": 10,
"name": "example",
"total_logs": 0,
"updated_at": null,
"user_id": 14
}
],
"status": "success"
}
Click to view descriptions of response fields
message array

An array of dataset objects, each representing a dataset created in Hyperstack AI Studio.

Show child attributes
created_at string

Timestamp indicating when the dataset was created (RFC 1123 format).


description string

Optional description provided by the user for the dataset.


filters object

Filtering criteria used when creating the dataset.

  • from_date: Start date of the data range, if specified.
  • to_date: End date of the data range, if specified.
  • models: List of model identifiers the dataset was filtered by.
  • tags: Tag filters applied to the dataset (if any).

id integer

Unique identifier for the dataset.


name string

User-defined name of the dataset.


total_logs integer

Total number of log records included in the dataset.


updated_at string|null

Timestamp of the last update to the dataset, or null if never updated.


user_id integer

Identifier of the user who created the dataset.


status string

Indicates the result of the API call. "success" confirms that the list of datasets was retrieved successfully.


Delete Datasets API

DELETE https://api.ai.hyperstack.cloud/api/v1/datasets/{dataset_id}

To delete a dataset via API, use the command below and provide the ID of the dataset to be deleted in the path. Obtain the ID of the dataset by calling the List Datasets API.

  • API_KEY: Your API key.
  • dataset_id: The ID of the dataset to be deleted.
curl -X DELETE "https://api.ai.hyperstack.cloud/api/v1/datasets/{dataset_id}" \
-H "X-API-KEY: API_KEY" \

Response

If successful, the API will return the following confirmation response:

200 OK
{
"message": "Successfully deleted dataset {name}",
"status": "success"
}

Error Responses

  • 404 Not Found: Providing a nonexistent dataset ID will return this status. Use the List Datasets API to obtain the correct ID.

Back to top