Datasets
This article explains how to create and manage datasets in Hyperstack AI Studio using the API or UI. Datasets help you organize logs for fine-tuning, training analysis, and efficient model iteration.
In this article
Create Datasets Using the UI
You can create datasets directly through the AI Studio UI using the following steps:
-
Open the Logs & Datasets Page
Navigate to the Logs & Datasets page.
-
Filter Logs by Tags or Models
Use the filtering options to select logs associated with specific tags or models.
Filter requirementsAt least one tag or model must be selected to create a dataset.
-
Review and Select Logs
Once filters are applied, review the matching logs displayed in the list.
-
Create Dataset
Click the Create Dataset button. Provide a unique name for your dataset and optionally add a description.
-
Save Dataset
Click Save to finalize dataset creation. The new dataset will appear under the Datasets tab.
Once created, you can:
- View all logs associated with the dataset.
- Add or remove metadata tags.
- Use the dataset for fine-tuning or evaluation tasks within Hyperstack AI Studio.
Delete Datasets Using the UI
Datasets can then be viewed or deleted from the Dataset tab within the Data & Datasets section of the UI.
When deleting a dataset the data still persist, only the grouping is deleted. To delete the data follow the steps above in the "Delete Data" Section
Dataset APIs
Create Datasets API
POST https://api.ai.hyperstack.cloud/api/v1/datasets
To create a dataset via API, use the command below and replace the following variables:
API_KEY
: Your API key.name
: A unique name to identify your dataset.- At least one of
tags
ormodels
is required to define the log selection criteria. - Optional fields such as
description
,tags
,models
,from_date
, andto_date
can be included to filter logs or add metadata. See the full list in the Optional Parameters section.
curl -X POST "https://api.ai.hyperstack.cloud/api/v1/datasets" \
-H "X-API-KEY: API_KEY" \
-H "Content-Type: application/json" \
-d '{
"name": "YOUR_DATASET_NAME",
"tags": ["tag1", "tag2"],
"models": ["model1", "model2"],
"description": "A brief description of your dataset",
"from_date": "2024-01-01T00:00:00Z",
"to_date": "2024-12-31T23:59:59Z"
}'
Required Parameters
name
(string)
– The unique name of the dataset.
At least one of:
tags
(array of strings)
– Array of tags to filter the dataset by.models
(array of strings)
– Array of model names to filter the dataset by.
Optional Parameters
description
(string)
– A brief description of the dataset.from_date
(string)
– Start date for filtering data (ISO 8601 format).to_date
(string)
– End date for filtering data (ISO 8601 format).
Response
If successful, the API will return the following confirmation response:
{
"status": "success",
"message": "Successfully created dataset {name}"
}
Error Responses
-
422 Unprocessable Entity: Request is missing required parameters. The
name
field is mandatory, and you must provide at least one oftags
ormodels
to filter the data to include in the dataset. -
400 Bad Request: A dataset with the specified
name
already exists. Provide a unique name for your dataset.
List Datasets API
GET https://api.ai.hyperstack.cloud/api/v1/datasets
To retrieve a list of your datasets via API, provide your API_KEY
and execute the command below. The dataset ID returned by this API can be used in the path of the Delete Datasets API to delete the specified dataset.
curl -X GET "https://api.ai.hyperstack.cloud/api/v1/datasets" \
-H "X-API-KEY: API_KEY" \
Response
If successful, the API will return the following:
{
"message": [
{
"created_at": "Thu, 26 Jun 2025 20:28:39 GMT",
"description": "",
"filters": {
"from_date": null,
"models": [
"mistralai/mistral-7b-instruct-v0.3"
],
"tags": null,
"to_date": null
},
"id": 10,
"name": "example",
"total_logs": 0,
"updated_at": null,
"user_id": 14
}
],
"status": "success"
}
Click to view descriptions of response fields
message array
An array of dataset objects, each representing a dataset created in Hyperstack AI Studio.
Show child attributes
created_at string
Timestamp indicating when the dataset was created (RFC 1123 format).
description string
Optional description provided by the user for the dataset.
filters object
Filtering criteria used when creating the dataset.
- from_date: Start date of the data range, if specified.
- to_date: End date of the data range, if specified.
- models: List of model identifiers the dataset was filtered by.
- tags: Tag filters applied to the dataset (if any).
id integer
Unique identifier for the dataset.
name string
User-defined name of the dataset.
total_logs integer
Total number of log records included in the dataset.
updated_at string|null
Timestamp of the last update to the dataset, or null
if never updated.
user_id integer
Identifier of the user who created the dataset.
status string
Indicates the result of the API call. "success"
confirms that the list of datasets was retrieved successfully.
Delete Datasets API
DELETE https://api.ai.hyperstack.cloud/api/v1/datasets/{dataset_id}
To delete a dataset via API, use the command below and provide the ID of the dataset to be deleted in the path. Obtain the ID of the dataset by calling the List Datasets API.
API_KEY
: Your API key.dataset_id
: The ID of the dataset to be deleted.
curl -X DELETE "https://api.ai.hyperstack.cloud/api/v1/datasets/{dataset_id}" \
-H "X-API-KEY: API_KEY" \
Required Parameters
dataset_id
(integer)
– The ID of the dataset to be deleted.
Response
If successful, the API will return the following confirmation response:
{
"message": "Successfully deleted dataset {name}",
"status": "success"
}
Error Responses
- 404 Not Found: Providing a nonexistent dataset ID will return this status. Use the List Datasets API to obtain the correct ID.