Data Synthesis
This article explains how to use Hyperstack AI Studio’s synthesis tools to generate high-quality training data variations. It covers when and why to synthesize data, how to initiate synthesis from existing datasets, and available filtering options—all currently available through the UI.
In this article
Why Synthesize Logs?
When working with model outputs that cannot be directly used for training other models, data synthesis becomes necessary. This is particularly relevant when:
- You have saved outputs from a model that you want to use for training
- You need to create variations of your existing data while maintaining its characteristics
Synthesize Logs Using the UI
You can generate high-quality synthetic training data directly in the AI Studio UI.
-
Go to the Logs & Datasets Page
Navigate to the Logs & Datasets page and open the Datasets tab.
-
Select Logs to Synthesize
By default, all logs in the selected dataset will be included. To narrow the selection, apply filters:
- Tags
- Models
-
Click “Synthesize Logs” button and confirm the action.
-
Review Results
After synthesis:
- You’ll receive a success notification.
- The logs table will let you toggle between Original and Synthetic versions.
Synthesis is currently only available through the UI. API support is coming soon.