This transform returns the specified percentage out of the full set of input records using random selection.
The Percentage Sampling transform requires one input transform that has at least one column.
2. Add the transform
Click to select the connector link between two transforms.
In the toolbar, choose Insert Other, then Percentage Sampling.
To edit/configure the transform, select it and choose Configure in the toolbar.
In the Percentage Sampling transform configuration dialog:
- Set the Sampling Percentage - The percentage of the input records that you want included in the output.
- Optionally, set the Random Seed - An arbitrary number no greater than 2,147,483,647, used to used to initialize a pseudorandom number generator. If this is set to zero, the current time is used to randomly determine which records will be sampled; otherwise, the same records will be sampled given the same input data and sampling percentage.
- Select the columns to be included in the output.
As with other transforms, these settings can be turned into a parameter to be changed outside of the transform or the data cube (e.g., with a filter on a dashboard) by clicking Define parameters.
Submit the configuration dialog, and with the transform selected, open the Data Preview to view the output.