Package edu.cmu.tetrad.data
Class DataSampling
java.lang.Object
edu.cmu.tetrad.data.DataSampling
A utility for resampling dataset.
Feb 20, 2024 6:57:28 PM
- Author:
- Kevin V. Bui (kvb2univpitt@gmail.com)
-
Method Summary
Modifier and TypeMethodDescriptionstatic DataSetcreateDataSample(DataSet dataSet, org.apache.commons.math3.random.RandomGenerator randomGenerator, int[] selectedColumns, Parameters parameters, double percentResamplingSize) Creates a resampled dataset from the given dataset based on the specified parameters.createDataSamples(DataSet dataSet, org.apache.commons.math3.random.RandomGenerator randomGenerator, Parameters parameters) Create a list of dataset resampled from the given dataset.createDataSamples(org.apache.commons.math3.random.RandomGenerator randomGenerator, DataSet dataSet, Parameters parameters) Create a list of dataset resampled from the given dataset.
-
Method Details
-
createDataSamples
public static List<DataSet> createDataSamples(org.apache.commons.math3.random.RandomGenerator randomGenerator, DataSet dataSet, Parameters parameters) Create a list of dataset resampled from the given dataset.- Parameters:
randomGenerator- the random number generate to use.dataSet- dataset to resampleparameters- bootstrap-related parameters- Returns:
- a list of resampled dataset
-
createDataSamples
public static List<DataSet> createDataSamples(DataSet dataSet, org.apache.commons.math3.random.RandomGenerator randomGenerator, Parameters parameters) Create a list of dataset resampled from the given dataset.- Parameters:
dataSet- dataset to resamplerandomGenerator- random number generator (optional)parameters- bootstrap-related parameters- Returns:
- a list of resampled dataset
-
createDataSample
public static DataSet createDataSample(DataSet dataSet, org.apache.commons.math3.random.RandomGenerator randomGenerator, int[] selectedColumns, Parameters parameters, double percentResamplingSize) Creates a resampled dataset from the given dataset based on the specified parameters.- Parameters:
dataSet- the input dataset from which the sample will be createdrandomGenerator- the random number generator used for samplingselectedColumns- an array of column indices to include in the sampled datasetparameters- the parameters for sampling, including sampling fraction and resampling methodpercentResamplingSize- The percentage of random rows to use for each resampling.- Returns:
- a new dataset containing the selected rows and columns
-