Class DataSampling

java.lang.Object
edu.cmu.tetrad.data.DataSampling

public final class DataSampling extends Object
A utility for resampling dataset.

Feb 20, 2024 6:57:28 PM

Author:
Kevin V. Bui (kvb2univpitt@gmail.com)
  • Method Details

    • createDataSamples

      public static List<DataSet> createDataSamples(org.apache.commons.math3.random.RandomGenerator randomGenerator, DataSet dataSet, Parameters parameters)
      Create a list of dataset resampled from the given dataset.
      Parameters:
      randomGenerator - the random number generate to use.
      dataSet - dataset to resample
      parameters - bootstrap-related parameters
      Returns:
      a list of resampled dataset
    • createDataSamples

      public static List<DataSet> createDataSamples(DataSet dataSet, org.apache.commons.math3.random.RandomGenerator randomGenerator, Parameters parameters)
      Create a list of dataset resampled from the given dataset.
      Parameters:
      dataSet - dataset to resample
      randomGenerator - random number generator (optional)
      parameters - bootstrap-related parameters
      Returns:
      a list of resampled dataset
    • createDataSample

      public static DataSet createDataSample(DataSet dataSet, org.apache.commons.math3.random.RandomGenerator randomGenerator, int[] selectedColumns, Parameters parameters, double percentResamplingSize)
      Creates a resampled dataset from the given dataset based on the specified parameters.
      Parameters:
      dataSet - the input dataset from which the sample will be created
      randomGenerator - the random number generator used for sampling
      selectedColumns - an array of column indices to include in the sampled dataset
      parameters - the parameters for sampling, including sampling fraction and resampling method
      percentResamplingSize - The percentage of random rows to use for each resampling.
      Returns:
      a new dataset containing the selected rows and columns