public class CompressedSizeEstimatorSample extends CompressedSizeEstimator
Constructor and Description |
---|
CompressedSizeEstimatorSample(MatrixBlock data,
CompressionSettings cs,
int sampleSize)
CompressedSizeEstimatorSample, samples from the input data and estimates the size of the compressed matrix.
|
Modifier and Type | Method and Description |
---|---|
CompressedSizeInfoColGroup |
estimateCompressedColGroupSize(int[] colIndexes,
int nrUniqueUpperBound)
A method to extract the Compressed Size Info for a given list of columns, This method further limits the
estimated number of unique values, since in some cases the estimated number of uniques is estimated higher than
the number estimated in sub groups of the given colIndexes.
|
MatrixBlock |
getSample() |
MatrixBlock |
sampleData(int sampleSize) |
String |
toString() |
computeCompressedSizeInfos, computeCompressedSizeInfos, computeCompressedSizeInfos, estimateCompressedColGroupSize, estimateCompressedColGroupSize, estimateCompressedColGroupSize, estimateCompressedColGroupSize, getNumColumns, getNumRows
public CompressedSizeEstimatorSample(MatrixBlock data, CompressionSettings cs, int sampleSize)
data
- The input data toSample fromcs
- The Settings used for the sampling, and compression, contains information such as seed.sampleSize
- The size to sample from the data.public MatrixBlock getSample()
public MatrixBlock sampleData(int sampleSize)
public CompressedSizeInfoColGroup estimateCompressedColGroupSize(int[] colIndexes, int nrUniqueUpperBound)
CompressedSizeEstimator
estimateCompressedColGroupSize
in class CompressedSizeEstimator
colIndexes
- The columns to extract compression information fromnrUniqueUpperBound
- The upper bound of unique elements allowed in the estimate, can be calculated from the
number of unique elements estimated in sub columns multiplied together. This is
flexible in the sense that if the sample is small then this unique can be manually
edited like in CoCodeCostMatrixMult.Copyright © 2021 The Apache Software Foundation. All rights reserved.