public class CompressedSizeEstimatorSample extends CompressedSizeEstimator
Modifier and Type | Method and Description |
---|---|
CompressedSizeInfoColGroup |
estimateCompressedColGroupSize(int[] colIndexes,
int estimate,
int nrUniqueUpperBound)
A method to extract the Compressed Size Info for a given list of columns, This method further limits the estimated
number of unique values, since in some cases the estimated number of uniques is estimated higher than the number
estimated in sub groups of the given colIndexes.
|
MatrixBlock |
getSample() |
int |
getSampleSize() |
String |
toString() |
computeCompressedSizeInfos, computeCompressedSizeInfos, computeCompressedSizeInfos, estimateCompressedColGroupSize, estimateCompressedColGroupSize, estimateCompressedColGroupSize, estimateCompressedColGroupSize, estimateJoinCompressedSize, estimateJoinCompressedSize, getData, getNumColumns, getNumRows
public MatrixBlock getSample()
public final int getSampleSize()
public CompressedSizeInfoColGroup estimateCompressedColGroupSize(int[] colIndexes, int estimate, int nrUniqueUpperBound)
CompressedSizeEstimator
estimateCompressedColGroupSize
in class CompressedSizeEstimator
colIndexes
- The columns to extract compression information fromestimate
- An estimate of number of unique elements in these columnsnrUniqueUpperBound
- The upper bound of unique elements allowed in the estimate, can be calculated from the
number of unique elements estimated in sub columns multiplied together. This is flexible
in the sense that if the sample is small then this unique can be manually edited like in
CoCodeCostMatrixMult.Copyright © 2022 The Apache Software Foundation. All rights reserved.