Skip navigation links

Package org.apache.datasketches.frequencies

This package is dedicated to streaming algorithms that enable estimation of the frequency of occurrence of items in a weighted multiset stream of items.

See: Description

Package org.apache.datasketches.frequencies Description

This package is dedicated to streaming algorithms that enable estimation of the frequency of occurrence of items in a weighted multiset stream of items. If the frequency distribution of items is sufficiently skewed, these algorithms are very useful in identifying the "Heavy Hitters" that occurred most frequently in the stream. The accuracy of the estimation of the frequency of an item has well understood error bounds that can be returned by the sketch.

These algorithms are sometimes referred to as "TopN" algorithms.

Skip navigation links

Copyright © 2015–2024 The Apache Software Foundation. All rights reserved.