Package cz.o2.proxima.beam.core
Interface DataAccessor
-
- All Superinterfaces:
AbstractDataAccessor,java.io.Serializable
- All Known Implementing Classes:
AttributeFamilyProxyDataAccessor,DirectDataAccessorWrapper,PubSubDataAccessor
@Internal public interface DataAccessor extends AbstractDataAccessor
A data accessor for attribute families.
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description org.apache.beam.sdk.values.PCollection<StreamElement>createBatch(org.apache.beam.sdk.Pipeline pipeline, java.util.List<AttributeDescriptor<?>> attrs, long startStamp, long endStamp)CreatePCollectionfor given attribute family's batch updates storage.org.apache.beam.sdk.values.PCollection<StreamElement>createStream(java.lang.String name, org.apache.beam.sdk.Pipeline pipeline, Position position, boolean stopAtCurrent, boolean eventTime, long limit)CreatePCollectionfor given attribute family's commit log.org.apache.beam.sdk.values.PCollection<StreamElement>createStreamFromUpdates(org.apache.beam.sdk.Pipeline pipeline, java.util.List<AttributeDescriptor<?>> attrs, long startStamp, long endStamp, long limit)CreatePCollectionfor given attribute family's batchUpdates.-
Methods inherited from interface cz.o2.proxima.core.storage.internal.AbstractDataAccessor
getUri
-
-
-
-
Method Detail
-
createStream
org.apache.beam.sdk.values.PCollection<StreamElement> createStream(java.lang.String name, org.apache.beam.sdk.Pipeline pipeline, Position position, boolean stopAtCurrent, boolean eventTime, long limit)
CreatePCollectionfor given attribute family's commit log.- Parameters:
name- name of the consumerpipeline- pipeline to createPCollectioninposition- to read fromstopAtCurrent- stop reading at current dataeventTime-trueto use event timelimit- limit number of elements read. Note that the number of elements might be actually lower, because it is divided by number of partitions It is useful mostly for testing purposes- Returns:
PCollectionrepresenting the commit log
-
createStreamFromUpdates
org.apache.beam.sdk.values.PCollection<StreamElement> createStreamFromUpdates(org.apache.beam.sdk.Pipeline pipeline, java.util.List<AttributeDescriptor<?>> attrs, long startStamp, long endStamp, long limit)
CreatePCollectionfor given attribute family's batchUpdates. The created PCollection is purposefully treated as unbounded (although it is bounded, in fact), which gives better performance in cases when it is united with another unboundedPCollection.- Parameters:
pipeline- pipeline to createPCollectioninattrs- attributes to read updates forstartStamp- minimal update timestamp (inclusive)endStamp- maximal update timestamp (exclusive)limit- limit number of elements read. Note that the number of elements might be actually lower, because it is divided by number of partitions It is useful mostly for testing purposes- Returns:
PCollectionrepresenting the commit log
-
createBatch
org.apache.beam.sdk.values.PCollection<StreamElement> createBatch(org.apache.beam.sdk.Pipeline pipeline, java.util.List<AttributeDescriptor<?>> attrs, long startStamp, long endStamp)
CreatePCollectionfor given attribute family's batch updates storage.- Parameters:
pipeline- pipeline to createPCollectioninattrs- attributes to readstartStamp- minimal update timestamp (inclusive)endStamp- maximal update timestamp (exclusive)- Returns:
PCollectionrepresenting the batch updates
-
-