Package cz.o2.proxima.beam.io.pubsub
Class PubSubDataAccessor
- java.lang.Object
-
- cz.o2.proxima.beam.io.pubsub.PubSubDataAccessor
-
- All Implemented Interfaces:
DataAccessor
,AbstractDataAccessor
,java.io.Serializable
public class PubSubDataAccessor extends java.lang.Object implements DataAccessor
ADataAccessor
for PubSub.- See Also:
- Serialized Form
-
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description org.apache.beam.sdk.values.PCollection<StreamElement>
createBatch(org.apache.beam.sdk.Pipeline pipeline, java.util.List<AttributeDescriptor<?>> attrs, long startStamp, long endStamp)
CreatePCollection
for given attribute family's batch updates storage.org.apache.beam.sdk.values.PCollection<StreamElement>
createStream(java.lang.String name, org.apache.beam.sdk.Pipeline pipeline, Position position, boolean stopAtCurrent, boolean eventTime, long limit)
CreatePCollection
for given attribute family's commit log.org.apache.beam.sdk.values.PCollection<StreamElement>
createStreamFromUpdates(org.apache.beam.sdk.Pipeline pipeline, java.util.List<AttributeDescriptor<?>> attributes, long startStamp, long endStamp, long limit)
CreatePCollection
for given attribute family's batchUpdates.-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface cz.o2.proxima.core.storage.internal.AbstractDataAccessor
getUri
-
-
-
-
Method Detail
-
createStream
public org.apache.beam.sdk.values.PCollection<StreamElement> createStream(java.lang.String name, org.apache.beam.sdk.Pipeline pipeline, Position position, boolean stopAtCurrent, boolean eventTime, long limit)
Description copied from interface:DataAccessor
CreatePCollection
for given attribute family's commit log.- Specified by:
createStream
in interfaceDataAccessor
- Parameters:
name
- name of the consumerpipeline
- pipeline to createPCollection
inposition
- to read fromstopAtCurrent
- stop reading at current dataeventTime
-true
to use event timelimit
- limit number of elements read. Note that the number of elements might be actually lower, because it is divided by number of partitions It is useful mostly for testing purposes- Returns:
PCollection
representing the commit log
-
createBatch
public org.apache.beam.sdk.values.PCollection<StreamElement> createBatch(org.apache.beam.sdk.Pipeline pipeline, java.util.List<AttributeDescriptor<?>> attrs, long startStamp, long endStamp)
Description copied from interface:DataAccessor
CreatePCollection
for given attribute family's batch updates storage.- Specified by:
createBatch
in interfaceDataAccessor
- Parameters:
pipeline
- pipeline to createPCollection
inattrs
- attributes to readstartStamp
- minimal update timestamp (inclusive)endStamp
- maximal update timestamp (exclusive)- Returns:
PCollection
representing the batch updates
-
createStreamFromUpdates
public org.apache.beam.sdk.values.PCollection<StreamElement> createStreamFromUpdates(org.apache.beam.sdk.Pipeline pipeline, java.util.List<AttributeDescriptor<?>> attributes, long startStamp, long endStamp, long limit)
Description copied from interface:DataAccessor
CreatePCollection
for given attribute family's batchUpdates. The created PCollection is purposefully treated as unbounded (although it is bounded, in fact), which gives better performance in cases when it is united with another unboundedPCollection
.- Specified by:
createStreamFromUpdates
in interfaceDataAccessor
- Parameters:
pipeline
- pipeline to createPCollection
inattributes
- attributes to read updates forstartStamp
- minimal update timestamp (inclusive)endStamp
- maximal update timestamp (exclusive)limit
- limit number of elements read. Note that the number of elements might be actually lower, because it is divided by number of partitions It is useful mostly for testing purposes- Returns:
PCollection
representing the commit log
-
-