Big data pipelines are notoriously hard to build. Mixing real-time data with batch data makes things even messier. Spark 2.0 offers new take on all-inclusive approach of solving this problem using Structured Streams. In this session we explore new features of Spark 2.0 and evaluate elegance of the new approach to streaming and how it achieves exactly once semantics of event processing.
Code:
BRK3184
Room:
A402 - A403