Course Overview
During this two-day instructor-led training course, participants will learn development and operations for Cloudera Streaming Analytics, a framework for low-latency processing and analytics powered by Apache Flink and Cloudera's innovative SQL Stream Builder.
Through extensive hands-on exercises, students will gain experience deploying and managing a Flink cluster, developing and running Flink applications, and using SQL Stream Builder's continuous SQL to perform analytics on streaming data.
Course Objectives
During this course, you learn how to:
- Deploy a Flink cluster using Cloudera Manager
- Develop Flink batch and streaming applications
- Run and view Flink jobs
- Transform data streams
- Use watermarks and windows to analyze streaming data
- Analyze data with Cloudera SQL Stream Builder
- Monitor Flink application metrics
Course Content
Overview
- Introduction to Apache Flink and Stream Processing
- Typical Use Cases
- Related Products
Basic Architecture
- Logical
- Physical
- Parallelism
- Fault Tolerance
- Data Storage
Service Deployment
- Planning Requirements
- Installation
- Flink Dashboard
- Exercise: Running a Flink Program
Flink Basics
- Execution Environment
- Flink Application Structure
- Create a Flink Project
- Build a Flink Program
- Exercise: Building a Simple Flink Program
DataStream API
- Data Types and Serialization
- Sources and Sinks
- Data Pipelines and ETL
- Transformations
- Exercise: Batch Processing Using Flink
- Exercise: Creating a Flink Streaming Application
- Using Kafka as a Source and Sink
- Exercise: Creating a Streaming Application Using a Kafka Source
Flink SQL and Table API
- Streaming Concepts
- Programming Options
- Integrations
- Exercise: Using Flink SQL and Kafka
Stateful Stream Processing
- Connected Streams
- Streaming Analytics
- Event Time Processing
- Watermarks
- Windows
- Exercise: Tumbling Windows with Event Time
Cloudera SQL Stream Builder
- Overview
- SQL Stream Builder Console
- Analytics and Stream Processing
- Exercise: Creating SQL Stream Jobs
Monitoring
- Flink Metrics
- Checkpointing
- Backpressure
- Log Files
- Exercise: Monitoring and Checkpointing