A comprehensive collection of Apache Flink examples using PyFlink for stream and batch processing. This repository provides practical, working examples for learning and building real-time data processing applications.
Features
- Stream Processing: Real-time data processing examples
- Batch Processing: Batch ETL and analytics examples
- Table API: SQL and Table API usage patterns
- Connectors: Kafka, filesystem, and database integrations
- State Management: Stateful processing patterns
Examples Included
- Word count (streaming and batch)
- Kafka consumer/producer
- Window aggregations
- State backend configuration
- Custom serialization
- CEP (Complex Event Processing)
Getting Started
# Clone the repository
git clone https://github.com/wdm0006/flink-python-examples
# Install PyFlink
pip install apache-flink
# Run an example
python examples/word_count_streaming.py