Flink Mobile Data Usage

This is the code for a talk @joemoe and I gave at Flink Forward Global 2021: Introduction to Flink in 30 minutes.

Getting Started

Import this repo into an IDE, such as IntelliJ, as a Maven project.

KafkaProducerJob

Before running any of these application, first use docker-compose to start up a Kafka cluster:

docker-compose up -d

Running KafkaProducerJob in your IDE will populate an input topic with UsageRecords.

To run the jobs in this project in IntelliJ, choose the option "Include dependencies with Provided scope" in your Run Configuration.

After having run the KafkaProducerJob you can run any of the other applications in this project (all of which expect to read from Kafka).

To watch what's happening in Kafka (using a local installation of Kafka, or from inside the container):

./kafka-console-consumer.sh --bootstrap-server localhost:9092 --topic input --property print.timestamp=true

UsageAlertingProcessFunctionJob

While this job is running in the IDE you can use Flink's web UI at http://localhost:8081.

Setting up a Flink cluster

curl -O https://dlcdn.apache.org/flink/flink-1.14.3/flink-1.14.3-bin-scala_2.12.tgz
tar -xzf flink-*.tgz
cd flink-1.14.3
ls -la bin/
./bin/start-cluster.sh

Then browse to http://localhost:8081.

To see the cluster run the streaming WordCount example:

./bin/flink run examples/streaming/WordCount.jar
tail log/flink-*-taskexecutor-*.out

Running this project in a cluster

If you have already started a cluster, stop it and reconfigure Flink to have more resources by modifing flink-1.14.3/conf/flink-conf.yaml so that

taskmanager.memory.process.size: 3072m
taskmanager.numberOfTaskSlots: 8

Then with flink-1.14.3/bin in your PATH:

mvn clean package
start-cluster.sh
flink run -d target/flink-mobile-data-usage-1.0.jar
flink run -d -c com.ververica.flink.example.datausage.UsageAlertingProcessFunctionJob \
  target/flink-mobile-data-usage-1.0.jar --webui false

How to create a new maven project for your own Flink application

curl https://flink.apache.org/q/quickstart.sh | bash -s 1.14.3

See Project Configuration in the Flink docs for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
src/main		src/main
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Flink Mobile Data Usage

Getting Started

KafkaProducerJob

UsageAlertingProcessFunctionJob

Setting up a Flink cluster

Running this project in a cluster

How to create a new maven project for your own Flink application

About

Releases

Packages

Contributors 2

Languages

License

alpinegizmo/flink-mobile-data-usage

Folders and files

Latest commit

History

Repository files navigation

Flink Mobile Data Usage

Getting Started

KafkaProducerJob

UsageAlertingProcessFunctionJob

Setting up a Flink cluster

Running this project in a cluster

How to create a new maven project for your own Flink application

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages