Monday, October 26, 2020

Docker and Apache Flink available in Dataproc’s Component Exchange

 

Google Cloud announced that Docker and Apache Flink are now available as optional components in Dataproc. The Docker availability means you will now be able to run daemons on your Dataproc cluster nodes allowing you to interact with Hadoop clusters via your containerized applications.

The Docker component also uses Google Container Registry, in addition to the default Docker registry. 

Apache Flink

Apache Beam and Apache Flink are two of the best streaming technologies today. Apache Flink is a distributed processing engine using stateful computation, while Apache Beam is a unified model for defining batch and steaming processing pipelines. Using Apache Flink as an execution engine, you can also run Apache Beam jobs on Dataproc, in addition to Google’s Cloud Dataflow service.

References

 See Also

Featured Article

Detect, diagnose, and prevent performance problems using Cloud SQL Insights

Cloud SQL Insights, a database observability tool for your Cloud SQL database has just arrived in GCP ...

Popular this Week