Google brings Cloud Dataproc to Kubernetes
https://techcrunch.com/2019/09/10/google-brings-cloud-dataproc-to-kubernetes/
Cloud Dataproc is probably one of the lesser-known products in Google Cloud’s portfolio, but it’s a powerful tool for data wranglers who are looking for a fully managed cloud service that lets them run Apache Spark and Hadoop clusters without having to worry about managing the underlying infrastructure. Today. Google announced that it is launching the alpha of Cloud Dataproc to Kubernetes — and while that, too, may not sound all that interesting at first, it’s an important step for Google Cloud as it works to adapt more of its products to a hybrid cloud model.
The general idea here is to give enterprise customers (and make no mistake, enterprise customers are the main focus of Google Cloud these days) the ability to run Apache Spark jobs on Google Kubernetes Engine (GKE) clusters. With products like Anthos now making GKE available virtually anywhere, this means customers can now also take Cloud Dataproc to their own data centers. Right now, the service only supports Apache Spark, but Google plans to support other open-source projects, too.