Blog posts tagged
"big_data"

Rob Gibbon
15 October 2024

Apache Spark 4.0 beta release – try it now

Data Platform Ubuntu tech blog

Apache Spark is a popular framework for developing distributed, parallel data processing applications. Our solution for Apache Spark on Kubernetes has made significant progress in the past year since we launched, adding support for Apache Iceberg, a new GPU accelerated image using the NVIDIA Spark-RAPIDS plugin, and support for the Volcan ...

Rob Gibbon
15 July 2024

Deploying and scaling Apache Spark on Amazon AWS EKS

Data Platform Ubuntu tech blog

Move over Hadoop, it’s time for Spark on Kubernetes Apache Spark, a framework for parallel distributed data processing, has become a popular choice for building streaming applications, data lake houses and big data extract-transform-load data processing (ETL). It is horizontally scalable, fault-tolerant, and performs well at high scale. H ...

Quick links

Quick links

Quick links

Quick links

Quick links

Quick links

Quick links

Quick links

Quick links

Categories

Industries

Partner programs

Quick links

Roles by department

Working here

Explore Canonical

Latest updates

Company highlights ›

Blog posts tagged
"big_data"

Apache Spark 4.0 beta release – try it now

Deploying and scaling Apache Spark on Amazon AWS EKS

Blog posts tagged "big_data"

Apache Spark 4.0 beta release – try it now

Deploying and scaling Apache Spark on Amazon AWS EKS

Blog posts tagged
"big_data"