One blog about data engineering
-
Architecting Compliance: Cost-Effective Data Strategies for GDPR
Implementing GDPR Compliance in Data Lakes: A Case Study from Adevinta Spain Abstract Compliance with the General Data Protection Regulation (GDPR) is not only a technical challenge but also carries significant financial implications if breached. At Adevinta Spain, we have structured our data lake architecture in a way that not only meets GDPR requirements but also reduces costs associated with data protection…
-
Running Spark on a Multi-node Local Kubernetes Cluster: A Step-by-Step Guide for Data Engineers
Introduction Embarking on a journey to run Apache Spark on a local Kubernetes cluster? This comprehensive guide is crafted for data engineers who seek to leverage the power of Spark within a Kubernetes environment. Whether you’re a beginner or an adept engineer, these steps will ensure a smooth and efficient setup. To complement this article,…