One blog about data engineering

Architecting Compliance: Cost-Effective Data Strategies for GDPR

Implementing GDPR Compliance in Data Lakes: A Case Study from Adevinta Spain Abstract Compliance with the General Data Protection Regulation (GDPR) is not only a technical challenge but also carries significant financial implications if breached. At Adevinta Spain, we have structured our data lake architecture in a way that not only meets GDPR requirements but also reduces costs associated with data protection…

2 de February de 2024
Running Spark on a Multi-node Local Kubernetes Cluster: A Step-by-Step Guide for Data Engineers

Introduction Embarking on a journey to run Apache Spark on a local Kubernetes cluster? This comprehensive guide is crafted for data engineers who seek to leverage the power of Spark within a Kubernetes environment. Whether you’re a beginner or an adept engineer, these steps will ensure a smooth and efficient setup. To complement this article,…

28 de January de 2024

Architecting Compliance: Cost-Effective Data Strategies for GDPR