Data Processing¶
Docs¶
- Quick start and training
Videos¶
- Running a Data Replication Pipeline on Kubernetes with Argo and Singer.io
- Scaling Kubernetes: Best Practices for Managing Large-Scale Batch Jobs with Spark and Argo Workflow
Books¶
- Distributed Machine Learning Patterns (see Chapter 2 on data processing/ingestion patterns)