UCSD Data Science offers this course. This is where I collect resources that I can either refer to or I read and studied from. Most resources are pulled from WI24 offering.
OS
Cloud Computing
cloud computing stack: SaaS, PaaS, and IaaS
Network
Collective Communication
Parallelism
Batch and Stream Processing
- Resilient Distributed Datasets
- Discretized Streams: An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters
- Ray
- Spark SQL