Kubernetes with hardware devices topology awareness at node level
- Mentors
- Lei Zhang, Jian He, Kai Zhang
- Organization
- Cloud Native Computing Foundation (CNCF)
We would like to propose a improvement on current Kubernetes topology manager to become aware of generic hardware device topology at node level, so Deep Learning training can be improved significantly due to data inter-connection between NVIDIA GPU devices on the node.