Deep Learning for Network Engineers

After checking these links (p1, p2), I decided to read the book.

The first part is mainly maths about deep learning. I coudn’t follow much but at least try to get the big picture.

The second one is more about networking concepts and I had big expectations. There was nothing really special, a refresh of ECN (egress port, inform receiver to notify sender to slow down vs PFC (ingress port, pause traffic flow). ECN needs to kick in before PFC: xON < WRED Min < WRED Max < xOFF. The use of both is called as DCQCN. This is a reference from the book for Cisco. There are references to the alternatives of ECMP as it is a main problem for elephant flows generated by GPUs. But it is mainly vendor features than standards (advance routing and packet spraying). It mentions NCCL and NVLink that is something I wasn’t really clear and at least I can see a bit the point.

I guess, I will need to give a second read at some point to refresh the theoretical/maths part of AI.