OpenAI Releases New Network Protocol for AI Training Clusters
OpenAI releases MRC (Multipath Reliable Connection), an open networking protocol for AI training supercomputers, improving GPU cluster resilience via multipath routing during long training runs. #AI #OpenAI