Zero Bubble Pipeline Parallelism

For session 6 of the Eleuther AI ML Scalability & Performance reading group, I gave a presentation covering Zero Bubble Pipeline Parallelism. I also covered 2 key pieces of prior work which provide context, to understand what the limitations were of those prior approaches and put the innovations of Zero Bubble PP in context.

My annotated versions of these papers can be found be found on my Github here.

Papers:

GPipe: Easy Scaling with Micro-Batch Pipeline Parallelism
PipeDream: Fast and Efficient Pipeline Parallel DNN Training
Zero Bubble Pipeline Parallelism

Recording: