Skip to content

Keep max_seqlen and cu_seqlens_argmin for later micro-batches when PP>1 #28005

Keep max_seqlen and cu_seqlens_argmin for later micro-batches when PP>1

Keep max_seqlen and cu_seqlens_argmin for later micro-batches when PP>1 #28005