Actions: NVIDIA/TransformerEngine
Actions
2,500+ workflow runs
2,500+ workflow runs
CrossEntropyFunction.forward() modifies input in-place without ctx.mark_dirty(), causing GPU memory leak
TE-CI Trigger
#9438:
Issue comment #2899 (comment)
created
by
shen-jiabin