mirror of
https://github.com/zebrajr/pytorch.git
synced 2026-01-15 12:15:51 +00:00
Fixes #100701 It seems like we don't call `multi_tensor_apply_kernel` at all if the input tensor lists are small and their last tensors are zero-size as per e.g.ca9f55f79d/aten/src/ATen/native/cuda/MultiTensorApply.cuh (L100-L102)which was introduced in05943712a4. This PR special cases the last zero-size tensors so that we won't be negligent. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100811 Approved by: https://github.com/ngimel