pytorch/test/distributed at master - pytorch - Carlos Sousa's Git

OSSForks/pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2026-01-15 12:15:51 +00:00

Files

History

Anshul Sinha 452abac61c [dtensor][partial] fixes unnecessary redistributions when subtracting two partial tensors (#170040 )

**Summary:** Currently, whenever we subtract two partial dtensors, we redistribute since linearity is -1 for aten.sub.tensor. However, this is an unnecessary redistribution that can be avoided in similar ways to its add counterpart. I moved the op to linear_ops and ensured subtracting a scalar from a partial dtensor continues to redistribute.

**Test Cases:**
1. pytest test/distributed/tensor/test_pointwise_ops.py -k test_add_sub_scalar_norm_partial
2. pytest test/distributed/tensor/test_pointwise_ops.py -k test_add_sub_scalar_partial

Pull Request resolved: https://github.com/pytorch/pytorch/pull/170040
Approved by: https://github.com/wconstab
ghstack dependencies: #170030, #170035

2025-12-31 23:52:03 +00:00

..

Revert "Add custom torch dispatch mode in aot_autograd runtime wrapper to analyze custom ops under config (#166545 )"

2025-12-30 15:27:14 +00:00

[DeviceMesh] Fix a corner case for coalesce in cute layout and mesh slicing (#169454 )

2025-12-15 23:09:52 +00:00

…

…

add device generalization support for distributed tests (#165067 )

2025-12-01 06:45:09 +00:00

…

Revert "[xpu][test] Port distributed checkpoint test cases on Intel GPU (#168921 )"

2025-12-20 17:36:56 +00:00

Revert "[xpu][test] Port distributed elastic test cases to Intel GPU (#168923 )"

2025-12-20 13:26:12 +00:00

flight_recorder

…

[FSDP] Fix _unshard() passing Stream instead of Event (#170525 )

2025-12-17 19:25:18 +00:00

Enable SIM118 (#167399 )

2025-11-28 08:00:09 +00:00

…

[ROCm] Enable ZerO Optimizer UTs (#169077 )

2025-12-02 21:59:29 +00:00

[CPU][Flex attn] Add a readable error message for the backward path (#169646 )

2025-12-15 05:28:34 +00:00

…

[dtensor][partial] fixes unnecessary redistributions when subtracting two partial tensors (#170040 )

2025-12-31 23:52:03 +00:00

_test_template.py

…

argparse_util_test.py

Enable SIM118 (#167399 )

2025-11-28 08:00:09 +00:00

test_aten_comm_compute_reordering.py

[inductor] overlap passes, bucket first exposed collectives (#169942 )

2025-12-17 07:09:59 +00:00

test_backends.py

…

test_c10d_common.py

Fix remaining Pylint W0143 warnings (#167421 )

2025-12-05 23:19:04 +00:00

test_c10d_functional_native.py

[dist] add reduce_scatter_out (#168260 )

2025-12-02 14:25:03 +00:00

test_c10d_gloo.py

Gloo PG expand tests for different reduce ops (#171458 )

2025-12-29 23:50:17 +00:00

test_c10d_logger.py

…

test_c10d_nccl.py

c10d/ProcessGroupNCCL: reduce_scatter world_size=1 work around (#170922 )

2025-12-20 03:20:49 +00:00

test_c10d_object_collectives.py

add device generalization support for distributed tests (#165067 )

2025-12-01 06:45:09 +00:00

test_c10d_ops_nccl.py

…

test_c10d_pypg.py

…

test_c10d_spawn_gloo.py

Enable ruff SIM115 check (#169437 )

2025-12-05 01:58:13 +00:00

test_c10d_spawn_nccl.py

…

test_c10d_spawn_ucc.py

…

test_c10d_spawn.py

Enable ruff SIM115 check (#169437 )

2025-12-05 01:58:13 +00:00

test_c10d_ucc.py

…

test_ce_colls.py

Test Copy Engine All-to-all (#170344 )

2025-12-17 07:08:47 +00:00

test_collective_utils.py

…

test_composability.py

…

test_compute_comm_reordering.py

[Inductor] handle GroupedSchedulerNode in combo kernel fusion (#168109 )

2025-12-02 18:33:45 +00:00

test_control_collectives.py

…

test_cupy_as_tensor.py

…

test_data_parallel.py

…

test_debug.py

dist/debug: support py-spy (native+subprocess) stacks (#169147 )

2025-12-02 23:08:49 +00:00

test_device_mesh.py

[DeviceMesh] Fix a corner case for coalesce in cute layout and mesh slicing (#169454 )

2025-12-15 23:09:52 +00:00

test_dist2.py

…

test_distributed_spawn.py

…

test_dynamo_distributed.py

…

test_fake_pg.py

…

test_functional_api.py

…

test_functional_differentials.py

Fix flaky compile tests for differentiable collectives (#170779 )

2025-12-19 22:21:54 +00:00

test_inductor_collectives.py

Apply various ruff fixes (#170968 )

2025-12-23 11:57:32 +00:00

test_launcher.py

…

test_local_tensor.py

Part 1: LocalTensor raise ValueError for empty tensor. (#170577 )

2025-12-19 15:48:04 +00:00

test_multi_threaded_pg.py

…

test_nccl.py

[c10d][Sym mem] Make nccl backend full fledged with nccl 2.28.9-1 (#168129 )

2025-12-13 08:06:56 +00:00

test_nvshmem_triton.py

[c10d][Sym mem] Add set_signal_pad_size API for SymmetricMemory (#169156 )

2025-12-04 17:53:59 +00:00

test_nvshmem.py

[CI] Fix NVSHMEM errors in H100 symm mem job (#169879 )

2025-12-09 04:35:48 +00:00

test_overlap_bucketing_unit.py

use fusion regions in overlapping (#170560 )

2025-12-18 16:20:31 +00:00

test_p2p_ipc.py

…

test_pg_wrapper.py

…

test_run.py

…

test_serialization.py

…

test_store.py

Enable ruff SIM115 check (#169437 )

2025-12-05 01:58:13 +00:00

test_symmetric_memory.py

[2/N] Remove outdated CUDA code (#170357 )

2025-12-19 07:54:29 +00:00