pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2026-01-15 12:15:51 +00:00

Files

Sherlock Huang 28ee8be5bf [NativeRT] Apply Device placement once when loading the graph (#158996 )

Summary:
Placement is leaked to too many classes!

In this diff, we consolidate all placement lookup into one place: Graph::ApplyDevicePlacement.

After applying placement, the in-memory graph, tensorMeta, weightMeta would already have the re-mapped device.
The subsequence weight loading, sample input loading, target device inference would look up the re-mapped device from graph's tensorMeta.

graph's tensorMeta becomes the only ground truth!

Test Plan:
Need to add some tests before landing.
This is a big change.

Rollback Plan:

Differential Revision: D78841818

Pull Request resolved: https://github.com/pytorch/pytorch/pull/158996
Approved by: https://github.com/henryoier

2025-07-25 20:11:35 +00:00

aoti_abi_check

Revert "Move some of vec into headeronly in preparation for Half.h (#158976 )"

2025-07-24 22:31:49 +00:00

aoti_inference

[AOTI] Convert C-struct zip handling to RAII container (#158687 )

2025-07-22 16:01:51 +00:00

api

[BE][3/6] fix typos in test/ (#157637 )

2025-07-17 12:08:33 +00:00

c10d

[cca] [c10d] Refactor CUDAEventCache into separate files (#158616 )