pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2026-01-15 12:15:51 +00:00

Files

Aapo Kyrola 0af0cba2b7 Refactor data_parallel_model initial sync and checkpointing

Summary:
Major improvements. Before we only synced "params" and "computed params" of model after initialization and after loading a checkpoint. But actually we want to sync all blobs that are generated in the param_init_net. For example the _momentum blobs were missed by the previous implementation and had to be manually included in checkpoint finalization.

I also added GetCheckpointParams() to data_parallel_model because it is now fully general. Also added a unit test.

Reviewed By: andrewwdye

Differential Revision: D5093689

fbshipit-source-id: 8154ded0c73cd6a0f54ee024dc5f2c6826ed7e42

2017-05-19 12:48:06 -07:00

char_rnn.py

rnn with brew

2017-05-16 13:33:44 -07:00

lmdb_create_example.py

doxygen python block added

2017-03-29 06:46:16 -07:00

resnet50_trainer.py

Refactor data_parallel_model initial sync and checkpointing

2017-05-19 12:48:06 -07:00