pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2026-01-15 12:15:51 +00:00

Files

Aapo Kyrola 63297e1a1f RunNetOnce->RunNet (removes rnn_executor overhead)

Summary:
seq2seq/translate.py was running much slower on RNNExecutor. This was because RNNExecutor has significant init overhead (I have another diff to reduce, but not completely eliminate it), and translate was calling the decoder with RunNetOnce -- thus always recreating the net and the ops. Changhing this to RunNet() makes translate run faster than without executor. RunNet uses the net name and uses the already created net, while RunNetOnce passes the whole protobuffer.

Noticed similar bug in seq2seq ensemble bean model, which also calls CreateNet() but uses RunNetOnce() instead of RunNet().

Reviewed By: jhcross

Differential Revision: D6156566

fbshipit-source-id: a933453e36a0d8fd163d0584186fda427a680687

2017-10-25 22:06:02 -07:00

seq2seq

RunNetOnce->RunNet (removes rnn_executor overhead)

2017-10-25 22:06:02 -07:00

__sym_init__.py

Re-license to Apache

2017-09-28 16:22:00 -07:00

download.py

Re-license to Apache

2017-09-28 16:22:00 -07:00

resnet_test.py

Re-license to Apache

2017-09-28 16:22:00 -07:00

resnet.py

Updated resnet50_trainer and resnet for more FP16 support

2017-10-24 09:19:06 -07:00