pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2026-01-15 12:15:51 +00:00

Author	SHA1	Message	Date
Aapo Kyrola	e8dc09064e	exhaustive_search=True Summary: For some reason I had been disabling the exhaustive search heuristic for cudnn for xray/resnet trainers. On BigBasin, this gives 10% perf boost. On BigSur maybe 5%. Reviewed By: prigoyal Differential Revision: D4338654 fbshipit-source-id: 3974dd612f5d4f4dc8b2febccb59664d3f276c3e	2016-12-15 22:59:27 -08:00
Aapo Kyrola	68cfc52452	MomemtumSGDUpdate -- version of MomentumSGD with update. Summary: It gives a significant perf boost to do the parameter update inside MomentumSGD, instead of with a separate WeightedSum op. To ensure backwards compatibility, I made it a separate op. Also added an unit test. Reviewed By: prigoyal Differential Revision: D4262446 fbshipit-source-id: 38e7ee6d7677b398658ac7fe9b7a59b569e033f4	2016-12-15 12:01:29 -08:00
Aapo Kyrola	e65eeff665	LMDB example Summary: This examples writes a LMDB database of image data and labels (random). Then it reads them using Caffe2's TensorProtosDBINput and validates the checksums match. This example shows how to coerce image data into TensorProtos and be happy. Before there was no clear example how to create databases for Caffe2. Differential Revision: D4263614 fbshipit-source-id: 21e08066899095b4efcc2d23dbc3ede81e75914a	2016-12-05 11:53:26 -08:00
Aapo Kyrola	3410939459	pass learning rate scaling factor to parameter update builder function Summary: When refactoring data parallel model, the division of LR by number of devices was dropped, and thus we ended up effectively multiplying gradients by the number of devices. Thus, we need to scale the LR by 1/numgpus. Created a test to confirm that data_parallel_model produces exactly same results on different number of gpus, given the total batch size. Reviewed By: prigoyal Differential Revision: D4248907 fbshipit-source-id: af21ede113e6ac25f12c556de298cb18974548be	2016-12-05 11:53:26 -08:00
Aapo Kyrola	b9f1555b6a	remove unused function from resnet50_trainer Summary: Just noticed that I had duplicate code in the example imagenet trainer. Removed the function. Differential Revision: D4223070 fbshipit-source-id: 443a9401bf7e425f7a3a13a44c9d0f7e21e72303	2016-11-29 15:18:37 -08:00
Yangqing Jia	589398950f	fbsync at f5a877	2016-11-18 15:41:06 -08:00

6 Commits