pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2026-01-15 12:15:51 +00:00

Author	SHA1	Message	Date
Devin He	b46fddf506	idtt + zch distributed inference (#35763 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35763 Adds inference function and test for ScatterAssign Test Plan: Updated unit test Reviewed By: yyetim, shunting1986 Differential Revision: D20501079 fbshipit-source-id: 7ec6ef0127a151250dd699c90c2b80c35cfb1fe4	2020-04-03 12:09:34 -07:00
Tristan Rice	676fc929b7	[caffe2] fix type and shape inference for common gradient ops (#35857 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35857 This fixes a lot of common ops for InferBlobShapesAndTypes as well as adds support for testing the inferred shapes and types of gradient ops. Ops: * Concat * Split * LeakyReLU * Relu * Prelu * Gelu * Elu * Sinh, Tanh, Cosh * Abs * ... and a number of other simple element wise ops Test Plan: Added support to hypothesis test to check the shape and type of gradient ops. Enabled it for all the ops I fixed the shape and type inference for. buck test caffe2/caffe2/python/operator_test: Reviewed By: pradeepd24 Differential Revision: D20806284 fbshipit-source-id: 77f796d9ff208e09e871bdbadf9a0a7c196b77f2	2020-04-02 11:17:04 -07:00
Yinghai Lu	af4d86788c	Split SparseLengthsSumSparse into SparseLengthsSumSparseLookup + SparseLengthsSum (#35507 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35507 We want to split up the SparseLengthsSumSparse op into an indirection op and the SparseLengthsSum op so that we can lower the later part. The indirection part is a plain impl now. Test Plan: ``` for i in `seq 10`; do buck test caffe2/caffe2/python/operator_test:lengths_reducer_fused_nbit_rowwise_ops_test -- test_sparse_lengths_sum_rowwise_sparse; done ``` Reviewed By: jspark1105 Differential Revision: D20683478 fbshipit-source-id: 509effe88719d20aa0c4783bbe0ce1f183ee473c	2020-03-30 13:33:29 -07:00
Tristan Rice	d4f3bc7f8e	[dt] [caffe2] add/fix shape inference for StumpFunc, SliceGradient and ResizeLike (#35430 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35430 This fixes and adds tests for several commonly used operators. There's some formatting differences due to running clang-format on one of the files. Test Plan: buck test //caffe2/caffe2/fb/operators:hypothesis_test //caffe2/caffe2/python/operator_test:utility_ops_test //caffe2/caffe2/python/operator_test:concat_split_op_test Reviewed By: yyetim Differential Revision: D20657405 fbshipit-source-id: 51d86d0834003b8ac8d6acb5149ae13d7bbfc6ab	2020-03-26 17:50:32 -07:00
Xiaodong Wang	53fceff1e1	Change weight scale test to cpu only (#35346 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35346 weight scale op doesn't have GPU impl. This is breaking OSS CI from D20506032. Making it cpu only Test Plan: OSS CI Reviewed By: ustctf Differential Revision: D20637440 fbshipit-source-id: 9aa6cce63ce637ab7856788e5d02f527decb2a26	2020-03-25 09:18:58 -07:00
Fei Tian	845b19c4ef	Add weight_scale in Adagrad (#34944 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34944 Reviewed By: chonglinsun Differential Revision: D20506032 fbshipit-source-id: ef025e536da01fdcabc783466bc065685b80ab9a	2020-03-20 22:36:51 -07:00
Edward Yang	d927d58c2a	Revert D20289209: Support RowWiseSparseAdam on GPU Test Plan: revert-hammer Differential Revision: D20289209 Original commit changeset: a7a8a21bd18c fbshipit-source-id: 4a8ae684d099a5499c28b7e65578fc7ab10b248d	2020-03-18 07:35:07 -07:00
Jongsoo Park	bcbdba450c	[caffe2] open source 2/4-bit SLS operators (#34903 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34903 Reattempt of D20461609 Moving 2/4-bit SLS and row-wise 2/4-bit conversion operator to open source to be used by DLRM Test Plan: CI Reviewed By: jianyuh Differential Revision: D20495304 fbshipit-source-id: 66a99677583f50fd40e29c514710c7b1a8cdbc29	2020-03-17 22:55:10 -07:00
Yan Xie	959a7138fd	Support RowWiseSparseAdam on GPU (#34341 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34341 Implement RowWiseSparseAdam on CUDA Reviewed By: xianjiec Differential Revision: D20289209 fbshipit-source-id: a7a8a21bd18c1b9891f04f202d3ecaf183e30cad	2020-03-17 15:08:24 -07:00
Edward Yang	3e68d0c5d0	Revert D20461609: [caffe2] open source 2/4-bit SLS operators Test Plan: revert-hammer Differential Revision: D20461609 Original commit changeset: b3ef73ff10f2 fbshipit-source-id: e90ee5e34b1feab5b0bd582ed7e96e37de7044b0	2020-03-17 11:10:10 -07:00
Jongsoo Park	d9b97a4ffd	[caffe2] open source 2/4-bit SLS operators (#34783 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34783 Moving 2/4-bit SLS and row-wise 2/4-bit conversion operator to open source to be used by DLRM Test Plan: CI Reviewed By: yinghai Differential Revision: D20461609 fbshipit-source-id: b3ef73ff10f2433afe06ffa73fe1145282d9ec4c	2020-03-17 01:00:31 -07:00
Xinyi Zhang	99b91ee2ad	[fix][tiny][caffe2] Avoid triggering errors when allow ratio is 100% (#34757 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34757 Reviewed By: Wakeupbuddy Differential Revision: D20451255 fbshipit-source-id: 07997cf31dba653b61d082ec3f28357c3b90c4eb	2020-03-16 11:39:32 -07:00
Alex Cheparukhin	ee23944f46	[Caffe2] Fix shape inference for element-wise operators (#33431 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33431 Some elementwise operators don't have shape and type inference specified for the output tensor: `BitwiseOr`, `BitwiseAnd`, `BitwiseXor`, `Not`, `Sign`. This change fixes this issue: - For `Not` and `Sign` operators, the output has the same type and shape as the input, so `IdenticalTypeAndShapeOfInput` function is used to specify that. - For bitwise operators created by `CAFFE2_SCHEMA_FOR_BINARY_BITWISE_OP` macro, the type and shape inference rules should be the same as for other binary element-wise operators, so `TensorInferenceFunction(ElementwiseOpShapeInference)` is used to specify that. Also some tests were modified to ensure that the shape and type are inferred (`ensure_outputs_are_inferred` parameter) Test Plan: ``` CAFFE2_ASSERT_SHAPEINFERENCE=1 buck test caffe2/caffe2/python/operator_test:elementwise_ops_test CAFFE2_ASSERT_SHAPEINFERENCE=1 buck test caffe2/caffe2/python/operator_test:math_ops_test ``` Note that the tests have to be executed with `CAFFE2_ASSERT_SHAPEINFERENCE=1` in order to fail upon shape inference failure. Reviewed By: idning Differential Revision: D19880164 fbshipit-source-id: 5d7902e045d79e5669e5e98dfb13a39711294939	2020-02-25 09:03:06 -08:00
Xinyi Zhang	696527e659	[caffe2] Add embedding empty ratio checker (disabled by default) (#33145 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33145 Reviewed By: xianjiec Differential Revision: D19716574 fbshipit-source-id: 42a636600ac3977910d35093916865790bbe5b10	2020-02-24 16:10:01 -08:00
Johannes M Dieterich	6ade7e3a15	[ROCm] Enable 3D convolutions through ROCm (#33067 ) Summary: For both the Caffe2 and PyTorch backends, enable 3D convolutions through MIOpen. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33067 Reviewed By: BIT-silence Differential Revision: D19880495 Pulled By: bddppq fbshipit-source-id: 8f6f970910654c1c5aa871b48a04c1054875691c	2020-02-14 13:19:10 -08:00
Chaitanya Sri Krishna Lolla	2635055229	[ROCm] Enable 3D batch norms through MIOpen (#33262 ) Summary: Enable test for Caffe2 Pull Request resolved: https://github.com/pytorch/pytorch/pull/33262 Differential Revision: D19880486 Pulled By: bddppq fbshipit-source-id: af663a11137a53302e55198f38117ab6bdc9ec89	2020-02-13 11:29:51 -08:00
Lin Yang	9d9fa2eace	[2/3] Bind Bucketize to PyTorch (#33014 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33014 Export Bucketize to PyTorch. Test Plan: buck test caffe2/caffe2/python/operator_test:torch_integration_test Reviewed By: bddppq Differential Revision: D19737534 fbshipit-source-id: be1c892bb8d01da9892f221f150f1a2788ac732e	2020-02-11 23:20:10 -08:00
Lin Yang	6f46962f21	[1/3] Bind IndexHash to PyTorch (#33015 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33015 Export IndexHash to PyTorch Test Plan: buck test caffe2/caffe2/python/operator_test:torch_integration_test ✓ caffe2/caffe2/python/operator_test:torch_integration_test-2.7 - test_index_hash_op (caffe2.caffe2.python.operator_test.torch_integration_test.TorchIntegration) 0.151 44/50 (passed) Reviewed By: bddppq Differential Revision: D19727301 fbshipit-source-id: a65c954539e81a15577fe5c3c0deb3614e983534	2020-02-10 17:47:38 -08:00
Xinyi Zhang	1f78bd0774	[caffe2] Early error throwing for currupted embeddings Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32717 Reviewed By: xianjiec Differential Revision: D19604954 fbshipit-source-id: c02eccf048c0dba3f66d729ab1fda50f3cacef63	2020-01-28 16:55:29 -08:00
Jongsoo Park	e735395fc6	[caffe2] use 2-stage EmbeddingSpMDM interface (#32271 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32271 Use the 2-stage EmbeddingSpMDM interface in D19425982 to reduce the overhead of code cache lookup and lock contention. Fix an issue in sparse_lengths_sum_benchmarks generating empty indices when average length is small like 1. Test Plan: CI Reviewed By: dskhudia Differential Revision: D19425987 fbshipit-source-id: d5c5f0d46e0072403901809c31d516fa0f4b9b31	2020-01-22 19:05:36 -08:00
Dehua Cheng	685f090ac8	[Rowwise Pruning][c2 op] Add Quantile Op (#32448 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32448 Using binary search to compute the value for the given quantile among the input tensors. Test Plan: Newly added unittests; Reviewed By: jspark1105 Differential Revision: D19487604 fbshipit-source-id: 0dc6627b78d1310ac35b3f1d53b89cc89a697ece	2020-01-22 16:59:56 -08:00
Jongsoo Park	14e0bec9f2	[caffe2] remove unnecessary np.set_printoptions and fix test errors (#32475 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32475 As title Test Plan: CI Reviewed By: houseroad Differential Revision: D19508778 fbshipit-source-id: fd9ad63607535980505d155f3e3c3b7c6b95daf7	2020-01-22 14:49:47 -08:00
Brian Wignall	f326045b37	Fix typos, via a Levenshtein-type corrector (#31523 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos, with https://github.com/bwignall/typochecker to help automate the checking. Uses an updated version of the tool used in https://github.com/pytorch/pytorch/pull/30606 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/31523 Differential Revision: D19216749 Pulled By: mrshenli fbshipit-source-id: 7fd489cb9a77cd7e4950c1046f925d57524960ea	2020-01-17 16:03:19 -08:00
Yanghan Wang	9b6ec61bfd	exposing CPU/GPU Copy ops (#32248 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32248 expose CPU/GPU copy ops Test Plan: buck test mode/dev-nosan caffe2/caffe2/python/operator_test:torch_integration_test Reviewed By: houseroad Differential Revision: D19405856 fbshipit-source-id: 1df4aa202e26647cb81e9fe7e4478e594a5f7f3e	2020-01-17 12:40:43 -08:00
Alexander Melnikov	4e69352713	Add 64bit atomic fetch add (#32354 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32354 adding int_64 version of AtomicFetchAdd Reviewed By: bwasti Differential Revision: D19434349 fbshipit-source-id: b2358e8c5c6b7cd7e7b21de974b4ee1b5258fcf4	2020-01-17 11:43:43 -08:00
Jing Huang	ef5ae4823a	Register RoIAlignRotated with C10 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30785 Reviewed By: wat3rBro Differential Revision: D18415056 fbshipit-source-id: e00376bec948309d53f2172697cd477449f769b2	2020-01-16 16:32:28 -08:00
Shu Liu	8c3ee9f2ba	[Python] Deprecate use of scipy.misc.logsumexp and scipy.misc.comb (#32209 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32209 * Deprecate use of scipy.misc.logsumexp and scipy.misc.comb. * Removed in 1.0.0 https://docs.scipy.org/doc/scipy-1.1.0/reference/generated/scipy.misc.logsumexp.html and https://docs.scipy.org/doc/scipy-1.2.1/reference/generated/scipy.misc.comb.html * Use scipy.special.logsumexp and scipy.special.comb instead. * This diff updates most usages of except those in experimental folders. * This diff does NOT fix existing lint/code/TARGETS issues. * This diff does NOT autoformat codes. Test Plan: sandcastle auto unittests Differential Revision: D19406460 fbshipit-source-id: 2103fa0d674d9671a0175f4ce54b3c887d22f04e	2020-01-15 10:40:47 -08:00
Hector Yuen	9e9ca6ec37	add conversion functions to embedding tables (#31083 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31083 add (fp32/fp16)<->(int8 rowwise quantized fp32/fp16 scale biases) Test Plan: added unit tests enhanced shape inference tests Reviewed By: jspark1105 Differential Revision: D18920547 fbshipit-source-id: 6b3d7cb93f9d1669ecf511817d73976177632891	2020-01-08 16:56:12 -08:00
Xinyi Zhang	f4e955ff62	Change PackSegments to ensure consistent behavior between CPU and GPU Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31673 Reviewed By: Wakeupbuddy, BIT-silence Differential Revision: D18925762 fbshipit-source-id: e0c318e97f69b14a54f43c176af57d98fbc16c9f	2019-12-30 13:31:45 -08:00
Dehua Cheng	35bee0c729	separate op for rowwise counter (#31612 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31612 Count the number recent update on rows. Exponential decay is applied on the counter with decay rate r, such that r^{counter_halflife} = 0.5; If counter_halflife is nonpositive, this operator is turned off. Test Plan: added unittest Reviewed By: chocjy Differential Revision: D19217921 fbshipit-source-id: 96d850123e339212cc0e0ef352ea8a1b1bf61dfa	2019-12-27 12:18:39 -08:00
Yanghan Wang	d08250c223	fix zero-batch handling in convtranspose (#24341 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24341 ConvTransposeOp doesn't crash for zero-batch, but it doesn't modify the output blob. This leads to buggy behaviour especially when running the same network twice using different input, or backprop during training. Seems `ConvTransposeUnpoolBase<Context>::GetOutputSize` works for zero-batch, so I remove the check for `input.numel() > 0`, and reshape the output blob before returning. For CudnnConvTransposeGradientOp, it's a bit verbose to set `dfilter` and `dbias`, it's a seems the Cudnn can handle it, so simply remove the `X.numel() == 0` branch. Test Plan: buck test mode/dev-nosan caffe2/caffe2/python/operator_test:conv_transpose_test -- --run-disabled Reviewed By: BIT-silence Differential Revision: D16807606 fbshipit-source-id: 0d72c5bd8f2e03c34465e7b530cca548d9bdd5e1	2019-12-18 15:06:36 -08:00
Vitaly Fedyunin	c5d2758c35	Disable flaky TestMomentumSGD.test_fp16momentum_sgd (#31369 ) Summary: Related to https://github.com/pytorch/pytorch/issues/31368 Pull Request resolved: https://github.com/pytorch/pytorch/pull/31369 Differential Revision: D19147072 Pulled By: VitalyFedyunin fbshipit-source-id: 6fad13be7b35f992d84a20f23877cad05ff18616	2019-12-17 19:16:54 -08:00
Yanghan Wang	52b8a52e4d	move AliasWithNameOp to caffe2/operators Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31281 Reviewed By: houseroad Differential Revision: D19053453 fbshipit-source-id: 350bfd5c001db9c17916dcae7ade8f56db1e9841	2019-12-17 02:39:40 -08:00
Yuchen Hao	4a751dfc20	optimize MulGradient for common shapes (#19705 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19705 Optimizing for a case when there's a consecutive dims that are not broadcasted followed by another consecutive dims that are broadcasted. For example, MulGradient(["dC", "A", "B"], ["dA", "dB"], broadcast=True, axis=0) where A.shape == dC.shape == [9508, 80] and B.shape == [80] . Test Plan: In SKL T6, Running mul_gradient_benchmark without this optimization Operator #0 (dA, MulGradient) 11.9119 ms/iter After this optimization, Operator #0 (dA, MulGradient) 0.672759 ms/iter Need to land D15291800 before to fix the unit test error Reviewed By: dmudiger Differential Revision: D15075415 fbshipit-source-id: 0f97be17cf8f1dacbafa34cd637fb8bc1c5e5387	2019-12-11 11:39:52 -08:00
Brian Wignall	e7fe64f6a6	Fix typos (#30606 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30606 Differential Revision: D18763028 Pulled By: mrshenli fbshipit-source-id: 896515a2156d062653408852e6c04b429fc5955c	2019-12-02 20:17:42 -08:00
Chuan Jiang	6c9b188262	Support in-place update in IndexHashOp (#30275 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30275 `IndexHash` did not support in-place update. Reviewed By: kennyhorror Differential Revision: D18612231 fbshipit-source-id: adeccdf1ceb6107454555ff9cdf66fd5e5773f2a	2019-11-22 14:49:28 -08:00
Huan Gui	be757957ba	Support softmax with D == 0 (#29167 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29167 As titled. This fix is crucial as multi_channel splitting would create history that has no items (i.e., D == 0), which leads to flow failure. Test Plan: Unittest flow test: before fix: f148783160 after fix: f149082299 buck test mode/dev-nosan caffe2/caffe2/python/operator_test:softmax_ops_test Reviewed By: xianjiec Differential Revision: D18296081 fbshipit-source-id: e0bb2dc2c4e5b465e213f31e5c5ced3a7e1fd574	2019-11-11 00:46:10 -08:00
Mike Ruberry	991c2ac383	Disables flaky test_rand_quantization (#29463 ) Summary: See https://github.com/pytorch/pytorch/issues/28550. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29463 Differential Revision: D18405669 Pulled By: mruberry fbshipit-source-id: 2984c3896a9260a06fbf052afb06e0cb8d28b53d	2019-11-08 13:51:22 -08:00
Mike Ruberry	2f2a0d1607	Disables test_atomic_ops and testInputOrder (#29145 ) Summary: These tests have been flaky for some time, see: - https://github.com/pytorch/pytorch/issues/28179 - https://github.com/pytorch/pytorch/issues/9064 This PR disables them. The actual tests were added/updated 2+ years ago. It's unclear who, if anyone, would own them now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29145 Differential Revision: D18327937 Pulled By: mruberry fbshipit-source-id: d02731d662aff3545b581272e5ae8db4e3097d87	2019-11-05 16:53:53 -08:00
Huan Gui	8a2dcff189	Add cuda version for operators BatchSparseToDense and BatchDenseToSparse (#29166 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29166 As titled Test Plan: unittest buck test mode/dev-nosan caffe2/caffe2/python/operator_test:batch_sparse_to_dense_op_test Reviewed By: xianjiec Differential Revision: D18197966 fbshipit-source-id: 7486300c509dd552ddb7484c2d83099f62878278	2019-11-05 13:06:23 -08:00
Xinyi Zhang	5821b9bf0f	Remove error logging of high empty range ratio Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28854 Reviewed By: xianjiec Differential Revision: D18206695 fbshipit-source-id: 4ce471f0236b2ceaf54ba1b1ce96e193feca720b	2019-10-30 12:55:25 -07:00
Huayu Li	793e2914e4	Support full id interations (#28769 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28769 Support full id interaction. Test Plan: * unit-tests * buck test caffe2/caffe2/python/operator_test:pack_ops_test -- * buck test caffe2/caffe2/fb/dper/layer_models/tests:sparse_nn_attention_test -- test_sparse_nn_full_id * canary * apply SUM + full id with max_length as 20 on SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID: f147253340 (v1: f146340704) # of embeddings for this features is 20: {F219139816} The corresponding ops: two lookups, which is as expected. ``` op { input: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_0/Repeat_0/sparse_lookup/w" input: "feature_preproc/output_features:SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM:values" input: "feature_preproc/output_features:SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM:lengths" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_0/Repeat_0/sparse_lookup/output" name: "" type: "SparseLengthsSum" } op { input: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/sparse_lookup/w" input: "feature_preproc/output_features:SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM:values" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/sparse_lookup/output" name: "" type: "Gather" } op { input: "feature_preproc/output_features:SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM:lengths" input: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/sparse_lookup/output" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/PackSegments/embedding_packed" name: "" type: "PackSegments" arg { name: "max_length" i: 20 } arg { name: "pad_minf" i: 0 } } op { input: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/PackSegments/embedding_packed" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/Reshape/reshaped_record" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/Reshape/old_shape" name: "" type: "Reshape" arg { name: "shape" ints: -1 ints: 1280 } } op { input: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/Reshape/reshaped_record" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_0" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_1" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_2" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_3" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_4" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_5" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_6" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_7" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_8" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_9" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_10" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_11" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_12" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_13" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_14" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_15" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_16" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_17" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_18" output: "nested/dot/SPARSE_AD_MEDIA_XRAY_V11_TOPIC_ID_AUTO_FIRST_X_AUTO_UNIGRAM/Pool_Option_1/Repeat_0/full_id/split/output_19" name: "" type: "Split" arg { name: "axis" i: 1 } } ``` Reviewed By: chonglinsun Differential Revision: D18083520 fbshipit-source-id: f592fb7734dd4e3e712ba42dc0afcd0b32a4afa0	2019-10-29 14:56:18 -07:00
Xinyi Zhang	f5ea2ca34a	Reduce logging frequency for empty range tolarence Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28704 Reviewed By: xianjiec Differential Revision: D18138828 fbshipit-source-id: 4f3c376502cb6e30b931217702c4ca537c9eb644	2019-10-28 09:52:17 -07:00
Xinyi Zhang	2f16284231	change empty range tolorrance logging Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28489 Differential Revision: D18067322 fbshipit-source-id: 2096d1cce820f4ebe28db0045a2ddacc022e07da	2019-10-23 09:39:39 -07:00
Xinyi Zhang	06bb74ce96	Tolerate small amount of embedding corruptions Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28371 Reviewed By: xianjiec Differential Revision: D18031155 fbshipit-source-id: a51d2a62a919f032dc04372b30cf9071aa2dd629	2019-10-21 16:23:25 -07:00
Jiang Wu	29f56eb920	Revert D17937850: Tolerate small amount of embedding corruptions Test Plan: revert-hammer Differential Revision: D17937850 Original commit changeset: e9c633768d98 fbshipit-source-id: 5c2c837c7867504392b19965d91a60cadd3b8101	2019-10-19 14:17:01 -07:00
Xinyi Zhang	ca6ba06f95	Tolerate small amount of embedding corruptions Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28299 Reviewed By: Wakeupbuddy Differential Revision: D17937850 fbshipit-source-id: e9c633768d9819fd734ddd59017c33688ebbdcca	2019-10-18 14:59:06 -07:00
Simran Suresh Motwani	d63d7ab997	Expose PiecewiseLinearTransform to PyTorch Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26903 Test Plan: Unit Test Reviewed By: bddppq Differential Revision: D17585637 fbshipit-source-id: fe669aaf3301d7efb5c28ec0097945d55a71773d	2019-09-27 12:49:04 -07:00
Jongsoo Park	8fb756d3b2	batch size 0 support in ChannelShuffle DNNLOWP op (#26858 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26858 Handle batch size = 0 in ChannelShuffle operator Test Plan: CI Reviewed By: jianyuh Differential Revision: D17591041 fbshipit-source-id: 63373aa752406c1f38401c3e93d8e1954ce7281e	2019-09-26 00:40:07 -07:00
Huan Gui	a8386d2a7d	fix composite learning rate (#26227 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26227 In the previous implementation of composite lr, the lr_scale for each sub policy will be rewritten by the last lr_scale. Due to another bug in unittest (where policy_lr_scale being the same for all sub policies), this bug was not detected by unittest... Fix: add an additional field in CompositeLearningRateItem so that we store lr_scale values for all sub policies If fix unittest, the error in previous implementation: https://fburl.com/testinfra/ikdbnmey With the fix, https://fburl.com/testinfra/m694ehl1 Test Plan: unittest buck test caffe2/caffe2/python/operator_test:learning_rate_op_test -- test_composite_learning_rate_op Reviewed By: chocjy, alex1o1o7cloud Differential Revision: D17380363 fbshipit-source-id: 161e9cb71bb2ea7f0734a3361e270616057a08e4	2019-09-18 17:34:17 -07:00

1 2 3 4 5 ...

866 Commits