pytorch/caffe2/python at 290acab2c7609cbc48afcd3c457e7e7e6836b1d0 - pytorch - Carlos Sousa's Git

OSSForks/pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2026-01-15 12:15:51 +00:00

Files

History

Honghao Wei 290acab2c7 implement drelu and unittest

Summary:
In this revision, I mainly implemented the DRelu activation. See https://arxiv.org/pdf/1706.06978v1.pdf for details.
To sum up, different from standard relu and purely, which divide the scope into two parts with boundary at zero, DRelu calculate another value p to divide the activation into two part. P is the softmax value of the output of Batch Normalization. For f(x)=x part in relu, you can find similar patten in f(x)=px, and for f(x)=0 part in rely, you can find similar pattern in f(x)=a(1-p)x, in which a is a parameter to tune. Drelu activation result is the sum of these two parts, f(x) = a(1-p)x + px.

To implement DRelu, I take BatchNormalization as super class and then use the above formula for computation. In order to allow users to choose activation methods, which usually takes place when calling add_mlp function in processor_util.py, I pass the parameter transfer in model_option from UI to the details, just as what dropout do. Currently, I place it in extra_option, but can modify it if AML team needs to redesign the UI.

I also add units test for DRelu. We check the shape of output and also do the numeric unit tests.
For Unit test, I first check the numeric value of BatchNormalization, since there is no similar test before. I then compute the value of DRelu outputs and compare the results with current DRelu layer.

Reviewed By: chocjy

Differential Revision: D5341464

fbshipit-source-id: 896b4dcc49cfd5493d97a8b448401b19e9c80630

2017-07-20 11:50:08 -07:00

..

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

Fixed typo

2017-06-23 14:02:40 -07:00

Adding tanh to brew

2017-07-11 18:17:52 -07:00

implement drelu and unittest

2017-07-20 11:50:08 -07:00

…

Deprecate CNNModelHelper - Inception()

2017-06-15 14:03:27 -07:00

allow param_info to set optimizer

2017-07-12 08:49:48 -07:00

Fix broken seq2seq example

2017-07-13 23:31:54 -07:00

RNN Workspace Blob Extraction

2017-07-17 10:24:18 -07:00

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

Moved sigmoid, tanh, and _prepare_lstm (renamed) to a util file.

2017-07-10 17:52:22 -07:00

_import_c_extension.py

…

allcompare_test.py

Adding AllCompare-like function to data_parallel_model

2017-07-13 13:03:57 -07:00

attention.py

Unrolled test for AttentionCell

2017-06-25 17:21:24 -07:00

binarysize.py

binary size util

2017-07-14 17:49:24 -07:00

brew_test.py

quick fix for model_helper __init__

2017-07-12 08:49:48 -07:00

brew.py

Adding tanh to brew

2017-07-11 18:17:52 -07:00

caffe_translator_test.py

…

caffe_translator.py

Read pretrained weights using binary mode in caffe_translator.py

2017-07-08 10:17:57 -07:00

checkpoint_test.py

Allow tasks/execution_steps to be cloned at runtime

2017-06-20 22:32:07 -07:00

checkpoint.py

…

CMakeLists.txt

…

cnn.py

…

context_test.py

…

context.py

…

control_test.py

…

control.py

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

convnet_benchmarks_test.py

…

convnet_benchmarks.py

brew API in convnet benchmark

2017-07-05 10:34:48 -07:00

core_gradients_test.py

add debug information when there is blob version mismatch

2017-06-30 16:22:46 -07:00

core_test.py

single trainer hybrid device

2017-06-27 22:06:30 -07:00

core.py

handle RecurrentNetwork operator when clone net

2017-07-17 17:33:21 -07:00

crf.py

Deprecate CNNModelHelper in python/crf.py

2017-06-14 08:49:27 -07:00

data_parallel_model_test.py

Allow CPU device scope in data_parallel_model and data_parallel_rendevous device scope checks

2017-07-18 15:47:41 -07:00

data_parallel_model.py

Allow CPU device scope in data_parallel_model and data_parallel_rendevous device scope checks

2017-07-18 15:47:41 -07:00

data_workers_test.py

fix a rare race condition by initializing scratch blobs beforehand

2017-06-26 10:18:18 -07:00

data_workers.py

add timeout argument to DequeueBlobs; use 10 min timeout for data workers

2017-07-13 18:52:03 -07:00

dataio_test.py

Allow tasks/execution_steps to be cloned at runtime

2017-06-20 22:32:07 -07:00

dataio.py

Fix a few typos and grammars in comment

2017-06-14 18:22:39 -07:00

dataset.py

Use scope name for dataset cursor

2017-07-15 19:22:32 -07:00

db_test.py

…

device_checker.py

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

dyndep.py

…

empty.so

…

experiment_util.py

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

extension_loader.py

…

gradient_check_test.py

Cos, Sin, and Abs operators

2017-07-03 22:18:32 -07:00

gradient_checker.py

Fix a few typos and grammars in comment

2017-06-14 18:22:39 -07:00

gru_cell.py

Implemented GRUCell

2017-07-10 17:52:25 -07:00

hsm_util.py

…

hypothesis_test_util.py

hyposesis_test grad_reference bug fixes

2017-07-14 14:41:23 -07:00

hypothesis_test.py

dot product using matmul

2017-07-17 23:20:37 -07:00

layer_model_helper.py

Fixing error message for layer model helper

2017-07-18 09:52:45 -07:00

layer_model_instantiator.py

…

layer_test_util.py

Core unit test fixes for Python 3

2017-06-23 13:22:16 -07:00

layers_test.py

implement drelu and unittest

2017-07-20 11:50:08 -07:00

load_save_test.py

…

lstm_benchmark.py

Added flags to lstm, convnet and sparse_nn_benchmarks to print out operators

2017-06-30 23:47:04 -07:00

memonger_test.py

fix for duplicate input case

2017-07-13 01:51:30 -07:00

memonger.py

add code comments to memonger

2017-07-17 13:07:33 -07:00

mkl_test_util.py

…

model_device_test.py

Deprecate CNNModelHelper in caffe2/python/model_device_test.py

2017-06-22 15:37:17 -07:00

model_helper.py

quick fix for model_helper __init__

2017-07-12 08:49:48 -07:00

mpi_python.cc

…

muji_test.py

…

muji.py

…

net_builder_test.py

Allow tasks/execution_steps to be cloned at runtime

2017-06-20 22:32:07 -07:00

net_builder.py

Allow tasks/execution_steps to be cloned at runtime

2017-06-20 22:32:07 -07:00

net_drawer.py

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

net_printer_test.py

Allow tasks/execution_steps to be cloned at runtime

2017-06-20 22:32:07 -07:00

net_printer.py

Fix net_printer.py

2017-07-11 15:26:52 -07:00

optimizer_context.py

allow param_info to set optimizer

2017-07-12 08:49:48 -07:00

optimizer_test_util.py

…

optimizer_test.py

Set device to the default device(CPU) when DeviceContext is None.

2017-07-13 17:54:36 -07:00

optimizer.py

Set device to the default device(CPU) when DeviceContext is None.

2017-07-13 17:54:36 -07:00

parallelize_gpu_bmuf_distributed_test.py

Add distributed BMUF implementation.

2017-06-21 16:18:11 -07:00

pipeline.py

Enable runtime cloning of tasks.

2017-06-21 03:18:20 -07:00

predictor_constants.py

…

pybind_state_gpu.cc

…

pybind_state_mkl.cc

…

pybind_state.cc

comment out unused parameter in pybind_state.cc

2017-07-17 15:57:49 -07:00

pybind_state.h

fast simple-net memonger for C++

2017-07-06 15:17:07 -07:00

python_op_test.py

Fix some typos

2017-06-28 13:50:48 -07:00

queue_util.py

…

record_queue.py

Fix a few typos and grammars in comment

2017-06-14 18:22:39 -07:00

recurrent.py

RNN Workspace Blob Extraction

2017-07-17 10:24:18 -07:00

rnn_cell.py

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

schema_test.py

Add __sub__ function for schema.Struct

2017-06-28 11:24:01 -07:00

schema.py

IndexHash

2017-07-07 23:06:11 -07:00

scope_test.py

…

scope.py

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

session_test.py

…

session.py

Allow tasks/execution_steps to be cloned at runtime

2017-06-20 22:32:07 -07:00

sparse_to_dense_mask_test.py

…

task.py

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

test_util.py

…

text_file_reader.py

…

timeout_guard.py

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00

toy_regression_test.py

…

tt_core_test.py

…

tt_core.py

Fix a few typos and grammars in comment

2017-06-14 18:22:39 -07:00

utils.py

Fast path for serializing large floating-point tensors to protobuf

2017-07-10 17:52:22 -07:00

visualize.py

Python 3 compatible integer division

2017-07-06 11:47:12 -07:00

workspace_test.py

Core unit test fixes for Python 3

2017-06-23 13:22:16 -07:00

workspace.py

Dict fixes/improvements and unittest targets for Python 3 in caffe2 core

2017-06-29 17:05:41 -07:00