pytorch/test/cpp/api/dataloader.cpp at 9bc8fb8dfdcec945bae7b23dff6c3742f02e648f

mirror of https://github.com/zebrajr/pytorch.git synced 2026-01-15 12:15:51 +00:00

Files

xzhu1900 31f1928096 add sorting policy to ChunkDataset (#23053 )

Summary:
Add a sorting policy to ChunkDataset.

This is considered an advanced parameter for developers who want to apply a 'sorting policy' to the chunk data before sampling into minibatch.

Different than the collate method, this policy is applied on the chunk level instead of minibatch level. When a chunk of data is loaded (multiple chunks if cross_chunk_shuffle_count_ is greater than 1), this policy is targeting to the full loaded data. It will be useful if developers want to perform some pre-processing (like bucketing) to the chunk data before example sampler samples the data.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/23053

Differential Revision: D16537692

Pulled By: colesbury

fbshipit-source-id: cd21ed40ab787a18b8c6dd304e5b806a7a45e6ba

2019-07-29 12:34:02 -07:00

74 KiB

Raw Blame History

View Raw

74 KiB Raw Blame History

74 KiB

Raw Blame History