tensorflow/tensorflow at 6160600087fb23d583897b0300a461526e27e026 - tensorflow - Carlos Sousa's Git

OSSForks/tensorflow

mirror of https://github.com/zebrajr/tensorflow.git synced 2026-01-15 12:15:41 +00:00

Files

History

Byungchul Kim fec780d7fe Set FC's keep_num_dims to false when output dims is different from input dims after quantization.

On gemma3n with decode batch > 1, it happens when the embedding is coupled with PLE by einsum.
The export steps are:
1) Initial: BMM([b,2048]x[2048,7680] -> [b,7680])
2) FuseInputReshape_BatchMatMulWithFlattenedRhsDims: BMM([b,2048]x[2048,7680] -> [b,7680])
3) ConvertBatchMatMulOp2FullyConnectedOp_Rank2ConstantRhs: FC([b,2048]x[2048,7680] -> [b,7680])
4) StrictQuantizationPattern(by IsDrqTensor): FC([b,1,2048]x[2048,7680] -> [b,7680])

When FC's keep_num_dims is false and it's followed by reshape op (like gemma3n), keep_num_dims will be set to true later with correct shapes by EnableFullyConnectedKeepNumDimsBeforeReshape.

PiperOrigin-RevId: 847813526

2025-12-22 10:45:22 -08:00

..

Automated Code Change

2025-12-20 15:30:21 -08:00

Automated Code Change

2025-12-16 20:37:24 -08:00

Set FC's keep_num_dims to false when output dims is different from input dims after quantization.

2025-12-22 10:45:22 -08:00

Automated Code Change

2025-12-22 02:44:02 -08:00

distribute/experimental/rpc

Replace C-style character functions with absl::ascii_ or llvm:: equivalents.

2025-11-04 19:21:09 -08:00

…

Automated Code Change

2025-12-20 13:49:36 -08:00

Automated Code Change

2025-11-16 19:46:01 -08:00

Add float4_e2m1fn to TensorFlow.

2025-11-14 09:17:31 -08:00

Automated Code Change

2025-11-13 23:21:36 -08:00

Automated Code Change

2025-11-07 08:06:42 -08:00

Use the XNNPack packing fingerprints to invalidate the weight cache.

2025-12-19 17:07:07 -08:00

compat: Update forward compatibility horizon to 2025-12-22

2025-12-22 02:31:13 -08:00

Automated Code Change

2025-12-02 07:53:24 -08:00

Internal build infrastructure cleanup.

2025-12-15 23:04:53 -08:00

__init__.py

…

.clang-format

…

api_template_v1.__init__.py

…

api_template.__init__.py

…

BUILD

#tf-data reenable captured_function on macos.

2025-11-12 11:21:56 -08:00

build_cleaner_spec.textproto

…

compat_template_v1.__init__.py

…

compat_template.__init__.py

…

opensource_only.files

…

py.default.bzl

…

pytype.default.bzl

…

strict.default.bzl

…

tensorflow.bzl

Remove unused stripped_cc_info

2025-12-12 15:55:19 -08:00

tensorflow.default.bzl

…

tf_exported_symbols.lds

…

tf_framework_version_script.lds

Rolling back the change that broke MacOS builds.

2025-09-23 14:14:11 -07:00

tf_private_symbols.lds

…

tf_version_script.lds

…

tf_version.bzl

…

tf_version.default.bzl

…

virtual_root_template_v1.__init__.py

…

virtual_root_template_v2.__init__.py

…

workspace0.bzl

Enable using custom hermetic NCCL version.

2025-12-19 10:47:57 -08:00

workspace1.bzl

Replace http_archive with tf_http_archive for Github links to avoid timeout issues.

2025-12-11 18:46:57 -08:00

workspace2.bzl

Update XNNPack version

2025-12-18 05:22:58 -08:00

workspace3.bzl

Replace http_archive with tf_http_archive for Github links to avoid timeout issues.

2025-12-11 18:46:57 -08:00