Empty compression messages #14261

andre4i · 2023-02-21T21:38:27Z

Description

We've identified an issue with compression, that the trailing empty ops added to the batch to preserve the batch shape and reserve sequence numbers are not really empty, as the content is still present in the deserializedContent property. This leads to compression creating a large payload which does not end up getting chunked.

This is not actually an issue. When compression is not supported by the loader, the outbox will send the deserializedContent property. When it is, a message will be crafted to only contain contents, metadata, compression and referenceSequenceNumber. So if compression is enabled and supported, the message size will actually be small as the deserializedContent value is discarded. This is still a good change, it will make memory consumption smaller when compression is enabled and supported.

Example of a compressed batch:

with actual content in the messages which are supposed to be empty:

This made the batch exceed 2.5 MB

…essed batch

packages/runtime/container-runtime/src/opLifecycle/opCompressor.ts

vladsud · 2023-02-21T21:51:15Z

It would be great to also have end-to-end test, that validates that we are sending substantially fewer bytes over the wire (compared to initial DDS payload) when dealing with large non-random payloads. Including hitting cases of compression, compression + chunking.

andre4i · 2023-02-21T22:49:53Z

Ok, we need to keep this for back-compat reasons (old loader submit function requires an object passed as opposed to string contents). ~~We could work around it, but it would add a performance hit as we'd be doing JSON.parse twice for the same message when compression happens~~ this is fine to do for this scenario.

I'll add a note to enhance some of the end-to-end tests to verify the resulting payload size and add more confidence.. (ADO:3499)

…ing it

msfluid-bot · 2023-03-01T18:11:19Z

⯅ @fluid-example/bundle-size-tests: +252 Bytes

Metric Name	Baseline Size	Compare Size	Size Diff
aqueduct.js	433.72 KB	433.84 KB	⯅ +126 Bytes
connectionState.js	680 Bytes	680 Bytes	■ No change
containerRuntime.js	228.83 KB	228.95 KB	⯅ +126 Bytes
loader.js	152.93 KB	152.93 KB	■ No change
map.js	43.78 KB	43.78 KB	■ No change
matrix.js	135.95 KB	135.95 KB	■ No change
odspDriver.js	91.79 KB	91.79 KB	■ No change
odspPrefetchSnapshot.js	43.51 KB	43.51 KB	■ No change
sharedString.js	156.5 KB	156.5 KB	■ No change
Total Size	1.36 MB	1.36 MB	⯅ +252 Bytes

Baseline commit: 81b5ef9

Generated by 🚫 dangerJS against 34f4181

This reverts commit 30a2205.

Reverts #14261 The change surfaced a bug related to the compressed message metadata and fired an incident in our stress tests.

## Description This is resubmitting #14261 which was reverted with #14370 as it was stripping all metadata from the batch. The issue was not caught in testing as the change removed the metadata only for the trailing ops, which broke the stress tests. Adjusted the end-to-end tests to catch similar issues in the future.

…re. (#14587) ## Description ADO:3499 There have been instances in which _we thought_ we were sending the wrong things over the wire (for example messages with duplicated content) like in #14261. However, there is in-flight work related to the message logistics, so it is very important to pin down expectations about the size of the messages we are sending.

andre4i added 2 commits February 21, 2023 13:29

Compression should not add any content in the trailing ops in a compr…

6ddb0e2

…essed batch

Don't clone the message

1dd5e1e

andre4i requested a review from a team as a code owner February 21, 2023 21:38

github-actions bot added area: runtime Runtime related issues base: main PRs targeted against main branch labels Feb 21, 2023

vladsud reviewed Feb 21, 2023

View reviewed changes

packages/runtime/container-runtime/src/opLifecycle/opCompressor.ts Show resolved Hide resolved

vladsud approved these changes Feb 21, 2023

View reviewed changes

Lint

a9b32ce

andre4i closed this Feb 21, 2023

andre4i reopened this Feb 21, 2023

andre4i added 5 commits February 21, 2023 15:19

In case of a really old loader, reconstruct the JSON object when send…

70e91b6

…ing it

Merge branch 'main' into empty-compression-messages

2c57315

Add assert

68e8bf1

Don't assert for all messages

e8a8dce

Linter

34f4181

andre4i merged commit 30a2205 into microsoft:main Mar 1, 2023

andre4i added a commit that referenced this pull request Mar 1, 2023

Revert "Empty compression messages (#14261)"

458e32c

This reverts commit 30a2205.

andre4i mentioned this pull request Mar 1, 2023

Revert "Empty compression messages" #14370

Merged

andre4i added a commit that referenced this pull request Mar 1, 2023

Revert "Empty compression messages" (#14370)

a51cda1

Reverts #14261 The change surfaced a bug related to the compressed message metadata and fired an incident in our stress tests.

andre4i mentioned this pull request Mar 1, 2023

Trailing messages in compressed batches should be empty. #14372

Merged

andre4i mentioned this pull request Mar 15, 2023

Add a test to assert the actual message size we will send over the wire. #14587

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Empty compression messages #14261

Empty compression messages #14261

andre4i commented Feb 21, 2023 •

edited

Loading

vladsud commented Feb 21, 2023

andre4i commented Feb 21, 2023 •

edited

Loading

msfluid-bot commented Mar 1, 2023

Empty compression messages #14261

Empty compression messages #14261

Conversation

andre4i commented Feb 21, 2023 • edited Loading

Description

vladsud commented Feb 21, 2023

andre4i commented Feb 21, 2023 • edited Loading

msfluid-bot commented Mar 1, 2023

andre4i commented Feb 21, 2023 •

edited

Loading

andre4i commented Feb 21, 2023 •

edited

Loading