storage: optionally ignore timestamps in the future for retention by jcsp · Pull Request #10028 · redpanda-data/redpanda

jcsp · 2023-04-13T10:05:37Z

This is an off-by-default behavior to enable systems where a user has sent in a dramatically wrong timestamp to ask Redpanda to ignore timestamps in the future, and do its best to infer an alternative timestamp for use in retention.

Fixes #9820

Backports Required

Release Notes

Improvements

Added storage_ignore_timestamps_in_future_sec cluster configuration property (default null). If set to non-null, then timestamps more than this many seconds in the future will be ignored by Redpanda when considering whether a segment is old enough to garbage collect.

jcsp · 2023-04-13T13:34:38Z

Failure is:

CI Failure (BadLogLines) in NodesDecommissioningTest.test_decommissioning_finishes_after_manual_cancellation #9839

andrwng

LGTM overall, just a question about toggling the escape hatch

andrwng · 2023-04-17T18:41:53Z

src/v/storage/segment_index.h

+        _retention_timestamp = t;
+    }
+    model::timestamp retention_timestamp() const {
+        return _retention_timestamp.value_or(_state.max_timestamp);


Should we also gate this on storage_ignore_timestamps_in_future_sec, in case we switch back from ignoring timestamps and no longer want to rely on _retention_timestamp?

I had mainly been thinking of this as a one shot thing, but you're right that it's straightforward to make it toggleable by adding a check here, so I've gone ahead and done that.

VladLazar

Makes sense to me. We should probably write down some instructions on how to detect scenarios where bogus timestamps are blocking retention and how to use this new config to resolve the situation.

Probably something along the lines:

Look for "segment with bogus retention timestamps" logs
From the log lines above, determine how far into the future those timestamps are
Set storage_ignore_timestamps_in_future_sec to something that will allow retention to proceed
Unset storage_ignore_timestamps_in_future_sec if the bogus timestamps was due to a transient client issue (perhaps the test should be extended to do this)

VladLazar · 2023-04-18T09:12:07Z

tests/rptest/tests/retention_policy_test.py

+        # remove anything because the timestamps are too far in the future.
+        with self.redpanda.monitor_log(self.redpanda.nodes[0]) as mon:
+            # Time period much larger than what we set log_compaction_interval_ms to
+            sleep(10)


Is this sleep actually necessary? The wait until below implies that retention has kicked in.

Mostly not, but I wanted to have a better chance of the housekeeping having entirely completed, rather than proceeding as soon as it has seen one segment that emits the bogus timestamp message.

`segment_index` now has a retention_timestamp overlap that overrides use of the index's max_timestamp as the timestamp for retention purposes. This override is set in disk_log_impl::retention_adjust_timestamps if the storage_ignore_timestamps_in_future_sec configuration is enabled and the segment's max timestamp is out of bounds. There is no change to behavior by default: this only kicks in if the new configuration property has been set. Fixes redpanda-data#9820

jcsp · 2023-04-19T18:54:25Z

/backport v23.1.x

jcsp · 2023-04-19T18:54:31Z

/backport v22.3.x

vbotbuildovich · 2023-04-19T18:55:17Z

Failed to run cherry-pick command. I executed the below command:

git cherry-pick -x 1f893a914033e8b5951e3a100d1f086f2203d155 a75c2208d3a9c11d966abe871a71685aeee80846 ba1b0ad59c72ea98d6e61dc3ead60c0bccdd12a1

Workflow run logs.

vbotbuildovich · 2023-04-19T18:55:36Z

Failed to run cherry-pick command. I executed the below command:

git cherry-pick -x 1f893a914033e8b5951e3a100d1f086f2203d155 a75c2208d3a9c11d966abe871a71685aeee80846 ba1b0ad59c72ea98d6e61dc3ead60c0bccdd12a1

Workflow run logs.

jcsp added kind/enhance New feature or request area/storage labels Apr 13, 2023

github-actions bot added the area/redpanda label Apr 13, 2023

jcsp marked this pull request as ready for review April 13, 2023 13:34

jcsp requested review from VladLazar and andrwng April 13, 2023 13:35

andrwng reviewed Apr 17, 2023

View reviewed changes

VladLazar previously approved these changes Apr 18, 2023

View reviewed changes

jcsp added 3 commits April 19, 2023 11:51

config: add storage_ignore_timestamps_in_future_sec

1f893a9

tests: add BogusTimestampTest

ba1b0ad

jcsp dismissed VladLazar’s stale review via ba1b0ad April 19, 2023 10:51

jcsp force-pushed the issue-9820-timestamp-filter branch from b5eb0ea to ba1b0ad Compare April 19, 2023 10:51

andrwng approved these changes Apr 19, 2023

View reviewed changes

jcsp merged commit 177cbe4 into redpanda-data:dev Apr 19, 2023

jcsp deleted the issue-9820-timestamp-filter branch April 19, 2023 18:54

This was referenced Apr 20, 2023

[v23.1.x] storage: optionally ignore timestamps in the future for retention #10228

Merged

[22.3.x] storage: optionally ignore timestamps in the future for retention #10236

Merged

storage: refine retention_adjust_timestamps #10258

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: optionally ignore timestamps in the future for retention#10028

storage: optionally ignore timestamps in the future for retention#10028
jcsp merged 3 commits intoredpanda-data:devfrom
jcsp:issue-9820-timestamp-filter

jcsp commented Apr 13, 2023 •

edited

Loading

Uh oh!

jcsp commented Apr 13, 2023

Uh oh!

andrwng left a comment

Uh oh!

andrwng Apr 17, 2023

Uh oh!

jcsp Apr 19, 2023

Uh oh!

VladLazar left a comment

Uh oh!

VladLazar Apr 18, 2023

Uh oh!

jcsp Apr 18, 2023

Uh oh!

jcsp commented Apr 19, 2023

Uh oh!

jcsp commented Apr 19, 2023

Uh oh!

vbotbuildovich commented Apr 19, 2023

Uh oh!

vbotbuildovich commented Apr 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jcsp commented Apr 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Backports Required

Release Notes

Improvements

Uh oh!

jcsp commented Apr 13, 2023

Uh oh!

andrwng left a comment

Choose a reason for hiding this comment

Uh oh!

andrwng Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

jcsp Apr 19, 2023

Choose a reason for hiding this comment

Uh oh!

VladLazar left a comment

Choose a reason for hiding this comment

Uh oh!

VladLazar Apr 18, 2023

Choose a reason for hiding this comment

Uh oh!

jcsp Apr 18, 2023

Choose a reason for hiding this comment

Uh oh!

jcsp commented Apr 19, 2023

Uh oh!

jcsp commented Apr 19, 2023

Uh oh!

vbotbuildovich commented Apr 19, 2023

Uh oh!

vbotbuildovich commented Apr 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jcsp commented Apr 13, 2023 •

edited

Loading