-
Notifications
You must be signed in to change notification settings - Fork 885
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generic hooks for testing #6938
Conversation
common/errorinjector/noop_impl.go
Outdated
@@ -0,0 +1,18 @@ | |||
//go:build !errorinjector |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd like to consider flipping this, actually (ie the default being the real impl).
A while ago I experimented with build tags and assertions; and I feel confident that we can add a step for the actual binary build that verifies there's no trace of ErrorInjector
to be found, as it was optimized away. That way we don't need to tell every single developer to run their tests with this flag (in their IDE etc.).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I like that, but I'm still a little concerned since we have so many different binary builds (docker images, goreleaser, internal ones), and then users that build their own binaries...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FWIW, docker-builds seems to already invoke the server Makefile.
} | ||
return lb | ||
} | ||
|
||
func (lb *defaultLoadBalancer) PickWritePartition( | ||
taskQueue *tqid.TaskQueue, | ||
) *tqid.NormalPartition { | ||
if n := lb.forceWritePartition(); n >= 0 { | ||
if n, ok := testhooks.Get[int](lb.testHooks, testhooks.MatchingLBForceWritePartition); ok { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
did we discuss it?
I really dislike the idea of having this kind of dependency, and having this kind of code in the main code path.
I would suggest to extract functionality into something like "PartitionPicker", and provide different implementations in functional tests.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We're discussing it now...
-
If you look at what it was doing before, it was abusing dynamic config to hook in here, so this is already a strict improvement (reduces runtime overhead, and makes it clearer that this is a hook for testing).
-
We only want this hook in some tests, only some of the time. So even in tests, most of the time we want the standard behavior. So we'd need a test LoadBalancer that can be set/unset to a mode with fixed behavior, otherwise falls back to the default. I think that's worse:
- First it's just a lot more code.
- Second, that means the mechanism to poke the test implementation is specific to each object, and tests will have to do their own cleanup. This generic mechanism is simpler for test writers, you just
s.InjectHook
and it's automatically cleaned up.
-
How do we do that for the other two examples here, forcing async match, and injecting a racing call in the middle of an update-with-start sequence? The alternative implementation method doesn't work there, as far as I can see.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Another good use will be the case when test needs to continue only after specific code line is reached somewhere deep inside server (like in almost all Update tests, I need to make sure that Update actually reached the server and added to the registry before moving forward). In this case, hook can unblock the channel which is awaited in test code.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
Approval pending on team discussion.
const ( | ||
MatchingDisableSyncMatch = "matching.disableSyncMatch" | ||
MatchingLBForceReadPartition = "matching.lbForceReadPartition" | ||
MatchingLBForceWritePartition = "matching.lbForceWritePartition" | ||
|
||
UpdateWithStartInBetweenLockAndStart = "history.updateWithStartInBetweenLockAndStart" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we use iota
instead? Since the identifiers don't need to be stable; it would save us some typing and effort to keep things consistent.
Counter point could be that we lose the ability to print them. Which we don't do right now; but might in the future.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah, good idea!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Print them as int
should be ok too. Anyway, adding Stringer
implementation later is not a big deal.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I really like the idea! Yes, we should use it with caution when there is no better way, but it really opens new horizons in testing.
@@ -410,6 +414,7 @@ func (c *TemporalImpl) startFrontend() { | |||
fx.Provide(func() persistenceClient.AbstractDataStoreFactory { return c.abstractDataStoreFactory }), | |||
fx.Provide(func() visibility.VisibilityStoreFactory { return c.visibilityStoreFactory }), | |||
fx.Provide(func() dynamicconfig.Client { return c.dcClient }), | |||
fx.Decorate(func() testhooks.TestHooks { return c.testHooks }), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Side note: most of these fx.Provide
should be fx.Decorate
.
} | ||
return lb | ||
} | ||
|
||
func (lb *defaultLoadBalancer) PickWritePartition( | ||
taskQueue *tqid.TaskQueue, | ||
) *tqid.NormalPartition { | ||
if n := lb.forceWritePartition(); n >= 0 { | ||
if n, ok := testhooks.Get[int](lb.testHooks, testhooks.MatchingLBForceWritePartition); ok { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Another good use will be the case when test needs to continue only after specific code line is reached somewhere deep inside server (like in almost all Update tests, I need to make sure that Update actually reached the server and added to the registry before moving forward). In this case, hook can unblock the channel which is awaited in test code.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would name this file key.go
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I really like the idea! Yes, we should use it with caution when there is no better way, but it really opens new horizons in testing.
…7097) ## What changed? <!-- Describe what has changed in this PR --> Make WorkflowIdConflictPolicy TerminateExisting follow the WorkflowIdReusePolicy after an _unsuccessful_ termination. NOTE: The frontend change was made in #7099 ## Why? <!-- Tell your future self why have you made these changes --> When the termination from TerminateExisting fails, the user would expect the WorkflowIdReusePolicy to be applied as the Workflow is not running. ## How did you test it? <!-- How have you verified this change? Tested locally? Added a unit test? Checked in staging env? --> Well ... there's no way to test this well right now. There are no unit tests; and the functional test cannot be written without the use of [testhooks](#6938) since there is no other way to simulate the race condition here. I manually added a `sync.Once` into the code that terminates the workflow and can confirm the expected behavior. ## Potential risks <!-- Assuming the worst case, what can be broken when deploying this change to production? --> ## Documentation <!-- Have you made sure this change doesn't falsify anything currently stated in `docs/`? If significant new behavior is added, have you described that in `docs/`? --> ## Is hotfix candidate? <!-- Is this PR a hotfix candidate or does it require a notification to be sent to the broader community? (Yes/No) -->
What changed?
Why?
To write integration/functional tests that require tweaking behavior of code under test, without affecting non-test builds.
Potential risks
Hooks are disabled by default, so there should be zero risk to production code, and zero overhead (assuming the Go compiler can do very basic inlining and dead code elimination).
The downside is that functional tests now have to be run with
-tags test_dep
everywhere.