Check client versions when performing any operations #346

bdach · 2025-10-28T12:44:17Z

RFC. Probably maybe closes #52.

This is the laziest possible version of a change that could possibly maybe come close to parameters of "checking client versions" outlined in the above issue.

Why is it lazy? Well there's a bunch of problems here and I don't know how to solve any of them, so this is supposed to be a start of a conversation. The list of problems is presented below in a numerical list.

This change prevents execution of any method of any hub if the client version does not match. In particular, this includes the spectator hub. Which is now responsible for recording replays. Therefore, if this is deployed as is, old clients will potentially be able to submit scores that don't have replays.

This part is possibly irrelevant, if it is ensured that all builds have consistent values of allow_ranking and allow_bancho, as in both should be true or false consistently and not mixed.
~~Retrieving the client hash is dependent on connecting to the metadata hub. If the client can somehow connect to all hubs except metadata, they will have online functionality blocked.~~

~~This can maybe be resolved by having a redundant copy of the hash information inside the filter implemented here.~~ Implemented in dcfd65f.
This can only really throw on attempting to execute any hub operation. Throwing on connect is not possible for reasons that vary, because the filter executes independently for each hub that we maintain. Therefore:
- ~~In spectator and multiplayer hub, because of point (2) (relying on the metadata hub to populate user state), it is not guaranteed that we can read the user state to get the user's client hash.~~
- In metadata hub, the user client hash can be read reliably if it is checked after the hub's OnConnectedAsync() is ran first, but throwing inside OnConnectedAsync() causes the client to disconnect from the metadata hub due to the error, and then proceed to get stuck in a loop of trying to re-connect every 3 seconds, which seems... let's call it 'suboptimal'?
Because of how simple this is (throw on every operation) this could get pretty spammy client-side. In testing, client handles this spam okay by limiting the count of notifications emitted... as long as it actually handles the errors. More on this later, await a client-side PR (Ensure all invocations of spectator server hub methods have their errors observed osu#35488).
The user is not forcibly disconnected from API, and instead is in a weird half-alive state where they can use API-dependent functions but not the realtime stuff. Adding a forcible logout would require client changes, but clients that are right now considered old won't abide by those changes, for obvious reasons (we can't ship extra code to deployed builds).

I think that's all of the caveats but I might be forgetting some at this point.

Test coverage can be added, but (a) I'm not sure how much of this is going to end up in the trash, and (b) the code is so dead simple that you may as well go and test full stack (and it's the arguably only useful sort of testing here), so I'm not bothering until I'm sure it's worth the admission price.

Probably maybe closes ppy#52. This is the laziest possible version of a change that could possibly maybe come close to parameters of "checking client versions" outlined in the above issue. Why is it lazy? Well there's a bunch of problems here and I don't know how to solve any of them, so this is supposed to be a start of a conversation. The list of problems is presented below in a numerical list. 1. This change prevents execution of any method of any hub if the client version does not match. In particular, this includes the spectator hub. Which is now responsible for recording replays. Therefore, if this is deployed as is, old clients will potentially be able to submit scores that don't have replays. This part is possibly irrelevant, if it is ensured that all builds have consistent values of `allow_ranking` and `allow_bancho`, as in both should be `true` or `false` consistently and not mixed. 2. Retrieving the client hash is dependent on connecting to the metadata hub. If the client can somehow connect to all hubs except metadata, they will have online functionality blocked. This can maybe be resolved by having a redundant copy of the hash information *inside* the filter implemented here. 3. This can only really throw on attempting to execute any hub operation. Throwing on connect is not possible because the filter executes independently for each hub that we maintain. Therefore: - In spectator and multiplayer hub, because of point (2) (relying on the metadata hub to populate user state), it is not guaranteed that we can *read* the user state to get the user's client hash. - In metadata hub, the user client hash *can* be read reliably if it is checked *after* the hub's `OnConnectedAsync()` is ran first, *but* throwing inside `OnConnectedAsync()` causes the client to disconnect from the metadata hub due to the error, and then proceed to get stuck in a loop of trying to re-connect every 3 seconds, which seems... let's call it 'suboptimal'? 4. Because of how simple this is (throw on every operation) this could get pretty spammy client-side. In testing, client handles this spam *okay* by limiting the count of notifications emitted... as long as it actually handles the errors. More on this later, await a client-side PR. 5. The user is not forcibly disconnected from API, and instead is in a weird half-alive state where they can use API-dependent functions but not the realtime stuff. Adding a forcible logout would require client changes, but clients that are *right now* considered old won't abide by those changes, for obvious reasons (we can't ship extra code to deployed builds). I think that's all of the caveats but I might be forgetting some at this point. Test coverage can be added, but (a) I'm not sure how much of this is going to end up in the trash, and (b) the code is so dead simple that you may as well go and test full stack (and it's the arguably only *useful* sort of testing here), so I'm not bothering until I'm sure it's worth the admission price.

…ors observed Fell out when attempting ppy/osu-server-spectator#346. Functionally, if a true non-`HubException` is produced via an invocation of a spectator server hub method, this doesn't really do much - the error will still log as 'unobserved' due to the default handler, it will still show up on sentry, etc. The only difference is that it'll get handled via the continuation installed in `FireAndForget()` rather than the `TaskScheduler.UnobservedTaskException` event. The only real case where this is relevant is when the server throws `HubException`s, which will now instead bubble up to a more human-readable form. Which is relevant to the aforementioned PR because that one makes any hub method potentially throw a `HubException` if the client version is too old. Obviously this does nothing for the existing old clients.

…ors observed (#35488) Fell out when attempting ppy/osu-server-spectator#346. Functionally, if a true non-`HubException` is produced via an invocation of a spectator server hub method, this doesn't really do much - the error will still log as 'unobserved' due to the default handler, it will still show up on sentry, etc. The only difference is that it'll get handled via the continuation installed in `FireAndForget()` rather than the `TaskScheduler.UnobservedTaskException` event. The only real case where this is relevant is when the server throws `HubException`s, which will now instead bubble up to a more human-readable form. Which is relevant to the aforementioned PR because that one makes any hub method potentially throw a `HubException` if the client version is too old. Obviously this does nothing for the existing old clients.

smoogipoo · 2025-10-29T03:32:04Z

osu.Server.Spectator/ClientVersionChecker.cs

+            var build = await memoryCache.GetOrCreateAsync(hash, async _ =>
+            {
+                using (var db = databaseFactory.GetInstance())
+                    return await db.GetBuildByHashAsync(hash);
+            });
+
+            return build?.allow_bancho == true;


What's the lifetime on these objects? Do we care about toggling allow_bancho without a new spectator startup?

If so then you probably want an absolute expiry window here.

I think we probably do yeah. I'd say 10-30 minute refresh is fine.

Set to 30 minutes in 8c96ae4

smoogipoo · 2025-10-29T03:41:52Z

osu.Server.Spectator/ClientVersionChecker.cs

+            string? hash;
+            using (var item = await metadataStore.GetForUse(callerContext.GetUserId()))
+                hash = item.Item?.VersionHash;


I think point (2) in OP is a big one, and I think it's easily possible to occur. Agree with duplicating it in here.

Addressed in dcfd65f

…hecker Instead of relying on metadata hub. Leads to some duplicated storage but safer, probably.

peppy · 2025-12-11T05:01:55Z

Revisiting this, one caveat is that us devs will no longer be able to connect to the live environment from locally built releases. Bancho gets around this by adding admin overrides for client hash checks.

@bdach thoughts on whether we want to do that here? or just be like, "we shouldn't be doing that in the first place and should be using staging instead"?

bdach · 2025-12-11T06:42:35Z

I'd be fine with adding some allowlist type facility for our own use in times of need if you are.

peppy · 2025-12-11T07:04:31Z

Let's go in that direction then. Either an envvar list of groups to include, or just one should be enough (11 for developers on production).

bdach requested a review from peppy October 28, 2025 12:44

bdach self-assigned this Oct 28, 2025

bdach added this to osu! untitled project Oct 28, 2025

github-project-automation bot moved this to Next up in osu! untitled project Oct 28, 2025

bdach moved this from Next up to Pending Review in osu! untitled project Oct 28, 2025

bdach mentioned this pull request Oct 28, 2025

Ensure all invocations of spectator server hub methods have their errors observed ppy/osu#35488

Merged

smoogipoo reviewed Oct 29, 2025

View reviewed changes

bdach added 4 commits October 29, 2025 09:30

Specify expiration in build-by-hash cached mapping

8c96ae4

Move client version header retrieval to helper method

d6755d8

Maintain connection-to-version-hash mapping local to client version c…

dcfd65f

…hecker Instead of relying on metadata hub. Leads to some duplicated storage but safer, probably.

Do not put null build entries in memory cache

7900c4c

bdach marked this pull request as ready for review October 29, 2025 08:32

Merge branch 'master' into client-version-check-2

4f728dc

Add mechanism to exempt selected user groups from client version checks

14646a2

peppy approved these changes Dec 12, 2025

View reviewed changes

peppy merged commit 82900d2 into ppy:master Dec 12, 2025
2 checks passed

github-project-automation bot moved this from Pending Review to Done in osu! untitled project Dec 12, 2025

bdach deleted the client-version-check-2 branch December 12, 2025 08:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check client versions when performing any operations #346

Check client versions when performing any operations #346

Uh oh!

bdach commented Oct 28, 2025 •

edited

Loading

Uh oh!

smoogipoo Oct 29, 2025

Uh oh!

peppy Oct 29, 2025

Uh oh!

bdach Oct 29, 2025

Uh oh!

smoogipoo Oct 29, 2025

Uh oh!

bdach Oct 29, 2025

Uh oh!

peppy commented Dec 11, 2025

Uh oh!

bdach commented Dec 11, 2025

Uh oh!

peppy commented Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Check client versions when performing any operations #346

Check client versions when performing any operations #346

Uh oh!

Conversation

bdach commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

smoogipoo Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

peppy Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

bdach Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

smoogipoo Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

bdach Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

peppy commented Dec 11, 2025

Uh oh!

bdach commented Dec 11, 2025

Uh oh!

peppy commented Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bdach commented Oct 28, 2025 •

edited

Loading