[ENG-7873] CLONE - SPAM - When Hamming a Spammed user, preprints and registrations remain private #11125

antkryt · 2025-05-02T13:32:10Z

Purpose

fix was_public state when flag spam

Changes

correct check if node was public when flag_spam
use earliest confirm/flag spam log to check if node was public instead of the latest one

fix TypeError when check archiving status for stuck registrations (not related to ticket ENG-7873, but it's just one line permissible_addons = set(permissible_addons), so no additional testing is required)

QA Notes

I couldn't reproduce this issue via UI, but combination confirm_spam() -> flag_spam() -> ... breaks this feature. I'm not sure if it's exactly what's happening in our case, but since flag_spam() is used with automatic spam checks during node/preprint updates, it's quite possible.

Documentation

Side Effects

Ticket

https://openscience.atlassian.net/browse/ENG-7873

brianjgeiger

Okay, this is a tricky one. Could you add a test that:

Spams a public project
Hams the project to make it public
Make the project private
Spam the private project
Ham the project

The project should be private when that's all done.

antkryt · 2025-05-06T18:29:54Z

Note that logging the privacy change is crucial in this context.
For example the following test will fail:

  def test_multiple_privacy_changing(self, project):
        project.set_privacy('public')
        assert project.is_public

        project.confirm_spam()
        assert not project.is_public

        project.confirm_ham()
        assert project.is_public

        project.set_privacy('private', log=False)  # log=True is crucial!!!
        assert not project.is_public

        project.confirm_spam()
        assert not project.is_public

        project.confirm_ham()
        assert not project.is_public

There are some suspicious places where the privacy log is not created:

osf.io/api/preprints/serializers.py

Line 453 in f7737c5

preprint.set_privacy('public', log=False, save=True, ignore_permission=ignore_permission)
osf.io/osf/models/preprint.py

Line 886 in f7737c5

self.set_privacy('public', log=False, save=False, **kwargs)
osf.io/osf/models/registrations.py

Line 587 in f7737c5

node.set_privacy(
osf.io/osf/models/registrations.py

Line 840 in f7737c5

node.set_privacy('public', auth=None, log=False)
osf.io/osf/models/sanctions.py

Line 888 in f7737c5

registration.set_privacy('public', auth=None, log=False)

These seem like edge cases, so I left them untouched as I don't know whether this behavior is expected

brianjgeiger · 2025-05-07T13:00:09Z

@antkryt So the spam system doesn't set a flag when the object is spammed to say whether it was public before the spamming happened or not? It's all just relying on logs?

antkryt · 2025-05-08T14:07:32Z

@brianjgeiger correct. It was proposed and implemented in this ticket to fix multiple spam scenario. Some alternatives:

add was_public_at_spam (or something) field to AbstractNode and Preprint models (easy to implement , but hard to extend/modify in the future)
add something like SpamContext table with OneToOne relationship (harder to implement, but easy to extend)

We can implement was_public_at_spam separately for AbstractNode and Preprint, and just check something like "if the status isn’t initial, pending, or rejected, then it was public" as you suggested in the ticket comment section. That technically works, but honestly feels a bit unreliable and messy.
A much better approach (and not just for spam) would be to have a dynamic is_public property. That way we can centralize the logic "if it’s not initial, pending, or rejected, then it’s public" and reuse it anywhere we need to know if something was or can be public

brianjgeiger · 2025-05-08T14:25:17Z

@antkryt Okay, I've been chatting with Product on the Jira ticket, and we're going to make this so that, regardless of logs or whatever, if a preprint is not in initial, pending, or rejected moderation state, then it should be public. We'll want to make sure that preprints that were never made public don't suddenly become public, but I think they'll be in the initial state, even if they aren't moderated. But please verify that.

antkryt · 2025-05-08T14:31:53Z

@brianjgeiger what about registrations and projects?

brianjgeiger · 2025-05-08T14:49:50Z

@antkryt Projects we'll continue to do the way we are. We might do registrations similarly to preprints, but there are more states and it's not as urgent, so let's leave registrations for the moment and revisit that if necessary later.

…cience/osf.io into refactor-notifications * 'feature/pbs-25-10' of https://github.com/CenterForOpenScience/osf.io: fix issue where trying another already confirmed email threw an uncaught exception (CenterForOpenScience#11161) [ENG-8148] Add ArtifactOutcome in annotations to linked nodes (CenterForOpenScience#11158) [ENG-7966] Add "collected-in" relationship for Nodes (CenterForOpenScience#11140) fix issue where not having any external identities caused a 500 [ENG-7965] Add v2 email token confirmation endpoints (CenterForOpenScience#11139) [ENG-8052] Fixed FilterMixin issue with multiple values of notification subscription field (CenterForOpenScience#11150) support related_counts for view_only links (CenterForOpenScience#11148) allow admins change registration providers (CenterForOpenScience#11145) [ENG-7927] Improved logging for embargo termination (CenterForOpenScience#11137) [ENG-7873] CLONE - SPAM - When Hamming a Spammed user, preprints and registrations remain private (CenterForOpenScience#11125) Update changelog and package.json fix TypeError when check stucked registration revert async email sending (CenterForOpenScience#11134) [ENG-7921] Add scopes for applications to full_read and full_write scopes (CenterForOpenScience#11126) # Conflicts: # api_tests/nodes/views/test_node_detail.py # api_tests/nodes/views/test_node_linked_registrations.py # framework/auth/oauth_scopes.py # tests/test_registrations/test_retractions.py

…registrations remain private (CenterForOpenScience#11125) ## Purpose fix was_public state when flag spam ## Changes - correct check if node was public when flag_spam - use earliest confirm/flag spam log to check if node was public instead of the latest one --- - fix TypeError when check archiving status for stuck registrations (not related to ticket ENG-7873, but it's just one line `permissible_addons = set(permissible_addons)`, so no additional testing is required) ## QA Notes I couldn't reproduce this issue via UI, but combination `confirm_spam()` -> `flag_spam()` -> `...` breaks this feature. I'm not sure if it's exactly what's happening in our case, but since `flag_spam()` is used with automatic spam checks during node/preprint updates, it's quite possible. ## Ticket https://openscience.atlassian.net/browse/ENG-7873

antkryt added 2 commits May 1, 2025 20:15

correct was_public state when flag_spam; minor fixes

a3ea4d4

check if node was public before changing privacy

525fbd5

antkryt changed the base branch from feature/pbs-25-08 to feature/pbs-25-09 May 2, 2025 13:38

brianjgeiger requested changes May 6, 2025

View reviewed changes

add test

0d85515

brianjgeiger approved these changes May 12, 2025

View reviewed changes

brianjgeiger changed the base branch from feature/pbs-25-09 to feature/pbs-25-10 May 16, 2025 13:05

brianjgeiger merged commit cbff71f into CenterForOpenScience:feature/pbs-25-10 May 16, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENG-7873] CLONE - SPAM - When Hamming a Spammed user, preprints and registrations remain private #11125

[ENG-7873] CLONE - SPAM - When Hamming a Spammed user, preprints and registrations remain private #11125

Uh oh!

antkryt commented May 2, 2025

Uh oh!

brianjgeiger left a comment

Uh oh!

antkryt commented May 6, 2025

Uh oh!

brianjgeiger commented May 7, 2025

Uh oh!

antkryt commented May 8, 2025 •

edited

Loading

Uh oh!

brianjgeiger commented May 8, 2025

Uh oh!

antkryt commented May 8, 2025

Uh oh!

brianjgeiger commented May 8, 2025

Uh oh!

Uh oh!

Uh oh!

[ENG-7873] CLONE - SPAM - When Hamming a Spammed user, preprints and registrations remain private #11125

[ENG-7873] CLONE - SPAM - When Hamming a Spammed user, preprints and registrations remain private #11125

Uh oh!

Conversation

antkryt commented May 2, 2025

Purpose

Changes

QA Notes

Documentation

Side Effects

Ticket

Uh oh!

brianjgeiger left a comment

Choose a reason for hiding this comment

Uh oh!

antkryt commented May 6, 2025

Uh oh!

brianjgeiger commented May 7, 2025

Uh oh!

antkryt commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brianjgeiger commented May 8, 2025

Uh oh!

antkryt commented May 8, 2025

Uh oh!

brianjgeiger commented May 8, 2025

Uh oh!

Uh oh!

Uh oh!

antkryt commented May 8, 2025 •

edited

Loading