Skip to content

feat: Add Bytespider (Bytedance) to web crawler filter#2747

Merged
AbhiPrasad merged 3 commits intomasterfrom
abhi-web-crawler-bytedance
Nov 21, 2023
Merged

feat: Add Bytespider (Bytedance) to web crawler filter#2747
AbhiPrasad merged 3 commits intomasterfrom
abhi-web-crawler-bytedance

Conversation

@AbhiPrasad
Copy link
Copy Markdown
Contributor

@AbhiPrasad AbhiPrasad commented Nov 20, 2023

In a recent tweet we were pointed to Bytespider, the web crawler by Bytedance (TikTok) causing noise for Sentry issues on peoples sites: https://twitter.com/ThisIsSingleton/status/1726387476317773878

Upon doing some research I found that this has been going on for the last couple months, and all the errors generated were noisy and unactionable

Given this seems to be a problematic web crawler (that also doesn't respect robots.txt) and that the problem has existed for a couple months now, let's add Bytespider to the default web crawler filter.

@AbhiPrasad AbhiPrasad marked this pull request as ready for review November 20, 2023 22:47
@AbhiPrasad AbhiPrasad requested a review from a team November 20, 2023 22:47
Copy link
Copy Markdown
Contributor

@olksdr olksdr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

@AbhiPrasad could you, please, add proper PR description before merging. Thanks!

@AbhiPrasad
Copy link
Copy Markdown
Contributor Author

Updated PR description, thanks for the review!

@AbhiPrasad AbhiPrasad enabled auto-merge (squash) November 21, 2023 17:33
@AbhiPrasad AbhiPrasad merged commit d7cbfc2 into master Nov 21, 2023
@AbhiPrasad AbhiPrasad deleted the abhi-web-crawler-bytedance branch November 21, 2023 17:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants