2025-03-02 19:06:17 lesigh 2025-03-02 19:54:03 whats happening? very colorful here 2025-03-02 19:55:26 Lots of scraping, but I'm not sure what exactly is causing the load issues 2025-03-02 19:56:03 Tried increasing the amount of workers, but that just increases the amount requests being able to be handled 2025-03-02 19:56:25 Not sure if it's just plain crawling that's the issue, or that there are specific requests which are heavy 2025-03-03 09:37:04 oh, git.a.o is 502 2025-03-03 09:38:41 ikke: any particular ip ranges that are guilty? 2025-03-03 10:07:13 pj: I see some ranges making the most requests, but I'm not entirely sure those requests cause most of the load 2025-03-03 10:07:28 I need to quantify that somehow 2025-03-03 10:14:39 there's been an ongoing problem for gitea/forgejo instances where bots keep scraping git archive tarballs which blows up their git archive cache 2025-03-03 10:16:14 an example from admin of my fedi instance: https://donotsta.re/notice/AreSNZlRlJv73AW7tI 2025-03-03 10:23:42 I may start sending the access logs to loki, may help to make it easier to analyse the requests 2025-03-03 11:10:40 not sure if we could do something like this? https://fosstodon.org/@dalias@hachyderm.io/114055514402836782 2025-03-03 11:16:55 I'm already doing something like that with nginx 2025-03-03 11:17:10 well, not a download speed, but more rate limit delay 2025-03-03 11:17:57 But, understanding where the slowness comes from is step 1 2025-03-03 15:17:10 if you find out it's from crawlers, it's a bit of whack-a-mole listing ips, but even where i see site operators doing that, they drop requests instead of returning something, which probably just gets the crawl job moved to a different queue/ip/asn. i personally favor returning 402. 2025-03-03 15:17:31 in any case site operators need to get better about working together because this is war 2025-03-03 15:27:54 invoked: agree 2025-03-04 00:33:51 From Trusted and Vouched Dealers... (full message at ) 2025-03-04 00:34:26 pj 2025-03-04 00:34:34 :3 2025-03-04 00:34:44 i can't do anything, no op here 2025-03-04 00:34:58 oh damn 2025-03-04 00:38:23 maybe I could reactive my mjolnir instance that I used for lapce 2025-03-04 00:38:39 would make it easier to ban across all channels 2025-03-04 00:39:31 they are all coming from :matrix.org, if they could just spend their money right and ban them immediatley