Hi, I've been having trouble with bots scraping my site REALLY hard, ignoring robots.txt, etc.
Here's one from this morning that locked up my VPS:
157.55.39.206 - - [24/Aug/2023:08:18:20 -0400] "GET /directory/page-7?start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150&start=150 HTTP/1.1" 500 - "-" "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm) Chrome/103.0.5060.134 Safari/537.36"
Is there a way using Akeeba Admin Tools that I can strip this type of thing back to the base URL:
/directory/page-7
Without affecting and breaking parts of the site that I don't want. This seems to only apply to K2 component, Author and Category views/layouts.
I have a forum discussion going with the K2 developers where they share a URL normalizer plugin and a code snippet:
https://www.joomlaworks.net/forum/k2-en/63421-duplicate-k2-category-listing
But it didn't fix the issue.
Any tips or suggestions are much appreciated.
Thanks!
-Nate