Block traffic from bots and crawlers #137
Replies: 5 comments
-
fwiw, you could look into how https://github.com/ankane/ahoy does it... I think they defer to JS bot detection a lot |
Beta Was this translation helpful? Give feedback.
-
Some more, maybe better suggestions for where to get IP blocklists:
The second one aggregates many smaller lists. We would have to go through each to figure out what exactly they block and and what sort of quality we can expect. If they're not kept up-to-date well enough we might risk blocking legit traffic instead. |
Beta Was this translation helpful? Give feedback.
-
Some implementation notes: https://paraxial.io/blog/cloud-ips |
Beta Was this translation helpful? Give feedback.
-
Update: We're now filtering traffic from data centers too. An overview of our bot filtering is here: https://plausible.io/docs/dashboard-faq#does-plausible-exclude-known-bots-and-spam-traffic |
Beta Was this translation helpful? Give feedback.
-
"I see many Crawler and Bots in my page views. Is it possible to exclude those?
Yes, bot detection can be improved significantly. Currently I use an open source solution that only looks at the User-Agent header. I’m planning to add detection based on known bot/crawler IP addresses and referrer spammers."
(imported from the old roadmap)
Beta Was this translation helpful? Give feedback.
All reactions