Hacker News new | past | comments | ask | show | jobs | submit login
Wikipedia is giving AI developers its data to fend off bot scrapers (theverge.com)
5 points by ambigious7777 67 days ago | hide | past | favorite | 1 comment



Kaggle certainly seems like a good route for this, making it easy for the many people who merely want Wikipedia data, who will now follow the path of least resistance to get it.

I doubt it will discourage the true large-scale bad actors for whom Wikipedia is only a tiny subset of what they are trying to download, and are sufficiently well-resourced that they can't be bothered to special-case it.

It'll be interesting to see how this plays out.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: