Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Nice! Thank you!

I just wonder if browsers will limit the amount of characters in URLs.

If memory serves me, there was a limit. But it might be high enough to work fro most pages.



It's around 8KB now – so text bigger than 8 thousand characters will return: "414 Request-URI Too Large".

Anyway the document.body.innerText contains all things on the site, including links, menus, buttons etc just 1 per newline. LLM will only recognise if it previously scanned the same website and it did not change much since the last training set. Some arbitrary websites it will not recognise this way and start hallucinating one because innerText removes all the structure from it.


Modern browsers are not an issue here, e.g. chromium allows 2MB; the issue is with web server's limits.


Indeed, I'm getting Cloudflare error "414 Request-URI Too Large" for this HN post which isn't large.

But the URL bar was not the problem.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: