More

edoloughlin · 2025-04-25T18:16:12 1745604972

> I also don't have any concrete examples jumping to mind

I do (and I may get publicly shamed and shunned for admitting I do such a thing): figuring out how to fix parenthesis matching errors in Clojure code that it's generated.

One coding agent I've used is so bad at this that it falls back to rewriting entire functions and will not recognise that it is probably never going to fix the problem. It just keeps burning rainforest trying one stupid approach after another.

Yes, I realise that this is not a philosophical question, even though it is philosophically repugnant (and objectively so). I am being facetious and trying to work through the PTSD I acquired from the above exercise.

edoloughlin · 2025-04-25T18:04:16 1745604256

Given that Gemini seems to have frequent availability issues, I wonder if this is a strategy to offload low-hanging fruit (from a human-effort pov) to the user. If it is, I think that's still kinda impressive.

edoloughlin · 2025-03-25T22:24:22 1742941462

I'm being trite, but if you can detect an AI bot, why not just serve them random data? At least they'll be sharing some of the pain they inflict.

nosianu · 2025-03-25T22:35:41 1742942141

You mean like this?

[2025-03-19] https://blog.cloudflare.com/ai-labyrinth/

> Trapping misbehaving bots in an AI Labyrinth

> Today, we’re excited to announce AI Labyrinth, a new mitigation approach that uses AI-generated content to slow down, confuse, and waste the resources of AI Crawlers and other bots that don’t respect “no crawl” directives.

barbazoo · 2025-03-26T05:27:25 1742966845

What a colossal waste of energy

fc417fc802 · 2025-03-26T06:09:40 1742969380

> No real human would go four links deep into a maze of AI-generated nonsense.

... I would. Out of curiosity and amusement I would most definitely do that. Not every time, and not many times, but I would definitely do that one or a few times.

Guess I'm getting added to (yet another) Cloudflare naughty list.

> It is important to us that we don’t generate inaccurate content that contributes to the spread of misinformation on the Internet, so the content we generate is real and related to scientific facts, just not relevant or proprietary to the site being crawled.

In that case wouldn't it be faster and easier to restyle the CSS of wikipedia pages?

mbesto · 2025-03-26T02:12:58 1742955178

Wait, what happens when a Cloudflare Worker AI meets an AI Labyrinth?!

ronsor · 2025-03-26T03:02:17 1742958137

Cloudflare deletes itself.

GoblinSlayer · 2025-03-26T21:46:04 1743025564

Rise of the machines.

noirscape · 2025-03-25T22:28:56 1742941736

Bandwidth isn't free, not at the volume these crawlers scrape at; serving them random data (for example by leading them down an endless tarpit of links that no human would end up visiting) would still incur bandwidth fees.

Also it's not identifiable AI bot traffic that's detected (they mask themselves as regular browsers and hop between domestic IP addresses when blocked), it's just really obviously AI scraper traffic in aggregate: other mass crawlers have no benefit from bringing down their host sites, except for AI.

A search engine has nothing if it brings down the site they're scraping (and has everything to gain from identifying itself as a search engine to try and get favorable request speeds - the only thing they'd need to check is if the site in question isn't serving different data, but that's much cheaper), same with an archive scraper and those two are pretty much the main examples I can think of for most scraping traffic.

BlarfMcFlarf · 2025-03-25T23:46:22 1742946382

Hmm, maybe you could zipbomb the data? Aka, you send a few kilobytes of compressed data that expands to many gigabytes on client side?

gus_massa · 2025-03-26T00:20:52 1742948452

Reverse Slowloris?

https://en.wikipedia.org/wiki/Slowloris_(cyber_attack)

miohtama · 2025-03-26T06:59:04 1742972344

For Cloudflare, bandwidth is practically free.

cyanydeez · 2025-03-26T00:43:21 1742949801

arnt a lot of these bots now actively loading javascript? you could just load a simple script that does the job .

edoloughlin · 2025-03-29T12:19:13 1743250753

If they agree to mine crypto for you then you send valid data. Is this a win-win?

(I feel I need to preemptively state that I am being sarcastic.)

charcircuit · 2025-03-25T22:58:19 1742943499

>Bandwidth isn't free

Via peering agreements it is.

rcxdude · 2025-03-25T23:45:40 1742946340

Not something available to smaller sites

charcircuit · 2025-03-26T01:15:14 1742951714

Yes, it is. They transitively get it via the agreements the smaller site's host's host makes. Or via services like Cloudflare.

xena · 2025-03-26T00:49:20 1742950160

What button do I click in the AWS panel for that?

charcircuit · 2025-03-26T01:13:49 1742951629

There is no button. AWS is where you go to light money on fire.

xena · 2025-03-25T22:26:31 1742941591

You can detect the patterns in aggregate. You can't detect it easily at an individual request level.

bluGill · 2025-03-26T00:20:45 1742948445

In short if you get several million requests and expect to only get 100 you won't know which are the real requests and which are the AI ones - but it is obvious that the vast majority are AI.

jmpeax · 2025-03-26T00:29:28 1742948968

You skipped the last section "Tarpits and labyrinths: The growing resistance" of the article.

DecentShoes · 2025-03-26T01:56:39 1742954199

Random data? Why not "recipes" that just say "Bezos is a pedo" over and over ?

edoloughlin · 2025-01-30T14:15:22 1738246522

Isn't CUDA a pretty good moat for Nvidia?

edoloughlin · 2024-11-17T11:51:57 1731844317

I think finding a meaningful way recognise a significant contributor in a way that doesn’t really impact anyone is something to be encouraged. I imagine that most people would use the hex code anyway and only devs/designers would see the name in their tooling.

I can’t see how to apply logic to naming a colour. It’s fundamentally a perceptual and, dare I say it, emotional process.

I also think your comment is uncharitable and tone deaf.

edoloughlin · 2024-09-12T17:20:58 1726161658

Looks really nice on an initial view. Will explore in a bit more depth and check back later

edoloughlin · 2024-06-21T20:15:46 1719000946

> But I don't see how it's not authoritarian.

If this thread continues for a few more levels, I think you’ll end up justifying hiring your own private police force.

Ownership requires that a state exists to enforce your rights. There are tradeoffs with this arrangement, one of which is that the state gets to set boundaries/limits on how you can use the thing you own. Ideally, acting with the best interests of the population. This sometimes includes ensuring areas are off limits to transient inhabitants so that a society can develop.

edoloughlin · on Sept 9, 2023

The universe could also spontaneously stop existing[1] but I still put my seatbelt on when I get into my car. It’s about probability.

1. https://en.m.wikipedia.org/wiki/False_vacuum_decay

Xaiph_Rahci · on Sept 9, 2023

People who get existential crisis out of this can find relief in the fact that the Universe can expand faster than the decay. So, it is possible that decay never reaches your region of Space.

delecti · on Sept 9, 2023

They should also get relief in the fact that there doesn't appear to be any reason to think our universe is even subject to that "risk".

appplication · on Sept 10, 2023

> The effects could range from complete cessation of existing fundamental forces, elementary particles and structures comprising them, to subtle change in some cosmological parameters, mostly depending on the potential difference between true and false vacuum. Some false vacuum decay scenarios are compatible with survival of structures like galaxies and stars or even biological life while others involve the full destruction of baryonic matter or even immediate gravitational collapse of the universe

What an interesting wiki article. It’s incredible this is proposed as a sliding scale effect, where the tunable parameters on that scale are the fundamental physical constants of the universe itself.

roenxi · on Sept 9, 2023

Yes please lets focus on probability. And cost-benefit analysis. We'd be a lot closer to solving all sorts of problems, including the asteroid one and any climate-related ones, if people had thought about probabilities when dealing with nuclear power in the 80s and 90s.

Humans don't seem to be very good at symbolic reasoning though.

edoloughlin · on Sept 6, 2023

the EU's vision of tech is more restrictive

That's an interesting perspective, given that the legislation is aimed at opening up the big players' platforms/products to more competition by forcing them to eliminate restrictions.

seanmcdirmid · on Sept 6, 2023

I don’t think legislation can force them to just open up, it has to be something like “if you offer an X in the EU, it must be open”, and they can always decide not to offer the X in the long EU. In other words, a country (or group of countries) can set the rules of play, but they can’t make play mandatory. And this is probably partial; eg they don’t have to offer news linking while still offering other features.

kmeisthax · on Sept 6, 2023

No step on snek that's stepping on other snek.

Any freedom can be viewed in one of two ways:

- A set of boundaries that sovereign states are required to respect (Congress shall make no law), or...

- A set of competing interests that have to be balanced in order to maximize the desired freedom for the maximum number of people (your right to swing your fist ends where my face begins).

The US codified the former in its constitution, because the formation of the current US government was a struggle between the people who found the prior Articles of Confederation to be unworkable[0] and the people who didn't want to build British Empire 2: Electric Boogaloo. The compromise was that we'd codify very obvious things the government was not allowed to do based on what horribly unpopular things the British did to the colonies. This is why we have a strong constitutional prohibition against being forced to quarter soldiers in one's home, even though you rarely see it get used in court. This as applied to freedom of speech is the 1st Amendment, which covers basically every way that the right of freedom of speech had been curtailed in Britain up to that point[1].

However, because we have this very strong prohibition on the government restricting speech, people have misinterpreted it to mean that only the government can restrict speech. That is, if Comcast refuses to let you watch Netflix unless they get paid more for the traffic, that's Comcast's protected speech, and any government regulation to unblock Netflix is censorship. Or, alternatively, if someone decides to publish your personal information in response to a video you don't like, the resulting mob justice against you isn't censorship because it's not the right person holding the gun, and taking down your personal information would be censoring the person who published it.

This is, of course, absurd. But the US has exported and normalized this thinking beyond the narrow realm of constitutional law scholars. Right-wingers were able to successfully appropriate 'free speech' to mean 'turn every website into 4chan', and in response, liberals made the mistake of adopting the Comcast Argument I listed above. This is how we got arguments like "Twitter and Facebook can't censor because it's a private platform", even though it's a monopoly. Now that Musk owns Twitter and boosts the far-right in exchange for them protecting Musk's fragile ego, I fully expect this calculus to flip and you'll see right-wingers go back to the Comcast Argument while liberals call for regulation.

The EU's tech regulations would be condemned by the same logic. i.e. the EU should not be allowed to regulate big tech, because only sovereigns can censor, and censorship is so bad that we should not even censor the people engaged in censorship[2]. The boundaries argument ends up with a nominal right to free speech that is defacto privatized censorship, which is a problem the balancing argument does not have.

[0] for reasons largely similar to the problems the EU faces being an international union rather than a federal state

[1] I would also argue that it is a de-facto ban on "culture war issues", given that religion and wars over minor church schisms were the European culture war of the 1700s.

[2] https://alphahistory.com/nazigermany/goebbels-on-the-reichst...

edoloughlin · on Aug 11, 2023

The post doesn't mention RMS, GPL or the entire internet. Is everything ok?

foooorsyth · on Aug 11, 2023

Yawn. It's classic FSF attitude, and it's a perfect microcosm of GPLv3's goals of preventing things like locked-down hardware in the scope of a software license. In this case, the commenter I replied to thinks it wrong that users of OSM's data product, which has copyleft clauses in the scope of the data only, should be able to sell a closed-source application using that data. Which is of course not the will of the data product creators (the only people that have a real say in this), and is just annoying rms cult dogma cut and paste. For the record, Magic Earth is far and away the best OSM-based non-big-tech mapping product. It absolutely blows away GPLv3 products like OSMAnd. So yeah, it's annoying hearing people clamoring for its source to be opened, or soft-shaming the authors for trying to earn a real living with their great product that's perfectly inline with the data provider's license.