I use zip bombs to protect my server

seanhunter · 2025-04-30T04:10:19 1745986219

Once upon a time around 2001 or so I used to have a static line at home and host some stuff on my home linux box. A windows NT update had meant a lot of them had enabled this optimistic encryption thing where windows boxes would try to connect to a certain port and negotiate an s/wan before doing TCP traffic. I was used to seeing this traffic a lot on my firewall so no big deal. However there was one machine in particular that was really obnoxious. It would try to connect every few seconds and would just not quit.

I tried to contact the admin of the box (yeah that’s what people used to do) and got nowhere. Eventually I sent a message saying “hey I see your machine trying to connect every few seconds on port <whatever it is>. I’m just sending a heads up that we’re starting a new service on that port and I want to make sure it doesn’t cause you any problems.”

Of course I didn’t hear back. Then I set up a server on that port that basically read from /dev/urandom, set TCP_NODELAY and a few other flags and pushed out random gibberish as fast as possible. I figured the clients of this service might not want their strings of randomness to be null-terminated so I thoughtfully removed any nulls that might otherwise naturally occur. The misconfigured NT box connected, drank 5 seconds or so worth of randomness, then disappeared. Then 5 minutes later, reappeared, connected, took its buffer overflow medicine and disappeared again. And this pattern then continued for a few weeks until the box disappeared from the internet completely.

I like to imagine that some admin was just sitting there scratching his head wondering why his NT box kept rebooting.

kqr · 2025-04-30T05:06:14 1745989574

The lesson for any programmers reading this is to always set an upper limit for how much data you accept from someone else. Every request should have both a timeout and a limit on the amounts of data it will consume.

keitmo · 2025-04-30T13:45:51 1746020751

As a former boss used to say: "Unlimited is a bad idea."

eru · 2025-04-30T05:45:32 1745991932

That doesn't necessarily need to be in the request itself.

You can also limit the wider process or system your request is part of.

kqr · 2025-04-30T06:37:53 1745995073

While that is true, I recommend on the request anyway, because it makes it abundantly clear to the programmer that requests can fail, and failure needs to be handled somehow – even if it's by killing and restarting the process.

GTP · 2025-04-30T09:32:23 1746005543

I second this: depending on the context, there might be a more graceful way of handling a response that's too long then crashing the process.

lazide · 2025-04-30T10:45:36 1746009936

Though the issue with ‘too many byte’ limits is that this tends to cause outages later then time has passed and now whatever the common size was is now ‘tiny’, like if you’re dealing with images, etc.

Time limits tend to also defacto limit size, if bandwidth is somewhat constrained.

kqr · 2025-04-30T13:44:39 1746020679

Deliberately denying service in one user flow because technology has evolved is much better than accidentally denying service to everyone because some part of the system misbehaved.

Timeouts and size limits are trivial to update as legitimate need is discovered.

lazide · 2025-04-30T14:25:58 1746023158

Oh man, I wish I could share some outage postmortems with you.

Practically speaking, putting an arbitrary size limit somewhere is like putting yet-another-ssl-cert-that-needs-to-be-renewed in some critical system. It will eventually cause an outage you aren’t expecting.

Will there be a plausible someone to blame? Of course. Realistically, it was also inevitable someone would forget and run right into it.

Time limits tend to not have this issue, for various reasons.

GTP · 2025-04-30T14:45:57 1746024357

But not putting the limits, leaves the door open to a different class of outages in the form of buffer overflows, that additionally can also pose a security risk as could be exploitable by an attacker. maybe this issue would be better solved at the protocol level, but in the meantime size limit it is.

lazide · 2025-04-30T14:56:21 1746024981

Nah, just OOM. Yes, there does need to be a limit somewhere - it just doesn’t need to be arbitrary, but based on some processing limit, and ideally will adapt as say memory footprint gets larger.

guappa · 2025-04-30T09:22:58 1746004978

Then you kill your service which might also be serving legitimate users.

mkwarman · 2025-04-30T04:26:23 1745987183

I enjoyed reading this, thank you for sharing. When you say you tried to contact the admin of the box and that this was common back then, how would you typically find the contact info for an arbitrary client's admin?

cobbaut · 2025-04-30T06:50:58 1745995858

Back then things like postmaster@theirdomain and webmaster@theirdomain were read by actual people. Also the whois command often worked.

dspearson · 2025-04-30T07:43:30 1745999010

I work for one of the largest Swiss ISPs, and these mailboxes are still to this day read by actual people (me included), so it's sometimes worthwhile even today.

NetOpWibby · 2025-04-30T07:48:24 1745999304

I setup a new mail server with Stalwart and have been getting automated mails to my postmaster address (security treat results mostly).

Pretty neat.

kqr · 2025-04-30T05:08:20 1745989700

You can also find out who owns a general group of IP addresses, and at the time they would often assist you in further pinpointing who is responsible for a particular address.

DocTomoe · 2025-04-30T04:49:14 1745988554

tech-c / abuse addresses were commonly available on whois.

ge96 · 2025-04-30T14:48:37 1746024517

tangent

I had a lazy fix for a down detection on my RPi server at home, it was pinging a domain I owned and if it couldn't hit that assumed it wasn't connected to a network/rebooted itself. I let the domain lapse and this RPi kept going down around 5 minutes... thought it was a power fault, then I remembered about that CRON job.

danillonunes · 2025-04-30T16:52:51 1746031971

That's why everyone else is lazy and just ping google.com

mjmsmith · 2025-04-30T16:24:36 1746030276

Around the same time, or maybe even earlier, some random company sent me a junk fax every Friday. Multiple polite voicemails to their office number were ignored, so I made a 100-page PDF where every page was a large black rectangle, and used one of the new-fangled email-to-fax gateways to send it to them. Within the hour, I got an irate call. The faxes stopped.

zerr · 2025-04-30T12:20:06 1746015606

Didn't get why that WinNT box was connecting to your box. Due to some misconfigured Windows update procedure?

gigatexal · 2025-04-30T05:35:57 1745991357

That’s awesome! Thank you for sharing.

layer8 · 2025-04-29T20:36:06 1745958966

Back when I was a stupid kid, I once did

    ln -s /dev/zero index.html

on my home page as a joke. Browsers at the time didn’t like that, they basically froze, sometimes taking the client system down with them.

Later on, browsers started to check for actual content I think, and would abort such requests.

bobmcnamara · 2025-04-29T23:59:04 1745971144

I made a 64kx64k JPEG once by feeding the encoder the same line of macro blocks until it produce the entire image.

Years later I was finally able to open it.

opan · 2025-04-30T00:09:47 1745971787

I had a ton of trouble opening a 10MB or so png a few weeks back. It was stitched together screenshots forming a map of some areas in a game, so it was quite large. Some stuff refused to open it at all as if the file was invalid, some would hang for minutes, some opened blurry. My first semi-success was Fossify Gallery on my phone from F-Droid. If I let it chug a bit, it'd show a blurry image, a while longer it'd focus. Then I'd try to zoom or pan and it'd blur for ages again. I guess it was aggressively lazy-loading. What worked in the end was GIMP. I had the thought that the image was probably made in an editor, so surely an editor could open it. The catch is that it took like 8GB of RAM, but then I could see clearly, zoom, and pan all I wanted. It made me wonder why there's not an image viewer that's just the viewer part of GIMP or something.

Among things that didn't work were qutebrowser, icecat, nsxiv, feh, imv, mpv. I did worry at first the file was corrupt, I was redownloading it, comparing hashes with a friend, etc. Makes for an interesting benchmark, I guess.

For others curious, here's the file: https://0x0.st/82Ap.png

I'd say just curl/wget it, don't expect it to load in a browser.

Scaevolus · 2025-04-30T00:32:06 1745973126

That's a 36,000x20,000 PNG, 720 megapixels. Many decoders explicitly limit the maximum image area they'll handle, under the reasonable assumption that it will exceed available RAM and take too long, and assume the file was crafted maliciously or by mistake.

lgeek · 2025-04-30T02:39:02 1745980742

On Firefox on Android on my pretty old phone, a blurry preview rendered in about 10 seconds, and it was fully rendered in 20 something seconds. Smooth panning and zooming the entire time

connicpu · 2025-04-30T05:57:02 1745992622

Firefox on a Samsung S23 Ultra did it a few seconds faster but otherwise the same experience

virtue3 · 2025-04-30T01:34:05 1745976845

I use honey view for reading comics etc. It can handle this.

Old school acdsee would have been fine too.

I think it's all the pixel processing on the modern image viewers (or they're just using system web views that isn't 100% just a straight render).

I suspect that the more native renderers are doing some extra magic here. Or just being significantly more OK with using up all your ram.

swiftcoder · 2025-04-30T15:54:33 1746028473

> don't expect it to load in a browser

Takes a few seconds, but otherwise seems pretty ok in desktop Safari. Preview.app also handles it fine (albeit does allocate an extra ~1-2GB of RAM)

Moosdijk · 2025-04-30T03:21:43 1745983303

It loads in about 5 seconds on an iPhone 12 using safari.

It also pans and zooms swiftly

avianlyric · 2025-04-30T10:43:08 1746009788

Same, right up until I zoomed in and waited for Safari to produce a higher resolution render.

Partially zoomed in was fine, but zooming to maximum fidelity resulted in the tab crashing (it was completely responsive until the crash). Looks like Safari does some pretty smart progressive rendering, but forcing it to render the image at full resolution (by zooming in) causes the render to get OOMed or similar.

mikaraento · 2025-04-30T17:19:45 1746033585

I remember that years ago (mobile) Safari would aggressively use GPU layers and crash if you ran out of GPU memory. Maybe that's still happening?

Preview on a mac handles the file fine.

close04 · 2025-04-30T08:24:12 1746001452

How strange, took at least 30s to load on my iPhone 12 Pro Max with Safari but it was smooth to pan and zoom after. Which is way better than my 16 core 64GB RAM Windows machine where both Chrome and Edge gave up very quickly, with a "broken thumbnail" icon.

GTP · 2025-04-30T10:44:00 1746009840

Probably because they're based on the same engine.

close04 · 2025-04-30T14:06:54 1746022014

The strangeness was that 2 iPhones from the same generation would exhibit such different performance behaviors, and in parallel the irony that a desktop browser (engine irrelevant) on a device with cutting edge performance can't do what a phone does.

Meneth · 2025-04-30T12:39:41 1746016781

On my Waterfox 6.5.6, it opened but remained blurry when zoomed in. MS Paint refused to open it. The GIMP v2.99.18 crashed and took my display driver with it. Windows 10 Photo Viewer surprisingly managed to open it and keep it sharp when zoomed in. The GIMP v3.0.2 (latest version at the time of writing) crashed.

promiseofbeans · 2025-04-30T04:22:20 1745986940

Firefox on a mid-tier Samsung and a cheapo data connection (4G) took avout 30s to load. I could pan, but it limited me from zooming much, and the little I could zoom in looked quite blury.

bugfix · 2025-04-30T00:54:09 1745974449

IrfanView was able to load it in about 8 seconds (Ryzen 7 5800x) using 2.8GB of RAM, but zooming/panning is quite slow (~500ms per action)

hdjrudni · 2025-04-30T03:21:32 1745983292

IrfanView on my PC is very fast. Zoomed to 100% I can pan around no problem. Is it using CPU or GPU? I've got an 11900K CPU and RTX 3090.

ChoGGi · 2025-04-30T13:38:00 1746020280

There's fast and slow resample viewing options in Irfanview, he may have slow turned on for higher quality.

beeslol · 2025-04-30T01:05:02 1745975102

For what it's worth, this loaded (slowly) in Firefox on Windows for me (but zooming was blurry), and the default Photos viewer opened it no problem with smooth zooming and panning.

jsnider3 · 2025-04-30T16:13:28 1746029608

I get a Your connection was interrupted on Chrome.

quickaccount · 2025-04-30T00:44:10 1745973850

Safari on my MacBook Air opened it fine, though it took about four seconds. Zooming works fine as well. It does take ~3GB of memory according to Activity Monitor.

jaeckel · 2025-04-30T05:20:36 1745990436

ImgurViewer from fdroid on an FP5 opened it blurry after around 5s and 5s later it was rendered completely.

Pan&zoom works instantly with a blurry preview and then takes another 5-10s to render completely.

spockz · 2025-04-30T06:46:39 1745995599

Loading this on my iPhone on 1gbit took about 5s and I can easily pan and zoom. A desktop should handle it beautifully.

sixtyj · 2025-04-30T05:54:40 1745992480

PDF files with included vector-based layers, e.g. plans or maps of large area, are also quite difficult to render/open.

jve · 2025-04-30T13:01:31 1746018091

Just today collegue was looking at some air traffic permit map within PDF that was like 12MB or something around that. Complained about Adobe Reader changing something so he cannot pan/zoom no more.

I suggested to try the HN beloved Sumatra PDF. Ugh, it couldn't cope with it normally. Chrome did it better coped better.

radeeyate · 2025-04-30T02:37:38 1745980658

Interestingly enough, it loads in about 5 seconds on my Pixel 6a.

arc-in-space · 2025-04-30T07:33:22 1745998402

Oh hey it's the thing that ruins an otherwise okay rhythm game.

MaysonL · 2025-04-30T02:14:23 1745979263

It loaded after 10-15 seconds on myiPad Pro M1, although it did start reloading after I looked around in it.

IamDaedalus · 2025-04-30T07:51:30 1745999490

on mobile Brave just displayed it as the placeholder broken link image but in Firefox it loaded in about 10s

glial · 2025-04-30T01:37:52 1745977072

It loads in about 10 seconds in Safari on an M1 Air. I think I am spoiled.

ninalanyon · 2025-04-30T07:36:38 1745998598

Opens fine in Firefox 138.

DiggyJohnson · 2025-04-30T06:18:24 1745993904

Safari on iPhone did a good job with it actually lol

ack_complete · 2025-04-30T01:26:57 1745976417

I once encoded an entire TV OP into a multi-megabyte animated cursor (.ani) file.

Surprisingly, Windows 95 didn't die trying to load it, but quite a lot of operations in the system took noticeably longer than they normally did.

M95D · 2025-04-30T07:37:22 1745998642

I wonder if I could create a 500TB html file with proper headers on a squashfs, an endless <div><div><div>... with no closing tags, and if I could instruct the server to not report file size before download.

Any ideeas?

Ugohcet · 2025-04-30T09:38:00 1746005880

Why use squashfs when you can do the same OP did and serve a compressed version, so that the client is overwhelmed by both the uncompression and the DOM depth:

yes "<div>"|dd bs=1M count=10240 iflag=fullblock|gzip | pv > zipdiv.gz

Resulting file is about 15 mib long and uncompresses into a 10 gib monstrosity containing 1789569706 unclosed nested divs

sroussey · 2025-04-30T15:45:19 1746027919

You can also just use code to endlessly serve up something.

Also you can reverse many DoD vectors depending on how you are setup and costs. For example reverse Slowloris attack and use up their connections.

M95D · 2025-04-30T12:04:10 1746014650

I like it. :)

CobrastanJorji · 2025-04-30T07:44:50 1745999090

Yes, servers can respond without specifying the size by using chunked encoding. And you can do the rest with a custom web server that just handles request by returning "<div>" in a loop. I have no idea if browsers are vulnerable to such a thing.

konata390 · 2025-04-30T08:25:48 1746001548

I just tested it via a small python script sending divs at a rate of ~900mb (as measured by curl) and firefox just kills the request after 1-2 gb received (~2 seconds) with an "out of memory" error, while chrome seems to only receive around 1mb/s, uses 1 cpu core 100%, and grows infinitely in memory use. I killed it after 3 mins and consuming ca. 6GB (additionally, on top of the memory it used at startup)

M95D · 2025-04-30T09:09:38 1746004178

What did the bots do?

M95D · 2025-04-30T08:05:56 1746000356

I would make it an invisible link from the main page (hidden behind a logo or something). Users won't click it, but bots will.

stefs · 2025-04-30T09:05:36 1746003936

the problem with this is that for a tarpit, you just don't want to make it expensive for bots, you also want to make it cheap for yourself. this isn't cheap for you. a zip bomb is.

m463 · 2025-04-29T22:58:01 1745967481

Sounds like the favicon.ico that would crash the browser.

I think this was it:

https://freedomhacker.net/annoying-favicon-crash-bug-firefox...

dolmen · 2025-04-30T05:05:00 1745989500

Looks like something I should add for my web APIs which are to be queried only by clients aware of the API specification.

koolba · 2025-04-29T21:21:08 1745961668

I hope you weren’t paying for bandwidth by the KiB.

santoshalper · 2025-04-29T22:48:08 1745966888

Nah, back then we paid for bandwidth by the kb.

slicktux · 2025-04-29T23:59:10 1745971150

That’s even worse! :)

amelius · 2025-04-30T08:35:11 1746002111

Maybe it's time for a /dev/zipbomb device.

GTP · 2025-04-30T10:52:13 1746010333

ln -s /dev/urandom /dev/zipbomb && echo 'Boom!'

Ok, not a real zip bomb, for that we would need a kernel module.

Dwedit · 2025-04-30T15:29:41 1746026981

That costs you a lot of bandwidth, defeating the whole point of a zip bomb.

M95D · 2025-04-30T08:25:20 1746001520

Could server-side includes be used for a html bomb?

Write an ordinary static html page and fill a <p> with infinite random data using .

or would that crash the server?

GTP · 2025-04-30T10:50:00 1746010200

I guess it depends on the server's implementation. but, since you need some logic to decide when to serve the html bomb anyway, I don't see why you would prefer this solution. Just use whatever script you're using to detect the bots to serve the bomb.

M95D · 2025-04-30T12:02:42 1746014562

No other scripts. Hide the link to the bomb behind an image so humans can't click it.

AStonesThrow · 2025-04-30T08:08:52 1746000532

Wait, you set up a symlink?

I am not sure how that could’ve worked. Unless the real /dev tree was exposed to your webserver’s chroot environment, this would’ve given nothing special except “file not found”.

The whole point of chroot for a webserver was to shield clients from accessing special files like that!

vidarh · 2025-04-30T09:08:48 1746004128

You yourself explain how it could've worked: Plenty of webservers are or were not chroot'ed.

pandemic_region · 2025-04-30T11:03:59 1746011039

Which means that if your bot is getting slammed by this, you can assume it's not chrooted and hence a more likely target for attack.

vidarh · 2025-04-30T16:34:45 1746030885

This does not logically follow. If your bot is getting slammed by a page returning all zeros (what the person I replied to reacted to), all you know is something on the server is returning a neverending stream of zeros. A symlink to /dev/zero is an easy way of doing that, but knowing the server is serving up a neverending stream of zeros by no means tells you whether the server is running in a decently isolated environment or not.

Even if you knew it was done with a symlink you don't know that - these days odds are it'd run in a container or vm, and so having access to /dev/zero means very little.

sandworm101 · 2025-04-29T21:45:38 1745963138

Devide by zero happens to everyone eventually.

https://medium.com/@bishr_tabbaa/when-smart-ships-divide-by-...

"On 21 September 1997, the USS Yorktown halted for almost three hours during training maneuvers off the coast of Cape Charles, Virginia due to a divide-by-zero error in a database application that propagated throughout the ship’s control systems."

" technician tried to digitally calibrate and reset the fuel valve by entering a 0 value for one of the valve’s component properties into the SMCS Remote Database Manager (RDM)"

astolarz · 2025-04-29T23:29:57 1745969397

Bad bot

fuzztester · 2025-04-29T23:59:42 1745971182

I remember reading about that some years ago. It involved Windows NT.

https://www.google.com/search?q=windows+nt+bug+affects+ship

jeroenhd · 2025-04-29T22:16:43 1745965003

These days, almost all browsers accept zstd and brotli, so these bombs can be even more effective today! [This](https://news.ycombinator.com/item?id=23496794) old comment showed an impressive 1.2M:1 compression ratio and [zstd seems to be doing even better](https://github.com/netty/netty/issues/14004).

Though, bots may not support modern compression standards. Then again, that may be a good way to block bots: every modern browser supports zstd, so just force that on non-whitelisted browser agents and you automatically confuse scrapers.

andersmurphy · 2025-04-30T09:00:24 1746003624

So I actually do this (use compression to filter out bots) for my one million checkboxes Datastar demo[1]. It relies heavily on streaming the whole user view on every interaction. With brotli over SSE you can easily hit 200:1 compression ratios[2]. The problem is a malicious actor could request the stream uncompressed. As brotli is supported by 98% of browsers I don't push data to clients that don't support brotli compression. I've also found a lot of scrapers and bots don't support it so it works quite well.

[1] checkboxes demo https://checkboxes.andersmurphy.com

[2] article on brotli SSE https://andersmurphy.com/2025/04/15/why-you-should-use-brotl...

kevin_thibedeau · 2025-04-29T23:16:43 1745968603

If you nest the gzip inside another gzip it gets even smaller since the blocks of compressed '0' data are themselves low entropy in the first generation gzip. Nested zst reduces the 10G file to 99 bytes.

galangalalgol · 2025-04-30T03:24:04 1745983444

Can you hand edit to create recursive file structures to make it infinite? I used to use debug in dos to make what appeared to be gigantic floppy discs by editing the fat

hidroto · 2025-04-30T06:39:03 1745995143

https://research.swtch.com/zip

it is basically a quine.

necovek · 2025-04-30T03:49:04 1745984944

That's what I was hoping for with the original article.

Thorrez · 2025-04-30T11:31:51 1746012711

But the bot likely only automatically unpacks the outer layer. So nesting doesn't help with bot deterrence.

Cloudef · 2025-04-30T03:35:02 1745984102

Wouldnt that defeat the attack though as you arent serving the large content anymore

kevin_thibedeau · 2025-04-30T03:41:49 1745984509

It would need a bot that is accessing files via hyperlink with an aim to decompress them and riffle through their contents. The compressed file can be delivered over a compressed response to achieve the two layers, cutting down significantly on the outbound traffic. passwd.zst, secrets.docx, etc. would look pretty juicy. Throw some bait in honeypot directories (exposed for file access) listed in robots.txt and see who takes it.

xiaoyu2006 · 2025-04-30T03:50:28 1745985028

How will my browser react on receiving such bombs? I’d rather not to test it myself…

jeroenhd · 2025-04-30T08:29:54 1746001794

Last time I checked, the tab keeps loading, freezes, and the process that's assigned to rendering the tab gets killed when it eats too much RAM. Might cause a "this tab is slowing down your browser" popup or general browser slowness, but nothing too catastrophic.

How bad the tab process dying is, depends per browser. If your browser does site isolation well, it'll only crash that one website and you'll barely notice. If that process is shared between other tabs, you might lose state there. Chrome should be fine, Firefox might not be depending on your settings and how many tabs you have open, with Safari it kind of depends on how the tabs were opened and how the browser is configured. Safari doesn't support zstd though, so brotli bombs are the best you can do with that.

anthk · 2025-04-30T11:15:28 1746011728

gzip it's everywhere and it will mess with every crawler.

bilekas · 2025-04-29T20:45:57 1745959557

> At my old employer, a bot discovered a wordpress vulnerability and inserted a malicious script into our server

I know it's slightly off topic, but it's just so amusing (edit: reassuring) to know I'm not the only one who, after 1 hour of setting up Wordpress there's a PHP shell magically deployed on my server.

protocolture · 2025-04-29T23:28:10 1745969290

>Take over a wordpress site for a customer

>Oh look 3 separate php shells with random strings as a name

Never less than 3, but always guaranteed.

ianlevesque · 2025-04-29T21:05:52 1745960752

Yes, never self host Wordpress if you value your sanity. Even if it’s not the first hour it will eventually happen when you forget a patch.

sunaookami · 2025-04-29T21:09:32 1745960972

Hosting WordPress myself for 13 years now and have no problem :) Just follow standard security practices and don't install gazillion plugins.

ozim · 2025-04-30T11:06:52 1746011212

I have better things to do with my time so I happily pay someone else to host it for me.

carlosjobim · 2025-04-29T21:18:15 1745961495

There's a lot of essential functionality missing from WordPress, meaning you have to install plugins. Depending on what you need to do.

But it's such a bad platform that there really isn't any reason for anybody to use WordPress for anything. No matter your use case, there will be a better alternative to WordPress.

aaronbaugher · 2025-04-29T21:37:01 1745962621

Can you recommend an alternative for a non-technical organization, where there's someone who needs to be able to edit pages and upload documents on a regular basis, so they need as user-friendly an interface as possible for that? Especially when they don't have a budget for it, and you're helping them out as a favor? It's so easy to spin up Wordpress for them, but I'm not a fan either.

I've tried Drupal in the past for such situations, but it was too complicated for them. That was years ago, so maybe it's better now.

ufmace · 2025-04-30T03:37:33 1745984253

I find it very telling that there's no 2 responses to this post recommending the same thing. Confirms my belief that there is no real alternative to Wordpress for a free and open-source CMS that is straightforward to install and usable to build and edit pages by non-tech-experts.

eru · 2025-04-30T05:52:08 1745992328

Perhaps people who wanted to recommend the same thing as was already written, just upvoted instead of writing their own comment?

realityloop · 2025-04-29T22:04:00 1745964240

DrupalCMS is a new project that aims to radically simplify for end users https://new.drupal.org/drupal-cms

arczyx · 2025-04-30T09:36:26 1746005786

> Drupal

> new

Pretty sure Drupal has been around for like, 20 years or so. Or is this a different Drupal?

nulbyte · 2025-04-30T11:36:32 1746012992

Drupal has been around for a while, but I've never heard of "Drupal CMS" as a separate product until now.

It appears Drupal CMS is a customized version of Drupal that is easier for less tech-savvy folks to get up and running. At least, that's the impression I got reading through the marketing hype that "explains" it with nothing but buzzwords.

vinceguidry · 2025-04-30T16:46:59 1746031619

Wiki software is the way to go here.

donnachangstein · 2025-04-29T21:44:26 1745963066

> Can you recommend an alternative for a non-technical organization, where there's someone who needs to be able to edit pages and upload documents on a regular basis, so they need as user-friendly an interface as possible for that

25 years ago we used Microsoft Frontpage for that, with the web root mapped to a file share that the non-technical secretary could write to and edit it as if it were a word processor.

Somehow I feel we have regressed from that simplicity, with nothing but hand waving to make up for it. This method was declared "obsolete" and ... Wordpress kludges took its place as somehow "better". Someone prove me wrong.

shakna · 2025-04-30T07:41:15 1745998875

Part of that is Frontpage needing a Windows server, and all that entails.

The other part is clients freaking out after Frontpage had a series of dangerous CVEs all in a row.

And then finally every time a part of Frontpage got popular, MS would deprecate the API and replace it with a new one.

Wordpress was in the right place at the right time.

aaronbaugher · 2025-04-30T13:30:47 1746019847

Yeah, getting Frontpage working on a Linux/Apache system and supporting it back then wasn't exactly a treat. Good idea, maybe, but bad implementation.

MrDOS · 2025-04-30T10:48:51 1746010131

For those on macOS, RapidWeaver still exists: https://www.realmacsoftware.com/rapidweaver/. (Shame that it's now subscriptionware, though – could've sworn it used to be an outright purchase per major version.)

bigfatkitten · 2025-04-29T23:41:00 1745970060

A previous workplace of mine did the same with Netscape (and later, Mozilla) Composer. Users could modify content via WebDAV.

blipvert · 2025-04-30T06:22:54 1745994174

We have a (internally accessible only) WP instance where the content is exported using a plugin as a ZIP file and then deployed to NGINX servers with a bit of scripting/Ansible.

Could be automated better (drop ZIP to a share somewhere where it gets processed and deployed) but best of both worlds.

jillyboel · 2025-04-30T08:08:15 1746000495

Which plugin?

djxfade · 2025-04-29T23:12:05 1745968325

Statamic https://statamic.com/

chilldsgn · 2025-04-30T06:24:45 1745994285

YES! I have switched to it for professional and personal CMS work and it's great. Incredibly flexible and simplistic in my opinion. I use it both as headful and headless.

rpmisms · 2025-04-30T01:33:35 1745976815

Seconded. It's absolutely phenomenal as a headful or headless CMS.

1oooqooq · 2025-04-30T12:23:33 1746015813

weird "license" on that project. pretty much blocks any self host usage besides a personal blog.

And only hosted option for the copyrighted code starts at 300/y

these don't cover any use case people use WordPress for.

bluocms · 2025-04-30T02:05:35 1745978735

We’re developing https://bluocms.com/

- very hard to hack because we pre render all assets to a Cloudflare kv store

- public website and CMS editor are on different domains

Basically very hard to hack. Also as a bonus is much more reliable as it will only go down when Cloudflare does.

shakna · 2025-04-29T21:44:23 1745963063

I've had some luck using Decap for that. An initial dev setup, followed by almost never needing support from the PR team running it.

[0] https://decapcms.org/

willyt · 2025-04-29T21:44:42 1745963082

Static site with Jekyll?

socalgal2 · 2025-04-29T22:12:19 1745964739

Jekyll and other static site generators do not repo Wordpress any more than notepad repos MSWord

In one, multiple users can login, edit WYSIWYG, preview, add images, etc, all from one UI. You can access it from any browser including smart phones and tablets.

In the other, you get to instruct users on git, how to deal with merge conflicts, code review (two people can't easily work on a post like they can in wordpress), previews require a manual build, you need a local checkout and local build installation to do the build. There no WYSIWYG, adding images is a manual process of copying a file, figuring out the URL, etc... No smartphone/tablet support. etc....

I switched by blog from wordpress install to a static site geneator because I got tired of having to keep it up to date but my posting dropped because of friction of posting went way up. I could no longer post from a phone. I couldn't easily add images. I had to build to preview. And had to submit via git commits and pushes. All of that meant what was easy became tedious.

koiueo · 2025-04-30T05:58:51 1745992731

Have you checked static site CMSes?

For example (not affiliated with them) https://www.siteleaf.com/

pettycashstash2 · 2025-04-29T23:13:21 1745968401

what are your favorite static site generators? I googled it and cloudflare article came up with Jekyll,Gatsby,Hugo,Next.js, Eleventy. But would like to avoid doing research if can be helped on pros/cons of each.

socalgal2 · 2025-04-30T02:52:51 1745981571

I looked recently when thinking of starting some new shared blog. My criteria was "based on tech I know". I don't know Ruby so Jekyll was out. I tried Eleventy and Hexo. I chose Hexo but then ultimately decided I wasn't going to do this new blog.

IIRC, Eleventy printed lots of out-of-date warnings when I installed it and/or the default style was broken in various ways which didn't give me much confidence.

My younger sister asked me to help her start a blog. I just pointed her to substack. Zero effort, easy for her.

pmontra · 2025-04-30T05:47:32 1745992052

I work with Ruby but I never had to use Ruby to use Jekyll. I downloaded the docker image and run it. It checks a host directory for updates and generates the HTML files. It could be written in any other language I don't know.

justusthane · 2025-04-30T00:12:20 1745971940

I don’t have much experience with other SSGs, but I’ve been using Eleventy for my personal site for a few years and I’m a big fan. It’s very simple to get started with, it’s fast to build, it’s powerful and flexible.

I build mine with GitHub Actions and host it free on Pages.

Tistron · 2025-04-30T07:18:49 1745997529

I've come to really appreciate Astro.js It's quite simple to get started, fairly intuitive for me, and very powerful.

beeburrt · 2025-04-29T23:29:33 1745969373

Jekyll and GitHub pages go together pretty well.

msh · 2025-04-30T10:37:08 1746009428

Its sad software like citydesk died and did not evolve into multiuser applications.

carlosjobim · 2025-04-29T21:57:03 1745963823

Yes I can. There's an excellent and stable solution called SurrealCMS, made by an indie developer. You connect it by FTP to any traditional web design (HTML+CSS+JS), and the users get a WYSIWYG editor where the published output looks exactly as it looked when editing. It's dirt cheap at $9 per month.

Edit: I actually feel a bit sorry for the SurrealCMS developer. He has a fantastic product that should be an industry standard, but it's fairly unknown.

hombre_fatal · 2025-04-30T11:52:37 1746013957

You can use WordPress as a static site generator: https://simplystatic.com/

Then WordPress is just your private CMS/UI for making changes, and it generates static files that are uploaded to a webhost like CloudFlare Pages, GitHub Pages, etc.

sureIy · 2025-04-30T15:45:43 1746027943

It has been a long time since I tried that, but it was never as simple as they claimed it to be.

Now that plugin became a service, at which point you might just use a WP host and let them do their thing.

dmje · 2025-04-30T07:32:49 1745998369

Just not true, although entirely aligned with HN users who often believe that the levels of nerdery on HN are common in the real world. WP isn’t bad, you’ve just done it wrong, and there really isn’t a better alternative for hundreds and hundreds of use cases..

carlosjobim · 2025-04-30T11:07:37 1746011257

My perspective is that WordPress is too complicated and too nerdy for most real world users. They are usually better off with a solution that is tailor made for their use case. And there's plenty of such solutions. Even for blogging, there are much better solutions than WordPress for non-technical users.

wincy · 2025-04-29T21:38:48 1745962728

I do custom web dev so am way out of the website hosting game. What are good frameworks now if I want to say, light touch help someone who is slightly technical set up a website? Not full react SPA with an API.

carlosjobim · 2025-04-29T22:10:20 1745964620

By the sound of your question I will guess you want to make a website for a small or medium sized organization? jQuery is probably the only "framework" you should need.

If they are selling anything on their website, it's probably going to be through a cloud hosted third party service and then it's just an embedded iframe on their website.

If you're making an entire web shop for a very large enterprise or something of similar magnitude, then you have to ask somebody else than me.

felbane · 2025-04-29T22:38:27 1745966307

Does anyone actually still use jQuery?

Everything I've built in the past like 5 years has been almost entirely pure ES6 with some helpers like jsviews.

karaterobot · 2025-04-29T22:50:08 1745967008

jQuery's still the third most used web framework, behind React and before NextJS. If you use jQuery to build Wordpress websites, you'd be specializing in popular web technologies in the year 2025.

https://survey.stackoverflow.co/2024/technology#1-web-framew...

carlosjobim · 2025-04-29T23:05:16 1745967916

Sure, why not? It's lightweight and works well, and there's a lot of good solutions that you can find already made for you online.

nophunphil · 2025-04-30T01:39:48 1745977188

jQuery hasn’t been necessary for many years. Vanilla JS equivalents of jQuery code are well-supported.

https://youmightnotneedjquery.com/

carlosjobim · 2025-04-30T11:16:30 1746011790

I've seen this site linked for many years among web devs, but I just don't understand the purpose? jQuery code is much cleaner and easier to understand, and there's a great amount of solutions written for jQuery available online for almost any need you have.

j16sdiz · 2025-04-30T02:26:10 1745979970

The vanilla one is so much longer.

arcfour · 2025-04-29T22:49:48 1745966988

Never use that junk if you value your sanity, I think you mean.

ufmace · 2025-04-30T03:32:16 1745983936

Ditto to self-hosting wordpress works fine with standard hosting practices and not installing a bazillion random plugins.

maeln · 2025-04-30T07:45:16 1745999116

I never hosted WP, but as soon as you have a HTTP server expose to the internet you will get request to /wp-login and such. It as become a good way to find bots also. If I see an IP requesting anything from a popular CMS, hop it goes in the iptables holes

Perz1val · 2025-04-30T08:28:39 1746001719

Hey, I check /wp-admin sometimes when I see a website and it has a certain feel to it

victorbjorklund · 2025-04-30T08:45:13 1746002713

I do the same. Great way to filter our security scanners.

Aransentin · 2025-04-30T08:14:21 1746000861

Wordpress is indeed a nice backdoor, it even has CMS functionality built in.

dx4100 · 2025-04-29T23:57:25 1745971045

There's ways that prevent it - - Freeze all code after an update through permissions - Don't make most directories writeable - Don't allow file uploads, or limit file uploads to media

There's a few plugins that do this, but vanilla WP is dangerous.

colechristensen · 2025-04-29T23:45:13 1745970313

>after 1 hour

I've used this teaching folks devops, here deploy your first hello world nginx server... huh what are those strange requests in the log?

ChuckMcM · 2025-04-29T20:05:06 1745957106

I sort of did this with ssh where I figured out how to crash an ssh client that was trying to guess the root password. What I got for my trouble was a number of script kiddies ddosing my poor little server. I switched to just identifying 'bad actors' who are clearly trying to do bad things and just banning their IP with firewall rules. That's becoming more challenging with IPV6 though.

Edit: And for folks who write their own web pages, you can always create zip bombs that are links on a web page that don't show up for humans (white text on white background with no highlight on hover/click anchors). Bots download those things to have a look (so do crawlers and AI scrapers)

grishka · 2025-04-30T01:27:00 1745976420

> you can always create zip bombs that are links on a web page that don't show up for humans

I did a version of this with my form for requesting an account on my fediverse server. The problem I was having is that there exist these very unsophisticated bots that crawl the web and submit their very unsophisticated spam into every form they see that looks like it might publish it somewhere.

First I added a simple captcha with distorted characters. This did stop many of the bots, but not all of them. Then, after reading the server log, I noticed that they only make three requests in a rapid succession: the page that contains the form, the captcha image, and then the POST request with the form data. They don't load neither the CSS nor the JS.

So I added several more fields to the form and hid them with CSS. Submitting anything in these fields will fail the request and ban your session. I also modified the captcha, I made the image itself a CSS background, and made the src point to a transparent image instead.

And just like that, spam has completely stopped, while real users noticed nothing.

anamexis · 2025-04-30T04:30:14 1745987414

I did essentially the same thing. I have this input in a form:

    <label for="gb-email" class="nah" aria-hidden="true">Email:</label>
    <input id="gb-email"
           name="email"
           size="40"
           class="nah"
           tabindex="-1"
           aria-hidden="true"
           autocomplete="off"
    >

With this CSS:

    .nah {
      opacity: 0;
      position: absolute;
      top: 0;
      left: 0;
      height: 0;
      width: 0;
      z-index: -1;
    }

And any form submission with a value set for the email is blocked. It stopped 100% of the spam I was getting.

DuncanCoffee · 2025-04-30T09:10:23 1746004223

Would this also stop users with automatic form filling enabled?

grishka · 2025-04-30T09:28:49 1746005329

No, `autocomplete="off"` takes care of that

BarryMilo · 2025-04-30T04:44:09 1745988249

We use to just call those honeypot fields. Works like a charm.

a_gopher · 2025-04-30T14:25:36 1746023136

apart from blind users, who are also now completely unable to use their screenreaders with your site

ChuckMcM · 2025-04-30T02:29:16 1745980156

Oh that is great.

dsp_person · 2025-04-30T03:12:51 1745982771

> you can always create zip bombs that are links on a web page that don't show up for humans (white text on white background with no highlight on hover/click anchors)

RIP screen reader users?

some-guy · 2025-04-30T03:25:44 1745983544

“aria-hidden” would spare those users, and possibly be ignored by the bots unless they are sophisticated.

j_walter · 2025-04-29T20:26:36 1745958396

Check this out if you want to stop this behavior...

https://github.com/skeeto/endlessh

gwd · 2025-04-30T12:02:18 1746014538

> I sort of did this with ssh where I figured out how to crash an ssh client that was trying to guess the root password. What I got for my trouble was a number of script kiddies ddosing my poor little server.

This is the main reason I haven't installed zip bombs on my website already -- on the off chance I'd make someone angry and end up having to fend off a DDoS.

Currently I have some URL patterns to which I'll return 418 with no content, just to save network / processing time (since if a real user encounters a 404 legitimately, I want it to have a nice webpage for them to look at).

Should probably figure out how to wire that into fail2ban or something, but not a priority at the moment.

1970-01-01 · 2025-04-29T20:23:22 1745958202

Why is it harder to firewall them with IPv6? I seems this would be the easier of the two to firewall.

carlhjerpe · 2025-04-29T22:00:16 1745964016

Manual banning is about the same since you just book /56 or bigger, entire providers or countries.

Automated banning is harder, you'd probably want a heuristic system and look up info on IPs.

IPv4 with NAT means you can "overban" too.

malfist · 2025-04-30T00:00:23 1745971223

Why wouldn't something like fail2ban not work here? That's what it's built for and has been around for eons.

ozim · 2025-04-30T13:36:12 1746020172

Fun part was that fail2ban had RCE vulnerability. So you were more secure not running it now it should be fixed but can you be sure?

carlhjerpe · 2025-04-30T10:27:20 1746008840

You don't always firewall 80/443 in Linux :(

firesteelrain · 2025-04-29T20:54:58 1745960098

I think they are suggesting the range of IPs to block is too high?

CBLT · 2025-04-29T21:56:06 1745963766

Allow -> Tarpit -> Block should be done by ASN

carlhjerpe · 2025-04-29T22:02:52 1745964172

You probably want to check how many ips/blocks a provider announces before blocking the entire thing.

It's also not a common metric you can filter on in open firewalls since you must lookup and maintain a cache of IP to ASN, which has to be evicted and updated as blocks still move around.

echoangle · 2025-04-29T20:54:22 1745960062

Maybe it’s easier to circumvent because getting a new IPv6 address is easier than with IPv4?

flexagoon · 2025-04-30T02:48:56 1745981336

Automated systems like Cloudflare and stuff also have a list of bot IPs. I was recently setting up a selfhosted VPN and I had to change the IPv4 of the server like 20 times before I got an IP that wasn't banned on half the websites.

bjoli · 2025-04-30T04:59:07 1745989147

I am just banning large swaths of IPs. Banning most of Asia and the middle east reduced the amount of bad traffic by something like 98%.

leephillips · 2025-04-29T22:24:12 1745965452

These links do show up for humans who might be using text browsers, (perhaps) screen readers, bookmarklets that list the links on a page, etc.

alpaca128 · 2025-04-30T10:03:04 1746007384

Weird that text browsers just ignore all the attributes that hide elements. I get that they don't care about styling, but even a plain hidden attribute or aria-hidden are ignored.

ChuckMcM · 2025-04-29T22:44:29 1745966669

true, but you can make the link text 'do not click this' or 'not a real link' to let them know. I'm not sure if crawlers have started using LLMs to check pages or not which would be a problem.

marcusb · 2025-04-29T21:29:25 1745962165

Zip bombs are fun. I discovered a vulnerability in a security product once where it wouldn’t properly scan a file for malware if the file was or contained a zip archive greater than a certain size.

The practical effect of this was you could place a zip bomb in an office xml document and this product would pass the ooxml file through even if it contained easily identifiable malware.

secfirstmd · 2025-04-29T21:58:11 1745963891

Eh I got news for ya.

The file size problem is still an issue for many big name EDRs.

marcusb · 2025-04-29T22:21:45 1745965305

Undoubtedly. If you go poking around most any security product (the product I was referring to was not in the EDR space,) you'll see these sorts of issues all over the place.

j16sdiz · 2025-04-30T02:32:38 1745980358

It have to be the way it is.

Scanning them are resources intensive. The choice are (1) skip scanning them; (2) treat them as malware; (3) scan them and be DoS'ed.

(deferring the decision to human iss effectively DoS'ing your IT support team)

avidiax · 2025-04-30T02:50:43 1745981443

Option #4, detect the zip bomb in its compressed form, and skip over that section of the file. Just like the malware ignores the zip bomb.

im3w1l · 2025-04-30T03:14:44 1745982884

Just the fact that it contains a zip bomb makes it malware by itself.

marcusb · 2025-04-30T10:49:37 1746010177

It does not have to be the way it is. Security vendors could do a much better job testing and red teaming their products to avoid bypasses, and have more sensible defaults.

LordGrignard · 2025-04-30T01:30:04 1745976604

is that endpoint detection and response?

marcusb · 2025-04-30T10:41:06 1746009666

kazinator · 2025-04-29T20:49:47 1745959787

I deployed this, instead of my usual honeypot script.

It's not working very well.

In the web server log, I can see that the bots are not downloading the whole ten megabyte poison pill.

They are cutting off at various lengths. I haven't seen anything fetch more than around 1.5 Mb of it so far.

Or is it working? Are they decoding it on the fly as a stream, and then crashing? E.g. if something is recorded as having read 1.5 Mb, could it have decoded it to 1.5 Gb in RAM, on the fly, and crashed?

There is no way to tell.

MoonGhost · 2025-04-29T21:19:25 1745961565

Try content labyrinth. I.e. infinitely generated content with a bunch of references to other generated pages. It may help against simple wget and till bots adapt.

PS: I'm on the bots side, but don't mind helping.

palijer · 2025-04-30T00:27:27 1745972847

This doesn't work if you pay bandwidth and CPU usage for your servers though.

Twirrim · 2025-04-30T05:10:19 1745989819

The labyrinth doesn't have to be fast, and things like iocaine (https://iocaine.madhouse-project.org/) don't use much CPU if you don't go and give them something like the Complete Works of Ahakespeare as input (Mine is using Moby Dick), and can easily be constrained with cgroups if you're concerned about resource usage.

I've noticed that LLM scrapers tend to be incredibly patient. They'll wait for minutes for even small amounts of text.

MoonGhost · 2025-04-30T01:57:34 1745978254

That will be your contribution. If others join scrapping will become very pricey. Till bots become smarter. But then they will not download much of generated crap. Which makes it cheaper for you.

Anyway, from bots perspective labyrinths aren't the main problem. Internet is being flooded with quality LLM-generated content.

bugfix · 2025-04-30T01:00:21 1745974821

Wouldn't this just waste your own bandwidth/resources?

gwd · 2025-04-30T12:06:20 1746014780

Kinda wonder if a "content labyrinth" could be used to influence the ideas / attitudes of bots -- fill it with content pro/anti Communism, or Capitalism, or whatever your thing is, hope it tips the resulting LLM towards your ideas.

arctek · 2025-04-30T00:57:33 1745974653

Perhaps need to semi-randomize the file size? I'm guessing some of the bots have a hard limit to the size of the resource they will download.

Many of these are annoying LLM training/scraping bots (in my case anyway). So while it might not crash them if you spit out a 800KB zipbomb, at least it will waste computing resources on their end.

unnouinceput · 2025-04-29T21:30:32 1745962232

Do they comeback? If so then they detect it and avoid it. If not then they crashed and mission accomplished.

kazinator · 2025-04-29T22:32:00 1745965920

I currently cannot tell without making a little configuration change, because as soon as an IP address is logged as having visited the trap URL (honeypot, or zipbomb or whatever), a log monitoring script bans that client.

Secondly, I know that most of these bots do not come back. The attacks do not reuse addresses against the same server in order to evade almost any conceivable filter rule that is predicated on a prior visit.

jpsouth · 2025-04-30T09:46:46 1746006406

I may be asking a really silly question here, but

> as soon as an IP address is logged as having visited the trap URL (honeypot, or zipbomb or whatever), a log monitoring script bans that client.

Is this not why they aren’t getting the full file?

kazinator · 2025-04-30T13:37:27 1746020247

I believe Apache is logging complete requests. For instance, in the case of clients sent to a honeypot, I see a log entry appear when I pick a honeypot script from the process listing and kill it. That could be hours after the client connected. The timestamps logged are connection time not completion time. E.g. here is a pair of consecutive logs:

  124.243.178.242 - - [29/Apr/2025:00:16:52 -0700] "GET /cgit/[...]
  94.74.94.113 - - [29/Apr/2025:00:07:01 -0700] "GET /honeypot/[...]

Notice the second timestamp is almost ten minutes earlier.