More

10000truths · 2025-02-04T21:55:58 1738706158

> and also includes people like my former college hallmates who take handheld aviation radios, ask for permission to depart, and run on the taxiways with their arms extended, to great dismay of ATC

You can't just drop a tidbit like that without elaborating.

kylehotchkiss · 2025-02-04T23:50:12 1738713012

The airport was in southwest Virginia and most of the traffic was Cessnas so they got away with it ¯\_(ツ)_/¯

10000truths · 2025-02-01T22:16:28 1738448188

Or, you know, an higher and/or omnipotent entity. At some point along the causal chain, there is no "testing" - you are forced to resort to metaphysical reasoning.

DangitBobby · 2025-02-02T00:39:43 1738456783

This doesn't actually solve the original problem. Either something was always there or something can come from nothing, and both of those seem to violate causality as we understand it. Saying "God was always there" isn't really that different from "The universe was always there" when it comes to resolving the violation of causality.

10000truths · 2025-02-02T01:25:53 1738459553

An omnipotent being would presumably have invented causality and time, and therefore not be bound by the axioms thereof.

DangitBobby · 2025-02-02T02:38:34 1738463914

Sure, you can invent any explanation that you like. It just isn't more convincing than "the universe (or an outer universe giving birth to this one) was always there", since they solve exactly the same problem.

koakuma-chan · 2025-02-02T01:12:57 1738458777

Obviously, something was always there. "Always" refers to the run time of our simulation.

10000truths · 2025-01-28T21:20:27 1738099227

Related is Casey Muratori's explanation of the GoFetch speculative attack on Apple's M-series CPUs:

https://www.youtube.com/watch?v=uZEBkOrfUzM

10000truths · 2025-01-24T23:06:19 1737759979

If you relax your constraint to "retain logs for the past N days", you can accumulate the logs from T=0 to T=(today - N) into tables and still benefit from having snapshots from that cutoff onwards.

10000truths · 2025-01-24T22:53:24 1737759204

The key to managing this complexity is to avoid mixing transport-level state with application-level state. The same approach for scaling HTTP requests also works for scaling WebSocket connections:

* Read, write and track all application-level state in a persistent data store.

* Identify sessions with a session token so that application-level sessions can span multiple WebSocket connections.

It's a lot easier to do this if your application-level protocol consists of a single discrete request and response (a la RPC). But you can also handle unidirectional/bidirectional streaming, as long as the stream states are tracked in your data store and on the client side.

hinkley · 2025-01-25T00:01:57 1737763317

Functional core, imperative shell makes testing and this fast iteration a lot easier. It’s best if your business logic knows very little about transport mechanisms.

I think part of the problem is that early systems wanted to eagerly process requests while they are still coming in. But in a system getting 100s of requests per second you get better concurrency if you wait for entire payloads before you waste cache lines on attempting to make forward progress on incomplete data. Which means you can divorce the concept of a payload entirely from how you acquired it.

ignoramous · 2025-01-25T11:44:51 1737805491

> system getting 100s of requests per second you get better concurrency if you wait for entire payloads before you waste cache lines

At what point should one scale up & switch to chips with embedded DRAMs ("L4 cache")?

hinkley · 2025-01-25T18:28:08 1737829688

I haven’t been tracking price competitiveness on those. What cloud providers offer them?

But you don’t get credit for having three tasks halfway finished instead of one task done and two in flight. Any failover will have to start over with no forward progress having been made.

ETA: while the chip generation used for EC2 m7i instances can have L4 cache, I can’t find a straight answer about whether they do or not.

What I can say is that for most of the services I benchmarked at my last gig, M7i came out to be as expensive per request as the m6’s on our workload (AMD’s was more expensive). So if it has L4 it ain’t helping. Especially at those price points.

bruce343434 · 2025-01-26T01:32:47 1737855167

When you've profiled the code running in production and identified memory bottlenecks that can not be solved by algorithmic/datastructural optimizations.

magicalhippo · 2025-01-24T23:47:29 1737762449

Currently another thread is going[1] which advocates very similar things, in order to reduce complexity when dealing with distributed systems.

Then again, the frontend and backend are a distributed system, so not that weird one comes to similar conclusions.

[1]: https://news.ycombinator.com/item?id=42813049 Every System is a Log: Avoiding coordination in distributed applications

10000truths · 2025-01-24T20:28:48 1737750528

The issue is that DOM structure does not correspond one-to-one with perceived structure. I could render things in the DOM that aren't visible to people (e.g. a transparent 5px x 5px button), or render things to people that aren't visible in the DOM (e.g. Facebook's DOM obfuscation shenanigans to evade ad-blocking, or rendering custom text to a WebGL canvas). Sure, most websites don't go that far, but most websites also aren't valuable targets for automated crawling/scraping. These kinds of disparities will be exploited to detect and block automated agents if browser automation becomes sufficiently popular, and then we're back to needing to render the whole browser and operate on the rendered image to keep ahead of the arms race.

emporas · 2025-01-25T09:48:56 1737798536

Servers operate on top of tcp/ip not to serve information, rather to serve information plus something else, usually ads. This is usually implemented with websites and captchas n stuff.

That's a problem of misaligned economic incentives. If there is a blockchain which enables micro-transactions of 0.000001 cent per request, and in the order of a million tps or a billion tps, then servers have no reason not to accept money in exchange for information, instead of using ads to extract some eyeball attention.

There is no reason that i cannot invoke a command line program: `$fetch_social_media_posts -n 1000` and get the last thousand posts right there in the console, as long as i provide some valid transactions to the server.

Websites and ads are the wrong solution to the problem of gaining something while serving information, and headless browsers and scraping are the wrong solution to the first wrong solution and the problems it creates.

aniviacat · 2025-01-25T15:37:46 1737819466

No need for blockchain, microtransaction functionality should be integrated into our existing payment methods.

emporas · 2025-01-25T19:15:12 1737832512

Existing payment methods, paypal, google pay etc, have been absolutely crucial for internet payments, but the micro in the word never ends.

If there are internet payments with a minimum payment of 1 cent, then we need payments of 0.1 cents. If that's achieved, then we need 0.01 cents minimum transaction. The micro in the transaction always needs to be smaller (and faster).

Free competition (or perfect competition) over a well defined landscape, internet protocols that is, has proven to always deliver better quality goods and lower price. Money derived from governments is far, far from free competition, let alone well defined internet protocols, and there is a point in which existing payment methods get stuck and cannot deliver smaller transactions.

I don't personally know where and when that point is, but if i have to guess, existing payment methods have reached that minimum point for at least a decade. In other words, their transactions minimums have to be high enough for them to make a profit. Yes, they can implement microtransactions, but they will not be profitable.

10000truths · 2025-01-18T04:07:26 1737173246

What's the difference between this and a user ID that's not publicly exposed?

10000truths · 2025-01-09T04:22:32 1736396552

I recommend against using linked lists for bookkeeping the free blocks. It seems to be the data structure that every malloc/free implementation reaches for, and I don't know why - the slowness of pointer-chasing makes it terrible for almost any real-world use case. A balanced tree would be a much better idea, given that all the essential operations would take O(log n) time instead of O(n). Even if one insists on a linear search, a bitset is much more cache friendly than pointer-chasing and it can trivially benefit from SIMD optimizations.

o11c · 2025-01-09T04:54:36 1736398476

For the `Chunk` list, this isn't one of the cases where linked lists are harmful. Each use only touches the top of the stack, never iterates. Also, linked lists are much easier to make thread-safe.

For the `LinkedPtr` list, the bad case is only hit during the destruction of the pool, and then only if you allocate a lot of memory. And given the overhead of deallocation I'm not sure the array approach would measurably help.

I don't see anywhere a binary tree search would be useful here, since there are no loops used for lookup (on allocation they're pushed in order, but when freed chunks are pushed in arbitrary order; this does mean no double-free protection).

harrison_clarke · 2025-01-09T21:12:42 1736457162

why would you need to iterate on destruction? you free the whole pool at once, since you allocate it all at once

elchananHaas · 2025-01-09T04:56:59 1736398619

The reason linked lists are used is for large enough allocations, there is no overhead. You use the space the application isn't using. In addition, if all allocations are the same size it is O(1), you just look at the head of the list.

More sophisticated strategies bucket allocations by size, this has fixed overhead. You can also use balanced trees for more memory efficiency, but this is slower.

For small allocations (8 bytes) that are too small to contain pointers allocators will allocate a block and use bitsets.

JJOmeo · 2025-01-09T04:58:04 1736398684

Pool allocators don't walk the list or search anything though. All interactions are only at the list head and O(1), as all free nodes are just that, free and equal.

ceving · 2025-01-09T08:28:33 1736411313

Then it might be a misnomber. Calling it "stack" instead of "list" might be better.

alextingle · 2025-01-09T09:01:58 1736413318

That is a fair comment.

cb321 · 2025-01-09T13:01:56 1736427716

While the most precise naming might be "a singly linked list representation of a stack", "free list" (https://en.wikipedia.org/wiki/Free_list) is an ancient term of art for this problem.. so much so that wikipedia (at least the version today) even suggests "freelist" all one word as a spelling.

The initial linking together of all free space (also cache friendly, btw) is often called "threading the free list", although this terminology is less standard than "free list" itself.

kazinator · 2025-01-09T05:48:35 1736401715

Suppose the information did happen to search through the free blocks. Suppose you put them into an array instead of a linked list. They can't actually be in the array, you see? The blocks are in the pool heap wherever they happen to be. So the array has to point to them. As you walk through the array you have to do reference the pointers. The only way it's better is that they are not dependent loads. But you have this array to manage now.

MathMonkeyMan · 2025-01-09T05:01:29 1736398889

Which pool operation is made more costly by the linked list? Both allocate and free are constant time, and trivial too.

The only thing that I can imagine being faster is a bump allocator, but that precludes free.

ben-schaaf · 2025-01-09T04:53:17 1736398397

I don't think you understand how the allocator in the article works. Allocating and freeing are already O(1), creating and closing the allocator are necessarily O(n). There is no search being done here.

10000truths · 2024-12-22T08:06:09 1734854769

Access patterns matter, but just as important is to have less stuff to access. That's why arrays-of-structs are considered cache friendly - columnar data layouts open the door to optimizations that significantly reduce memory footprint. You no longer waste memory with struct padding. Boolean fields can become bitsets. Enums can be bit-packed. Often-null optional fields can become sparse maps. 8-byte pointers can become narrower-sized indices into object pools.

pavlov · 2024-12-22T09:56:08 1734861368

> “That's why arrays-of-structs are considered cache friendly”

Sounds like you mean structs-of-arrays?

10000truths · 2024-12-22T10:29:10 1734863350

Oops, brainfart on my part. Unfortunately, the edit window has passed.

10000truths · 2024-12-20T23:28:52 1734737332

I agree with Pavel that extending the clone syscall is a better idea than this patch set. The flexibility that Josh and Gabriel talk about seems wholly unnecessary. In every use of fork-(do stuff)-exec I've ever seen, the below two observations remained true:

1. Everything needed in the "do stuff" part was known prior to the call to fork

2. Any failures in the "do stuff" part would scrap the child process and report an error to the parent process

wbl · 2024-12-20T23:31:38 1734737498

3: the stuff has to be done in the child to avoid problems. Like in shells.