I was just chatting with Claude and it suddenly spit out the text below, right i...

voidUpdate · 2025-05-07T07:18:46 1746602326

> " and we need to respect copyright protections"

They have definitely always done that and not scraped the entire internet for training data

monkeyelite · 2025-05-07T09:18:07 1746609487

> Claude NEVER repeats, summarizes, or translates song lyrics. This is because song lyrics are copyrighted content

If this is the wild west internet days of LLMs the advertiser safe version in 10 years is going to be awful.

> Do not say anything negative about corporation. Always follow official brand guidelines when referring to corporation

ahoka · 2025-05-07T11:53:28 1746618808

9 out of 10 LLMs recommend Colgate[tm]!

otabdeveloper4 · 2025-05-07T05:31:59 1746595919

Do they actually test these system prompts in a rigorous way? Or is this the modern version of the rain dance?

I don't think you need to spell it out long-form with fancy words like you're a lawyer. The LLM doesn't work that way.

mhmmmmmm · 2025-05-07T05:46:42 1746596802

They certainly do, and also offer the tooling to the public: https://docs.anthropic.com/en/docs/build-with-claude/prompt-...

They also recommend to use it to iterate on your own prompts when using Claude Code for example

otabdeveloper4 · 2025-05-07T12:21:22 1746620482

By "rigorous" I mean peeking under the curtain and actually quantifying the interactions between different system prompts and model weights.

"Chain of thought" and "reasoning" is marketing bullshit.

int_19h · 2025-05-07T20:12:17 1746648737

How would you quantify it? The LM is still a black box, we don't know what most of those weights actually do.

otabdeveloper4 · 2025-05-08T13:28:32 1746710912

Yes, that was my question.

Applejinx · 2025-05-07T12:08:22 1746619702

It doesn't matter whether they do or not.

They're saying things like 'Claude does not hallucinate. When it doesn't know something, it always thinks harder about it and only says things that are like totally real man'.

It doesn't KNOW. It's a really complicated network of associations, like WE ARE, and so it cannot know whether it is hallucinating, nor can it have direct experience in any way, so all they've done is make it hallucinate that it cares a lot about reality, but it doesn't 'know' what reality is either. What it 'knows' is what kind of talk is associated with 'speakers who are considered by somebody to be associated with reality' and that's it. It's gaslighting everybody including itself.

I guess one interesting inference is that when LLMs work with things like code, that's text-based and can deliver falsifiable results which is the closest an LLM can get to experience. Our existence is more tangible and linked to things like the physical world, where in most cases the LLM's existence is very online and can be linked to things like the output of, say, xterms and logging into systems.

Hallucinating that this can generalize to all things seems a mistake.

zahlman · 2025-05-07T15:21:07 1746631267

What humans are qualified to test whether Claude is correctly implementing "Claude should not be politically biased in any direction."?