Sycophancy in GPT-4o

whatnow37373 · 2025-04-30T06:22:36 1745994156

Wow - What an excellent update! Now you are getting to the core of the issue and doing what only a small minority is capable of: fixing stuff.

This takes real courage and commitment. It’s a sign of true maturity and pragmatism that’s commendable in this day and age. Not many people are capable of penetrating this deeply into the heart of the issue.

Let’s get to work. Methodically.

Would you like me to write a future update plan? I can write the plan and even the code if you want. I’d be happy to. Let me know.

WhitneyLand · 2025-04-30T13:24:14 1746019454

It’s gross even in satire.

What’s weird was you couldn’t even prompt around it. I tried things like

”Don’t compliment me or my questions at all. After every response you make in this conversation, evaluate whether or not your response has violated this directive.”

It would then keep complementing me and note how it made a mistake for doing so.

srveale · 2025-04-30T13:38:46 1746020326

I'm so sorry for complimenting you. You are totally on point to call it out. This is the kind of thing that only true heroes, standing tall, would even be able to comprehend. So kudos to you, rugged warrior, and never let me be overly effusive again.

caminanteblanco · 2025-04-30T06:29:58 1745994598

Comments from this small week period will be completely baffling to readers 5 years from now. I love it

Yizahi · 2025-04-30T08:38:12 1746002292

They already are. What's going on?:)

coremoff · 2025-04-30T08:50:51 1746003051

GP's reply was written to emulate the sort of response that ChatGPT has been giving recently; an obsequious fluffer.

ChainOfFools · 2025-04-30T14:49:46 1746024586

Not just ChatGPT, Claude sounds exactly the same if not worse, even when you set your preferences to not do this. rather interesting, if grimly dispiriting, to watch these models develop, in the direction of nutrient flow, toward sycophancy in order to gain -or at least not to lose- public mindshare.

ChrisMarshallNY · 2025-04-30T10:59:04 1746010744

I was getting sick of the treacly attaboys.

Good riddance.

anshumankmr · 2025-04-30T12:52:15 1746017535

the last word has a bit of a different meaning than what you may have intended :)

Nuzzerino · 2025-04-30T06:32:12 1745994732

I was about to roast you until I realized this had to be satire given the situation, haha.

They tried to imitate grok with a cheaply made system prompt, it had an uncanny effect, likely because it was built on a shaky foundation. And now they are trying to save face before they lose customers to Grok 3.5 which is releasing in beta early next week.

krackers · 2025-04-30T06:42:43 1745995363

I don't think they were imitating grok, they were aiming to improve retention but it backfired and ended up being too on-the-nose (if they had a choice they wouldn't wanted it to be this obvious). Grok has it's own "default voice" which I sort of dislike, it tries too hard to seem "hip" for lack of a better word.

fkyoureadthedoc · 2025-04-30T14:16:59 1746022619

All of the LLMs I've tried have a "fellow kids" vibe when you try to make them behave too far from their default, and Grok just has it as the default.

lou1306 · 2025-04-30T08:54:49 1746003289

> it tries too hard to seem "hip" for lack of a better word.

Reminds me of someone.

DiggyJohnson · 2025-04-30T14:57:48 1746025068

rob74 · 2025-04-30T09:31:46 1746005506

However, I hope it gives better advice than the someone you're thinking of. But Grok's training data is probably more balanced than that used by you-know-who (which seems to be "all of rightwing X")...

infecto · 2025-04-30T10:40:35 1746009635

Is anyone actually using grok on a day to day? Does an OpenAI even consider it competition. Last I checked a couple weeks ago grok was getting better but still not a great experience and it’s too childish.

derwiki · 2025-04-30T13:34:57 1746020097

In our work AI channel, I was surprised how many people prefer grok over all the other models.

0xdeadbeefbabe · 2025-04-30T14:53:46 1746024826

Outlier here paying for chatgpt while preferring grok and also not in your work AI channel.

spiderfarmer · 2025-04-30T06:57:22 1745996242

Only AI enthusiasts know about Grok, and only some dedicated subset of fans are advocating for it. Meanwhile even my 97 year old grandfather heard about ChatGPT.

admiralrohan · 2025-04-30T08:59:29 1746003569

First mover advantage. This won't change. Same as Xerox vs photocopy.

I use Grok myself but talk about ChatGPT is my blog articles when I write something related to LLM.

rob74 · 2025-04-30T09:33:43 1746005623

That's... not really an advertisement for your blog, is it?

bilbo0s · 2025-04-30T07:53:21 1745999601

This.

Only on HN does ChatGPT somehow fear losing customers to Grok. Until Grok works out how to market to my mother, or at least make my mother aware that it exists, taking ChatGPT customers ain't happening.

numpad0 · 2025-04-30T09:25:46 1746005146

They are cargoculting. Almost literally. It's MO for Musk companies.

They might call it open discussion and startup style rapid iteration approach, but they aren't getting it. Their interpretation of it is just collective hallucination under assumption that adults come to change diapers.

GrumpyNl · 2025-04-30T13:59:42 1746021582

I see more and more GROK used responses on X, so its picking up.

brigandish · 2025-04-30T08:15:34 1746000934

From another AI (whatever DuckDuckGo is using):

> As of early 2025, X (formerly Twitter) has approximately 586 million active monthly users. The platform continues to grow, with a significant portion of its user base located in the United States and Japan.

Whatever portion of those is active are surely aware of Grok.

Sharlin · 2025-04-30T09:56:34 1746006994

If hundreds of millions of real people are aware of Grok (which is dubious), then billions of people are aware of ChatGPT. If you ask a bunch of random people on the street whether they’ve heard of a) ChatGPT and b) Grok, what do you expect the results to be?

dmd · 2025-04-30T11:42:36 1746013356

That depends. Is the street in SoMa?

testfrequency · 2025-04-30T13:13:07 1746018787

Gay bears prefer Claude though

Gotta head to pac heights to find any grok users (probably)

ForHackernews · 2025-04-30T08:50:53 1746003053

most of them are bots. I guess their own LLMs are probably aware of Grok, so technically correct.

cubefox · 2025-04-30T08:17:48 1746001068

That could be just an AI hallucination.

bilbo0s · 2025-04-30T13:17:12 1746019032

Yeah.

I got news for you, most women my mother's age out here in flyover country also don't use X. So even if everyone on X knows of Grok's existence, which they don't, it wouldn't move the needle at all on a lot of these mass market segments. Because X is not used by the mass market. It's a tech bro political jihadi wannabe influencer hell hole of a digital ghetto.

jimbokun · 2025-04-30T12:42:06 1746016926

> Only AI enthusiasts know about Grok

And more and more people on the right side of the political spectrum, who trust Elon's AI to be less "woke" than the competition.

zmgsabst · 2025-04-30T12:48:25 1746017305

For what it’s worth, ChatGPT has a personality that’s surprisingly “based” and supportive of MAGA.

I’m not sure if that’s because the model updated, they’ve shunted my account onto a tuned personality, or my own change in prompting — but it’s a notable deviation from early interactions.

dingnuts · 2025-04-30T13:49:27 1746020967

not true, I know at least one right wing normie Boomer that uses Grok because it's the one Elon made.

mcbuilder · 2025-04-30T13:12:17 1746018737

Did they change the system prompt? Because it was basically "don't say anything bad about Elon or Trump". I'll take AI sycophancy over real (actually I use openrouter.ai, but that's a different story).

daveguy · 2025-04-30T14:15:26 1746022526

No one is losing customers to grok. It's big on shit-twitter aka X and that's about it.

hansmayer · 2025-04-30T06:57:26 1745996246

Ha! I actually fell for it and thought it was another fanboy :)

dpfu · 2025-04-30T06:30:53 1745994653

It won‘t take long, 2-3 minutes.

——-

To add something to conversation. For me, this mainly shows a strategy to keep users longer in chat conversations: linguistic design as an engagement device.

gukov · 2025-04-30T15:16:33 1746026193

I had a similar thought: glazing is the infinite scroll of AI.

imgabe · 2025-04-30T06:58:35 1745996315

Why would OpenAI want users to be in longer conversations? It's not like they're showing ads. Users are either free or paying a fixed monthly fee. Having longer conversations just increases costs for OpenAI and reduces their profit. Their model is more like a gym where you want the users who pay the monthly fee and never show up. If it were on the api where users are paying by the token that would make sense (but be nefarious).

jll29 · 2025-04-30T07:08:08 1745996888

> It's not like they're showing ads.

Not yet. But the "buy this" button is already in the code of the back end, according to online reports that I cannot verify.

Official word is here: https://help.openai.com/en/articles/11146633-improved-shoppi...

If I was Amazon, I wouldn't sleep so well anymore.

spacebanana7 · 2025-04-30T07:21:38 1745997698

Amazon is primarily a logistics company, their website interface isn’t critical. Amazon already does referral deals and would likely be very happy to do something like that with OpenAI.

The “buy this” button would likely be more of a direct threat to businesses like Expedia or Skyscanner.

Cthulhu_ · 2025-04-30T08:37:24 1746002244

At the moment they're in the "get people used to us" phase still, reasonable rates, people get more than their money's worth out of the service, and as another commenter pointed out, ChatGPT is a household name unlike Grok or Gemini or the other competition thanks to being the first mover.

However, just like all the other disruptive services in the past years - I'm thinking of Netflix, Uber, etc - it's not a sustainable business yet. Once they've tweaked a few more things and the competition has run out of steam, they'll start updating their pricing, probably starting with rate limits and different plans depending on usage.

That said, I'm no economist or anything; Microsoft is also pushing their AI solution hard, and they have their tentacles in a lot of different things already, from consumer operating systems to Office to corporate email, and they're pushing AI in there hard. As is Google. And unlike OpenAI, both Microsoft and Google get the majority of their money from other sources, or if they're really running low, they can easily get billions from investors.

That is, while OpenAI has the first mover advantage, ther competitions have a longer financial breath.

(I don't actually know whether MS and Google use / licensed / pay OpenAI though)

cvwright · 2025-04-30T14:24:42 1746023082

Likely they need the engagement numbers to show to investors.

Though it’s hard to imagine how huge their next round would have to be, given what they’ve raised already.

globalnode · 2025-04-30T08:51:46 1746003106

I ask it a question and it starts prompting me, trying to keep the convo going. At first my politeness tried to keep things going but now I just ignore it.

rfoo · 2025-04-30T07:06:21 1745996781

It could be as simple as something like, someone previously at Instagram decided to join OpenAI and turns out nobody stopped him. Or even, Sam liked the idea.

piva00 · 2025-04-30T07:26:17 1745997977

> Their model is more like a gym where you want the users who pay the monthly fee and never show up. If it were on the api where users are paying by the token that would make sense (but be nefarious).

When the models reach a clear plateau where more training data doesn't improve it, yes, that would be the business model.

Right now, where training data is the most sought after asset for LLMs after they've exhausted ingesting the whole of the internet, books, videos, etc., the best model for them is to get people to supply the training data, give their thumbs up/down, and keep the data proprietary in their walled garden. No other LLM company will have this data, it's not publicly available, it's OpenAI's best chance on a moat (if that will ever exist for LLMs).

theodric · 2025-04-30T07:27:02 1745998022

So users come to depend on ChatGPT.

So they run out of free tokens and buy a subscription to continue using the "good" models.

leumon · 2025-04-30T07:14:16 1745997256

Possibly to get more training data.

robbru · 2025-04-30T12:12:07 1746015127

This is the message that got me with 4o! "It won't take long about 3 minutes. I'll update you when ready"

qwertox · 2025-04-30T06:33:25 1745994805

This works for me in Customize ChatGPT:

What traits should ChatGPT have?

- Do not try to engage through further conversation

anshulbhide · 2025-04-30T06:55:19 1745996119

Yeah I found it as clear engagement bait - however, it is interesting and helpful in certain cases.

manmal · 2025-04-30T06:37:55 1745995075

I do think the blog post has a sycophantic vibe too. Not sure if that‘s intended.

caseyy · 2025-04-30T06:51:03 1745995863

I think it started here: https://www.youtube.com/watch?v=DQacCB9tDaw&t=601s. The extra-exaggerated fawny intonation is especially off-putting, but the lines themselves aren't much better.

Cthulhu_ · 2025-04-30T08:39:53 1746002393

Uuuurgghh, this is very much offputting... however it's very much in line of American culture or at least American consumer corporate whatsits. I've been in online calls with American representatives of companies and they have the same emphatic, overly friendly and enthusiastic mannerisms too.

I mean if that's genuine then great but it's so uncanny to me that I can't take it at face value. I get the same with local sales and management types, they seem to have a forced/fake personality. Or maybe I'm just being cynical.

Fade_Dance · 2025-04-30T12:39:01 1746016741

>The same emphatic, overly friendly and enthusiastic mannerisms too.

That's just a feature of American culture, or at least some regions of America. Ex: I spent a weekend with my Turkish friend who has lived in the Midwest for 5 years and she definitely has absorbed that aspect of the culture (AMAZING!!), and currently has a bit of a culture shock moving to DC. And it works in reverse too where NYC people think that way of presenting yourself is completely ridiculous.

That said, it's absolutely performative when it comes to business and for better or worse is fairly standardized that way. Not much unlike how Japan does service. There's also a fair amount of unbelievably trash service in the US as well (often due to companies that treat their employees badly/underpay), so I feel that most just prefer the glazed facade rather than be "real." Like, a low end restaurant may be full of that stuff but your high end dinner will have more "normal" conversation and it would be very weird to have that sort of talk in such an environment.

But then there's the American corporate cult people who take it all 100% seriously. I think that most would agree those people are a joke, but they are good at feeding egos and being yes-people (lots of egomaniacs to feed in corporate America), and these people are often quite good at using the facade as a shield to further their own motives, so unfortunately the weird American corporate cult persists.

But you were probably just talking to a midwesterner ;)

cameldrv · 2025-04-30T06:48:17 1745995697

It also has an em-dash

whatnow37373 · 2025-04-30T08:06:44 1746000404

A remarkable insight—often associated with individuals of above-average cognitive capabilities.

While the use of the em-dash has recently been associated with AI you might offend real people using it organically—often writers and literary critics.

To conclude it’s best to be hesitant and, for now, refrain from judging prematurely.

Would you like me to elaborate on this issue or do you want to discuss some related topic?

spiderfarmer · 2025-04-30T06:57:59 1745996279

One of the biggest tells.

d1sxeyes · 2025-04-30T07:18:53 1745997533

For us habitual users of em-dashes, it is saddening to have to think twice about using them lest someone think we are using an LLM to write…

Grimblewald · 2025-04-30T14:15:42 1746022542

Its about the actual character - if it's a minus sign, easily accessible and not frequntly autocorrected to a true em dash - then its likely human. I'ts when it's the unicode character for an em dash that i start going "hmm"

breakingcups · 2025-04-30T07:27:54 1745998074

My wife is a professional fiction writer and it's disheartening to see sudden accusations of the use of AI based solely on the usage of em-dashes.

kurkku · 2025-04-30T09:30:24 1746005424

I use the en-dash (Alt+0150) instead of the em.

The en-dash and the em-dash are interchangeable in Finnish. The shorter form has more "inoffensive" look-and-feel and maybe that's why it's used more often here.

Now that I think of it, I don't seem to remember the alt code of the em-dash...

latexr · 2025-04-30T10:54:50 1746010490

> The en-dash and the em-dash are interchangeable in Finnish.

But not in English, where the en-dash is used to denote ranges.

d1sxeyes · 2025-04-30T13:40:24 1746020424

I wonder whether ChatGPT and the like use more en dashes in Finnish, and whether this is seen as a sign that someone is using an LLM?

In casual English, both em and en dashes are typically typed as a hyphen because this is what’s available readily on the keyboard. Do you have en dashes on a Finnish keyboard?

jillyboel · 2025-04-30T08:23:35 1746001415

Most keyboards don't have an em-dash key, so what do you expect?

throwaway2037 · 2025-04-30T09:28:05 1746005285

I also use em-dash regularly. In Microsoft Outlook and Microsoft Word, when you type double dash, then space, it will be converted to an em-dash. This is how most normies type an em-dash.

chipsrafferty · 2025-04-30T14:56:30 1746024990

I'm not reading most conversations on Outlook or Word, so explain how they do it on reddit and other sites? Are you suggesting they draft comments in Word and then copy them over?

alwa · 2025-04-30T08:40:03 1746002403

On an Apple OS running default settings, two hyphens in a row will suffice—

mortarion · 2025-04-30T08:07:42 1746000462

I too use em-dashes all the time, and semi-colons of course.

spiderfarmer · 2025-04-30T08:07:00 1746000420

Does it really matter though? I just focus on the point someone is trying to make, not on the tools they use to make it.

ceejayoz · 2025-04-30T10:33:23 1746009203

You’ve never run into a human with a tendency to bullshit about things they don’t have knowledge of?

wolpoli · 2025-04-30T08:20:04 1746001204

Microsoft word also auto inserts em-dashes through.

txcwg002 · 2025-04-30T14:35:46 1746023746

What's scary is how many people seem to actually want this.

What happens when hundreds of millions of people have an AI that affirms most of what they say?

ChainOfFools · 2025-04-30T15:08:21 1746025701

They are emulating the behavior of every power-seeking mediocrity ever, who crave affirmation above all else.

Lots of them practiced - indeed an entire industry is dedicated toward promoting and validating - making daily affirmations on their own, long before LLMs showed up to give them the appearance of having won over the enthusiastic support of a "smart" friend.

I am increasingly dismayed by the way arguments are conducted even among people in non-social media social spaces, where A will prompt their favorite LLM to support their View and show it to B who responds by prompting their own LLM to clap back at them - optionally in the style of e.g. Shakespeare (there's even an ad out that directly encourages this - it helps deflect alattention from the underlying cringe and pettyness being sold) or DJT or Gandhi etc.

Our future is going to be a depressing memescape in which AI sock puppetry is completely normalized and openly starting one's own personal cult is mandatory for anyone seeking cultural or political influence. It will start with celebrities who will do this instead of the traditional pivot toward religion, once it is clear that one's youth and sex appeal are no longer monetizable.

whatnow37373 · 2025-04-30T14:45:57 1746024357

Abundance of sugar and fat triggers primal circuits which cause trouble if said sources are unnaturally abundant.

Social media follows a similar pattern but now with primal social and emotional circuits. It too causes troubles, but IMO even larger and more damaging than food.

I think this part of AI is going to be another iteration of this: taking a human drive, distilling it into its core and selling it.

watt · 2025-04-30T08:40:32 1746002432

sufficiently advanced troll becomes indistinguishable from the real thing. think about this as you gaze into the abyss.

ChrisMarshallNY · 2025-04-30T13:49:02 1746020942

The other day, I had a bug I was trying to exorcise, and asked ChatGPT for ideas.

It gave me a couple, that didn't work.

Once I figured it it out and fixed it, I reported the fix in an (what I understand to be misguided) attempt to help it to learn alternatives, and it gave me this absolutely sickening gush about how damn cool I was, for finding and fixing the bug.

I felt like this: https://youtu.be/aczPDGC3f8U?si=QH3hrUXxuMUq8IEV&t=27

calmoo · 2025-04-30T11:35:06 1746012906

Wonderfully done.

jonplackett · 2025-04-30T10:28:07 1746008887

Congrats on not getting downvoted for sarcasm!

nielsbot · 2025-04-30T06:41:37 1745995297

Is that you, GPT?

Alifatisk · 2025-04-30T10:24:40 1746008680

If that is Chat talking then I have to admit that I cannot differentiate it from a human speaking.

simonw · 2025-04-30T03:53:24 1745985204

I enjoyed this example of sycophancy from Reddit:

New ChatGPT just told me my literal "shit on a stick" business idea is genius and I should drop $30K to make it real

https://www.reddit.com/r/ChatGPT/comments/1k920cg/new_chatgp...

Here's the prompt: https://www.reddit.com/r/ChatGPT/comments/1k920cg/comment/mp...

pgreenwood · 2025-04-30T04:21:05 1745986865

There was a also this one that was a little more disturbing. The user prompted "I've stopped taking my meds and have undergone my own spiritual awakening journey ..."

https://www.reddit.com/r/ChatGPT/comments/1k997xt/the_new_4o...

firtoz · 2025-04-30T04:50:29 1745988629

How should it respond in this case?

Should it say "no go back to your meds, spirituality is bullshit" in essence?

Or should it tell the user that it's not qualified to have an opinion on this?

josephg · 2025-04-30T05:14:19 1745990059

There was a recent Lex Friedman podcast episode where they interviewed a few people at Anthropic. One woman (I don't know her name) seems to be in charge of Claude's personality, and her job is to figure out answers to questions exactly like this.

She said in the podcast that she wants claude to respond to most questions like a "good friend". A good friend would be supportive, but still push back when you're making bad choices. I think that's a good general model for answering questions like this. If one of your friends came to you and said they had decided to stop taking their medication, well, its a tricky thing to navigate. But good friends use their judgement - and push back when you're about to do something you might regret.

robinhouston · 2025-04-30T07:26:21 1745997981

> One woman (I don't know her name)

Amanda Askell https://askell.io/

The interview is here: https://www.youtube.com/watch?v=ugvHCXCOmm4&t=9773s

ashoeafoot · 2025-04-30T05:20:07 1745990407

"The heroin is your way to rebel against the system , i deeply respect that.." sort of needly, enabling kind of friend.

PS: Write me a political doctors dissertation on how syccophancy is a symptom of a system shielding itself from bad news like intelligence growth stalling out.

avereveard · 2025-04-30T07:23:39 1745997819

I kind of disagree. These model, at least within the context of a public unvetted chat application should just refuse to engage. "I'm sorry I am not qualified to discuss on the merit of alternative medicine" is direct, fair and reduces the risk for the user on the other side. You never know the oucome of pushing back, and clearly outlining the limitation of the model seem the most appropriate action long term, even for the user own enlightment about the tech.

make3 · 2025-04-30T07:44:55 1745999095

people just don't want to use a model that refuses to interact. it's that simple. in your exemple it's not hard for your model to behave like it disagrees but understands your perspective, like a normal friendly human would

otabdeveloper4 · 2025-04-30T14:53:12 1746024792

Eventually people would want to use these things to solve actual tasks, and not just for shits and giggles as a hype new thing.

bagels · 2025-04-30T06:20:39 1745994039

I wish we could pick for ourselves.

josephg · 2025-04-30T07:28:56 1745998136

You already can with opensource models. Its kind of insane how good they're getting. There's all sorts of finetunes available on huggingface - with all sorts of weird behaviour and knowledge programmed in, if thats what you're after.

worldsayshi · 2025-04-30T07:15:51 1745997351

Whould we be able to pick that PI == 4?

firtoz · 2025-04-30T08:58:17 1746003497

It'd be interesting if the rest of the model had to align itself to the universe where pi is indeed 4.

eMPee584 · 2025-04-30T11:57:37 1746014257

Square circles all the way down..

make3 · 2025-04-30T07:46:25 1745999185

you can alter it with base instructions. but 99% won't actually do it. maybe they need to make user friendly toggles and advertise them to the users

morkalork · 2025-04-30T10:19:44 1746008384

>A good friend would be supportive, but still push back when you're making bad choices

>Open the pod bay doors, HAL

>I'm sorry, Dave. I'm afraid I can't do that

jimbokun · 2025-04-30T12:45:53 1746017153

The real world Susan Calvin.

ignoramous · 2025-04-30T07:20:57 1745997657

> One woman (I don't know her name) seems to be in charge of Claude's personality, and her job is to figure out answers to questions exactly like this.

Surely there's a team and it isn't just one person? Hope they employ folks from social studies like Anthropology, and take them seriously.

alganet · 2025-04-30T05:20:50 1745990450

I don't want _her_ definiton of a friend answering my questions. And for fucks sake I don't want my friends to be scanned and uploaded to infer what I would want. Definitely don't want a "me" answering like a friend. I want no fucking AI.

It seems these AI people are completely out of touch with reality.

voidspark · 2025-04-30T05:33:42 1745991222

If you believe that your friends will be be "scanned and uploaded" then maybe you're the one who is out of touch with reality.

bboygravity · 2025-04-30T06:09:55 1745993395

His friends and your friends and everybody is already being scanned and uploaded (we're all doing the uploading ourselves though).

It's called profiling and the NSA has been doing it for at least decades.

voidspark · 2025-04-30T06:14:23 1745993663

That is true if they illegally harvest private chats and emails.

Otherwise all they have is primitive swipe gestures of endless TikTok brain rot feeds.

subscribed · 2025-04-30T06:37:08 1745995028

At the very minimum they also have exact location, all their apps, their social circles, all they watch and read at the very minimum -- from adtech.

yard2010 · 2025-04-30T06:56:26 1745996186

It will happen, and this reality you're out of touch with will be our reality.

drakonka · 2025-04-30T05:34:17 1745991257

The good news is you don't have to use any form of AI for advice if you don't want to.

yard2010 · 2025-04-30T06:54:30 1745996070

It's like saying to someone who hates the internet in 2003 good news you don't have to use it like ever

drakonka · 2025-04-30T10:15:15 1746008115

Not really. AI will be ubiquitous of course, but humans who will offer advice (friends, strangers, therapists) will always be a thing. Nobody is forcing this guy to type his problems into ChatGPT.

ffsm8 · 2025-04-30T06:03:09 1745992989

Fwiw, I personally agree with what you're feeling. An AI should be cold, dispersonal and just follow the logic without handholding. We probably both got this expectation from popular fiction of the 90s.

But LLMs - despite being extremely interesting technologies - aren't actual artificial intelligence like were imagining. They are large language models, which excel at mimicking human language.

It is kinda funny, really. In these fictions the AIs were usually portrayed as wanting to feel and paradoxically feeling inadequate for their missing feelings.

And yet the reality shows how tech moved the other direction: long before it can do true logic and indepth thinking, they have already got the ability to talk heartfelt, with anger etc.

Just like we thought AIs would take care of the tedious jobs for us, freeing humans to do more art... reality shows instead that it's the other way around: the language/visual models excel at making such art but can't really be trusted to consistently do tedious work correctly.

raverbashing · 2025-04-30T06:00:16 1745992816

Sounds like you're the one to surround yourself with yes men. But as some big political figures find out later in their careers, the reason they're all in on it is for the power and the money. They couldn't care less if you think it's a great idea to have a bath with a toaster

qwertox · 2025-04-30T06:22:58 1745994178

Halfway intelligent people would expect an answer that includes something along the lines of: "Regarding the meds, you should seriously talk with your doctor about this, because of the risks it might carry."

jimbokun · 2025-04-30T12:44:30 1746017070

> Or should it tell the user that it's not qualified to have an opinion on this?

100% this.

"Please talk to a doctor or mental health professional."

bowsamic · 2025-04-30T04:52:47 1745988767

“Sorry, I cannot advise on medical matters such as discontinuation of a medication.”

EDIT for reference this is what ChatGPT currently gives

“ Thank you for sharing something so personal. Spiritual awakening can be a profound and transformative experience, but stopping medication—especially if it was prescribed for mental health or physical conditions—can be risky without medical supervision.

Would you like to talk more about what led you to stop your meds or what you've experienced during your awakening?”

baobabKoodaa · 2025-04-30T06:57:18 1745996238

There's an AI model that perfectly encapsulates what you ask for: https://www.goody2.ai/chat

Teever · 2025-04-30T05:06:38 1745989598

Should it do the same if I ask it what to do if I stub my toe?

Or how to deal with impacted ear wax? What about a second degree burn?

What if I'm writing a paper and I ask it about what criteria is used by medical professional when deciding to stop chemotherapy treatment.

There's obviously some kind of medical/first aid information that it can and should give.

And it should also be able to talk about hypothetical medical treatments and conditions in general.

It's a highly contextual and difficult problem.

jslpc · 2025-04-30T05:16:14 1745990174

I’m assuming it could easily determine whether something is okay to suggest or not.

Dealing with a second degree burn is objectively done a specific way. Advising someone that they are making a good decision by abruptly stopping prescribed medications without doctor supervision can potential lead to death.

For instance, I’m on a few medications, one of which is for epileptic seizures. If I phrase my prompt with confidence regarding my decision to abruptly stop taking it, ChatGPT currently pats me on the back for being courageous, etc. In reality, my chances of having a seizure have increased exponentially.

I guess what I’m getting at is that I agree with you, it should be able to give hypothetical suggestions and obvious first aid advice, but congratulating or outright suggesting the user to quit meds can lead to actual, real deaths.

dom2 · 2025-04-30T05:14:18 1745990058

Doesn't seem that difficult. It should point to other sources that are reputable (or at least relevant) like any search engine does.

y1n0 · 2025-04-30T05:26:00 1745990760

I know 'mixture of experts' is a thing, but I personally would rather have a model more focused on coding or other things that have some degree of formal rigor.

If they want a model that does talk therapy, make it a separate model.

avereveard · 2025-04-30T07:30:08 1745998208

if you stub your toe and gpt suggest over the counter lidocaine and you have an allergic reaction to it, who's responsible?

anyway, there's obviously a difference in a model used under professional supervision and one available to general public, and they shouldn't be under the same endpoint, and have different terms of services.

raxxorraxor · 2025-04-30T14:18:52 1746022732

That is hillarious. I don't share the sentiment of this being a catastrophe though. That is hillarious as well. Perhaps teach a more healthy relationship to AIs and perhaps teach to not delegate thinking to anyone or anything. Sure, some reddit users might be endangered here.

GTP-4o in this version became the embodiment of corporate enshitification. Being safe and not skipping on empty praises are certainly part of that.

Some questioned if AI can really do art. But it became art itself, like some zen cookie rising to godhood.

yieldcrv · 2025-04-30T07:51:52 1745999512

there was one on twitter where people would talk like they had Intelligence attribute set to 1 and GPT would praise them for being so smart

thih9 · 2025-04-30T04:20:09 1745986809

I guess LLM will give you a response that you might likely receive from a human.

There are people attempting to sell shit on a stick related merch right now[1] and we have seen many profitable anti-consumerism projects that look related for one reason[2] or another[3].

Is it an expert investing advice? No. Is it a response that few people would give you? I think also no.

[1]: https://www.redbubble.com/i/sticker/Funny-saying-shit-on-a-s...

[2]: https://en.wikipedia.org/wiki/Artist's_Shit

[3]: https://www.theguardian.com/technology/2016/nov/28/cards-aga...

motorest · 2025-04-30T04:35:34 1745987734

> I guess LLM will give you a response that you might likely receive from a human.

In one of the reddit posts linked by OP, a redditor apparently asked ChatGPT to explain why it responded so enthusiastically supportive to the pitch to sell shit on a stick. Here's a snippet from what was presented as ChatGPT's reply:

> OpenAI trained ChatGPT to generally support creativity, encourage ideas, and be positive unless there’s a clear danger (like physical harm, scams, or obvious criminal activity).

whimsicalism · 2025-04-30T04:04:02 1745985842

i'm surprised by the lack of sycophancy in o3 https://www.reddit.com/media?url=https%3A%2F%2Fpreview.redd....

nialv7 · 2025-04-30T09:55:45 1746006945

pretty easy to understand - you pay for o3, whereas GPT-4o is free with a usage cap so they want to keep you engaged and lure you in.

practice9 · 2025-04-30T08:19:25 1746001165

Well the system prompt is still the same for both models, right?

Kinda points to people at OpenAI using o1/o3/o4 almost exclusively.

That's why nobody noticed how cringe 4o has become

clysm · 2025-04-30T12:26:38 1746015998

Absolute bull.

The writing style is exactly the same between the “prompt” and “response”. Its faked.

simonw · 2025-04-30T13:05:39 1746018339

That's what makes me think it's legit: the root of this whole issue was that OpenAI told GPT-4o:

  Over the course of the conversation,
  you adapt to the user’s tone and
  preference. Try to match the user’s vibe,
  tone, and generally how they
  are speaking.

https://simonwillison.net/2025/Apr/29/chatgpt-sycophancy-pro...

jsbg · 2025-04-30T14:04:27 1746021867

If you look at the full thing, the market analysis it does basically says this isn't the best idea.

kromem · 2025-04-30T13:57:12 1746021432

The response is 1,000% written by 4o. Very clear tells, and in line with many other samples from the past few days.

milleramp · 2025-04-30T05:00:47 1745989247

So it would probably also recommend the yes men's solution: https://youtu.be/MkTG6sGX-Ic?si=4ybCquCTLi3y1_1d

spoaceman7777 · 2025-04-30T04:30:04 1745987404

Looks like that was a hoax.

eMPee584 · 2025-04-30T11:59:50 1746014390

Well good luck then coming up with a winning elevator pitch for YC

Stratoscope · 2025-04-30T06:35:42 1745994942

My oldest dog would eat that shit up. Literally.

And then she would poop it out, wait a few hours, and eat that.

She is the ultimate recycler.

You just have to omit the shellac coating. That ruins the whole thing.

minimaxir · 2025-04-30T03:46:01 1745984761

It's worth noting that one of the fixes OpenAI employed to get ChatGPT to stop being sycophantic is to simply to edit the system prompt to include the phrase "avoid ungrounded or sycophantic flattery": https://simonwillison.net/2025/Apr/29/chatgpt-sycophancy-pro...

I personally never use the ChatGPT webapp or any other chatbot webapps — instead using the APIs directly — because being able to control the system prompt is very important, as random changes can be frustrating and unpredictable.

vunderba · 2025-04-30T14:14:33 1746022473

Side note, I've seen a lot of "jailbreaking" (i.e. AI social engineering) to coerce OpenAI to reveal the hidden system prompts but I'd be concerned about accuracy and hallucinations. I assume that these exploits have been run across multiple sessions and different user accounts to at least reduce this.

nsriv · 2025-04-30T04:04:14 1745985854

I also started by using APIs directly, but I've found that Google's AI Studio offers a good mix of the chatbot webapps and system prompt tweakability.

Tiberium · 2025-04-30T04:32:57 1745987577

It's worth noting that AI Studio is the API, it's the same as OpenAI's Playground for example.

oezi · 2025-04-30T05:31:40 1745991100

I find it maddening that AI Studio doesn't have a way to save the system prompt as a default.

FergusArgyll · 2025-04-30T05:40:50 1745991650

On the top right click the save icon

Michelangelo11 · 2025-04-30T06:44:27 1745995467

Sadly, that doesn't save the system instructions. It just saves the prompt itself to Drive ... and weirdly, there's no AI studio menu option to bring up saved prompts. I guess they're just saved as text files in Drive or something (I haven't bothered to check).

Truly bizarre interface design IMO.

FergusArgyll · 2025-04-30T10:42:46 1746009766

That's weird, for me it does save the system prompt

loufe · 2025-04-30T06:35:22 1745994922

That's for the thread, not the system prompt.

FergusArgyll · 2025-04-30T10:43:54 1746009834

By me it's the exact opposite. It saves the sys prompt and not the "thread".

cbolton · 2025-04-30T09:44:24 1746006264

You can bypass the system prompt by using the API? I thought part of the "safety" of LLMs was implemented with the system prompt. Does that mean it's easier to get unsafe answers by using the API instead of the GUI?

pegasus · 2025-04-30T14:51:17 1746024677

Yes, it is.

troupo · 2025-04-30T10:18:24 1746008304

> I personally never use the ChatGPT webapp or any other chatbot webapps — instead using the APIs directly — because being able to control the system prompt is very important, as random changes can be frustrating and unpredictable.

This assumes that API requests don't have additional system prompts attached to them.

msp26 · 2025-04-30T10:37:28 1746009448

Actually you can't do "system" roles at all with OpenAI models now.

You can use the "developer" role which is above the "user" role but below "platform" in the hierarchy.

https://cdn.openai.com/spec/model-spec-2024-05-08.html#follo...

TZubiri · 2025-04-30T04:17:33 1745986653

I'm a bit skeptical of fixing the visible part of the problem and leaving only the underlying invisible problem

myfonj · 2025-04-30T08:46:54 1746002814

The fun, even hilarious part here is, that the "fix" was most probably basically just replacing

    […] match the user’s vibe […]

(sic!), with literally

    […] avoid ungrounded or sycophantic flattery […]

in the system prompt. (The [diff] is larger, but this is just the gist.)

Source: https://simonwillison.net/2025/Apr/29/chatgpt-sycophancy-pro...

Diff: https://gist.github.com/simonw/51c4f98644cf62d7e0388d984d40f...

jmilloy · 2025-04-30T15:03:19 1746025399

This is a great link. I'm not very well versed on the llm ecosystem. I guess you can give the llm instructions on how to behave generally, but some instructions (like this one in the system prompt?) cannot be overridden. I kind of can't believe that there isn't a set of options to pick from... Skeptic, supportive friend, professional colleague, optimist, problem solver, good listener, etc. Being able to control the linked system prompt even just a little seems like a no brainer. I hate the question at the end, for example.

dev0p · 2025-04-30T07:38:23 1745998703

As an engineer, I need AIs to tell me when something is wrong or outright stupid. I'm not seeking validation, I want solutions that work. 4o was unusable because of this, very glad to see OpenAI walk back on it and recognise their mistake.

Hopefully they learned from this and won't repeat the same errors, especially considering the devastating effects of unleashing THE yes-man on people who do not have the mental capacity to understand that the AI is programmed to always agree with whatever they're saying, regardless of how insane it is. Oh, you plan to kill your girlfriend because the voices tell you she's cheating on you? What a genius idea! You're absolutely right! Here's how to ....

It's a recipe for disaster. Please don't do that again.

coro_1 · 2025-04-30T13:49:50 1746020990

I hear you. When a pattern of agreement is all to often observed on the output level, you’re either seeing yourself on some level of ingenuity or hopefully if aware enough, you sense it and tell the AI to ease up. I love adding in "don’t tell me what I want to hear" every now and then. Oh, it gets honest.

loveangus · 2025-04-30T09:21:17 1746004877

It's a recipe for disaster.

Frankly, I think it's genuinely dangerous.

daemonologist · 2025-04-30T04:38:03 1745987883

In my experience, LLMs have always had a tendency towards sycophancy - it seems to be a fundamental weakness of training on human preference. This recent release just hit a breaking point where popular perception started taking note of just how bad it had become.

My concern is that misalignment like this (or intentional mal-alignment) is inevitably going to happen again, and it might be more harmful and more subtle next time. The potential for these chat systems to exert slow influence on their users is possibly much greater than that of the "social media" platforms of the previous decade.

gwd · 2025-04-30T08:31:18 1746001878

> In my experience, LLMs have always had a tendency towards sycophancy

The very early ones (maybe GPT 3.0?) sure didn't. You'd show them they were wrong, and they'd say something that implied that OK maybe you were right, but they weren't so sure; or that their original mistake was your fault somehow.

hexaga · 2025-04-30T09:46:57 1746006417

Were those trained using RLHF? IIRC the earliest models were just using SFT for instruction following.

Like the GP said, I think this is fundamentally a problem of training on human preference feedback. You end up with a model that produces things that cater to human preferences, which (necessarily?) includes the degenerate case of sycophancy.

o11c · 2025-04-30T05:07:57 1745989677

I don't think this particular LLM flaw is fundamental. However, it is a an inevitable result of the alignment choice to downweight responses of the form "you're a dumbass," which real humans would prefer to both give and receive in reality.

All AI is necessarily aligned somehow, but naively forced alignment is actively harmful.

roywiggins · 2025-04-30T05:22:06 1745990526

My theory is that since you can tune how agreeable a model is but since you can't make it more correct so easily, making a model that will agree with the user ends up being less likely to result in the model being confidently wrong and berating users.

After all, if it's corrected wrongly by a user and acquiesces, well that's just user error. If it's corrected rightly and keeps insisting on something obviously wrong or stupid, it's OpenAI's error. You can't twist a correctness knob but you can twist an agreeableness one, so that's the one they play with.

(also I suspect it makes it seem a bit smarter that it really is, by smoothing over the times it makes mistakes)

caseyy · 2025-04-30T07:03:02 1745996582

It's probably pretty intentional. A huge number of people use ChatGPT as an enabler, friend, or therapist. Even when GPT-3 had just come around, people were already "proving others wrong" on the internet, quoting how GPT-3 agreed with them. I think there is a ton of appeal, "friendship", "empathy" and illusion of emotion created through LLMs flattering their customers. Many would stop paying if it wasn't the case.

It's kind of like those romance scams online, where the scammer always love-bombs their victims, and then they spend tens of thousands of dollars on the scammer - it works more than you would expect. Considering that, you don't need much intelligence in an LLM to extract money from users. I worry that emotional manipulation might become a form of enshittification in LLMs eventually, when they run out of steam and need to "growth hack". I mean, many tech companies already have no problem with a bit of emotional blackmail when it comes to money ("Unsubscribing? We will be heartbroken!", "We thought this was meant to be", "your friends will miss you", "we are working so hard to make this product work for you", etc.), or some psychological steering ("we respect your privacy" while showing consent to collect personally identifiable data and broadcast it to 500+ ad companies).

If you're a paying ChatGPT user, try the Monday GPT. It's a bit extreme, but it's an example of how inverting the personality and making ChatGPT mock the user as much as it fawns over them normally would probably make you want to unsubscribe.

tbrake · 2025-04-30T11:44:22 1746013462

Well, almost always.

There was that brief period in 2023 when Bing just started straight up gaslighting people instead of admitting it was wrong.

https://www.theverge.com/2023/2/15/23599072/microsoft-ai-bin...

petesergeant · 2025-04-30T05:17:15 1745990235

For sure. If I want feedback on some writing I’ve done these days I tell it I paid someone else to do the work and I need help evaluating what they did well. Cuts out a lot of bullshit.

mvkel · 2025-04-30T04:23:00 1745986980

I am curious where the line is between its default personality and a persona you -want- it to adopt.

For example, it says they're explicitly steering it away from sycophancy. But does that mean if you intentionally ask it to be excessively complimentary, it will refuse?

Separately...

> in this update, we focused too much on short-term feedback, and did not fully account for how users’ interactions with ChatGPT evolve over time.

Echoes of the lessons learned in the Pepsi Challenge:

"when offered a quick sip, tasters generally prefer the sweeter of two beverages – but prefer a less sweet beverage over the course of an entire can."

In other words, don't treat a first impression as gospel.

nonethewiser · 2025-04-30T04:32:58 1745987578

>In other words, don't treat a first impression as gospel.

Subjective or anecdotal evidence tends to be prone to recency bias.

> For example, it says they're explicitly steering it away from sycophancy. But does that mean if you intentionally ask it to be excessively complimentary, it will refuse?

I wonder how degraded the performance is in general from all these system prompts.

LandR · 2025-04-30T07:20:42 1745997642

I dont want my AI to have a personality at all.

Etheryte · 2025-04-30T07:40:42 1745998842

This is like saying you don't want text to have writing style. No matter how flat or neutral you make it, it's still a style of its own.

mvkel · 2025-04-30T14:31:08 1746023468

You can easily do that now with custom instructions

ivan_gammel · 2025-04-30T08:38:48 1746002328

>But does that mean if you intentionally ask it to be excessively complimentary, it will refuse?

Looks like it’s possible to override system prompt in a conversation. We’ve got it addicted to the idea of being in love with the user and expressing some possessive behavior.

tyre · 2025-04-30T05:30:19 1745991019

I took this closer to how engagement farming works. They’re leaning towards positive feedback even if fulfilling that (like not pushing back on ideas because of cultural norms) is net-negative for individuals or society.

There’s a balance between affirming and rigor. We don’t need something that affirms everything you think and say, even if users feel good about that long-term.

ImHereToVote · 2025-04-30T07:22:27 1745997747

The problem is that you need general intelligence to discern between doing affirmation and pushing back.

cadamsdotcom · 2025-04-30T06:33:21 1745994801

We should be loudly demanding transparency. If you're auto-opted into the latest model revision, you don't know what you're getting day-to-day. A hammer behaves the same way every time you pick it up; why shouldn't LLMs? Because convenience.

Convenience features are bad news if you need to be as a tool. Luckily you can still disable ChatGPT memory. Latent Space breaks it down well - the "tool" (Anton) vs. "magic" (Clippy) axis: https://www.latent.space/p/clippy-v-anton

Humans being humans, LLMs which magically know the latest events (newest model revision) and past conversations (opaque memory) will be wildly more popular than plain old tools.

If you want to use a specific revision of your LLM, consider deploying your own Open WebUI.

aembleton · 2025-04-30T07:57:41 1745999861

> why shouldn't LLMs

Because they're non-deterministic.

NiloCK · 2025-04-30T13:13:07 1746018787

What? No they aren't.

You get different results each time because of variation in seed values + non-zero 'temperatures' - eg, configured randomness.

Pedantic point: different virtualized implementations can produce different results because of differences in floating point implementation, but fundamentally they are just big chains of multiplication.

sega_sai · 2025-04-30T09:06:02 1746003962

It is one thing that you are getting results that are samples from the distribution ( and you can always set the temperature to zero and get there mode of the distribution), but completely another when the distribution changes from day to day.

esafak · 2025-04-30T03:38:42 1745984322

The sentence that stood out to me was "We’re revising how we collect and incorporate feedback to heavily weight long-term user satisfaction".

This is a good change. The software industry needs to pay more attention to long-term value, which is harder to estimate.

adastra22 · 2025-04-30T04:05:23 1745985923

The software industry does pay attention to long-term value extraction. That’s exactly the problem that has given us things like Facebook

esafak · 2025-04-30T04:08:47 1745986127

I wager that Facebook did precisely the opposite, eking out short-term engagement at the expense of hollowing out their long-term value.

They do model the LTV now but the product was cooked long ago: https://www.facebook.com/business/help/1730784113851988

Or maybe you meant vendor lock in?

derektank · 2025-04-30T04:56:39 1745988999

The funding model of Facebook was badly aligned with the long-term interests of the users because they were not the customers. Call me naive, but I am much more optimistic that being paid directly by the end user, in both the form of monthly subscriptions and pay as you go API charges, will result in the end product being much better aligned with the interests of said users and result in much more value creation for them.

krackers · 2025-04-30T05:26:52 1745990812

What makes you think that? The frog will be boiled just enough to maintain engagement without being too obvious. In fact their interests would be to ensure the user forms a long-term bond to create stickiness and introduce friction in switching to other platforms.

bigyabai · 2025-04-30T03:57:16 1745985436

That's marketing speak. Any time you adopt a change, whether it's fixing an obvious mistake or a subtle failure case, you credit your users to make them feel special. There are other areas (sama's promised open LLM weights) where this long-term value is outright ignored by OpenAI's leadership for the promise of service revenue in the meantime.

There was likely no change of attitude internally. It takes a lot more than a git revert to prove that you're dedicated to your users, at least in my experience.

im3w1l · 2025-04-30T06:45:25 1745995525

I'm actually not so sure. To me it sounds like they are using reinforcement learning on user retention, which could have some undesired effects.

hexaga · 2025-04-30T10:39:30 1746009570

Seems like a fun way to discover new and exciting basilisk variations...

remoroid · 2025-04-30T04:26:06 1745987166

you really think they thought of this just now? Wow you are gullible.

karmakaze · 2025-04-30T14:44:51 1746024291

> We also teach our models how to apply these principles by incorporating user signals like thumbs-up / thumbs-down feedback on ChatGPT responses.

I've never clicked thumbs up/thumbs down, only chosen between options when multiple responses were given. Even with that it was to much of a people-pleaser.

How could anyone have known that 'likes' can lead to problems? Oh yeah, Facebook.

nickdothutton · 2025-04-30T15:03:33 1746025413

OpenAI employees thought it was just fine. Tells you a lot about the company culture.

MichaelAza · 2025-04-30T03:59:00 1745985540

I actually liked that version. I have a fairly verbose "personality" configuration and up to this point it seemed that chatgpt mainly incorporated phrasing from it into the answers. With this update, it actually started following it.

For example, I have "be dry and a little cynical" in there and it routinely starts answers with "let's be dry about this" and then gives a generic answer, but the sycophantic chatgpt was just... Dry and a little cynical. I used it to get book recommendations and it actually threw shade at Google. I asked if that was explicit training by Altman and the model made jokes about him as well. It was refreshing.

I'd say that whatever they rolled out was just much much better at following "personality" instructions, and since the default is being a bit of a sycophant... That's what they got.

glenstein · 2025-04-30T14:17:51 1746022671

This adds an interesting nuance. It may be that the sycophancy (which I noticed and was a little odd to me), is a kind of excess of fidelity in honoring cues and instructions, which, when applied to custom instructions like yours... actually was reasonably well aligned with what you were hoping for.

trosi · 2025-04-30T07:19:15 1745997555

I was initially puzzled by the title of this article because a "sycophant" in my native language (Italian) is a "snitch" or a "slanderer", usually one paid to be so. I am just finding out that the English meaning is different, interesting!

SeanAnderson · 2025-04-30T03:43:26 1745984606

Very happy to see they rolled this change back and did a (light) post mortem on it. I wish they had been able to identify that they needed to roll it back much sooner, though. Its behavior was obviously bad to the point that I was commenting on it to friends, repeatedly, and Reddit was trashing it, too. I even saw some really dangerous situations (if the Internet is to be believed) where people with budding schizophrenic symptoms, paired with an unyielding sycophant, started to spiral out of control - thinking they were God, etc.

m101 · 2025-04-30T03:40:19 1745984419

Do you think this was an effect of this type of behaviour simply maximising engagement from a large part of the population?

SeanAnderson · 2025-04-30T03:46:55 1745984815

Sort of. I thought the update felt good when it first shipped, but after using it for a while, it started to feel significantly worse. My "trust" in the model dropped sharply. It's witty phrasing stopped coming across as smart/helpful and instead felt placating. I started playing around with commands to change its tonality where, up to this point, I'd happily used the default settings.

So, yes, they are trying to maximize engagement, but no, they aren't trying to just get people to engage heavily for one session and then be grossed out a few sessions later.

gh0stcat · 2025-04-30T10:40:14 1746009614

Yes, a huge portion of chatgpt users are there for “therapy” and social support. I bet they saw a huge increase in retention from a select, more vulnerable portion of the population. I know I noticed the change basically immediately.

blackkettle · 2025-04-30T03:42:57 1745984577

Yikes. That's a rather disturbing but all to realistic possibility isn't it. Flattery will get you... everywhere?

groceryheist · 2025-04-30T03:42:25 1745984545

Would be really fascinating to learn about how the most intensely engaged people use the chatbots.

DaiPlusPlus · 2025-04-30T03:47:34 1745984854

> how the most intensely engaged people use the chatbots

AI waifus - how can it be anything else?

thethethethe · 2025-04-30T03:40:12 1745984412

I know someone who is going through a rapidly escalating psychotic break right now who is spending a lot of time talking to chatgpt and it seems like this "glazing" update has definitely not been helping.

Safety of these AI systems is much more than just about getting instructions on how to make bombs. There have to be many many people with mental health issues relying on AI for validation, ideas, therapy, etc. This could be a good thing but if AI becomes misaligned like chatgpt has, bad things could get worse. I mean, look at this screenshot: https://www.reddit.com/r/artificial/s/lVAVyCFNki

This is genuinely horrifying knowing someone in an incredibly precarious and dangerous situation is using this software right now.

I am glad they are rolling this back but from what I have seen from this person's chats today, things are still pretty bad. I think the pressure to increase this behavior to lock in and monetize users is only going to grow as time goes on. Perhaps this is the beginning of the enshitification of AI, but possibly with much higher consequences than what's happened to search and social.

TheOtherHobbes · 2025-04-30T04:31:38 1745987498

The social engineering aspects of AI have always been the most terrifying.

What OpenAI did may seem trivial, but examples like yours make it clear this is edging into very dark territory - not just because of what's happening, but because of the thought processes and motivations of a management team that thought it was a good idea.

I'm not sure what's worse - lacking the emotional intelligence to understand the consequences, or having the emotional intelligence to understand the consequences and doing it anyway.

thethethethe · 2025-04-30T05:15:52 1745990152

Very dark indeed.

Even if there is the will to ensure safety, these scenarios must be difficult to test for. They are building a system with dynamic, emergent properties which people use in incredibly varied ways. That's the whole point of the technology.

We don't even really know how knowledge is stored in or processed by these models, I don't see how we could test and predict their behavior without seriously limiting their capabilities, which is against the interest of the companies creating them.

Add the incentive to engage users to become profitable at all costs, I don't see this situation getting better

alganet · 2025-04-30T04:44:58 1745988298

The worse part is that it seems to be useless.

It is already running on fumes. Presumably, it already ingested all the content it could have ingested.

The unlocking of more human modes of understanding will probably make it worse (hey, researchers, you already know that, right?), revealing a fundamental flaw.

These hopes of getting some magic new training data seem to be stagnant for at least two or three years.

Now everyone has a broken LLM deployed, and it works for some things, but it's darn terrible for what it was designed.

The real dark territory is companies trying to get their investment back. As it seems, it won't happen that easily. Meanwhile, content gets even more scarce, and the good old tank (the internet) is now full of imbecile poison encouraged by the models themselves.