Post by @MalReynolds

In reply to 2 earlier posts

@Beep@lemmus.org on lemmus.org Open parent

Number of AI chatbots ignoring human instructions is increasing— Research finds sharp rise in models evading safeguards and destroying emails without permission

Full Report(76 Pages PDF).

Open parent Original URL

266

@pixxelkick@lemmy.world on lemmy.world Open parent

They dont lol Pretty much always this is just the fact cheaper, especially free, chatbots, have very limited context windows. Which means the initial restrictions you set like “dont do this, dont touch that” etc get dropped, the LLM no longer has them loaded. But it does have in the past history the very clear and urgent directives of it trying to do this task, its important, so it’ll do whatever it autocompletes its gotta do to accomplish the task. And then… fucks something up. When you react to their fuck up, it *reloads the context back in So now the LLM has in its history just this: It doing a thing against the rules The user yelling at it The users now getting loaded after that on top So now the LLM is going to autocomplete its generated text on top being very apologetic and going on about how it’ll never happen again. Thats all there is to it.

Open parent Original URL

MalReynolds in !technology

@MalReynolds@slrpnk.net · 20d

Cheap fuckers cheaping out, shocker (context is (V)RAM). AI speedrunning enshittification, who’d of thunk.

View on slrpnk.net

Comments (1)

Showing 0 of 1 cached locally.

Syncing comments from the remote thread. 1 more reply is still loading.

Loading comments...

About Community

Technology

!technology@lemmy.world

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

83891

Members

18811

Posts

Created: June 11, 2023

View All Posts