you can like... enforce this rule programatically? you don't have to say "pretty please" to ai? basically, when AI requests some potentially unwanted thing (like deleting an email), this request goes through a proxy that asks the human for confirmation. Also you can have a safe word set up in the chat interface to act as a killswitch. I thought these are ABCs of ai safety but apparently these are foreign concepts to this "safety director"
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
Program? Like a fucking farmer?
The people who internalize this would never engage with a chatbot in this way in the first place. To them this is another intelligence they're conversing with, where you get what you want by following social decorum and enforcing your will amounts to abuse.
Run? Like physically run? You install a server on your hardware without setting up remote access? Even plug and play one-click solutions like tailscale??
I hate how Apple users feel the need to call their computer by the brand. It really makes me cringe.
It is called "a computer"
Maybe "PC"
"box" if you really have to flex that UNIX
They should treat their computers less like a sports car and more like a van
I mean, isnt that the entire point of Apple? Brand recognition and percieved status attributed to said brand. Its like rappers and gucci belts or country artists and ford pickups
yes the point of apple prodcuts is to waste money and shove it at everyone's faces
yeah I sat there for a few seconds trying to figure out the relevance
turns out, it wasn't relevant
instant loss of attention and judging of their character
Ehhhh as an owner of five or six windows computers, four Linux machines, and a couple Apple computers, I always specify which machine I’m referring to if I’m talking about something I did/something that happened on one of them in case it could be pertinent.
The funniest part is this person job is AI safety.
Yeah, I personally wouldn't be announcing this failure to the world if I were in her position. I don't think you could torture it out of me lmao
Maybe they want to get this out there as cover if/when some regulator somewhere decides to subpoena records from the AI safety director.
Maybe they are meant to protect the AI
Maybe they'll take their job more seriously now?
Can someone explain to mr why these people are buying Mac Minis to run this in a "safe" environment and then they go on and connect it to the internet and give the AI credentials to all their cloud accounts? This seems excessively moronic to me? Am I missing something?
They are buying the Mac Minis since they are a cheap way to run a server where this would work. They aren't create a safe environment for AI, but an access point on local hardware.
I don't think you're missing anything. I'm pretty sure this is the trend. People buy Mac Minis, probably don't even download a local model, FA, and FO.
They are slaves to trends and haven't thought about it even a little bit?
Arm power efficiency, and unified ram at a fairly low price (at least compared to current ram pricing).
I love so much that there are real, hilarious consequences for overzealous early adoption. You can't make this shit up.
AI: I'm so sorry. You're correct I violated protocol. I'll make a note of this so it won't happen again.
Nurse: You gave my 5 year old patient 5000cc of morphine!
it won’t happen again.
Not to him, no.
Now, that's on the Nurse if they didn't notice they were injecting someone with 5-liters of morphine.
Isn't that why we're adopting AI? So the nurse can focus on more important things? 🤫
What could possibly be more important than the patient?
Why, the shareholders of course, silly!
I love how these models apologize like they mean it. It doesn't mean it. It doesn't feel bad, and it will do it again.
Apologies mean "I made a mistake and I learned from it so it won't repeat."
Sure it claims it added more notes to it's config, but if it ignored the rules before, what makes you think that new rules are going to change anything?
Like an abusive relationship
They behave exactly a child does when a parent forces an apology.
They have the words they're expect to say so they do say them but they don't undersranr why, they definitely don't mean it and they lack the restrain to not doing whatever they apologized for over and over.
Apologies mean "I made a mistake and I learned from it so it won't repeat."
If only some people meant it that way too!
But it’s adding it to a text file that eats up a ton of tokens and routinely gets ignored!
Apologies mean “I made a mistake and I learned from it so it won’t repeat.”
At best it might not make the same mistake again if that memory is in the current context. But more likely: It will not remember.
Although latest Gemini in particular has much more room for "remembering" things, still.
But "I made a mistake"? It is not self-aware in any way shape or form to the degree where "I made a mistake" carries any real meaning.
That MEMORY. md file won't do shit if the AI doesn't read it.
I give it 2 hours before it stops reading it until prompted again.
If I was the director of AI safety, and I used AI to own and delete my inbox, I sure as shit would never tell a soul.
This is pure unbridled incompetence.
If I was the director of AI safety, [...] would never tell a soul.
As a director of something, you are kinda public person. No way to just not tell.
Okay but this is like the armoury master person shooting their own foot with a loaded gun when they were juggling guns.
Then the public wants to know where that hole in the director's foot comes from.
The whole "AI safety" field is this incompetent. These people that will tell you AI is on the verge of creating a bioweapon, and then run random code in a command line. Completely and totally unserious.
I don’t know what the hell has happened, but some of these people are basically human jellyfish. Big tech is full of them now.
No thought enters their mind, but they dodge the layoffs and the PIPs and get promoted like this.
I don’t fucking get it.
They wanted to “eat their own dog food” but it’s closer to “eating their own dog shit”
How come some 25yo person is a director at Facebook?
I mean, even if she is a child prodigy genius, which she obviously is not as she is face first fist deep into AI, how the frack do you have even enough life experience to become a director of any large organization at that age unless you somehow cheated your way in?
Then reading the hat she's doing and how she resolved it tells me she doesn't know shit about computers, she just know how to type commands into AI systems
Is this the future? Am I going to end up being one of those long bearded magicians that still know the old technology, that still can still save the day by using shell commands?
Don't American companies give a loooot of people director or executive director titles just because it sounds impressive? In roles where you gotta talk to corporate customers at least
How come some 25yo person is a director at Facebook?
Maybe she has met the Suckerberg at some time when she was ... younger?
If all the qualifications I need to be a security engineer for Facebook are
- buy a Mac Mini
- don't configure remote access
- install untrusted software
- leave
Then Facebook should hire me. I'll buy so many Mac Minis on their dime. I will run so many crazy things.
“The bot ate my homework” is quickly becoming more plausible than the customary canine culprit.

