this post was submitted on 25 Jul 2024
1005 points (97.5% liked)
Technology
59963 readers
3481 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I have the opposite problem. Gen A.I. has tripled my productivity, but the C-suite here is barely catching up to 2005.
Have you tripled your billing/salary? Stop being a scab lol
The opposite, actually.
Cool too
What do you do, just out of interest?
Soup to nuts video production.
Cool, enjoy your entire industry going under thanks to cheap and free software and executives telling their middle managers to just shoot and cut it on their phone.
Sincerely,
A former video editor.
If something can be effectively automated, why would I want to continue to invest energy into doing it manually? That's literal busy work.
So you can continue to be employed? What an odd question.
We should be employed to do busy work? Is that just UBI with extra steps?
Video editing is not busy work. You're excusing executives telling middle managers to put out inferior videos to save money.
You seem to think what I used to do was just cutting and pasting and had nothing to do with things like understanding film making techniques, the psychology of choosing and arranging certain shots, along with making do what you have when you don't have enough to work with.
But they don't care about that anymore because it costs money. Good luck getting an AI to do that as well as a human any time soon. They don't care because they save money this way.
I've been editing video for 30 years, 25 professionally - narrative, advertising, live, etc. I know exactly what it entails. Rough cuts can be automated right now. They still need a fair amount of work to take them to the finish line, though who knows how long that'll remain true. I'm more interested in training an AI editor on my particular editing style and choices than lamenting the death of a job description. I've already seen newscasts go from needing 9 people behind the camera to only 3 and the analog film industry transition to digital, putting LOTS of people out of a career. It's been a long time since I was under the illusion that this wouldn't happen to my occupation.
And I'm telling you that's not what is happening anymore. They are just having middle managers do rough cuts and saying "good enough." Have you seen the quality of advertising video these days?
Sounds like a very specific fetish
I don't know what that is. What is it?
"Soup to nuts" just means I am responsible for the entirety of the process, from pre-production to post-production. Sometimes that's like a dozen roles. Sometimes it's me.
OK. Where on earth does that phrase come from? Makes no logical sense!
It comes from when a full course dinner would always begin with soup and end with nuts.
Same, I've automated alot of my tasks with AI. No way 77% is "hampered" by it.
I dunno, mishandling of AI can be worse than avoiding it entirely. There's a middle manager here that runs everything her direct-report copywriter sends through ChatGPT, then sends the response back as a revision. She doesn't add any context to the prompt, say who the audience is, or use the custom GPT that I made and shared. That copywriter is definitely hampered, but it's not by AI, really, just run-of-the-mill manager PEBKAC.
I'm infuriated on their behalf.
E-fucking-xactly. I hate reading long winded bullshit AI stories with a passion. Drivel all of it.
What have you actually replaced/automated with AI?
Voiceover recording, noise reduction, rotoscoping, motion tracking, matte painting, transcription - and there's a clear path forward to automate rough cuts and integrate all that with digital asset management. I used to do all of those things manually/practically.
e: I imagine the downvotes coming from the same people that 20 years ago told me digital video would never match the artistry of film.
They're right IMO. Practical effects still look and age better than (IMO very obvious) digital effects. Oh and digital deaging IMO looks like crap.
But, this will always remain an opinion battle anyway, because quantifying "artistry" is in and of itself a fool's errand.
Digital video, not digital effects - I mean the guys I went to film school with that refused to touch digital videography.
All the models I've used that do TTS/RVC and rotoscoping have definitely not produced professional results.
What are you using? Cause if you're a professional, and this is your experience, I'd think you'd want to ask me what I'm using.
Coqui for TTS, RVC UI for matching the TTS to the actor's intonation, and DWPose -> controlnet applied to SDXL for rotoscoping
Full open source, nice! I respect the effort that went into that implementation. I pretty much exclusively use 11 Labs for TTS/RVC, turn up the style, turn down the stability, generate a few, and pick the best. I do find that longer generations tend to lose the thread, so it's better to batch smaller script segments.
Unless I misunderstand ya, your controlnet setup is for what would be rigging and animation rather than roto. I do agree that while I enjoy the outputs of pretty much all the automated animators, they're not ready for prime time yet. Although I'm about to dive into KREA's new key framing feature and see if that's any better for that use case.
I was never able to get appreciably better results from 11 labs than using some (minorly) trained RVC model :/ The long scripts problem is something pretty much any text-to-something model suffers from. The longer the context the lower the cohesion ends up.
I do rotoscoping with SDXL i2i and controlnet posing together. Without I found it tends to smear. Do you just do image2image?
The voice library 11labs added includes some really reliable and expressive models. I've only trained a few voice clones, but I find them totally usable for swapping out short lines to avoid having to bring a subject back in to record. I'll fabricate a sentence or two, but for longer form stuff, I only use AI for the rough cuts. Then I'll practically record as a last step, once everything's gone through revision cycles. The "generate a few and chop em together" method is fine for short clips, but becomes tedious for longer stuff.
Funnily enough, when I say roto, I really just mean tracing the subject to remove it from the background. Background removal's so baked in to things now, I dunno if people even think of it as roto. But I mostly still prefer the Adobe solutions on this - roto brush in After Effects, for the AI/manual collaboration. As for roto in the A Scanner Darkly sense, I've played with a few of the video to video models, but mostly as a lark for fluff B-roll.
A lot of people are keen to hear that AI is bad, though, so the clicks go through on articles like this anyway.
This may come as a shock to you, but the vast majority of the world does not work in tech.
I'm not working in tech either. Everyone relying on a computer can use this.
Also, medicin and radiology are two areas that will benefit from this - especially the patients.