Technology

75069 readers

3214 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

788

Spotify just changed their TOS, giving them unprecedented rights to create "derivative works" from audiobooks (storyfair.net)

submitted 2 years ago* (last edited 2 years ago) by gedaliyah@lemmy.world to c/technology@lemmy.world

88 comments fedilink hide all child comments

They frame it as though it's for user content, more likely it's to train AI, but in fact it gives them the right to do almost anything they want - up to (but not including) stealing the content outright.

you are viewing a single comment's thread
view the rest of the comments

[–] theneverfox@pawb.social 4 points 2 years ago

This is a much better take.

Intonation is huge, and something general models tend to have trouble with - especially with something like an audiobook, which is narration - it's very contextual in a way not found in almost any other form of communication. It even encapsulates every other form of context through dialogue.

And not only that - a lot of audiobooks have versions by multiple voice actors. And they might change a word here or there, but it's highly structured data - it's truly a treasure trove

I'd go a step further and say they really want access to the dataset - not just for audiobooks, but because this is a fantastic dataset to train very context aware (and silky smooth) text to voice.

Spotify probably doesn't have the chops to do this, but they might be trying to leverage the dataset - I'm not sure if they could sell it wholesale or not, but if nothing else they could "partner" with Microsoft or Google to train VTT capabilities into multi-modal LLMs (a pitch with all the buzzwords to make investors need to change their underwear)