Oh yeah those are problematic, but I’m pretty sure a court has ruled in a customer’s favor when the AI fucked up, which is good at least.
bamboo
If it ends up being ruled that training an LLM is fair use so long as the LLM doesn’t reproduce the works it is trained on verbatim, then licensing becomes irrelevant.
Chatbots are fine as long as it’s clearly disclosed to the user that anything they generate could be wrong. They’re super useful just as an idea generating machine for example, or even as a starting point for technical questions when you don’t know what the right vocabulary is to describe a problem.
How would an LLM answering questions about a git repo be legally different from a person answering those same questions (think stackoverflow)? Specific to this case, US law does not consider “APIs” to be copyrightable (Oracle v Google, Google reimplemented Java using the same APIs but their own implementation code, court ruled that Oracle couldn’t copyright the APIs).
Regarding “replace”, the primary use of the git repo is the code itself, not the Q&A about how to use it. The LLM doesn’t generate code that fully replaces that library or program, or if it does, it is distinct enough to be a different work.
I mean, this is how courts work. Someone will sue because a work they hold copyright to was used in a training set without their authorization, the defendant will claim it was fair use, the judge will pick a side. To the best of my knowledge this hasn’t happened just yet, and since I’m not a judge, I use “probably”. Fair use is both vague and broad, and this is important to ensure copyright holders don’t have complete control over their work. It was recognized a long time ago that you can make works that utilize another copyrighted work, but don’t functionally replace the original work, and are therefore fair use. The whole point was to try and foster innovation, not to allow copyright holders to dictate how their works are used, and fair use is an essential part of that.
Training an LLM with a work doesn’t functionally replace that work. If there is a filter that prevents 1:1 reproduction, then it literally cannot. It also provides significant benefit to have these LLMs, they are a unique and valuable work themselves. That’s why it’s fair use.
Why do you think they need your permission to use information you posted publicly to train their models? Copyright isn’t unlimited, and model training is probably fair use.
Yeah this is super sensible. Out of curiosity, do you have any decent examples bad usage? I think chatbots, GitHub copilot type stuff to be fine. I find the rewording applications to be fine. I haven’t used it but Duolingo has an AI mode now and it is questionable sounding, but maybe it is elementary enough and fine tuned well enough for the content in the supported courses that errors are extremely rare or even detectable.
Lmao you got some criticism and now you’re saying everyone else is a bot or has an agenda. I am a software engineer and my organization does not gain any specific benefits for promoting AI in any way. They don’t sell AI products and never will. We do publish open source work however, and per its license anyone is free to use it for any purpose, AI training included. It’s actually great that our work is in training sets, because it means our users can ask tools like ChatGPT questions and it can usually generate accurate code, at least for the simple cases. Saves us time answering those questions ourselves.
I think that the anti-AI hysteria is stupid virtue signaling for luddites. LLMs are here, whether or not they train on your random project isn’t going to affect them in any meaningful way, there are more than enough fully open source works to train on. Better to have your work included so that the LLM can recommend it to people or answer questions about it.
Anything you put publicly on the internet in a well known format is likely to end up in a training set. It hasn’t been decided legally yet, but it’s very likely that training a model will fall under fair use. Commercial solutions go a step further and prevent exact 1:1 reproductions, which would likely settle any ambiguity. You can throw anti-AI licenses on it, but until it’s determined to be a violation of copyright, it is literally meaningless.
Also if you just hope to spam tab with any of the AI code generators and get good results, you’re not. That’s not how those work. Saying something like this just shows the world that you have no idea how to use the tool, not the quality of the tool itself. AI is a useful tool, it’s not a magic bullet.
In many cases you’re right. I manage some spreadsheets at work that are relatively small, have little/no automation, and are really just inventory lists with some notes on each item. Could be a CSV.
I got a MacBook Air last year because Apple Silicon is insanely good. I never have to worry about battery life. I was working on an android app at the time, and Android Studio ran so much better on that thing than it did on my 11 gen intel DPS 15. Build times were almost instant, and unlike the XPS, the battery life was well more than I needed in a day. The XPS I would close everything and could squeeze out about 3 hours, and it was hot on my lap. Got my boss to buy me an M3 Max MBP for work, I’m truly a convert, can’t see myself ever going back unless windows laptops can leapfrog Apple in terms of performance and battery life. And for the Linux crowd, much of my work takes place in a Linux VM and it’s great, build times are noticeably faster than the WSL2 environment I used on the XPS.
Since then I’ve bought the rest of the Apple ecosystem. Like, other than a Vision Pro and HomePods, I’ve got them all. Started with an iPhone simply because it was phone upgrade time, the iPhone 15 series was about to come out with USB C, figured I’d give it a try since it would sync up all the stuff really well with my Mac. Figured I’d try the watch too, I kinda like it (got the magnetic link band for Christmas too, it’s an awesome fidget toy you can wear on your wrist).
My Sony WF-1000xm4 battery died in one of the buds. Now, even though they’re out of warranty I’m pretty sure Sony would replace them, but I figured since I’m increasingly all in on the Apple ecosystem, I’d try the AirPods Pro with their new USB C case. My review: they’re not the best earbuds for noise cancelling, but they’re serviceable, the sound quality is pretty good (I switched to Apple Music at the same time, they encode their music with noticeably higher quality than Spotify, so that’s an important factor here). The real magic though: I stick them in my ears, and then they just follow me to whatever device I’m using. Listening to a podcast while cooking and cleaning, then sit down to do some work, pause my phone, they connect right to the Mac. Get up, grab the iPad to go lay down in bed, the switch automatically. Best thing I’ve ever used in that regard, and they’re quick. Highly recommend if you have many apple devices.
Most recently I bought one of the new M4 iPad Pros. My review isn’t quite as rosy here, don’t buy this thing unless you really want to piss away money. Like, I enjoy playing around with the Apple Pencil and all, but really it says more about my financial discipline than the usefulness of the device. I am hoping to contribute better iPad support to some open source iOS apps. Also bought an Apple TV because my partner was ready to destroy the HiSense google tv we have because its interface is slow and unresponsive, and the audio and video would desync all the time. Crossy road on the TV is amusing.
So that was long, but that’s how I went from no Apple products to all Apple products in a year, save for my gaming desktop which is running windows solely for the reason that it has the best games compatibility and I don’t want to have to spend any more time babysitting it than I have to. RX 5800x3d, 64 GB RAM, 3080ti. It ultimately came down to Apple silicon being the best thing to ever happen to laptops, Apple switching to USB C, and optimism that governments are going to force them to open up the ecosystem a bit more.
Most computer displays also support hdmi too though. In the last though there were usually tradeoffs in using the hdmi input. Now hdmi has caught up enough that usually there’s no difference, assuming the manufacturer is using the latest standard.