I don’t get it. What would they redefine it to?
Ask “OpenAI”
What are we going to do with the colonisers?
He is definitely in the same list as Trump and Elon Musk.
Because he’s an insecure and greedy child.
He’s also a sociopath who will say and do anything to get his way.
You’re right, he’s a very complex asshole, indeed!
I dont give a fuck what you want mark. nobody is. what i want is for you to fuck off.
How about a no.
Yes
Money? Is it money?
clicks article
For Meta, it’s all about the money.
Shocking.
I taught myself programming in the 80s, then worked myself from waitress and line cook to programmer, UXD, and design lead to the point of being in the running for an Apple design award in the 2010s.
But I cared more than anything about making things FOR people. Making like easier. Making people happy. Making software that was a joy to use.
Then I got sick with something that’s neither curable nor easily manageable.
Now I’m destitute and have to choose between medicine and food, and I’m staring down homelessness. (eta I was homeless from age 16-18, and I won’t do that again now, with autoimmune dysautonomia and in my mid-50s, even if the alternative is final.)
Fuck these idiots who bought their way into nerd status (like Musk) or had one hot idea that took off and didn’t have to do anything after (this fucking guy). Hundreds or thousands of designers and programmers made these companies, and were tossed out like trash so a couple of people can be rock stars, making more per hour than most of us will see in a lifetime.
Slay the dragons.
I mean, didn’t he famously steal the idea?
His “idea” was about how to monetize a concept already in existence on MySpace, facilitated by completely ignoring any ethical constraints. That, and a snobbery-based product launch through the Ivies.
You’re right. I forgot about the lawsuit and settlement (for $65m). They’re both frauds.
We’re trying! You didn’t know Karla when you were there did you? She had the best stories about Spain.
I knew a Karla, but she was from Romania. Fantastic person. I miss her.
For Meta, it’s all about the money.
And avoiding regulation
Well yeah, because following regulations has an impact on the bottom line.
Well, they have almost always circumvented them instead, but that impacts the bottom line too.
But at least that way they get to power trip
Yup, lawyers are expensive
One is in direct relation with the other
The time it took me to reach this conclusion, after seeing the headline, is measured in quectoseconds.
That’s alotl seconds!
Several thousand is a lot, sure.
Kinda funny how when mega corps can benefit from the millions upon millions of developer hours that they’re not paying for they’re all for open source. But when the mega corps have to ante up (with massive hardware purchases out of reach of any of said developers) they’re suddenly less excited about sharing their work.
I’ve been begging my company to commit to 1% of our revenue toward open source software we use.
It would be life changing for many of these devs.
You are describing parasitic behavior
A cancer does this also.
Billionaires are a cancer on the body politic.
Meta’s Llama models also impose licensing restrictions on its users. For example, if you have an extremely successful AI program that uses Llama code, you’ll have to pay Meta to use it. That’s not open source. Period.
open source != no license restrictions
According to Meta, “Existing open source definitions for software do not encompass the complexities of today’s rapidly advancing AI models. We are committed to keep working with the industry on new definitions to serve everyone safely and responsibly within the AI community.”
i think, he’s got a point, tho
is ai open source, when the trainig data isn’t?
as i understand, right now: yes, it’s enough, that the code is open source. and i think that’s a big problemi’m not deep into ai, so correct me if i’m wrong.
Software licenses that “discriminate against any person or group of persons” or “restrict anyone from making use of the program in a specific field of endeavor” are not open source. Llama’s license doesn’t just restrict Llama from being used by companies with “700 million monthly active users”, it also restricts Llama from being used to “create, train, fine tune, or otherwise improve an AI model” or being used for military purposes (although Meta made an exception for the US military). Therefore, Llama is not open source.
The license shall not restrict any party from selling or giving away the software as a component of an aggregate software distribution containing programs from several different sources
So as I understand it, under the OSI definition of the word, anything distributed under a copyleft licence would not be open source.
So all software with GNU GPL, for example.
That’s incorrect. GPL licenses are open source.
The GPL does not restrict anyone from selling or distributing GPL-licensed software as a component of an aggregate software distribution. For example, all Linux distributions contain GPL-licensed software, as the Linux kernel is GPLv2.
Open source software doesn’t, by definition, place restrictions on usage.
The license must not restrict anyone from making use of the program in a specific field of endeavor.
Clauses like “you can use this software freely except in specific circumstances” fly against that. Open source licenses usually have very little to say about what the software should be used for, and usually just as an affirmation that you can use the software for whatever you want.
I don’t think any of our classical open licenses from the 80s and 90s were ever created with AI in mind. They are inadequate. An update or new one is needed.
Stallman, spit out the toe cheese and get to work.
The OSI have had a go: https://opensource.org/ai/open-source-ai-definition
To note is that this definition was discussed for awhile with many engineers in the AI field, including from Meta.
I understand the same way and I think there’s a lot of gray area which makes it hard to just say “the data also needs to be open source for the code to be open source”. What would that mean for postgreSQL? Does it magically turn closed source if I don’t share what’s in my db? What would it mean to every open source software that stores and uses that stored data?
I’m not saying the AI models shouldn’t be open source, I’m saying reigning in the models needs to be done very carefully because it’s very easy to overreach and open up a whole other can of worms.
PostgreSQL is not built on top of the data you host in your db. It’s not a valid comparison.
I think the licence type he is looking for is shareware
Desperately trying tap in to the general trust/safety feel that open source software typically has. Trying to muddy the waters because they’ve proven they cannot be trusted whatsoever
when the data used to train the AI is copyrighted, how do you make it open source? it’s a valid question.
one thing is the model or the code that trains the AI. the other thing is the data that produces the weights which determines how the model predicts
of course, the obligatory fuck meta and the zuck and all that but there is a legal conundrum here we need to address that don’t fit into our current IP legal framework
my preferred solution is just to eliminate IP entirely
The OSI’s definition actually tackles this pretty well:
Sufficient information as to the source of the data so that one could potentially go out and to retrieve it, and recreate the model, is sufficient to fall within the OSAI definition.
when the data used to train the AI is copyrighted, how do you make it open source?
When part of my code base belongs to someone else, how do I make it open source? By open sourcing the parts that belong to me, while clarifying that it’s only partially open source.
This is essentially what Llama does, no? The reason they are attempting a clarification is because they would be subject to different regulations depending on whether or not it’s open source.
If they open source everything they legally can, then do they qualify as “open source” for legal purposes? The difference can be tens of millions if not hundreds of millions of dollars in the EU according to Meta.
So a clarification on this issue, I think, is not asking for so much. Hate Facebook as much as the next guy but this is like 5 minute hate material
If they open source everything they legally can, then do they qualify as “open source” for legal purposes?
No, definitely not! Open source is a binary attribute. If your product is partially open source, it’s not open source, only the parts you open sourced.
So Llama is not open source, even if some parts are.
I agree with you. What I’m saying is that perhaps the law can differentiate between “not open source” “partially open source” and “fully open source”
right now it’s just the binary yes/no. which again determines whether or not millions of people would have access to something that could be useful to them
i’m not saying change the definition of open source. i’m saying for legal purposes, in the EU, there should be some clarification in the law. if there is a financial benefit to having an open source product available then there should be something for having a partially open source product available
especially a product that is as open source as it could possible legally be without violating copyright
Open source isn’t defined legally, only through the OSI. The benefit is only from a marketing perspective as far as I’m aware.
Which is also why it’s important that “open source” doesn’t get mixed up with “partially open source”, otherwise companies will get the benefits of “open source” without doing the actual work.
I mean, you can have open source weights, training data, and code/model architecture. If you’ve done all three it’s an open model, otherwise you state open “component”. Seems pretty straightforward to me.
Yes, but that model would never compete with the models that use copyrighted data.
There is a unfathomably large ocean of copyrighted data that goes into the modern LLMs. From scraping the internet to transcripts of movies and TV shows to tens of thousands of novels, etc.
That’s the reason they are useful. If it weren’t for that data, it would be a novelty.
So do we want public access to AI or not? How do we wanna do it? Zuck’s quote from article “our legal framework isn’t equipped for this new generation of AI” I think has truth to it
I mean using proprietary data has been an issue with models as long as I’ve worked in the space. It’s always been a mixture of open weights, open data, open architecture.
I admit that it became more obvious when images/videos/audio became more accessible, but from things like facial recognition to pose estimation have all used proprietary datasets to build the models.
So this isn’t a new issue, and from my perspective not an issue at all. We just need to acknowledge that not all elements of a model may be open.
If people could stop redefining words, that would go a long way to fixing our current strife.
Not a total solution, but it would clarify the discussion. I loathe people who redefine and weaponize words.
I have some Aladeen news for you my friend
Embrace, extend, extinguish.
expend, reload, repeat
No open source license type where corporations still have to pay?
If you are referring to licenses that prohibit commercial use or prevent certain types of users from using the software, those licenses are not open source because they “discriminate against any person or group of persons” or “restrict anyone from making use of the program in a specific field of endeavor”.
For example, if a developer offers their software in a source-available “community” version that is restricted to non-commercial use and a proprietary “enterprise” version, neither the community version nor the enterprise version is open source. On the other hand, if a developer uses an open core licensing model by offering an open source “community” version and a proprietary “enterprise” version, the community version is open source while the enterprise version is not.
No, because that would no longer be open in the open source sense.
It’s either open for everyone, or it isn’t open.
Edit: sorry to whoever doesn’t like it, but it’s literally how “open source” is defined
No, software being free as in beer is not a necessary condition for being open-source. And if the code is not free as in beer, the pricing model can be whatever the hell you want, as long as the code is shared when the user is licensed. That can mean an expensive license for enterprise use coexisting with a free license for (say) researchers and individual devs.
No, not in the way GP wrote. You’re not allowed to have your license discriminate between users, so you’d have to sell your software to everyone, not just big companies.
Either no one pays, or everyone pays.
Open source software can be sold at different prices to different customers, and still remain open source. Open source software can also be sold only to certain types of customers, and still remain open source. Who the developer decides to sell or distribute the software to, and at what price, is unrelated to how the software is licensed.
However, because the Open Source Definition prohibits open source software licenses from discriminating against “any person or group of persons”, the customers who buy open source software cannot be restricted from reselling or redistributing the software to any other individual or organization.
Right, which means that you practically cannot give open source software for free to non-corporations while selling it to corporations while still being fully open source, as the corporations can simply get it for free from any non-corporation.
Exactly!
I appreciate the clarification, thank you!
And that’s literally what the article says lol I don’t know why you were downvoted.
Emily Omier, a well-regarded open-source start-up consultant, emphasized that open source is a binary standard set by the Open Source Initiative (OSI), not a spectrum. "Either you’re open source, or you are not.
The binary mentioned is different. Omier was saying either you share all the source code, or it’s not open-source. You don’t get to retain some proprietary blob for an essential component and still say the whole app is open-source. Pricing is an entirely different question.
I suppose that both cases apply here. He’s saying that you either comply with an open source license that’s defined by the OSI or you don’t. That includes the source code to be available yes, but the article also mentions Meta license has a restriction:
if you have an extremely successful AI program that uses Llama code, you’ll have to pay Meta to use it. That’s not open source. Period.
From my understanding, you can’t take an open source license, add random restrictions and still call it open source (“if it’s a corporation it needs to pay a % fee to me”). It doesn’t matter if 98% of the license is open source, at that point your software simply isn’t open source anymore.
You can definitely have multiple licenses, such as Qt does to allow statically linking it and to modify it without distributing the source code, but that simply isn’t an open source one.
deleted by creator