Why Mark Zuckerberg wants to redefine open source so badly

fossilesque@mander.xyz · 18 hours ago

Why Mark Zuckerberg wants to redefine open source so badly

SplashJackson@lemmy.ca · 33 minutes ago

I don’t get it. What would they redefine it to?

Evotech@lemmy.world · 20 minutes ago

Ask “OpenAI”

bungalowtill@lemmy.dbzer0.com · 2 hours ago

What are we going to do with the colonisers?

daniskarma@lemmy.dbzer0.com · 6 hours ago

He is definitely in the same list as Trump and Elon Musk.

latenightnoir@lemmy.world · 8 hours ago

Because he’s an insecure and greedy child.

futatorius@lemm.ee · 6 hours ago

He’s also a sociopath who will say and do anything to get his way.

latenightnoir@lemmy.world · 3 minutes ago

You’re right, he’s a very complex asshole, indeed!

3aqn5k6ryk@lemmy.world · 12 hours ago

I dont give a fuck what you want mark. nobody is. what i want is for you to fuck off.

pastermil@sh.itjust.works · edit-2 12 hours ago

How about a no.

MonkderVierte@lemmy.ml · 3 hours ago

Ulrich@feddit.org · edit-2 18 hours ago

Money? Is it money?

clicks article

For Meta, it’s all about the money.

Shocking.

LillyPip@lemmy.ca · edit-2 14 hours ago

I taught myself programming in the 80s, then worked myself from waitress and line cook to programmer, UXD, and design lead to the point of being in the running for an Apple design award in the 2010s.

But I cared more than anything about making things FOR people. Making like easier. Making people happy. Making software that was a joy to use.

Then I got sick with something that’s neither curable nor easily manageable.

Now I’m destitute and have to choose between medicine and food, and I’m staring down homelessness. (eta I was homeless from age 16-18, and I won’t do that again now, with autoimmune dysautonomia and in my mid-50s, even if the alternative is final.)

Fuck these idiots who bought their way into nerd status (like Musk) or had one hot idea that took off and didn’t have to do anything after (this fucking guy). Hundreds or thousands of designers and programmers made these companies, and were tossed out like trash so a couple of people can be rock stars, making more per hour than most of us will see in a lifetime.

Slay the dragons.

chuckleslord@lemmy.world · 13 hours ago

I mean, didn’t he famously steal the idea?

futatorius@lemm.ee · edit-2 6 hours ago

His “idea” was about how to monetize a concept already in existence on MySpace, facilitated by completely ignoring any ethical constraints. That, and a snobbery-based product launch through the Ivies.

LillyPip@lemmy.ca · 8 hours ago

You’re right. I forgot about the lawsuit and settlement (for $65m). They’re both frauds.

horse_battery_staple@lemmy.world · 17 hours ago

We’re trying! You didn’t know Karla when you were there did you? She had the best stories about Spain.

LillyPip@lemmy.ca · 16 hours ago

I knew a Karla, but she was from Romania. Fantastic person. I miss her.

manucode@infosec.pub · 18 hours ago

For Meta, it’s all about the money.

And avoiding regulation

NOT_RICK@lemmy.world · 18 hours ago

Well yeah, because following regulations has an impact on the bottom line.

Curious Canid@lemmy.ca · 17 hours ago

Well, they have almost always circumvented them instead, but that impacts the bottom line too.

grrgyle@slrpnk.net · 2 hours ago

But at least that way they get to power trip

NOT_RICK@lemmy.world · 17 hours ago

Yup, lawyers are expensive

Exec@pawb.social · 17 hours ago

One is in direct relation with the other

don@lemm.ee · 18 hours ago

The time it took me to reach this conclusion, after seeing the headline, is measured in quectoseconds.

Baron Von J@lemmy.world · 18 hours ago

That’s alotl seconds!

don@lemm.ee · 18 hours ago

Several thousand is a lot, sure.

will_a113@lemmy.ml · 18 hours ago

Kinda funny how when mega corps can benefit from the millions upon millions of developer hours that they’re not paying for they’re all for open source. But when the mega corps have to ante up (with massive hardware purchases out of reach of any of said developers) they’re suddenly less excited about sharing their work.

CosmicTurtle0@lemmy.dbzer0.com · 15 hours ago

I’ve been begging my company to commit to 1% of our revenue toward open source software we use.

It would be life changing for many of these devs.

sunzu2@thebrainbin.org · 18 hours ago

You are describing parasitic behavior

horse_battery_staple@lemmy.world · 17 hours ago

A cancer does this also.

futatorius@lemm.ee · 6 hours ago

Billionaires are a cancer on the body politic.

fuzzy_feeling@programming.dev · 18 hours ago

Meta’s Llama models also impose licensing restrictions on its users. For example, if you have an extremely successful AI program that uses Llama code, you’ll have to pay Meta to use it. That’s not open source. Period.

open source != no license restrictions

According to Meta, “Existing open source definitions for software do not encompass the complexities of today’s rapidly advancing AI models. We are committed to keep working with the industry on new definitions to serve everyone safely and responsibly within the AI community.”

i think, he’s got a point, tho

is ai open source, when the trainig data isn’t?
as i understand, right now: yes, it’s enough, that the code is open source. and i think that’s a big problem

i’m not deep into ai, so correct me if i’m wrong.

airglow@lemmy.world · edit-2 2 hours ago

Software licenses that “discriminate against any person or group of persons” or “restrict anyone from making use of the program in a specific field of endeavor” are not open source. Llama’s license doesn’t just restrict Llama from being used by companies with “700 million monthly active users”, it also restricts Llama from being used to “create, train, fine tune, or otherwise improve an AI model” or being used for military purposes (although Meta made an exception for the US military). Therefore, Llama is not open source.

Syntha@sh.itjust.works · 2 hours ago

The license shall not restrict any party from selling or giving away the software as a component of an aggregate software distribution containing programs from several different sources

So as I understand it, under the OSI definition of the word, anything distributed under a copyleft licence would not be open source.

So all software with GNU GPL, for example.

airglow@lemmy.world · edit-2 1 hour ago

That’s incorrect. GPL licenses are open source.

The GPL does not restrict anyone from selling or distributing GPL-licensed software as a component of an aggregate software distribution. For example, all Linux distributions contain GPL-licensed software, as the Linux kernel is GPLv2.

umbraroze@lemmy.world · 5 hours ago

Open source software doesn’t, by definition, place restrictions on usage.

The license must not restrict anyone from making use of the program in a specific field of endeavor.

Clauses like “you can use this software freely except in specific circumstances” fly against that. Open source licenses usually have very little to say about what the software should be used for, and usually just as an affirmation that you can use the software for whatever you want.

TimeSquirrel@kbin.melroy.org · 14 hours ago

I don’t think any of our classical open licenses from the 80s and 90s were ever created with AI in mind. They are inadequate. An update or new one is needed.

Stallman, spit out the toe cheese and get to work.

Alex@lemmy.ml · 8 hours ago

The OSI have had a go: https://opensource.org/ai/open-source-ai-definition

WalnutLum@lemmy.ml · 2 hours ago

To note is that this definition was discussed for awhile with many engineers in the AI field, including from Meta.

GoodEye8@lemm.ee · 7 hours ago

I understand the same way and I think there’s a lot of gray area which makes it hard to just say “the data also needs to be open source for the code to be open source”. What would that mean for postgreSQL? Does it magically turn closed source if I don’t share what’s in my db? What would it mean to every open source software that stores and uses that stored data?

I’m not saying the AI models shouldn’t be open source, I’m saying reigning in the models needs to be done very carefully because it’s very easy to overreach and open up a whole other can of worms.

Kilgore Trout@feddit.it · 5 hours ago

PostgreSQL is not built on top of the data you host in your db. It’s not a valid comparison.

Theoriginalthon@lemmy.world · 18 hours ago

I think the licence type he is looking for is shareware

Kompressor@lemmy.world · 18 hours ago

Desperately trying tap in to the general trust/safety feel that open source software typically has. Trying to muddy the waters because they’ve proven they cannot be trusted whatsoever

kava@lemmy.world · edit-2 12 hours ago

when the data used to train the AI is copyrighted, how do you make it open source? it’s a valid question.

one thing is the model or the code that trains the AI. the other thing is the data that produces the weights which determines how the model predicts

of course, the obligatory fuck meta and the zuck and all that but there is a legal conundrum here we need to address that don’t fit into our current IP legal framework

my preferred solution is just to eliminate IP entirely

WalnutLum@lemmy.ml · 2 hours ago

The OSI’s definition actually tackles this pretty well:

Sufficient information as to the source of the data so that one could potentially go out and to retrieve it, and recreate the model, is sufficient to fall within the OSAI definition.

FooBarrington@lemmy.world · 8 hours ago

when the data used to train the AI is copyrighted, how do you make it open source?

When part of my code base belongs to someone else, how do I make it open source? By open sourcing the parts that belong to me, while clarifying that it’s only partially open source.

kava@lemmy.world · 2 hours ago

This is essentially what Llama does, no? The reason they are attempting a clarification is because they would be subject to different regulations depending on whether or not it’s open source.

If they open source everything they legally can, then do they qualify as “open source” for legal purposes? The difference can be tens of millions if not hundreds of millions of dollars in the EU according to Meta.

So a clarification on this issue, I think, is not asking for so much. Hate Facebook as much as the next guy but this is like 5 minute hate material

FooBarrington@lemmy.world · 1 hour ago

If they open source everything they legally can, then do they qualify as “open source” for legal purposes?

No, definitely not! Open source is a binary attribute. If your product is partially open source, it’s not open source, only the parts you open sourced.

So Llama is not open source, even if some parts are.

kava@lemmy.world · 18 minutes ago

I agree with you. What I’m saying is that perhaps the law can differentiate between “not open source” “partially open source” and “fully open source”

right now it’s just the binary yes/no. which again determines whether or not millions of people would have access to something that could be useful to them

i’m not saying change the definition of open source. i’m saying for legal purposes, in the EU, there should be some clarification in the law. if there is a financial benefit to having an open source product available then there should be something for having a partially open source product available

especially a product that is as open source as it could possible legally be without violating copyright

FooBarrington@lemmy.world · 9 minutes ago

Open source isn’t defined legally, only through the OSI. The benefit is only from a marketing perspective as far as I’m aware.

Which is also why it’s important that “open source” doesn’t get mixed up with “partially open source”, otherwise companies will get the benefits of “open source” without doing the actual work.

jacksilver@lemmy.world · 12 hours ago

I mean, you can have open source weights, training data, and code/model architecture. If you’ve done all three it’s an open model, otherwise you state open “component”. Seems pretty straightforward to me.

kava@lemmy.world · 2 hours ago

Yes, but that model would never compete with the models that use copyrighted data.

There is a unfathomably large ocean of copyrighted data that goes into the modern LLMs. From scraping the internet to transcripts of movies and TV shows to tens of thousands of novels, etc.

That’s the reason they are useful. If it weren’t for that data, it would be a novelty.

So do we want public access to AI or not? How do we wanna do it? Zuck’s quote from article “our legal framework isn’t equipped for this new generation of AI” I think has truth to it

jacksilver@lemmy.world · 5 minutes ago

I mean using proprietary data has been an issue with models as long as I’ve worked in the space. It’s always been a mixture of open weights, open data, open architecture.

I admit that it became more obvious when images/videos/audio became more accessible, but from things like facial recognition to pose estimation have all used proprietary datasets to build the models.

So this isn’t a new issue, and from my perspective not an issue at all. We just need to acknowledge that not all elements of a model may be open.

paraphrand@lemmy.world · 16 hours ago

If people could stop redefining words, that would go a long way to fixing our current strife.

Not a total solution, but it would clarify the discussion. I loathe people who redefine and weaponize words.

conc@lemmy.ml · 8 hours ago

I have some Aladeen news for you my friend

futatorius@lemm.ee · 6 hours ago

Embrace, extend, extinguish.

GreenKnight23@lemmy.world · 1 hour ago

expend, reload, repeat

dilroopgill@lemmy.world · 18 hours ago

No open source license type where corporations still have to pay?

airglow@lemmy.world · edit-2 2 hours ago

If you are referring to licenses that prohibit commercial use or prevent certain types of users from using the software, those licenses are not open source because they “discriminate against any person or group of persons” or “restrict anyone from making use of the program in a specific field of endeavor”.

For example, if a developer offers their software in a source-available “community” version that is restricted to non-commercial use and a proprietary “enterprise” version, neither the community version nor the enterprise version is open source. On the other hand, if a developer uses an open core licensing model by offering an open source “community” version and a proprietary “enterprise” version, the community version is open source while the enterprise version is not.

FooBarrington@lemmy.world · edit-2 7 hours ago

No, because that would no longer be open in the open source sense.

It’s either open for everyone, or it isn’t open.

Edit: sorry to whoever doesn’t like it, but it’s literally how “open source” is defined

futatorius@lemm.ee · edit-2 6 hours ago

No, software being free as in beer is not a necessary condition for being open-source. And if the code is not free as in beer, the pricing model can be whatever the hell you want, as long as the code is shared when the user is licensed. That can mean an expensive license for enterprise use coexisting with a free license for (say) researchers and individual devs.

FooBarrington@lemmy.world · edit-2 5 hours ago

No, not in the way GP wrote. You’re not allowed to have your license discriminate between users, so you’d have to sell your software to everyone, not just big companies.

Either no one pays, or everyone pays.

airglow@lemmy.world · 3 hours ago

Open source software can be sold at different prices to different customers, and still remain open source. Open source software can also be sold only to certain types of customers, and still remain open source. Who the developer decides to sell or distribute the software to, and at what price, is unrelated to how the software is licensed.

However, because the Open Source Definition prohibits open source software licenses from discriminating against “any person or group of persons”, the customers who buy open source software cannot be restricted from reselling or redistributing the software to any other individual or organization.

FooBarrington@lemmy.world · 3 hours ago

Right, which means that you practically cannot give open source software for free to non-corporations while selling it to corporations while still being fully open source, as the corporations can simply get it for free from any non-corporation.

airglow@lemmy.world · 2 hours ago

Exactly!

FooBarrington@lemmy.world · 2 hours ago

I appreciate the clarification, thank you!

Balder@lemmy.world · 6 hours ago

And that’s literally what the article says lol I don’t know why you were downvoted.

Emily Omier, a well-regarded open-source start-up consultant, emphasized that open source is a binary standard set by the Open Source Initiative (OSI), not a spectrum. "Either you’re open source, or you are not.

futatorius@lemm.ee · 6 hours ago

The binary mentioned is different. Omier was saying either you share all the source code, or it’s not open-source. You don’t get to retain some proprietary blob for an essential component and still say the whole app is open-source. Pricing is an entirely different question.

Balder@lemmy.world · edit-2 56 minutes ago

I suppose that both cases apply here. He’s saying that you either comply with an open source license that’s defined by the OSI or you don’t. That includes the source code to be available yes, but the article also mentions Meta license has a restriction:

if you have an extremely successful AI program that uses Llama code, you’ll have to pay Meta to use it. That’s not open source. Period.

From my understanding, you can’t take an open source license, add random restrictions and still call it open source (“if it’s a corporation it needs to pay a % fee to me”). It doesn’t matter if 98% of the license is open source, at that point your software simply isn’t open source anymore.

You can definitely have multiple licenses, such as Qt does to allow statically linking it and to modify it without distributing the source code, but that simply isn’t an open source one.

Balder@lemmy.world · edit-2 1 hour ago

deleted by creator