BlackRock's Larry Fink urges trillions in AI investment to keep US ahead, citing national security risks if China advances. His proposal to use retirement savings sparks social media uproar.
Haven’t they clearly documented how they did it and what they used so that anyone can replicate it? Anyone with the compute power, which of course few have. But universities could do it.
So how is it not open source in this specific domain of problems? What would a LLM model need to do to be open source then? Duplicate the whole training dataset in a big zipfile for you to download?
From what I understand you could even replicate deepseek by replacing the “cold start” with latest deepseek instead.
Haven’t they clearly documented how they did it and what they used so that anyone can replicate it?
They don’t put up the actual code for their training pipeline though. It’s more of a “if you have enough engineers, you can do this too” whitepaper, because they wouldn’t want any rando training their own model.
Right now, even if you had the exact training set (which is a CRUCIAL part of an LLM and you can NOT replicate it without it), you couldn’t rebuild the thing exactly, you’d need to do a whole lot of extra work.
So how is it not open source in this specific domain of problems?
You could call all proprietary software open source then. The UI and user manual describe what it does, you can do your own engineering to duplicate the functionality.
Haven’t they clearly documented how they did it and what they used so that anyone can replicate it? Anyone with the compute power, which of course few have. But universities could do it.
So how is it not open source in this specific domain of problems? What would a LLM model need to do to be open source then? Duplicate the whole training dataset in a big zipfile for you to download?
From what I understand you could even replicate deepseek by replacing the “cold start” with latest deepseek instead.
They don’t put up the actual code for their training pipeline though. It’s more of a “if you have enough engineers, you can do this too” whitepaper, because they wouldn’t want any rando training their own model.
Right now, even if you had the exact training set (which is a CRUCIAL part of an LLM and you can NOT replicate it without it), you couldn’t rebuild the thing exactly, you’d need to do a whole lot of extra work.
You could call all proprietary software open source then. The UI and user manual describe what it does, you can do your own engineering to duplicate the functionality.
Why even have this discussion? Self-learning algorithms appeared more than ten years ago. AI is being used very effectively in countless areas.
The idea that there is some sort of prize waiting for whomever gets the most computing power, is highly dubious.