• AlteredEgo@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    8 hours ago

    Haven’t they clearly documented how they did it and what they used so that anyone can replicate it? Anyone with the compute power, which of course few have. But universities could do it.

    So how is it not open source in this specific domain of problems? What would a LLM model need to do to be open source then? Duplicate the whole training dataset in a big zipfile for you to download?

    From what I understand you could even replicate deepseek by replacing the “cold start” with latest deepseek instead.

    • boonhet@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      3
      ·
      4 hours ago

      Haven’t they clearly documented how they did it and what they used so that anyone can replicate it?

      They don’t put up the actual code for their training pipeline though. It’s more of a “if you have enough engineers, you can do this too” whitepaper, because they wouldn’t want any rando training their own model.

      Right now, even if you had the exact training set (which is a CRUCIAL part of an LLM and you can NOT replicate it without it), you couldn’t rebuild the thing exactly, you’d need to do a whole lot of extra work.

      So how is it not open source in this specific domain of problems?

      You could call all proprietary software open source then. The UI and user manual describe what it does, you can do your own engineering to duplicate the functionality.

    • metermatic26@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 hours ago

      Why even have this discussion? Self-learning algorithms appeared more than ten years ago. AI is being used very effectively in countless areas.

      The idea that there is some sort of prize waiting for whomever gets the most computing power, is highly dubious.