DeepSeek Proves It: Open Source is the Secret to Dominating Tech Markets (and Wall Street has it wrong).

Cat@ponder.cat · 10 months ago

DeepSeek Proves It: Open Source is the Secret to Dominating Tech Markets (and Wall Street has it wrong).

Pennomi@lemmy.world · 10 months ago

The open paper they published details the algorithms and techniques used to train it, and it’s been replicated by researchers already.

legolas@fedit.pl · edit-2 10 months ago

So are these techiques so novel and breaktrough? Will we now have a burst of deepseek like models everywhere? Cause that’s what absolutely should happen if the whole storey is true. I would assume there are dozens or even hundreds of companies in USA that are in a posession of similar number but surely more chips that Chinese folks claimed to trained their model on, especially in finance sector and just AI reserach focused.

ArchRecord@lemm.ee · edit-2 6 months ago

deleted by creator

Aatube@kbin.melroy.org · 10 months ago

Note that s1 is transparently a distilled model instead of a model trained from scratch, meaning it inherits knowledge from an existing model (Gemini 2.0 in this case) and doesn’t need to retrain its knowledge nearly as much as training a model from scratch. It’s still important, but the training resources aren’t really directly comparable.

ArchRecord@lemm.ee · edit-2 6 months ago

deleted by creator