• MagicShel@lemmy.zip
    link
    fedilink
    English
    arrow-up
    37
    ·
    17 hours ago

    both OpenAI and Microsoft are probing whether DeepSeek used OpenAI’s application programming interface (API) without permission to train its own models on the output of OpenAI’s systems, an approach referred to as distillation.

    That would definitely show up in the quality of responses. Surely they have better and cheaper training sources…

    • monotremata@lemmy.ca
      link
      fedilink
      English
      arrow-up
      1
      ·
      46 minutes ago

      I think it’s reasonably likely. There was a research paper about how to do basically that a couple years ago. If you need a basic LLM trained on a specialized form of input and output, getting the expensive existing LLMs to generate that text for you is pretty efficient/inexpensive, so it’s a reasonable way to get a baseline model. Then you can add stuff like chain of reasoning and mixture of experts to improve the performance back up to where you need it. It’s not going to be a way to push the state of the art forward, but it’s sure a cheap way to catch up to models that have done that pushing.

    • sunzu2@thebrainbin.org
      link
      fedilink
      arrow-up
      31
      arrow-down
      1
      ·
      16 hours ago

      And if they did… So what

      Get fucked corpo parasite. Nobody fucking care about another corpo punking u esp when it is done in spectacular manner.

    • Da Bald Eagul@feddit.nl
      link
      fedilink
      English
      arrow-up
      8
      ·
      16 hours ago

      Considering that they actively recruit young and inexperienced people to work for 'm, there’s a big chance, yeah.

    • WhyJiffie@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      4 hours ago

      only if it would be so easy. think about your data that’s taken about you and you can’t refuse. healthcare, home ownership, if you’re still learning then a bunch of data about your progress, and maybe even your handwriting

    • Mac@mander.xyz
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      3
      ·
      6 hours ago

      Lemmy.world admins have your data right here, what are you on about?