Deutsch English Français Italiano |
<vnjkii$p4aq$4@solani.org> View for Bookmarking (what is this?) Look up another Usenet article |
Path: ...!2.eu.feeder.erje.net!3.eu.feeder.erje.net!feeder.erje.net!weretis.net!feeder8.news.weretis.net!reader5.news.weretis.net!news.solani.org!.POSTED!not-for-mail From: Mild Shock <janburse@fastmail.fm> Newsgroups: sci.physics.relativity Subject: Some modern heros of DeepSeek (Re: the asteroid that kills tech dinosaurs) Date: Fri, 31 Jan 2025 23:58:26 +0100 Message-ID: <vnjkii$p4aq$4@solani.org> References: <vn98qo$ie21$5@solani.org> <vniq6a$o4ch$5@solani.org> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Date: Fri, 31 Jan 2025 22:58:26 -0000 (UTC) Injection-Info: solani.org; logging-data="823642"; mail-complaints-to="abuse@news.solani.org" User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:128.0) Gecko/20100101 Firefox/128.0 SeaMonkey/2.53.20 Cancel-Lock: sha1:mU0/69EZdujHlOw6aOgfujdQJi4= X-User-ID: eJwNyMEBwCAIA8CVKiQo4yiS/Ueon3scPUbURDBAUZYdXoDs1YH5lY+4XcZDan22sdMbeuYg5lEpe2LdZZU/Q7QVTA== In-Reply-To: <vniq6a$o4ch$5@solani.org> Bytes: 2746 Lines: 60 Hi, Please meet Luo Fuli: The 29-Year-Old Genius Behind DeepSeek’s AI Revolution https://www.youtube.com/watch?v=B2fxh4aoQ8Q I find this paper interesting, finally some say about fine tuning during pretraing: Raise a Child in Large Language Model 13 Sep 2021 - Fuli Luo et al. https://arxiv.org/pdf/2109.05687 Bye Mild Shock schrieb: > Hi, > > So how its going? DeepSeek embraced by many cloud > providers, even by NVIDIA NIM itself. > > DeepSeek-R1 Now Live With NVIDIA NIM > https://blogs.nvidia.com/blog/deepseek-r1-nim-microservice/ > > What what are these models doing and how are they > trained. Is Geoffrey Hinton our only AI God? There > seems to be another slightly disputed AI God, > > S. Hochreiter, J. Schmidhuber. Long Short-Term Memory. Neural > Computation, 9(8):1735-1780, 1997. > https://people.idsia.ch/~juergen/deep-learning-history.html > > Bye > > P.S.: It allows a mechanistic view on our linguistic > brain if the latent space is some semantic vectors? > So that learning is a kind of control mechanism: > > Machine Learning Approach to Model Order Reduction > of Nonlinear Systems via Autoencoder and LSTM Networks > Thomas Simpson - 23 Sep 2021 > https://arxiv.org/abs/2109.11213 > > Mild Shock schrieb: >> Hi, >> >> Wait till USA figures out there is a second >> competitor besides DeepSeek, its called Yi-Lightning: >> >> Yi-Lightning Technical Report >> https://arxiv.org/abs/2412.01253 >> >> It was already discussed 2 months ago: >> >> Eric Schmidt DROPS BOMBSHELL: China DOMINATES AI! >> https://www.youtube.com/watch?v=ddWuEUjo4u4 >> >> Bye >