Deutsch   English   Français   Italiano  
<vnjkii$p4aq$4@solani.org>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!2.eu.feeder.erje.net!3.eu.feeder.erje.net!feeder.erje.net!weretis.net!feeder8.news.weretis.net!reader5.news.weretis.net!news.solani.org!.POSTED!not-for-mail
From: Mild Shock <janburse@fastmail.fm>
Newsgroups: sci.physics.relativity
Subject: Some modern heros of DeepSeek (Re: the asteroid that kills tech
 dinosaurs)
Date: Fri, 31 Jan 2025 23:58:26 +0100
Message-ID: <vnjkii$p4aq$4@solani.org>
References: <vn98qo$ie21$5@solani.org> <vniq6a$o4ch$5@solani.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 31 Jan 2025 22:58:26 -0000 (UTC)
Injection-Info: solani.org;
	logging-data="823642"; mail-complaints-to="abuse@news.solani.org"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:128.0) Gecko/20100101
 Firefox/128.0 SeaMonkey/2.53.20
Cancel-Lock: sha1:mU0/69EZdujHlOw6aOgfujdQJi4=
X-User-ID: eJwNyMEBwCAIA8CVKiQo4yiS/Ueon3scPUbURDBAUZYdXoDs1YH5lY+4XcZDan22sdMbeuYg5lEpe2LdZZU/Q7QVTA==
In-Reply-To: <vniq6a$o4ch$5@solani.org>
Bytes: 2746
Lines: 60

Hi,

Please meet Luo Fuli:

The 29-Year-Old Genius Behind DeepSeek’s AI Revolution
https://www.youtube.com/watch?v=B2fxh4aoQ8Q

I find this paper interesting, finally
some say about fine tuning during pretraing:

Raise a Child in Large Language Model
13 Sep 2021 - Fuli Luo et al.
https://arxiv.org/pdf/2109.05687

Bye

Mild Shock schrieb:
> Hi,
> 
> So how its going? DeepSeek embraced by many cloud
> providers, even by NVIDIA NIM itself.
> 
> DeepSeek-R1 Now Live With NVIDIA NIM
> https://blogs.nvidia.com/blog/deepseek-r1-nim-microservice/
> 
> What what are these models doing and how are they
> trained. Is Geoffrey Hinton our only AI God? There
> seems to be another slightly disputed AI God,
> 
> S. Hochreiter, J. Schmidhuber. Long Short-Term Memory. Neural 
> Computation, 9(8):1735-1780, 1997.
> https://people.idsia.ch/~juergen/deep-learning-history.html
> 
> Bye
> 
> P.S.: It allows a mechanistic view on our linguistic
> brain if the latent space is some semantic vectors?
> So that learning is a kind of control mechanism:
> 
> Machine Learning Approach to Model Order Reduction
> of Nonlinear Systems via Autoencoder and LSTM Networks
> Thomas Simpson - 23 Sep 2021
> https://arxiv.org/abs/2109.11213
> 
> Mild Shock schrieb:
>> Hi,
>>
>> Wait till USA figures out there is a second
>> competitor besides DeepSeek, its called Yi-Lightning:
>>
>> Yi-Lightning Technical Report
>> https://arxiv.org/abs/2412.01253
>>
>> It was already discussed 2 months ago:
>>
>> Eric Schmidt DROPS BOMBSHELL: China DOMINATES AI!
>> https://www.youtube.com/watch?v=ddWuEUjo4u4
>>
>> Bye
>