Reinforcement Learning Training

Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks

Ai2 updates its Olmo 3 family of models to Olmo 3.1 following additional extended RL training to boost performance.

NVIDIA Debuts Nemotron 3 Family of Open Models

The Nemotron 3 family of open models — in Nano, Super and Ultra sizes — introduces the most efficient family of open models ...

Korean AI startup Motif reveals 4 big lessons for training enterprise LLMs

Motif-2-12.7B-Reasoning is positioned as competitive with much larger models, but its real value lies in the transparency of ...

Tech Xplore on MSN

AI gets a private tutor for learning human preferences more accurately

No matter how much data they learn, why do artificial intelligence (AI) models often miss the mark on human intent?

SiliconRepublic

What is reinforcement learning?

Prof Ambuj Tewari from the University of Michigan explains the origins of reinforcement learning and why it’s so valuable in AI research and development. Understanding intelligence and creating ...

usace.army.mil

Army researchers develop innovative framework for training AI

ADELPHI, Md. — Army researchers developed a pioneering framework that provides a baseline for the development of collaborative multi-agent systems. The framework is detailed in the survey paper ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results