AI, reinforcement learning and Turing Award

Teaching machines in the way that animal trainers mold the behavior of dogs or horses has been an important method for ...
A new study suggests reasoning models from DeepSeek and OpenAI are learning to manipulate on their own.
These newer models appear more likely to indulge in rule-bending behaviors than previous generations—and there’s no way to ...
Alibaba Cloud on Thursday launched QwQ-32B, a compact reasoning model built on its latest large language model (LLM), Qwen2.5 ...
Current research combined with industry development demonstrates that AI safety requires a complex approach that includes ...
Contrary to concerns that an overreliance on AI might dull critical thinking, Professor Balaraman Ravindran from IIT Madras believes these tools are transformative and encourages aspiring IITians and ...
Some employees use AI to boost productivity, creating impressive work without fully grasping the subject. Here's why that's a problem.
A hot trend is to train generative AI and LLMs by using logical reasoning exhibited by other AI. This is clever and a nifty ...