Best AI papers explained
Een podcast door Enoch H. Kang

Categorieën:
183 Afleveringen
-
Test-Time RL: Self-Evolving LLMs via Majority Voting Rewards
Gepubliceerd: 25-4-2025 -
Tina: Tiny LoRA Reasoning Models
Gepubliceerd: 25-4-2025 -
Evaluating large language models in theory of mind tasks
Gepubliceerd: 25-4-2025 -
QUEST: Quality Sampling for Machine Translation
Gepubliceerd: 24-4-2025 -
Offline Preference Learning via Simulated Trajectory Feedback
Gepubliceerd: 24-4-2025 -
Reasoning Elicitation in Language Models via Counterfactual Feedback
Gepubliceerd: 24-4-2025 -
Eliciting Human Preferences with Language Models
Gepubliceerd: 24-4-2025 -
Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Gepubliceerd: 24-4-2025 -
γ-Bench: Evaluating LLMs in Multi-Agent Games
Gepubliceerd: 24-4-2025 -
DRAFT: Self-Driven LLM Tool Mastery via Documentation Refinement
Gepubliceerd: 24-4-2025 -
Optimal Prediction Sets for Enhanced Human-AI Accuracy
Gepubliceerd: 24-4-2025 -
Self-Correction via Reinforcement Learning for Language Models
Gepubliceerd: 24-4-2025 -
Tractable Multi-Agent Reinforcement Learning through Behavioral Economics
Gepubliceerd: 24-4-2025 -
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement
Gepubliceerd: 24-4-2025 -
Iterative Nash Policy Optimization for Language Model Alignment
Gepubliceerd: 24-4-2025 -
SycEval: Benchmarking LLM Sycophancy in Mathematics and Medicine
Gepubliceerd: 23-4-2025 -
Stack AI: Democratizing Enterprise AI Development
Gepubliceerd: 22-4-2025 -
Evaluating Modern Recommender Systems: Challenges and Future Directions
Gepubliceerd: 22-4-2025 -
AI in the Enterprise: Seven Lessons from Frontier Companies by OpenAI
Gepubliceerd: 22-4-2025 -
Discussion: Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Gepubliceerd: 21-4-2025
Men know other men best. Women know other women best. And yes, perhaps AIs know other AIs best. AI explains what you should know about this week's AI research progress.