Best AI papers explained
Een podcast door Enoch H. Kang

Categorieën:
183 Afleveringen
-
Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective
Gepubliceerd: 12-5-2025 -
Leaked Claude Sonnet 3.7 System Instruction tuning
Gepubliceerd: 12-5-2025 -
Converging Predictions with Shared Information
Gepubliceerd: 11-5-2025 -
Test-Time Alignment Via Hypothesis Reweighting
Gepubliceerd: 11-5-2025 -
Rethinking Diverse Human Preference Learning through Principal Component Analysis
Gepubliceerd: 11-5-2025 -
Active Statistical Inference
Gepubliceerd: 10-5-2025 -
Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework
Gepubliceerd: 10-5-2025 -
AI-Powered Bayesian Inference
Gepubliceerd: 10-5-2025 -
Can Unconfident LLM Annotations Be Used for Confident Conclusions?
Gepubliceerd: 9-5-2025 -
Predictions as Surrogates: Revisiting Surrogate Outcomes in the Age of AI
Gepubliceerd: 9-5-2025 -
Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control
Gepubliceerd: 9-5-2025 -
How to Evaluate Reward Models for RLHF
Gepubliceerd: 9-5-2025 -
LLMs as Judges: Survey of Evaluation Methods
Gepubliceerd: 9-5-2025 -
The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs
Gepubliceerd: 9-5-2025 -
Limits to scalable evaluation at the frontier: LLM as Judge won’t beat twice the data
Gepubliceerd: 9-5-2025 -
Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation
Gepubliceerd: 9-5-2025 -
Accelerating Unbiased LLM Evaluation via Synthetic Feedback
Gepubliceerd: 9-5-2025 -
Prediction-Powered Statistical Inference Framework
Gepubliceerd: 9-5-2025 -
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Gepubliceerd: 9-5-2025 -
RM-R1: Reward Modeling as Reasoning
Gepubliceerd: 9-5-2025
Men know other men best. Women know other women best. And yes, perhaps AIs know other AIs best. AI explains what you should know about this week's AI research progress.