Sitemap

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

</article> </div>

Blog Post number 3

less than 1 minute read

Published: August 14, 2014

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

</article> </div>

Blog Post number 2

less than 1 minute read

Published: August 14, 2013

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

</article> </div>

Blog Post number 1

less than 1 minute read

Published: August 14, 2012

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

</article> </div>

portfolio

Portfolio item number 1

Short description of portfolio item number 1

</article> </div>

Portfolio item number 2

Short description of portfolio item number 2

</article> </div>

publications

Offline Meta Reinforcement Learning - Identifiability Challenges and Effective Data Collection Strategies

Ron Dorfman, Idan Shenfeld, and Aviv Tamar
Published in NeurIPS, 2021

TGRL: An Algorithm for Teacher Guided Reinforcement Learning

Idan Shenfeld, Zhang-Wei Hong, Aviv Tamar, and Pulkit Agrawal
Published in ICML, 2023

Selected for Oral Presentation at 2023 ICLR RRL Workshop.

Curiosity-driven Red-teaming for Large Language Models

Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James R. Glass, Akash Srivastava, Pulkit Agrawal
Published in ICLR, 2024

Value Augmented Sampling for Language Model Alignment and Personalization

Idan Shenfeld, Seungwook Han, Akash Srivastava, Yoon Kim, Pulkit Agrawal
Published in Oral presentation at Workshop on Reliable and Responsible Foundation Models, ICLR 2024, 2024

Juicer: Data-efficient Imitation Learning for Robotic Assembly

Lars Ankile, Anthony Simeonov, Idan Shenfeld, Pulkit Agrawal
Published in IROS, 2024

The Future of Open Human Feedback

Shachar Don-Yehiya, Ben Burtenshaw,... Idan Shenfeld ..., Leshem Choshen
Published in Nature Machine Intelligence, 2025

Learning How Hard to Think: Input-Adaptive Allocation of LM Computation

Mehul Damani, Idan Shenfeld, Andi Peng, Andreea Bobu, Jacob Andreas
Published in ICLR, 2025

Language Model Personalization via Reward Factorization

Idan Shenfeld, Felix Faltings, Pulkit Agrawal, Aldo Pacchiano
Published in COLM, 2025

From Imitation to Refinement: Residual RL for Precise Visual Assembly

Lars Lien Ankile, Anthony Simeonov, Idan Shenfeld, Marcel Torne Villasevil, Pulkit Agrawal
Published in ICRA, 2025

KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity

Gholamali Aminian, Amir Asadi, Idan Shenfeld, Youssef Mroueh
Published in NeurIPS, 2025

Reinforcement Learning via Self-Distillation

Jonas Hübotter, Frederike Lübeck,... Idan Shenfeld,... Andreas Krause
Published in ICML, 2026

Self-Distillation Enables Continual Learning

Idan Shenfeld, Mehul Damani, Jonas Hübotter, Pulkit Agrawal
Published in ICML, 2026
Best Paper Award at Lifelong Agents Workshop, ICLR 2026

Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty

Mehul Damani, Isha Puri, Stewart Slocum, Idan Shenfeld, Leshem Choshen, Yoon Kim, Jacob Andreas
Published in ICLR, 2026

RL’s Razor: Why Online Reinforcement Learning Forgets Less

Idan Shenfeld, Jyothish Pari, Pulkit Agrawal
Published in ICLR, 2026
Outstanding Paper Award at the CCFM Workshop, NeurIPS 2025

Aligning Language Models From User Interactions

Thomas Kleine Buening, Jonas Hübotter, Barna Pásztor, Idan Shenfeld, Giorgia Ramponi, Andreas Krause
Published in Arxiv Preprint, 2026

Best-of-n through the Smoothing Lens: KL Divergence and Regret Analysis

Gholamali Aminian, Idan Shenfeld, Amir Asadi, Ahmad Beirami, Youssef Mroueh
Published in ICLR, 2026

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

Isha Puri, Mehul Damani, Idan Shenfeld, Marzyeh Ghassemi, Jacob Andreas, Yoon Kim
Published in ICML, 2026

talks

Talk 1 on Relevant Topic in Your Field

Published: March 01, 2012

This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!

</article> </div>

Tutorial 1 on Relevant Topic in Your Field

Published: March 01, 2013

More information here

</article> </div>

Talk 2 on Relevant Topic in Your Field

Published: February 01, 2014

More information here

</article> </div>

Conference Proceeding talk 3 on Relevant Topic in Your Field

Published: March 01, 2014

This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.

</article> </div>

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

</article> </div>

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

</article> </div>

Idan Shenfeld

Sitemap

Pages

Posts

portfolio

publications

talks

teaching