Gabriel Lucchesi's picture

3 2

Gabriel Lucchesi

gabehubner

·

https://ghubnerr.github.io

ghubnerr

AI & ML interests

LLMs, RLHF, Model Interpretability, GNNs

Recent Activity

upvoted a paper 3 days ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

updated a Space 7 days ago

gabehubner/distilbert-sentiment-classifier

published a Space 7 days ago

gabehubner/distilbert-sentiment-classifier

View all activity

Organizations

gabehubner's activity

upvoted a paper 3 days ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published 5 days ago • 23

upvoted a paper 7 months ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 123

upvoted a paper about 1 year ago

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Paper • 2402.11450 • Published Feb 18, 2024 • 22