-
CLEAR: Character Unlearning in Textual and Visual Modalities
Paper • 2410.18057 • Published • 198 -
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
Paper • 2410.23090 • Published • 53 -
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
Paper • 2410.23743 • Published • 58 -
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Paper • 2411.02355 • Published • 44
Omar Elcircevi
omarcevi
·
AI & ML interests
None yet
Organizations
Collections
1
models
9
omarcevi/ppo-Pyramids_Training
Reinforcement Learning
•
Updated
•
15
omarcevi/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
12
omarcevi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
omarcevi/Reinforce-CartPole1
Reinforcement Learning
•
Updated
omarcevi/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
2
omarcevi/q-Taxi-V3
Reinforcement Learning
•
Updated
omarcevi/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
omarcevi/ppo-Huggy
Reinforcement Learning
•
Updated
•
67
omarcevi/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
4
datasets
None public yet