Xing Han Lù's picture

Xing Han Lù

xhluca

·

https://xinghanlu.com

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

McGill-NLP/agent-reward-bench

new activity 1 day ago

McGill-NLP/agent-reward-bench:Add task category

updated a Space 7 days ago

McGill-NLP/agent-reward-bench-leaderboard

View all activity

Organizations

xhluca's activity

upvoted a paper 8 days ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published 11 days ago • 27

upvoted a paper 11 days ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 21 days ago • 82

upvoted 2 papers about 1 month ago

Exploiting Instruction-Following Retrievers for Malicious Information Retrieval

Paper • 2503.08644 • Published Mar 11 • 16

SafeArena: Evaluating the Safety of Autonomous Web Agents

Paper • 2503.04957 • Published Mar 6 • 20

upvoted 2 papers about 2 months ago

Societal Alignment Frameworks Can Improve LLM Alignment

Paper • 2503.00069 • Published Feb 27 • 17

How to Get Your LLM to Generate Challenging Problems for Evaluation

Paper • 2502.14678 • Published Feb 20 • 17

upvoted a collection about 2 months ago

CHASE

Generate challenging synthetic data to evaluate LLMs • 5 items • Updated Feb 21 • 4

upvoted a paper about 2 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 34

upvoted an article 3 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 173

upvoted a paper 4 months ago

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 21

upvoted a paper 7 months ago

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Paper • 2410.01679 • Published Oct 2, 2024 • 25

upvoted a paper 9 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 71

upvoted 2 papers 10 months ago

BM25S: Orders of magnitude faster lexical search via eager sparse scoring

Paper • 2407.03618 • Published Jul 4, 2024 • 13

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

Paper • 2407.03471 • Published Jul 3, 2024 • 32

upvoted a collection 10 months ago

AURORA

Repository: https://github.com/McGill-NLP/AURORA • 5 items • Updated Jul 9, 2024 • 4

upvoted a paper 11 months ago

Interpretability Needs a New Paradigm

Paper • 2405.05386 • Published May 8, 2024 • 3

upvoted a paper about 1 year ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9, 2024 • 66

upvoted a collection about 1 year ago

LLM2Vec

16 items • Updated Oct 8, 2024 • 46

upvoted a paper about 1 year ago

Improving Text-to-Image Consistency via Automatic Prompt Optimization

Paper • 2403.17804 • Published Mar 26, 2024 • 18

upvoted a collection about 1 year ago

WebLINX Models

https://mcgill-nlp.github.io/weblinx • 17 items • Updated Jun 28, 2024 • 8