J Li's picture

8 5

J Li

jiazhengli

·

https://jiazhengli.com/

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

jiazhengli/Qwen2.5-7B-RoleMRC-sft

updated a model 3 days ago

jiazhengli/Qwen2.5-7B-RoleMRC-dpo

updated a model 3 days ago

jiazhengli/Llama-3.1-8B-RoleMRC-sft

View all activity

Organizations

None yet

jiazhengli's activity

upvoted a collection 4 days ago

RoleMRC

A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following • 6 items • Updated 7 days ago • 1

upvoted an article 22 days ago

Article

大模型偏好优化技术：DPO及其变种

By

•

22 days ago

• 4

upvoted a paper about 2 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 44

upvoted 2 papers 5 months ago

Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

Paper • 2406.10957 • Published Jun 16, 2024 • 1

Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring

Paper • 2406.19949 • Published Jun 28, 2024 • 1

upvoted 3 collections 5 months ago

AERA

Resources for EMNLP 2023 Paper: Distilling ChatGPT for Explainable Automated Student Answer Assessment • 3 items • Updated Oct 14, 2024 • 1

MCTS with Preference Optimisation

Resources for EMNLP 2024 Paper: Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring • 8 items • Updated Oct 14, 2024 • 2

SamPO

Resources for EMNLP 2024 Paper: Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence • 4 items • Updated Oct 14, 2024 • 2