1 10 77

Yiming Zheng

ZYM666

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Auditing Agent Harness Safety

liked a dataset 4 months ago

tencent/CL-bench

upvoted an article 6 months ago

Transformers v5: Simple model definitions powering the AI ecosystem

View all activity

Organizations

Collections 1

Papers 1

arxiv:2412.11713

models 7

datasets 0

None public yet

Yiming Zheng

AI & ML interests

Recent Activity

Organizations

Collections 1

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Towards Efficient and Exact Optimization of Language Model Alignment

A General Theoretical Paradigm to Understand Learning from Human Preferences

Statistical Rejection Sampling Improves Preference Optimization

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Towards Efficient and Exact Optimization of Language Model Alignment

A General Theoretical Paradigm to Understand Learning from Human Preferences

Statistical Rejection Sampling Improves Preference Optimization

Papers 1

models 7

ZYM666/swin-spe-model

ZYM666/q-FrozenLake-v1-4x4-noSlippery

ZYM666/Alpaca

ZYM666/ChatDoctor_change

ZYM666/text2vec-large-chinese-support-sentence-transformer

ZYM666/text2vec-large-chinese-support-sentence

ZYM666/flower_yolov5

datasets 0

Yiming Zheng

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 1

models 7 Sort: Recently updated

datasets 0

models 7