General Preference

university

https://github.com/general-preference

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

yifAI authored a paper about 1 month ago

Scaling Image Tokenizers with Grouped Spherical Quantization

yifAI authored a paper about 2 months ago

Training and Evaluating Language Models with Template-based Data Generation

thughost authored a paper about 2 months ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

View all activity

general-preference's activity

yifAI

authored a paper about 1 month ago

Scaling Image Tokenizers with Grouped Spherical Quantization

Paper • 2412.02632 • Published Dec 3, 2024 • 10

yifAI

authored a paper about 2 months ago

Training and Evaluating Language Models with Template-based Data Generation

Paper • 2411.18104 • Published Nov 27, 2024 • 3

thughost

authored a paper about 2 months ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

kirigayahitsugi

updated 4 models 3 months ago

thughost

authored a paper 3 months ago

DPLM-2: A Multimodal Diffusion Protein Language Model

Paper • 2410.13782 • Published Oct 17, 2024 • 20

kirigayahitsugi

updated a model 3 months ago

general-preference/GPM-Llama-3.1-8B

Updated Oct 15, 2024 • 9 • 1

yifAI

updated 2 models 3 months ago

general-preference/GPO-Llama-3-8B-Instruct-GPM-2B

Text Generation • Updated Oct 11, 2024 • 17 • 2

general-preference/SPPO-Llama-3-8B-Instruct-GPM-2B

Text Generation • Updated Oct 11, 2024 • 13 • 1

thughost

authored 2 papers 3 months ago

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 8

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 35

yifAI

authored a paper 3 months ago

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 8

yifAI

authored a paper 4 months ago

On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 13

thughost

authored a paper 4 months ago

ProteinBench: A Holistic Evaluation of Protein Foundation Models

Paper • 2409.06744 • Published Sep 10, 2024 • 8

thughost

posted an update 7 months ago

Post

701

We've open-sourced the code and models for Self-Play Preference Optimization (SPPO)! 🚀🚀🚀
🤗paper: Self-Play Preference Optimization for Language Model Alignment (2405.00675)
⭐ code: https://github.com/uclaml/SPPO
🤗models: UCLA-AGI/sppo-6635fdd844f2b2e4a94d0b9a

thughost

authored 3 papers 9 months ago

DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization

Paper • 2403.13829 • Published Mar 7, 2024

Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs

Paper • 2305.08359 • Published May 15, 2023

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

Paper • 2311.14222 • Published Nov 23, 2023

AI & ML interests

Recent Activity

Team members 4

general-preference's activity