9 21 3

Xiang Liu

Dominic789654

https://dominic789654.github.io/

Dominic789654

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

upvoted a paper about 1 month ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

commented on a paper about 1 month ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

View all activity

Organizations

None yet

Dominic789654's activity

upvoted a paper about 1 month ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

Paper • 2502.17535 • Published Feb 24 • 8

upvoted 2 papers about 2 months ago

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Paper • 2502.12669 • Published Feb 18 • 2

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published Feb 6 • 4

upvoted 5 papers 2 months ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published Feb 4 • 15

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

Paper • 2502.00299 • Published Feb 1 • 2

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 91

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 113

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 374

upvoted 4 papers 3 months ago

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published Jan 2 • 17

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

Paper • 2412.17483 • Published Dec 23, 2024 • 33

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 54

SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation

Paper • 2412.13649 • Published Dec 18, 2024 • 20

upvoted a paper 5 months ago

Should We Really Edit Language Models? On the Evaluation of Edited Language Models

Paper • 2410.18785 • Published Oct 24, 2024 • 7

upvoted 4 papers 6 months ago

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Paper • 2410.13085 • Published Oct 16, 2024 • 22

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 51

LPZero: Language Model Zero-cost Proxy Search from Zero

Paper • 2410.04808 • Published Oct 7, 2024 • 2

LongGenBench: Long-context Generation Benchmark

Paper • 2410.04199 • Published Oct 5, 2024 • 20

upvoted a paper 8 months ago

3D Question Answering for City Scene Understanding

Paper • 2407.17398 • Published Jul 24, 2024 • 22

upvoted a paper 10 months ago

Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models

Paper • 2406.02924 • Published Jun 5, 2024 • 2

upvoted a paper about 1 year ago

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Paper • 2403.17919 • Published Mar 26, 2024 • 16