seung hwan jung
digit82
AI & ML interests
None yet
Recent Activity
updated
a collection
about 1 month ago
llm
updated
a collection
about 1 month ago
llm
upvoted
a
paper
about 1 month ago
Mixture-of-Transformers: A Sparse and Scalable Architecture for
Multi-Modal Foundation Models
Organizations
None yet
Collections
2
-
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 135 -
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Paper • 2409.20566 • Published • 53 -
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
Paper • 2411.04996 • Published • 49 -
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation
Paper • 2410.21271 • Published • 6
models
9
digit82/qwen2-7b-instruct-amazon-description
Updated
digit82/whisper-small-hi
Updated
digit82/gpt2-chat-sample
Text Generation
•
Updated
•
17
digit82/test-model2
Text Classification
•
Updated
•
12
digit82/test-model
Text Classification
•
Updated
•
12
digit82/kobart-summarization
Text2Text Generation
•
Updated
•
8.93k
•
4
digit82/dialog-sbert-base
Text Classification
•
Updated
•
13
digit82/kogpt2-summarization
Text Generation
•
Updated
•
19
digit82/kolang-t5-base
Text2Text Generation
•
Updated
•
16
datasets
None public yet