Chao-Chun (Joe) Hsu's picture

Chao-Chun (Joe) Hsu

joe32140

·

https://chaochunhsu.github.io

AI & ML interests

Hi, I am Joe!

Recent Activity

upvoted a collection 8 days ago

new activity 11 days ago

mixedbread-ai/mxbai-edge-colbert-v0-17m:fix: Include all Dense projection layers in ONNX export (output dim 48)

new activity 12 days ago

mixedbread-ai/mxbai-edge-colbert-v0-32m:fix: Include all Dense projection layers in ONNX export (output dim 64)

View all activity

Organizations

upvoted a collection 8 days ago

Qwen3.5

21 items • Updated about 16 hours ago • 857

upvoted a paper 12 days ago

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models

Paper • 2602.16609 • Published 14 days ago • 6

upvoted a collection 21 days ago

artificial-hivemind

This collection contains datasets for the Artificial Hiveminds paper. • 4 items • Updated May 16, 2025 • 13

upvoted a collection about 1 month ago

LightOnOCR-2 🦉

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated about 22 hours ago • 22

upvoted a collection about 2 months ago

Qwen3-VL-Embedding

2 items • Updated Jan 8 • 62

upvoted a collection 4 months ago

Sarashina2.2

Large Language Models developed by SB Intuitions. Pretrained and instruction-tuned models are available in three sizes: 0.5B, 1B, and 3B. • 6 items • Updated Mar 5, 2025 • 8

upvoted an article 5 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

+4

Oct 1, 2025

•

138

upvoted a collection 6 months ago

EmbeddingGemma

3 items • Updated Sep 11, 2025 • 111

upvoted an article 6 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

Sep 4, 2025

•

273

upvoted an article 8 months ago

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Jul 1, 2025

•

133

upvoted a collection 9 months ago

Qwen3-Embedding

6 items • Updated Dec 31, 2025 • 149

upvoted a collection 10 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.7k

upvoted a paper 11 months ago

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents

Paper • 2504.13128 • Published Apr 17, 2025 • 7

upvoted a collection 11 months ago

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated about 12 hours ago • 16

upvoted a collection 12 months ago

reranking series v2

V2 crispy rerank series • 3 items • Updated Jun 25, 2025 • 25

upvoted a paper about 1 year ago

CG-RAG: Research Question Answering by Citation Graph Retrieval-Augmented LLMs

Paper • 2501.15067 • Published Jan 25, 2025 • 1

upvoted 2 collections about 1 year ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Dec 31, 2025 • 126

🏟️ Long Code Arena

All the resources for our Long Code Arena benchmark! • 12 items • Updated 2 days ago • 6

upvoted a paper about 1 year ago

Measuring Taiwanese Mandarin Language Understanding

Paper • 2403.20180 • Published Mar 29, 2024 • 6

upvoted a collection about 1 year ago

OLMoE (November 2024)

Artifacts for open mixture-of-experts language models. • 13 items • Updated Dec 23, 2025 • 31