Data and filtering models of our financial open-source YiZhao Dataset.

HITsz-Text Machine Group
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Text Machine Group (TMG) from Harbin Institute of Technology (Shenzhen). 🔥
Collections
3
-
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model
Paper • 2501.01028 • Published • 13 -
HIT-TMG/KaLM-embedding-multilingual-mini-v1
Sentence Similarity • Updated • 7.26k • • 19 -
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1
Sentence Similarity • Updated • 32.4k • 32 -
HIT-TMG/KaLM-embedding-multilingual-max-instruct-v1
Updated • 9
spaces
3
models
16

HIT-TMG/KaLM-embedding-multilingual-mini-unsupervised
Updated
•
4

HIT-TMG/KaLM-embedding-multilingual-max-instruct-v1
Updated
•
9

HIT-TMG/KaLM-embedding-multilingual-mini-v1
Sentence Similarity
•
Updated
•
7.26k
•
•
19

HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1
Sentence Similarity
•
Updated
•
32.4k
•
32

HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5
Sentence Similarity
•
Updated
•
74.4k
•
•
55

HIT-TMG/bge-m3_RAG-conversational-IR
Sentence Similarity
•
Updated
•
12
•
1

HIT-TMG/Mixtral_13B_Chat_RAG-Reader
Text Generation
•
Updated
•
12

HIT-TMG/Qwen1.5-14B-Chat_RAG-Reader
Text Generation
•
Updated
•
14

HIT-TMG/yizhao-risk-en-scorer
Text Classification
•
Updated
•
22
•
3

HIT-TMG/yizhao-risk-zh-scorer
Text Classification
•
Updated
•
17
•
2
datasets
5
HIT-TMG/YiZhao
Viewer
•
Updated
•
36.1M
•
905
•
3
HIT-TMG/KaLM-embedding-pretrain-data
Viewer
•
Updated
•
23.7M
•
1.71k
•
3
HIT-TMG/MultiSkill
Viewer
•
Updated
•
1k
•
56
HIT-TMG/TruthReader_RAG_train
Viewer
•
Updated
•
7.16k
•
113
•
4
HIT-TMG/Hansel
Viewer
•
Updated
•
7.81M
•
2.12k
•
8