Zhang Ruichong's picture

Zhang Ruichong

ZhangRC

·

https://www.zhihu.com/people/triangjyeddriung

Triang-jyed-driung

AI & ML interests

Mathematics (Real analysis, functional analysis, commutative algebra, etc)

Recent Activity

liked a model 4 days ago

google-bert/bert-base-uncased

updated a model 12 days ago

ZhangRC/RWKV-World-7B-2.1-State-PengBo-Style

liked a model 15 days ago

recursal/QRWKV6-32B-Instruct-Preview-v0.1

View all activity

Organizations

ZhangRC's activity

New activity in THUDM/chatglm3-6b about 1 month ago

Inference error in transformers 4.42.1

#58 opened 6 months ago by

New activity in HuggingFaceFW/blogpost-fine-tasks 2 months ago

XNLI in Chinese is basically corrupted - that's why you got unreliable results

#1 opened 2 months ago by

commented a paper 5 months ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16 • 54 •

commented a paper 8 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 61 •

New activity in BlinkDL/rwkv-6-world 9 months ago

Will rwkv6-pile be released in the future？

#1 opened 9 months ago by

New activity in Jellyfish042/Bubble-Factory-V0 10 months ago

Add these datasets

#2 opened 10 months ago by

New activity in a686d380/rwkv-5-h-world about 1 year ago

🚩 Report: Ethical issue(s)

#1 opened about 1 year ago by

New activity in xiaol/rwkv-7B-world-novel-128k over 1 year ago

你好，你能嘗試在保留模型原本能力的同時，訓練模型能生成化學分子與分析嗎？

#1 opened over 1 year ago by

New activity in BlinkDL/rwkv-5-world over 1 year ago

Which languages are used in the dataset?

#1 opened over 1 year ago by

New activity in BlinkDL/rwkv-4-world over 1 year ago

How to fine-tune this model?

#2 opened over 1 year ago by

Unable to load tokenizer

#1 opened over 1 year ago by

New activity in BlinkDL/rwkv-4-raven over 1 year ago

Any plan for bigger model such as 30B?

#10 opened over 1 year ago by