Request: DOI
#15 opened 28 days ago
by
Atharvab7

Request: DOI
#13 opened about 2 months ago
by
JK-slone
Attention matrix
1
#12 opened over 1 year ago
by
stolosa
Lower precision
#11 opened over 1 year ago
by
pipparichter
PEFT LoRA and QLoRA
#10 opened over 1 year ago
by
AmelieSchreiber

accessing to embedding layer and generate embeddings step by step
#9 opened almost 2 years ago
by
francescopatane
Understanding vocabulary size
#8 opened almost 2 years ago
by
dannyLCG
how visualize attention matrix
2
#7 opened almost 2 years ago
by
francescopatane
TorchScript export failed. Maybe related to sequence length cache.
#5 opened about 2 years ago
by
chenchaozhao
inferring device map for model
#4 opened over 2 years ago
by
mahdi-b
passing parameters to the underlying model's forward
4
#3 opened over 2 years ago
by
mahdi-b