Phil
phil111
AI & ML interests
None yet
Recent Activity
new activity
7 days ago
THUDM/GLM-4-32B-0414:SimpleQA Scores Are WAY off
new activity
10 days ago
meta-llama/Llama-4-Scout-17B-16E-Instruct:Less Knowledge Than Llama 3.3 70b?
new activity
14 days ago
meta-llama/Llama-4-Scout-17B-16E-Instruct:13 B and34 B Pleeease!!! Most people cannot even run this.
Organizations
None yet
phil111's activity
SimpleQA Scores Are WAY off
4
5
#3 opened 8 days ago
by
phil111
Less Knowledge Than Llama 3.3 70b?
2
5
#60 opened 12 days ago
by
phil111
13 B and34 B Pleeease!!! Most people cannot even run this.
4
4
#52 opened 15 days ago
by
UniversalLove333
SimpleQA?
7
3
#29 opened 29 days ago
by
phil111
This Mistral Small has FAR less knowledge than the last.
5
20
#5 opened 3 months ago
by
phil111
English tests and tasks are absurdly overfit.
7
21
#8 opened 3 months ago
by
phil111
A heavily filtered corpus simply doesn't work.
9
4
#19 opened 3 months ago
by
phil111
I Don't Understand This Model
7
16
#9 opened 4 months ago
by
phil111
Notably better than Phi3.5 in many ways, but something is wrong.
1
8
#5 opened 4 months ago
by
phil111
Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
3
2
#27 opened 4 months ago
by
phil111
SimpleQA score
2
#1 opened 4 months ago
by
frappuccino

Exceptional creative writer
15
5
#1 opened 4 months ago
by
SubtleOne
Very High English MMLU scores, Yet Extremely Low Broad English Knowledge
2
#8 opened 4 months ago
by
phil111
How was r7b?
6
#3 opened 4 months ago
by
MRU4913
Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks
12
#1 opened 4 months ago
by
Fizzarolli

local Llama + GPU(cuda)
7
#34 opened 4 months ago
by
Luciolla

local Llama + GPU(cuda)
7
#34 opened 4 months ago
by
Luciolla

How was r7b?
6
#3 opened 4 months ago
by
MRU4913
Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks
12
#1 opened 4 months ago
by
Fizzarolli

Base Model?
3
#32 opened 4 months ago
by
User8213