Oguzhan Gencoglu

Ouz-G
·

AI & ML interests

LLM Evals

Recent Activity

updated a Space 22 days ago
root-signals/RootEvaluatorsDemo
updated a Space 22 days ago
root-signals/CustomJudgeDemo
View all activity

Organizations

Root Signals AI's profile picture

Ouz-G's activity

New activity in WildEval/ZebraLogic about 1 month ago

Human baseline

#1 opened about 1 month ago by
Ouz-G
New activity in WildEval/ZebraLogic about 1 month ago

Human baseline for ZebraLogic

#3 opened about 1 month ago by
Ouz-G
New activity in 6cf/liveideabench about 1 month ago

Do you have human baselines?

2
#2 opened about 1 month ago by
Ouz-G
New activity in marianna13/AIW-responses about 2 months ago

Is human baseline available?

#1 opened about 2 months ago by
Ouz-G
New activity in cais/hle about 2 months ago

Do you have human baseline?

#3 opened about 2 months ago by
Ouz-G
New activity in Salesforce/GIFT-Eval about 2 months ago
New activity in Salesforce/GIFT-Eval 2 months ago
New activity in justin-zk/Personalize-SAM 2 months ago
New activity in bigcode/bigcodebench-leaderboard 3 months ago

Human baseline

1
#8 opened 3 months ago by
Ouz-G
New activity in open-llm-leaderboard/open_llm_leaderboard 3 months ago

Human Performance row

1
#1050 opened 3 months ago by
Ouz-G
New activity in shenyunhang/APE_demo 4 months ago

Demo is down

#2 opened 4 months ago by
Ouz-G
New activity in huawei-noah/human_rank_eval 4 months ago
New activity in jadechoghari/OmniParser 4 months ago
New activity in gabrielvaz/microsoft-OmniParser 4 months ago

Space seems to be down

#1 opened 4 months ago by
Ouz-G
New activity in IGNF/FLAIR-INC_rgbie_12cl_resnet34-unet 5 months ago

config file is missing

1
#1 opened 5 months ago by
Ouz-G
New activity in google/frames-benchmark 5 months ago