Xintao Wang's picture

2 9 2

Xintao Wang

Neph0s

·

https://neph0s.github.io/

Neph0s

AI & ML interests

None yet

Recent Activity

liked a model 12 days ago

Neph0s/CoSER-Llama-3.1-8B

upvoted a paper 24 days ago

JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models' Detection of Human Self-Destructive Behavior Content in Jirai Community

updated a dataset 24 days ago

Neph0s/CoSER

View all activity

Organizations

None yet

Neph0s's activity

upvoted a paper 24 days ago

JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models' Detection of Human Self-Destructive Behavior Content in Jirai Community

Paper • 2503.21679 • Published 25 days ago • 1

upvoted 2 papers about 1 month ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 24

Implicit Reasoning in Transformers is Reasoning through Shortcuts

Paper • 2503.07604 • Published Mar 10 • 21

upvoted 2 papers 2 months ago

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 53

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

Paper • 2502.09082 • Published Feb 13 • 28

upvoted a paper 3 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 105

upvoted 2 papers 6 months ago

VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI

Paper • 2410.11623 • Published Oct 15, 2024 • 49

Revealing the Barriers of Language Agents in Planning

Paper • 2410.12409 • Published Oct 16, 2024 • 28

upvoted a paper 8 months ago

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 45