weishen

fakerbaby

AI & ML interests

NLP, alignment, LLM

Recent Activity

Organizations

OpenMOSS, Fudan NLP, SII's profile picture

fakerbaby's activity

upvoted an article 2 days ago
view article
Article

Open Preference Dataset for Text-to-Image Generation by the πŸ€— Community

β€’ 59
reacted to onekq's post with πŸ‘ 7 months ago
view post
Post
2574
Here is my latest study on OpenAIπŸ“o1πŸ“.
A Case Study of Web App Coding with OpenAI Reasoning Models (2409.13773)

I wrote an easy-to-read blogpost to explain finding.
https://huggingface.co/blog/onekq/daily-software-engineering-work-reasoning-models

INSTRUCTION FOLLOWING is the key.

100% instruction following + Reasoning = new SOTA

But if the model misses or misunderstands one instruction, it can perform far worse than non-reasoning models.