VirtueAI

company

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

ydeng9 authored a paper 10 days ago

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

yuyangy authored a paper about 2 months ago

AI Risk Categorization Decoded (AIR 2024): From Government Regulations to Corporate Policies

yuyangy authored a paper about 2 months ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

View all activity

Virtue-AI-HUB's activity

ydeng9

authored a paper 10 days ago

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

Paper • 2503.17352 • Published 13 days ago • 21

yuyangy

authored 2 papers about 2 months ago

AI Risk Categorization Decoded (AIR 2024): From Government Regulations to Corporate Policies

Paper • 2406.17864 • Published Jun 25, 2024

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7 • 22

ydeng9

authored a paper about 2 months ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7 • 22

ydeng9

authored a paper 5 months ago

Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning

Paper • 2410.22304 • Published Oct 29, 2024 • 18

yuyangy

authored 8 papers 6 months ago

SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models

Paper • 2403.07384 • Published Mar 12, 2024 • 1

AIR-Bench 2024: A Safety Benchmark Based on Risk Categories from Regulations and Policies

Paper • 2407.17436 • Published Jul 11, 2024

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

Paper • 2410.11096 • Published Oct 14, 2024 • 13

yuyangy

updated a dataset 6 months ago

Virtue-AI-HUB/SecCodePLT

Viewer • Updated Oct 16, 2024 • 1.35k • 89 • 4

ydeng9

authored 2 papers 9 months ago

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

Paper • 2405.19716 • Published May 30, 2024

MIRAI: Evaluating LLM Agents for Event Forecasting

Paper • 2407.01231 • Published Jul 1, 2024 • 18

ydeng9

posted an update 9 months ago

Post

1363

Check out our new benchmark paper on LLM agents for global events forecasting! MIRAI: Evaluating LLM Agents for Event Forecasting (2407.01231)

📜 Arxiv: https://arxiv.org/abs/2407.01231
🔗 Project page: https://mirai-llm.github.io
💻 GitHub Repo: https://github.com/yecchen/MIRAI
📁 Dataset: https://drive.google.com/file/d/1xmSEHZ_wqtBu1AwLpJ8wCDYmT-jRpfrN/view?usp=sharing
📊 Interactive Demo Notebook: https://colab.research.google.com/drive/1QyqT35n6NbtPaNtqQ6A7ILG_GMeRgdnO?usp=sharing

zhangce

authored a paper 10 months ago

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 59

yizeng

authored 2 papers 12 months ago

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published Apr 18, 2024 • 11

RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content

Paper • 2403.13031 • Published Mar 19, 2024 • 1

AI & ML interests

Recent Activity

Team members 10

Virtue-AI-HUB's activity