Vardaan Pahuja's picture

3 13 4

Vardaan Pahuja

vardaan123

·

https://vardaanpahuja.github.io/

AI & ML interests

LLM Agents, Multimodal Foundation Models, Knowledge Graphs

Recent Activity

upvoted a paper 12 days ago

Diversifying Joint Vision-Language Tokenization Learning

upvoted a paper 12 days ago

A Systematic Investigation of KB-Text Embedding Alignment at Scale

upvoted an article 23 days ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

vardaan123's activity

upvoted 2 papers 12 days ago

Diversifying Joint Vision-Language Tokenization Learning

Paper • 2306.03421 • Published Jun 6, 2023 • 2

A Systematic Investigation of KB-Text Embedding Alignment at Scale

Paper • 2106.01586 • Published Jun 3, 2021 • 1

upvoted an article 23 days ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.21k

upvoted 3 papers about 1 month ago

Structure Learning for Neural Module Networks

Paper • 1905.11532 • Published May 27, 2019 • 1

Learning Sparse Mixture of Experts for Visual Question Answering

Paper • 1909.09192 • Published Sep 19, 2019 • 1

Bringing Back the Context: Camera Trap Species Identification as Link Prediction on Multimodal Knowledge Graphs

Paper • 2401.00608 • Published Dec 31, 2023 • 2

upvoted a paper about 2 months ago

Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

Paper • 2502.11357 • Published Feb 17 • 10

upvoted 2 papers 6 months ago

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Paper • 2410.05080 • Published Oct 7, 2024 • 21

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Paper • 2410.05243 • Published Oct 7, 2024 • 19

upvoted 4 papers about 1 year ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8, 2024 • 83

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Paper • 2403.04746 • Published Mar 7, 2024 • 26

Learning and Leveraging World Models in Visual Representation Learning

Paper • 2403.00504 • Published Mar 1, 2024 • 34

A Retrieve-and-Read Framework for Knowledge Graph Link Prediction

Paper • 2212.09724 • Published Dec 19, 2022 • 1