1 43 89

gerald hewes

gerald29

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

teapotai/teapotllm

liked a Space 10 days ago

enzostvs/deepsite

liked a model 10 days ago

deepseek-ai/DeepSeek-V3-0324

View all activity

Organizations

None yet

gerald29's activity

upvoted an article 13 days ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 963

upvoted a paper 13 days ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 115

upvoted a paper 17 days ago

SynCity: Training-Free Generation of 3D Worlds

Paper • 2503.16420 • Published 20 days ago • 25

upvoted 12 papers about 2 months ago

LLM-based User Profile Management for Recommender System

Paper • 2502.14541 • Published Feb 20 • 6

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

Paper • 2502.14802 • Published Feb 20 • 13

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

Paper • 2502.14044 • Published Feb 19 • 8

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Paper • 2502.14377 • Published Feb 20 • 12

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published Feb 20 • 13

NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization

Paper • 2502.14638 • Published Feb 20 • 11

S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published Feb 18 • 29

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published Feb 20 • 24

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 141

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 180

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published Feb 6 • 37

upvoted an article 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.21k

upvoted a paper 2 months ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 56

upvoted 2 papers 3 months ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published Jan 18 • 26

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 114