1 21 33

Nwankwo samuel

Samexplorer

AI & ML interests

Multi modal

Recent Activity

updated a collection 9 days ago

GAI

liked a Space 13 days ago

bytedance-research/UNO-FLUX

liked a dataset 17 days ago

MrDragonFox/Elise

View all activity

Organizations

None yet

Samexplorer's activity

upvoted a paper 20 days ago

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance

Paper • 2504.01724 • Published 20 days ago • 64

upvoted a paper 26 days ago

FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

Paper • 2503.04919 • Published Mar 6 • 8

upvoted a collection about 1 month ago

💫StarVector Models

Collection

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 93

upvoted 3 papers 3 months ago

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

Paper • 2501.15907 • Published Jan 27 • 17

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 57

Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

Paper • 2501.08331 • Published Jan 14 • 20

upvoted 4 papers 4 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 99

upvoted a paper 5 months ago

One Shot, One Talk: Whole-body Talking Avatar from a Single Image

Paper • 2412.01106 • Published Dec 2, 2024 • 20

upvoted a collection 5 months ago

LipSync and Face Operations

Collection

18 items • Updated 13 days ago • 48

upvoted 6 papers 6 months ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 55

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 130

Distilling an End-to-End Voice Assistant Without Instruction Training Data

Paper • 2410.02678 • Published Oct 3, 2024 • 23

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8, 2024 • 83

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Paper • 2410.05229 • Published Oct 7, 2024 • 22

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published Oct 9, 2024 • 47

upvoted a paper 7 months ago

Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise

Paper • 2410.03017 • Published Oct 3, 2024 • 29

upvoted an article 7 months ago

Article

Exploring the Daily Papers Page on Hugging Face

Sep 23, 2024

• 54