WangYM999's picture

30 9

WangYM999

YimingWang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting

upvoted a paper about 1 month ago

Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining

upvoted a paper about 1 month ago

iVideoGPT: Interactive VideoGPTs are Scalable World Models

View all activity

Organizations

None yet

YimingWang's activity

upvoted 20 papers about 1 month ago

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting

Paper • 2405.15125 • Published May 24, 2024 • 8

Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining

Paper • 2405.14908 • Published May 23, 2024 • 15

iVideoGPT: Interactive VideoGPTs are Scalable World Models

Paper • 2405.15223 • Published May 24, 2024 • 16

Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Paper • 2405.15216 • Published May 24, 2024 • 16

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach

Paper • 2405.15613 • Published May 24, 2024 • 17

CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner

Paper • 2405.14979 • Published May 23, 2024 • 19

The Road Less Scheduled

Paper • 2405.15682 • Published May 24, 2024 • 27

AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct

Paper • 2405.14906 • Published May 23, 2024 • 27

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

Paper • 2405.15319 • Published May 24, 2024 • 29

Aya 23: Open Weight Releases to Further Multilingual Progress

Paper • 2405.15032 • Published May 23, 2024 • 31

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 40

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Paper • 2405.15738 • Published May 24, 2024 • 46

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24, 2024 • 55

Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras

Paper • 2405.14866 • Published May 23, 2024 • 9

Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling

Paper • 2405.14847 • Published May 23, 2024 • 10

NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections

Paper • 2405.14871 • Published May 23, 2024 • 10

Semantica: An Adaptable Image-Conditioned Diffusion Model

Paper • 2405.14857 • Published May 23, 2024 • 11

CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers

Paper • 2405.13195 • Published May 21, 2024 • 12

RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance

Paper • 2405.14677 • Published May 23, 2024 • 12

Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

Paper • 2405.14598 • Published May 23, 2024 • 14