Paulson's picture

314 22

Paulson

Pnaomi

·

AI & ML interests

Yes

Recent Activity

upvoted a paper 1 day ago

NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations

upvoted a paper 1 day ago

Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages

upvoted a paper 1 day ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

View all activity

Organizations

Pnaomi's activity

upvoted 11 papers 1 day ago

NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations

Paper • 2503.23162 • Published 8 days ago • 9

Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages

Paper • 2503.23542 • Published 7 days ago • 9

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published 5 days ago • 10

Interpreting Emergent Planning in Model-Free Reinforcement Learning

Paper • 2504.01871 • Published 4 days ago • 11

Efficient Model Selection for Time Series Forecasting via LLMs

Paper • 2504.02119 • Published 4 days ago • 13

Scaling Analysis of Interleaved Speech-Text Language Models

Paper • 2504.02398 • Published 3 days ago • 24

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published 3 days ago • 71

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published 3 days ago • 27

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Paper • 2504.02782 • Published 3 days ago • 50

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 3 days ago • 61

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 6 days ago • 155

upvoted 6 papers 2 days ago

Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations

Paper • 2503.18817 • Published 13 days ago • 3

MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

Paper • 2502.18924 • Published Feb 26 • 8

Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models

Paper • 2503.22879 • Published 9 days ago • 9

VerifiAgent: a Unified Verification Agent in Language Model Reasoning

Paper • 2504.00406 • Published 6 days ago • 6

Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks

Paper • 2504.01308 • Published 5 days ago • 13

DASH: Detection and Assessment of Systematic Hallucinations of VLMs

Paper • 2503.23573 • Published 7 days ago • 12

upvoted 3 papers 4 days ago

MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing

Paper • 2503.24219 • Published 6 days ago • 2

m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models

Paper • 2504.00869 • Published 5 days ago • 9

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Paper • 2504.00557 • Published 5 days ago • 15