3 39 4

Dan Jacobellis PRO

danjacobellis

https://danjacobellis.net

danjacobellis

AI & ML interests

Signal processing, information theory, data compression

Recent Activity

updated a dataset 3 days ago

danjacobellis/imagenet_288_webp_fg

published a dataset 4 days ago

danjacobellis/imagenet_288_webp_fg

upvoted a paper 4 days ago

Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding

View all activity

Organizations

None yet

danjacobellis's activity

upvoted a paper 4 days ago

Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding

Paper • 2501.17578 • Published 5 days ago • 1

upvoted 3 papers 6 days ago

upvoted 3 papers 12 days ago

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Paper • 2408.02900 • Published Aug 6, 2024 • 28

The Geometry of Tokens in Internal Representations of Large Language Models

Paper • 2501.10573 • Published 17 days ago • 8

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 25 days ago • 87

upvoted a paper 18 days ago

Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding

Paper • 2501.07783 • Published 21 days ago • 7

upvoted a collection 29 days ago

NeMo Audio Codecs

Collection

A series of Neural Audio Codecs • 5 items • Updated 17 days ago • 11

upvoted 2 collections about 1 month ago

DC-AE

Collection

Deep Compression Autoencoder • 17 items • Updated 10 days ago • 15

Cosmos Tokenizer

Collection

A suite of image and video tokenizers • 13 items • Updated 17 days ago • 37

upvoted 3 papers about 2 months ago

Generalized Gaussian Model for Learned Image Compression

Paper • 2411.19320 • Published Nov 28, 2024 • 1

Learned Compression for Compressed Learning

Paper • 2412.09405 • Published Dec 12, 2024 • 13

I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token

Paper • 2412.06676 • Published Dec 9, 2024 • 9

upvoted a paper 2 months ago

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Paper • 2411.17459 • Published Nov 26, 2024 • 11

upvoted a collection 2 months ago

medmnist 224²

Collection

12 items • Updated Dec 31, 2024 • 1

upvoted a paper 2 months ago

Factorized Visual Tokenization and Generation

Paper • 2411.16681 • Published Nov 25, 2024 • 17

upvoted 3 papers 3 months ago

Occam's Razor for Self Supervised Learning: What is Sufficient to Learn Good Representations?

Paper • 2406.10743 • Published Jun 15, 2024 • 1

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 44

Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings

Paper • 2411.08017 • Published Nov 12, 2024 • 11