T Ruskus's picture

24 7

T Ruskus

nlsefouh

·

nlsefouh

AI & ML interests

None yet

Organizations

nlsefouh's activity

upvoted 20 collections 5 months ago

NeMo Curator - Classifier Models

Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 11 items • Updated 23 days ago • 16

Llama3-ChatQA-2

This is the collection that presents ChatQA-2, a suite of 128K long-context models, that also have exceptional RAG capabilities • 3 items • Updated Jan 17 • 3

Model Optimizer

A collection of generative models quantized and optimized with TensorRT Model Optimizer. • 10 items • Updated 17 days ago • 9

NIM Serverless Inference API

Models in this collection are available for inference via a serverless API powered by NVIDIA NIM. • 8 items • Updated Jan 17 • 23

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 6 items • Updated Jan 17 • 5

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated Jan 17 • 60

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes tiny, tiny2, small, base, large and large2 variants. • 8 items • Updated Jan 17 • 18

BigVGAN

BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. • 11 items • Updated Jan 17 • 11

Nemotron 3 8B

The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated Jan 17 • 48

SSMs

A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated Jan 17 • 27

Llama3-ChatQA-1.5

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated Jan 17 • 43

NV-Embed

NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. • 3 items • Updated Jan 17 • 12

RLHF

A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated Jan 17 • 5

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated Jan 17 • 42

InstructRetro

InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning. • 4 items • Updated Jan 17 • 9

Canary

A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 1 item • Updated Jan 17 • 18

Parakeet

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 8 items • Updated Jan 17 • 21

SteerLM

A collection of models and datasets relating to SteerLM and HelpSteer. • 7 items • Updated Jan 17 • 14

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated Jan 17 • 162

OpenMath-2

A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" • 7 items • Updated Jan 17 • 13