Ian J's picture

Ian J

iyanello

·

MIkeLP

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

Team-ACE/ToolACE-2-Llama-3.1-8B

liked a model 8 days ago

nvidia/Llama-3.1-Nemotron-8B-UltraLong-1M-Instruct

liked a model 13 days ago

bartowski/meta-llama_Llama-4-Scout-17B-16E-Instruct-GGUF

View all activity

Organizations

None yet

iyanello's activity

upvoted a collection 6 months ago

Pangea

A Fully Open Multilingual Multimodal LLM for 39 Languages • 26 items • Updated Feb 1 • 18

upvoted a collection 7 months ago

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 20 days ago • 85

upvoted 2 collections 8 months ago

Knowledge graph

25 items • Updated Feb 11, 2024 • 6

VideoLLaMA2

Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability • 13 items • Updated Mar 11 • 19

upvoted a collection 11 months ago

Cohere Labs Aya 23

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated 8 days ago • 55

upvoted a paper 11 months ago

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

Paper • 2405.12981 • Published May 21, 2024 • 33

upvoted a paper about 1 year ago

Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models

Paper • 2404.07973 • Published Apr 11, 2024 • 33

upvoted a collection about 1 year ago

StarChat2 15B

Model, datasets, and demo for StarChat2 15B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 10 items • Updated Apr 12, 2024 • 13

upvoted a paper about 1 year ago

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15, 2024 • 60