Dharun Krishna K B's picture

17 36

Dharun Krishna K B

kbdharun

·

https://kbdharun.dev

AI & ML interests

Deep Neural Networks, Convolutional Neural Networks, Generative Adversarial Networks, Natural Language Processing

Recent Activity

liked a dataset about 2 months ago

abhinand/tamil-alpaca

liked a dataset about 2 months ago

azharmo/tamil-orca

liked a dataset about 2 months ago

AnanthZeke/oscar_tamil_clean

View all activity

Organizations

kbdharun's activity

upvoted a paper about 2 months ago

Tamil-Llama: A New Tamil Language Model Based on Llama 2

Paper • 2311.05845 • Published Nov 10, 2023 • 3

upvoted 4 papers 3 months ago

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published Oct 10 • 45

Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 25

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 168

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

upvoted a collection 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 21 days ago • 548

upvoted 2 papers 3 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 138

upvoted 9 papers 4 months ago

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10 • 55

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published Sep 5 • 30

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2 • 94

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published Aug 28 • 34

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29 • 92

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27 • 138

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27 • 121

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 124

ImageNet Large Scale Visual Recognition Challenge

Paper • 1409.0575 • Published Sep 1, 2014 • 8