16 2 82

Samuel Azran

SamuelAzran

AI & ML interests

None yet

Recent Activity

updated a model 21 days ago

SamuelAzran/Llama3.1-s-base

published a model 21 days ago

SamuelAzran/Llama3.1-s-base

updated a model 22 days ago

SamuelAzran/Llama-3.2-3B-Instruct-s-whispervq2048-init_with_sound_and_duration_tokens

View all activity

Organizations

None yet

SamuelAzran's activity

New activity in hebrew-llm-leaderboard/leaderboard 6 months ago

New Gemma 2 27B?

#3 opened 8 months ago by

SamuelAzran

New activity in yam-peleg/Hebrew-Gemma-11B-Instruct 11 months ago

Was it train after the latest Huggingface Transformers Gemma fix? if not any update plans?

#4 opened 11 months ago by

SamuelAzran

New activity in cloudyu/Mixtral_34Bx2_MoE_60B about 1 year ago

Should not be called mixtral, the models made into the moe are yi based

#2 opened about 1 year ago by

teknium

How does the MoE work?

#5 opened about 1 year ago by

PacmanIncarnate

New activity in cloudyu/Mixtral_7Bx2_MoE about 1 year ago

One or two models during inference?

#3 opened about 1 year ago by

Venkman42

New activity in upstage/SOLAR-10.7B-Instruct-v1.0 about 1 year ago

You know Mixtral, Llama 2 70b, GPT3.5... Are All Much Better

#13 opened about 1 year ago by deleted

New activity in VAGOsolutions/SauerkrautLM-SOLAR-Instruct about 1 year ago

Awesome- Could you help with pointers on doing same for Other languages(Swedish)?

#2 opened about 1 year ago by

Olofp

QLora or full fine-tuning?

#1 opened about 1 year ago by

SamuelAzran

New activity in NousResearch/Nous-Capybara-34B about 1 year ago

Was system message used during training?

#8 opened about 1 year ago by

SamuelAzran

New activity in open-llm-leaderboard/open_llm_leaderboard about 1 year ago

NEW! OpenLLMLeaderboard 2023 fall update

#356 opened over 1 year ago by

clefourrier

New activity in NousResearch/Nous-Capybara-34B about 1 year ago

Did you do full model fine tuning (all layers) or only adapters?

#2 opened about 1 year ago by

SamuelAzran

New activity in 01-ai/Yi-34B over 1 year ago

Can you release a chat version soon ?

#8 opened over 1 year ago by

dong0213

New activity in openchat/openchat_v2_w over 1 year ago

Great work, but why only 2048 context length?

#4 opened over 1 year ago by

SamuelAzran

New activity in nomic-ai/gpt4all-mpt over 1 year ago

Would it work well with sequence length > 2048?

#1 opened over 1 year ago by

SamuelAzran

New activity in TheBloke/alpaca-lora-65B-GGML almost 2 years ago

Thank you very much!

#2 opened almost 2 years ago by

AiCreatornator

New activity in google/flan-ul2 almost 2 years ago

Error running the example code

#6 opened almost 2 years ago by

will33am