Martin Viewegger

Viewegger

AI & ML interests

None yet

Recent Activity

Organizations

None yet

Viewegger's activity

New activity in jpgallegoar/F5-Spanish about 1 month ago
New activity in PetrosStav/F5-TTS-Greek about 1 month ago

Dataset size and output quality

#2 opened about 1 month ago by
Viewegger
New activity in marduk-ra/F5-TTS-German about 1 month ago

Training process details

4
#2 opened about 1 month ago by
Nils11
liked a Space about 1 month ago
reacted to m-ric's post with 🔥 about 1 month ago
view post
Post
787
𝗔𝗿𝗲 𝘀𝗰𝗮𝗹𝗶𝗻𝗴 𝗹𝗮𝘄𝘀 𝗼𝘃𝗲𝗿? 𝗔 𝗿𝗲𝗽𝗼𝗿𝘁 𝗳𝗿𝗼𝗺 𝘁𝗵𝗲 𝗜𝗻𝗳𝗼𝗿𝗺𝗮𝘁𝗶𝗼𝗻 𝗮𝗻𝗻𝗼𝘂𝗻𝗰𝗲𝗱 𝘁𝗵𝗮𝘁 𝗢𝗽𝗲𝗻𝗔𝗜 𝗶𝘀 𝘀𝗲𝗲𝗶𝗻𝗴 𝗱𝗶𝗺𝗶𝗻𝗶𝘀𝗵𝗶𝗻𝗴 𝗿𝗲𝘁𝘂𝗿𝗻𝘀 𝗳𝗿𝗼𝗺 𝘀𝗰𝗮𝗹𝗶𝗻𝗴 𝘂𝗽 𝘁𝗵𝗲 𝗻𝗲𝘅𝘁 𝗚𝗣𝗧 𝗺𝗼𝗱𝗲𝗹𝘀.

📊 What are scaling laws? These are empiric laws that say "Every time you increase compute spent in training 10-fold, your LLM's performance will go up by a predictable tick". Of course, they apply only if you train your model with the right methods.

The image below illustrates it: they're from a paper by Google, "Scaling Autoregressive Models for Content-Rich Text-to-Image Generation", and they show how quality and instruction following of models improve when you scale the model up (which is equivalent to scaling up the compute spent in training).

➡️ These scaling laws have immense impact: they triggered the largest gold rush ever, with companies pouring billions into scaling up theiur training. Microsoft and OpenAI spent 100B into their "Startgate" mega training cluster, due to start running in 2028.

🤔 So, what about these reports of scaling laws slowing down?

If they are true, they would mean a gigantic paradigm shift, as the hundreds of billions poured by AI companies into scaling could be a dead-end. ⛔️

But I doubt it: until the most recent publications, scaling laws showed no signs of weakness, and the researchers at the higher end of the scale-up seems to imply the scaling up continues.

Wait and see!
  • 1 reply
·
reacted to yongchanghao's post with 🔥 about 2 months ago
New activity in gokaygokay/Flux-Seamless-Texture-LoRA about 2 months ago

Size of the dataset?

8
#1 opened about 2 months ago by
Viewegger
New activity in nerijs/pixel-art-3.5L about 2 months ago

Thank you!

1
#1 opened about 2 months ago by
Viewegger
New activity in kodoqmc/XTTS-v2_PeterDrury 2 months ago

Hyperaparameters

#1 opened 2 months ago by
Viewegger