Etched/oasis-500m · Can you train a real world model?

Nov 5, 2024

Capture videos from the real world, label them with corresponding "user input (WASD etc.)", and train like this one, can we get a absolutely real world e-print game? The real world can be considered as a complex game I think.

Araki

Nov 6, 2024

My guess is that it will be heavily limited to the environment they'd have to record the footage in. They probably can't just take random videos from the Internet, since the footage probably requires a fair level of consistency, something videogames do have, think consistent camera shaking during movement or always the same jumping height. So we're probably thinking of very long camera rail systems across hundreds of places worldwide, which will cost quite a lot of money.

Kirpyy

Nov 6, 2024

I think that essentially this is possible. This is really what the video generating models do, to some extent

gtx1020

Nov 8, 2024

yes this is where we are heading to

yooon123

Dec 21, 2024

My guess is that it will be heavily limited to the environment they'd have to record the footage in. They probably can't just take random videos from the Internet, since the footage probably requires a fair level of consistency, something videogames do have, think consistent camera shaking during movement or always the same jumping height. So we're probably thinking of very long camera rail systems across hundreds of places worldwide, which will cost quite a lot of money.

how much video length do you think we need? and how large dataset have to be?