arxiv:2604.19698
Michal Valko
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
updated a dataset about 12 hours ago
misovalko/my-research-papers authored a paper 1 day ago
Budgeted Online Influence Maximization authored a paper 1 day ago
Planning in entropy-regularized Markov decision processes and games