Vladislav Kurenkov
Sergey Kolesnikov
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
Nikita Balagansky
Daniil Gavrilov
PALBERT: Teaching ALBERT to Ponder
Ivan Karpukhin
Stanislav Dereka
Sergey Kolesnikov
Probabilistic Embeddings Revisited
Mark Rofin
Nikita Balagansky
Daniil Gavrilov
Linear Interpolation In Parameter Space is Good Enough for Fine-Tuned Language Models
Stanislav Dereka
Ivan Karpukhin
Sergey Kolesnikov
Deep Image Retrieval is not Robust to Label Noise
Open-Domain Reasoning Under Multi-Modal Settings Workshop
Denis Tarasov
Vladislav Kurenkov
Sergey Kolesnikov
Prompts and Pre-Trained Language Models for Offline Reinforcement Learning
Workshop on Generalizable Policy Learning in Physical World
Askhat Sitdikov
Nikita Balagansky
Daniil Gavrilov
Alexander Markov
Classifiers are Better Experts for Controllable Text Generation
Workshop on Transfer Learning for Natural Language Processing
Ivan Karpukhin
Stanislav Dereka
Sergey Kolesnikov
EXACT: How to Train Your Accuracy
Topology, Algebra, and Geometry in Machine Learning Workshop
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Dmitriy Akimov
Vladislav Kurenkov
Alexander Nikulin
Denis Tarasov
Sergey Kolesnikov
Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows