prof_pic.png

📍 Europe

Moscow · Yerevan · Tel Aviv
Zürich · London · Abu Dhabi

Michael Diskin

Head of LLM R&D · Wildberries
Researcher · HSE University

Industry. Head of LLM R&D at Wildberries — Russia’s largest e-commerce platform. Built the LLM & embeddings organization from scratch (30+ people, 4–5 teams), shipping search, retrieval, machine translation, and RAG systems at scale. Previously ML at Yandex.

Research. Published at NeurIPS, ICML, ICLR, and EMNLP (700+ citations). Core topics: distributed training, collaborative deep learning, graph neural networks. Co-created Hivemind — an open-source framework for decentralized training.

Teaching. Lecturer at Harbour.Space University and Yandex School of Data Analysis (NLP, Deep Vision & Graphics, Reinforcement Learning).

news

Apr 23, 2026 New paper: Learning When to Be Sparse: Adaptive Activations via Two-Parameter Entropy appeared on OpenReview (Sci4DL workshop @ ICLR 2026). Also arrived in Rio de Janeiro for the conference, happy to connect on site.
Apr 09, 2026 Talk video from Data Fusion 2026: “Do You Really Need a Multi-Agent System? Lessons from Teams That Already Tried.”
Feb 13, 2026 Attended MLWS @ MBZUAI; no talk this time, but many valuable conversations and networking with the community.
Nov 09, 2025 Attended EMNLP 2025, presented at a workshop, and had many productive discussions with colleagues; also helped organize and connect the Russian-speaking NLP/ML community on site.
Nov 01, 2025 Synthetic Proofs with Tool-Integrated Reasoning: Contrastive Alignment for LLM Mathematics with Lean appeared in ACL Anthology (MathNLP @ EMNLP 2025).

selected publications

  1. SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient
    Max Ryabinin, Tim Dettmers, Michael Diskin, and 1 more author
    In International Conference on Machine Learning (ICML), 2023
  2. A Critical Look at the Evaluation of GNNs under Heterophily: Are We Really Making Progress?
    Oleg Platonov, Denis Kuznedelev, Michael Diskin, and 2 more authors
    In International Conference on Learning Representations (ICLR), 2023
  3. Distributed Deep Learning in Open Collaborations
    Michael Diskin, Alexey Bukhtiyarov, Max Ryabinin, and 13 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), 2021