Leadership for AI

Practical Reinforcement Learning 

Leverage Reinforcement Learning to solve complex real-world problems.

Dive into RL to control complex systems, optimize decisions, and build advanced AI models that tackle real-world challenges.

  • Next Training: To be announced
  • Duration: To be announced
  • Price: To be announced
  • Languages: English (German available upon request)
  • Location: Online (Onsite available upon request)

Details

Reinforcement Learning (RL) is an integral part of various algorithms and applications: from controlling robots and complex systems like weather balloons to breakthroughs in games (Go, Chess, Dota, Starcraft) and algorithm development (Matrix Multiplication, Compression, and others). Elements of RL also play crucial roles in the newest developments in large language models, like alignment of models with RL from Human Feedback (RLHF), prompting techniques that use Monte Carlo Tree Search, and reasoning traces in OpenAI’s o1 models.

Despite such successes and the long history of RL, there are comparatively few real-world applications and even fewer mature software resources. Unlike areas like Computer Vision and Natural Language Processing, as of 2024, Reinforcement Learning is far from being commoditized. This training by Oraios is a step towards improving the current situation.

We will discuss which problems are most suitable for RL, and when other techniques should be preferred. Then we go on to building custom environments, highlighting the most common problems and pitfalls in environment design, and demonstrate the structure and the strengths and weaknesses of various RL algorithms. As the developers behind a large open-source RL library (Tianshou) who have also worked on industry applications of RL for multiple years, Oraios’ trainers are uniquely suited for teaching how the challenging techniques behind reinforcement learning can be leveraged for creating real value in applications.

Modules

Understanding RL: Basics, History, and Modern Applications

Understand the foundations of RL, its history, and its applications in the latest AI innovations.

Building RL Environments

Create custom environments and overcome common design challenges to maximize your RL models’ potential.

Algorithm Analysis and Pitfalls

Analyze the strengths, weaknesses, and implementation of key RL algorithms.

Real-World Case Studies

Learn how RL is used to create real value across different sectors and develop skills for practical implementation.

Meet the trainers

Dr. Dominik Jain

Dominik is a computer scientist with a passion for algorithms,
software design and automation in general. As a researcher at
Technische Universität München, he gave courses on the topics of artificial intelligence, knowledge representation, statistical relational models and discrete probability theory. After receiving
his PhD in artificial intelligence in 2012, he spent the next 11 years
working on applied research and development, primarily in the
automotive sector. His work leveraged machine learning,
probabilistic modelling, combinatorial optimisation, search
algorithms, and other methodologies to advance AI-based
capabilities across a diverse array of problem domains. In 2023,
he joined appliedAI’s TransferLab, working on applications of
reinforcement learning and generative AI and developing trainings as well as open-source software for the wider community, before co-founding Oraios AI in September 2024.

Dr. Michael Panchenko

Michael is an AI researcher with a strong background in mathematics and theoretical physics. After completing his master’s degrees in mathematical and theoretical physics as well as mathematics and subsequently working as a researcher in theoretical physics at Ludwig Maximilian University, Michael transitioned into applied AI research. As a lead AI researcher at appliedAI since 2020, he has spearheaded numerous innovative
projects, including the application of reinforcement learning for controlling electron microscopes and the development of large language model-based applications. Michael has always been dedicated to education and knowledge sharing. He has taught courses ranging from theoretical physics and mathematics at the university level to specialized industry trainings in ML engineering, reinforcement learning, anomaly detection, and Bayesian methods. His ability to convey complex concepts clearly and effectively has made him a valued educator in both academic and professional settings. He co-founded Oraios AI in 2024.

Dr. Dominik Jain

Dominik is a computer scientist with a passion for algorithms,
software design and automation in general. As a researcher at
Technische Universität München, he gave courses on the topics of artificial intelligence, knowledge representation, statistical relational models and discrete probability theory. After receiving
his PhD in artificial intelligence in 2012, he spent the next 11 years
working on applied research and development, primarily in the
automotive sector. His work leveraged machine learning,
probabilistic modelling, combinatorial optimisation, search
algorithms, and other methodologies to advance AI-based
capabilities across a diverse array of problem domains. In 2023,
he joined appliedAI’s TransferLab, working on applications of
reinforcement learning and generative AI and developing trainings as well as open-source software for the wider community, before co-founding Oraios AI in September 2024.

Dr. Michael Panchenko

Michael is an AI researcher with a strong background in mathematics and theoretical physics. After completing his master’s degrees in mathematical and theoretical physics as well as mathematics and subsequently working as a researcher in theoretical physics at Ludwig Maximilian University, Michael transitioned into applied AI research. As a lead AI researcher at appliedAI since 2020, he has spearheaded numerous innovative projects, including the application of reinforcement learning for controlling electron microscopes and the development of large language model-based applications. Michael has always been dedicated to education and knowledge sharing. He has taught courses ranging from theoretical physics and mathematics at the university level to specialized industry trainings in ML engineering, reinforcement learning, anomaly detection, and Bayesian methods. His ability to convey complex concepts clearly and effectively has made him a valued educator in both academic and professional settings. He co-founded Oraios AI in 2024.

Get your ticket now!

Coming Soon

Contact for inquiries