Online Workshop Every Week
Join our free weekly interactive learning sessions.
Master AI/ML with instant feedback and personalized learning
"Cogito, ergo sum" (I think, therefore I am)
β RenΓ© Descartes
Free Problems
Chapter 22 - Multi-Agent Reinforcement Learning Concepts
This problem set covers key concepts from Chapter 22 on Multi-Agent Reinforcement Learning (MARL), including environment setup, agent interactions, observation spaces, and training approaches. The problems progress from basic MARL concepts to advanced implementation details and analytical reasoning about multi-agent systems.
38 pts
Medium
102
reinforcement learning
multi-agent systems
agent interaction
+7
Chapter 21 - RL in Discrete Optimization - Rubik's Cube Applications
This problem set explores reinforcement learning applications in discrete optimization, specifically focusing on the Rubik's cube puzzle. Based on the autodidactic iteration (ADI) method from McAleer et al., these problems test understanding of state representations, neural network architectures, training processes, and Monte Carlo Tree Search (MCTS) for solving combinatorial optimization problems.
49 pts
Medium
94
group theory
combinatorics
rubiks cube
+7
Chapter 20 - AlphaGo Zero and MuZero Concepts
This problem set covers key concepts from Chapter 20 on AlphaGo Zero and MuZero model-based reinforcement learning methods. These methods revolutionized board game AI by enabling agents to improve through self-play without human knowledge. The problems test understanding of MCTS, neural network architectures, training processes, and the differences between model-based and model-free approaches.
35 pts
Medium
96
reinforcement learning
model-based learning
model-free learning
+7
Chapter 19 - Reinforcement Learning with Human Feedback
This problem set covers key concepts from Chapter 19 on Reinforcement Learning with Human Feedback (RLHF). You'll explore the motivation behind RLHF, its theoretical foundations, implementation details, and practical applications in both traditional RL environments and modern LLM training pipelines. The problems progress from basic conceptual understanding to advanced analytical thinking about RLHF systems.
33 pts
Medium
98
reinforcement learning
human feedback
machine learning
+7
Chapter 10 - Stocks Trading Using RL
This problem set covers key concepts from Chapter 10: Stocks Trading Using RL, which demonstrates how to apply reinforcement learning to financial trading. The problems test your understanding of the trading environment design, data representation, reward systems, and model architectures used in this practical RL application.
22 pts
Medium
101
reinforcement learning
rl components
stock trading
+7
Chapter 01 - Deep Reinforcement Learning Hards-On Chapter 1
This problem set covers the fundamental concepts of reinforcement learning from Chapter 1, including the differences between RL and other ML paradigms, Markov processes, reward systems, and the core components of RL systems. These questions test your understanding of the theoretical foundations that underpin modern reinforcement learning approaches.
13 pts
Medium
102
reinforcement learning
supervised learning
unsupervised learning
+7
Premium Problems
Knowledge Graphs
USA AI Olympiad
Explore competitive programming and AI contest preparation concepts
Grade 5 Math
Discover elementary mathematics concepts and learning paths
Featured PDFs
View All PDFsSystem Design Interview: An Insider's Guide Volume 2
116 questions
348 pts
System Design Interview: An Insider's Guide
108 questions
317 pts
UNICALLI: A UNIFIED DIFFUSION FRAMEWORK FOR COLUMN-LEVEL GENERATION AND RECOGNITION OF CHINESE CALLIGRAPHY
10 questions
38 pts
The Principles of Deep Learning Theory
107 questions
418 pts
Featured Books
View All BooksAcing the System Design Interview
153 questions
456 pts
Numerical Python: Scientific Computing and Data Science Applications with Numpy, SciPy and Matplotlib
190 questions
543 pts
Hands-On Machine Learning with Scikit-Learn and PyTorch
200 questions
554 pts
Deep Reinforcement Learning Hands-On - Third Edition
222 questions
720 pts
Featured Videos
View All VideosFlow-Matching vs Diffusion Models explained side by side
10 questions
29 pts
Attention in transformers, step-by-step | Deep Learning Chapter 6
10 questions
30 pts
Knowledge Distillation: How LLMs train each other
10 questions
27 pts
Diffusion Model
10 questions
32 pts
Popular Topics
machine learning
56
deep learning
40
neural networks
35
reinforcement learning
33
system-design
28
grade5
27
optimization
14
large language models
13
attention mechanisms
13
combinatorics
13
system-architecture
13
natural language processing
12
aime problems
12
Number Sense
12
scalability
11
beginner
10
number theory
10
performance
10
transformers
9
capacity-planning
9
Click on any tag to filter problems by that topic