MockSphere - 🧠 AI/ML Online Judge

Online Workshop Every Week

Join our free weekly interactive learning sessions.

Master AI/ML with instant feedback and personalized learning

"Cogito, ergo sum" (I think, therefore I am)

— René Descartes

Free Problems

Chapter 01 - Improving LLM Accuracy

This problem set covers key concepts from Chapter 1 on improving LLM accuracy, including LLM limitations, retrieval-augmented generation (RAG), knowledge graphs, and strategies for overcoming LLM constraints. These problems test your understanding of how LLMs work, their inherent limitations, and practical approaches to enhance their accuracy and reliability in real-world applications.

28 pts Medium 103 llm training andrew karpathy machine learning +7

AgentFlow by Stanford - Multi-Agent System Optimization

This problem set explores the AgentFlow framework developed by Stanford researchers, which demonstrates how a 7B parameter agent can outperform much larger 200B LLMs through optimized multi-agent system design. The problems cover key concepts including agentic system architecture, reinforcement learning optimization, tool integration, and the limitations of current approaches.

55 pts Medium 91 agent flow framework multi-agent systems system optimization +7

Proximal Policy Optimization (PPO) for LLMs Explained Intuitively

This problem set tests your understanding of Proximal Policy Optimization (PPO) for Large Language Models based on the intuitive explanation from the YouTube video. The problems cover core reinforcement learning concepts, PPO algorithm components, and practical implementation considerations for aligning LLMs with human preferences.

26 pts Medium 98 reinforcement learning llms policy optimization +7

DeepSeek's GRPO (Group Relative Policy Optimization) | Reinforcement Learning for LLMs

This problem set explores DeepSeek's GRPO (Group Relative Policy Optimization), a reinforcement learning algorithm for fine-tuning Large Language Models (LLMs). GRPO is designed to be more computationally efficient than traditional methods like PPO while maintaining effectiveness for reasoning tasks. These problems will test your understanding of where GRPO fits in the LLM training pipeline, its core mechanisms, and how it compares to other reinforcement learning approaches.

48 pts Hard 100 reinforcement learning llm training policy optimization +7

Chapter 19 - Reinforcement Learning Fundamentals

This problem set covers key concepts from Chapter 19 on Reinforcement Learning, including policy gradients, value-based methods, actor-critic algorithms, and practical implementations using Gymnasium and PyTorch. The problems progress from fundamental concepts to advanced implementations and analytical thinking.

28 pts Medium 98 reinforcement learning agent environment interaction reward function +7

Chapter 18 - Autoencoders, GANs, and Diffusion Models

This problem set covers key concepts from Chapter 18 on Autoencoders, GANs, and Diffusion Models. These unsupervised learning techniques are used for dimensionality reduction, feature extraction, anomaly detection, and generative modeling. The problems progress from basic concepts to advanced implementation details, testing your understanding of how these models work, their applications, and their relative strengths and weaknesses.

31 pts Medium 94 autoencoders neural networks deep learning +7

Premium Problems

Python I/O and Data Pipeline Assessment - Part 4

20 questions focused on PyTorch Dataset/DataLoader design: map/iterable datasets, transforms, custom collate/padding, worker seeding/sharding, num_workers/pin_memory/prefetch_factor, caching, memmap/shared memory, batching by size, profiling, and performance tuning.

10.00 60 pts Medium 98 torch.utils.data.dataset pytorch dataset +7

Chapter 02 - Numeric Python

This problem set covers key concepts from Chapter 2: Vectors, Matrices, and Multidimensional Arrays. The problems test understanding of NumPy array fundamentals, including array creation, indexing, slicing, operations, and vectorized computing. Each question is designed to reinforce the core concepts presented in the chapter.

5.00 26 pts Medium 97 numpy-arrays array-attributes shape +7

USAAIO 2025 R1P3 - Logistic Regression Implementation

This problem focuses on implementing logistic regression from scratch using the Titanic dataset. You will work through data pre-processing, mathematical derivations, and implement both gradient descent and Newton's method for logistic regression. The dataset contains passenger information from the Titanic, and your goal is to predict survival based on various features.

10.00 48 pts Easy 93 data-loading pandas data-exploration +7

USAAIO 2025 R1P2 - Basics of Neural Network - From Linear Regression to DNN Training

This problem is about the basics of neural network. Each part has its particular purpose to intentionally test you something. Do not attempt to find a shortcut to circumvent the rule. And all coding tasks shall run on CPUs, **not GPUs**.

10.00 36 pts Easy 96 learning-rate-scheduler pytorch optimization +12

USAAIO 2025 R1P1 - Fibonacci Matrix Form

Let us consider the following sequence: $$ F_n = F_{n-1} + F_{n-2},\ \forall\ n \ge 2. $$

8.00 27 pts Medium 96 fibonacci sequence linear algebra matrix form +7

IAIO 2024 Part 2 - Machine Learning Algorithms and Deep Learning

This problem covers the remaining categories of the 2024 International Artificial Intelligence Olympiad (IAIO), focusing on machine learning algorithms and deep learning. You'll work through practical implementations of k-means clustering, deep learning architectures, and advanced machine learning theory including kernel methods and the Perceptron algorithm. The problems cover: - K-means clustering algorithm implementation and convergence - Deep learning architectures (DALL-E, Transformers) - Perceptron algorithm and kernel methods - Mathematical proofs and theoretical analysis - Parameter counting and computational complexity

10.00 44 pts Hard 99 k-means clustering euclidean distance machine learning +7

Knowledge Graphs

USA AI Olympiad

Explore competitive programming and AI contest preparation concepts

View Full Graph

Grade 5 Math

Discover elementary mathematics concepts and learning paths

View Full Graph

Featured PDFs

View All PDFs

System Design Interview: An Insider's Guide Volume 2

116 questions 348 pts

System Design Interview: An Insider's Guide

108 questions 317 pts

UNICALLI: A UNIFIED DIFFUSION FRAMEWORK FOR COLUMN-LEVEL GENERATION AND RECOGNITION OF CHINESE CALLIGRAPHY

10 questions 38 pts

The Principles of Deep Learning Theory

107 questions 418 pts

Featured Books

View All Books

Acing the System Design Interview

153 questions 456 pts

Numerical Python: Scientific Computing and Data Science Applications with Numpy, SciPy and Matplotlib

190 questions 543 pts

Hands-On Machine Learning with Scikit-Learn and PyTorch

200 questions 554 pts

Deep Reinforcement Learning Hands-On - Third Edition

222 questions 720 pts

Online Workshop Every Week

Free Problems

Chapter 01 - Improving LLM Accuracy

AgentFlow by Stanford - Multi-Agent System Optimization

Proximal Policy Optimization (PPO) for LLMs Explained Intuitively

DeepSeek's GRPO (Group Relative Policy Optimization) | Reinforcement Learning for LLMs

Chapter 19 - Reinforcement Learning Fundamentals

Chapter 18 - Autoencoders, GANs, and Diffusion Models

Premium Problems

Python I/O and Data Pipeline Assessment - Part 4

Chapter 02 - Numeric Python

USAAIO 2025 R1P3 - Logistic Regression Implementation

USAAIO 2025 R1P2 - Basics of Neural Network - From Linear Regression to DNN Training

USAAIO 2025 R1P1 - Fibonacci Matrix Form

IAIO 2024 Part 2 - Machine Learning Algorithms and Deep Learning

Knowledge Graphs

USA AI Olympiad

Grade 5 Math

Featured PDFs

System Design Interview: An Insider's Guide Volume 2

System Design Interview: An Insider's Guide

UNICALLI: A UNIFIED DIFFUSION FRAMEWORK FOR COLUMN-LEVEL GENERATION AND RECOGNITION OF CHINESE CALLIGRAPHY

The Principles of Deep Learning Theory

Featured Books

Acing the System Design Interview

Numerical Python: Scientific Computing and Data Science Applications with Numpy, SciPy and Matplotlib

Hands-On Machine Learning with Scikit-Learn and PyTorch

Deep Reinforcement Learning Hands-On - Third Edition

Featured Videos

Flow-Matching vs Diffusion Models explained side by side

Attention in transformers, step-by-step | Deep Learning Chapter 6

Knowledge Distillation: How LLMs train each other

Diffusion Model

Popular Topics

🚀 Join MockSphere Today!

Unlock Your AI/ML Learning Potential

Online Workshop Every Week

Free Problems

Chapter 01 - Improving LLM Accuracy

AgentFlow by Stanford - Multi-Agent System Optimization

Proximal Policy Optimization (PPO) for LLMs Explained Intuitively

DeepSeek's GRPO (Group Relative Policy Optimization) | Reinforcement Learning for LLMs

Chapter 19 - Reinforcement Learning Fundamentals

Chapter 18 - Autoencoders, GANs, and Diffusion Models

Premium Problems

Python I/O and Data Pipeline Assessment - Part 4

Chapter 02 - Numeric Python

USAAIO 2025 R1P3 - Logistic Regression Implementation

USAAIO 2025 R1P2 - Basics of Neural Network - From Linear Regression to DNN Training

USAAIO 2025 R1P1 - Fibonacci Matrix Form

IAIO 2024 Part 2 - Machine Learning Algorithms and Deep Learning

Knowledge Graphs

USA AI Olympiad

Grade 5 Math

Featured PDFs

System Design Interview: An Insider's Guide Volume 2

System Design Interview: An Insider's Guide

UNICALLI: A UNIFIED DIFFUSION FRAMEWORK FOR COLUMN-LEVEL GENERATION AND RECOGNITION OF CHINESE CALLIGRAPHY

The Principles of Deep Learning Theory

Featured Books

Acing the System Design Interview

Numerical Python: Scientific Computing and Data Science Applications with Numpy, SciPy and Matplotlib

Hands-On Machine Learning with Scikit-Learn and PyTorch

Deep Reinforcement Learning Hands-On - Third Edition

Featured Videos

Flow-Matching vs Diffusion Models explained side by side

Attention in transformers, step-by-step | Deep Learning Chapter 6

Knowledge Distillation: How LLMs train each other

Diffusion Model

Popular Topics