Prof by Lex AI Labs — home
Profby Lex AI
EARLY ACCESS

Shipping daily.

Last shippedMAY 20

New curriculum map — see how all 7 courses build on each other, search any lesson with /.

View what’s new→
COMING NEXT
  • Adaptive quizzes
  • Tutor v2.0
  • More company interview guides
  • Voting board for content
Request a feature or report a bug→
CurriculumPracticeFor OrganizationsPricing
Sign InStart Learning
← Practice

reinforcement-learning

8 problems

  • Bellman equation for value iterationMedium
  • GRPO objectiveHard
  • Policy gradient with REINFORCEMedium
  • PPO clipped objectiveMedium
  • Prioritized experience replayMedium
  • Q-learning for MDPsMedium
  • TD(0) value updateEasy
  • UCB1 multi-armed banditEasy