RL Proof Bot
Scrape database for math questions.
Feed questions into math proof bot to output proofs.
Feed proofs into a cost function that checks correctness of the proofs.
Use the cost function to update the math proof bot with reinforcement learning.