The Gym environment

1- environment (gym_classification)
2- main code (RL_MNIST_example)
this code for Reinforced learning environment that implements open AI GYM interfaces to develop classification models using the latest RL algorithms.
Requirements:
1-Methodology include:
• Write description in sentences to clear up the methodology was used. and support your description by new resources (references should be from 2017- till now).
• What is kind the Gym environment used in specific? describe it in few sentences with supporting by new references (2017-till now). Just example it may another (the CartPole-v1 environment modified to be suitable to classify images……)
2- One page (single space without the references). the references should be in separate page.