In MDP, I've worked on HMM or Hidden Markov Model generally used for state estimation in robotics. 2nd, I've used MCMC or Markov Chain Monte Carlo in reinforcement learning. The Markov Decision Process is the primary research focus as I'm working on estimating dynamics of objects in mobile robots path planning. Further, using convex optimization like CVXOpt for finding a feasible solution. I also use Taylor-Series approximations quite few times.
PLATFORM
MATLAB
Python
REGARDING YOUR PROJECT
Since you've not described anything, placing a budget or timeline is not reasonable, although keeping in mind a graduate-level project, I've placed a bid.
1) Kindly prepare a brief with all details
2) Put down exact deliverables
3) Attach documents and material related to it
TIMELINE
I shall prepare a TIMELINE pdf and discuss what exactly we shall complete, goals, deliverables, milestone payments and track progress. If it's 4 days, we divide into 4 milestones and update every 24 hours.
I'm a research scholar in machine learning and robotics and take projects along my research area. Kindly initiate a chat with the above details. If not online, shall reply ASAP.
Thank you!