Monoshiz Mahbub Khan

(মনসিজ মাহবুব খান)

Hello There!

My name is Monoshiz Mahbub Khan. I am a final year PhD candidate in the PhD in Computing and Information Sciences program at Rochester Institute of Technology, having started in Fall 2021. I have also worked as a Graduate Research Assistant under Dr. Zhe Yu at hil-se lab. My research and internship have primarily focused on using traditional NLP and ML methods, and contemporary deep learning based methods to build ranking and classification tools to support software engineering tasks, including code search and story point estimation. I also have experience building ML pipelines in a cross-domain production setting through an internship at ABB.

I am interested working on AI/ML based projects with tangible and visible outcomes across domains. I am currently looking for full-time opportunities starting from September 2026.

Feel free to reach out. I’d love to collaborate, network and build new opportunities together!

‣

Education

‣

Work Experience

‣

Publications

‣

Research Experience

Code Search (2021-2024) Research project focusing on retrieving programming language artifacts related to some natural language queries from a pool of possible programming language artifacts and ranking them by relevance, using dual encoder models. Model is built in Python using TensorFlow and Keras modules. This model showed an average improvement of 10.03% over state-of-the-art methods in terms of MRR scores. The research was conducted under the guidance of Dr. Zhe Yu. This work has been published in Empirical Software Engineering (EMSE) Journal and presented at FSE 2025 in the journal-first track. https://github.com/hil-se/CodeSearch

Comparative Learning (2023-2026) Research project focusing on modeling learning comparative judgments for Agile story point estimation through machine learning and human subject experiments. Machine learning experiments involved building a model to learn from pairwise story point data and rank them. These experiments involved using GPT2, SBERT, FastText language models and traditional machine learning methods. The framework was built using TensorFlow modules. The proposed model showed an average increase of 21.84% in Spearman’s rank correlation coefficient scores over state-of-the-art models. The research has been conducted under the guidance of Dr. Zhe Yu.

https://github.com/hil-se/EfficientSPEComparativeLearning

Explainable image classification (2024)

Image processing and Explainable AI-based research project focusing on explaining a pre-trained VGG model's classification decisions on face image data. This work involved fine-tuning a pre-trained VGG model on SCUT face image data for classification, and using the model's gradients on the images to explain why the model made those decisions.

Comparative learning for face image attractiveness (2024)

Research project focused on modeling comparative learning on face image data. This work involved using the comparative judgment framework with a pre-trained VGG model as the encoder to predict a ranked preference order for the images.

Comparative learning for image captioning (2024 - 2026)

Research project focused on modeling comparative learning on image and associated caption data. This work involved using the comparative judgment framework on this multi-modal data to predict whether a paired image and text caption are likely to be connected.

Outdated comment detection for repository commits (2024 - 2025)

Research project focused on detecting whether the comment associated with repository commits are up-to-date or outdated after new commits. This work involved the use of various deep learning structures, including dual encoders.

Modeling Art Evaluations from Comparative Judgments (2024 - 2026)

Research project focused on modeling comparative learning on image data. This work involved using the comparative judgment framework on image based data to evaluate direct and comparative judgments on image data.

Bangla Abstractive Text Summarization using Encoder-Decoder Model (2019-2020) A research project on constructing a dataset for the task of abstractive text summarization in Bangla, and constructing a deep learning based model capable of using said dataset. The model was written in Python using Tensorflow modules. The research was conducted as the final year research project at University of Dhaka under the supervison of Dr. Muhammad Asif Hossain Khan. https://github.com/monoshizmkhan/Bangla-Abstractive-Text-Summarization

‣

Mentorship & Supervision Experience

‣

Other Past Projects

Personal projects

MLOps and Data Pipeline Projects (2025) Small toy project to learn and brush up on several tools, including - MLflow, Airflow, PySpark. https://github.com/monoshizmkhan/BostonToyProjects/

LLM and RAG Projects (2025) Small toy project to learn fine-tuning an LLM (GPT2) and implementing a RAG. Planned future parts of this project include using LangChain modules. https://github.com/monoshizmkhan/LLM-Experiments/

Course projects

Kabaddi (2016) A single or multiplayer video game based on the sport of the same name. Written in C++ as the Fundamentals of Programming Lab project at University of Dhaka. https://github.com/monoshizmkhan/Kabaddi
Trapped (2017) A single or online multiplayer interactive puzzle game. Written in JAVA as the Object Oriented Programming Lab project at University of Dhaka. https://github.com/monoshizmkhan/Trapped
Musyc (2018) A music-based social networking application with a built-in offline music player on Android platform. Made using JAVA and SQLite as the Application Development Lab project at University of Dhaka. https://github.com/monoshizmkhan/Musyc
EasyML (2018) A web-based application for the purpose of applying and visualizing several machine learning algorithms on datasets. Written using Python and JavaScript Served as the project in the course Software Engineering Lab at University of Dhaka. https://github.com/Saad-Mahmud/EasyML
Pharmassistant (2018) A software as a user interface for the use of online product inventory, searching, sales and finances management by employees of a pharmacy. Written in Python and JavaScript as the Software Design Patterns Lab project at University of Dhaka. https://github.com/HHMoon13/Pharmassist
CSEDU Project Hub (2019) A web application for the purpose of storing, sharing and viewing undergrad research projects. Written in Python and JavaScript as the Internet Programming Lab project at University of Dhaka.

BackPack (2022) An e-store for selling hiking, camping and miscellaneous equipment and equipment collections (known on the store as backpacks). Written using Java Spring and Angular frameworks as the Foundations of Software Engineering course project at Rochester Institute of Technology.

‣

Technical Strengths

‣

Scholarships

‣

Volunteering Experience

Programming languages	Python, JAVA, R, C, C++, JavaScript
Machine Learning & AI	TensorFlow, Keras, PyTorch, scikit-learn, LLM fine-tuning, RAG
MLOps & Data Engineering	MLflow, Airflow, PySpark, Docker
Frameworks & Databases	Flask, Spring, Angular, SQL (Oracle, SQLite), NoSQL (mongoDB)
Tools & Methodologies	Git (GitHub, Azure DevOps), LaTeX, Agile, Scrum