I analyzed non-matriculated international students by using probabilistic Machine Learning. The tricky part about this dataset is that it would get overfitted easily since the dataset are time-series, and usually college decisions are determined in a steps-by-steps sequence (which means certain features are heavily depended on another feature). If we were to use normal classification approach, this would not work and will be overfitted.
Motivation: Since offering scholarships involves a significant cost, and resources are wasted on those who are accepted but don't end up signing the offer letter, a good way to contribute is to built an early prediction model to gain insights into this group. That way, we can leave the scholarship to the students who are actually in need.
Joint work with Dr. Kefei WangI implemented t-SNE and PCA to understand the correlations to understand what features have the most significant impact towards the rate of leaving university in STEM students, reduce dimensions to find a good fit, and implemented a Lasso Regression on the entire dataset.
Joint work with Dr. Longfei Zhou and Dr. Davide Piovesan
I worked on a project to build a serverless IPS system in AWS using ECS, EC2, API Gateway, Sagemaker.
Motivation: The goal of this project is to reduce as much cost as possible while maintaining the real-time nature of network. We implemnted light-LSTM, a model that could be deployed in ECS and inferenced using serverless endpoint, without too long warmup (~1,2 seconds.) Joint work with Dr. Ronny Bazan and Md. Anisur