Computer Vision 👁️‍🗨️

Thyroid Cancer Identification with Deep Learning

In my research, I used deep learning to improve identification of cancer. I started with using a modified VGG-19 network. Later on, I used 3-dimensional vision transformer for this task. I adapted the system to train spectral images with great success and found improvement over regular RGB images.

Tissue

Personal Contributions 👨🏻‍💻

  • Developed software in C++ to control imaging microscope and capture 100K images of thyroid
  • Used OpenCV to identify and remove 25% of images that are whitespace or out-of-focus
  • Trained a vision transformer in PyTorch to detect thyroid cancer with 0.906 AUC
  • Used Amazon SageMaker and wandb to monitor training and select hyperparameters
  • Presented research result orally at the 2022 SPIE Medical Imaging Conference

Technology Used ⚙️ Python (PyTorch, pandas, OpenCV, PIL, sklearn, matplotlib), AWS, wandb, C++ (OpenCV)

Publication Link 📝

U-Net for Image Segmentation

Implementation of U-Net for various image segmentation and image reconstruction tasks. To validate my network, I generated synthesized segmentation data using OpenCV.

UNet

Output

Technology Used ⚙️ Python (PyTorch, OpenCV, PIL, sklearn, matplotlib),

GitHub Link 💻 View on Github

Stanford CS231n: Convolutional Neural Networks for Visual Recognition

Solutions for Stanford CS231n coursework in Spring 2020.

In this course, I used Python, NumPy, TensorFlow, and Keras to implement the following:

  • Multi-layered networks
  • Regularization techniques (batchnorm, dropout)
  • Convolutional neural networks (CNNs)
  • Recurrent neural networks (RNNs)
  • Long-short term memory networks (LSTMs)
  • Style Transfer

GitHub Link 💻 View on Github

Machine Learning 👩‍🏫

Sleep Quality Analysis

Sleep scoring is used to judge the quality of sleep and identify potential problems (sleep apnea, for example). In this study performed at Northwestern University, I used clustering to automatically classify 30-second epochs of sleep into different stages.

Brain

Stages

Personal Contributions 👨🏻‍💻

  • Downloaded and analyzed the ISRUC-SLEEP Dataset using MATLAB
  • Analyzed 100 recordings totalling 500 hours.
  • Used wavelet transform to extract relevant features.
  • Used hierarchical clustering to cluster sleep stages with 84% accuracy

Technology Used ⚙️ MATLAB, Python (SciPy, OpenCV, pandas, matplotlibt)

Posture Detection

At the University of Wyoming, I performed two studies on using inertial measurement units (IMUs) to collect kinematic data from an exoskeleton wearer.

Personal Contributions 👨🏻‍💻

  • Planned and conducted scientific studies on 17 volunteers wearing exoskeletons and inertial measurement units (IMU) sensors.
  • Analyzed 20 hours of sensor data taken during various activities
  • Reduced input feature space by a factor of 5 using stepwise forward selection method
  • Used Naïve Bayes classifier to classify five activities (squatting, slouching, sitting, standing, and walking) with 92.2% accuracy.
  • Presented research result at the IEEE Conference on Rehabilitation Robotics

Technology Used ⚙️ MATLAB, R

Publication Link 📝

[In Progress] Machine Learning From Scratch

Implementation from scratch using NumPy and PyTorch popular machine learning and deep learning algorithms:

  • Machine learning algorithms
    • Linear regression
    • Decision tree
    • K-nearest neighbors
  • Deep learning architechtures
    • ResNet
    • U-Net
    • MobileNet
    • SqueezeNet
    • Vision transformers

GitHub Link 💻 View on Github

Natural Language Processing 🗪

Question-Answering Network

QA

I used BERT, a language model pretrained on hundreds of millions of natural language text, for the task of implementing a question-answering network. The network is finetuned on BioASQ, a dataset of biomedical questions.

Technology Used ⚙️ Python (HuggingFace, Gensim, SpaCy, sklearn), AWS EC2.

GitHub Link 💻 View on Github

[In Progress] CS224n: Natural Language Processing with Deep Learning

Solutions to answers for CS224n. In this course I used PyTorch to implement:

  • Word embedding
  • Attention models (transformers)
  • Question-answering networks
  • Language model pipelines (sentiment analysis)

Publications 📝

Journal/Conference Papers

  • Tran, Minh Ha, et al. “Thyroid carcinoma detection on whole histologic slides using hyperspectral imaging and deep learning.” Medical Imaging 2022: Digital and Computational Pathology. Vol. 12039. SPIE, 2022.
  • Ma, Ling, et al. “Unsupervised super-resolution reconstruction of hyperspectral histology images for whole-slide imaging.” Journal of Biomedical Optics 27.5 (2022): 056502.
  • Tran, Minh Ha, et al. “Toward real-world evaluations of trunk exoskeletons using inertial measurement units.” 2019 IEEE 16th International Conference on Rehabilitation Robotics (ICORR). IEEE, 2019.
  • Goršič, Maja, Minh Ha Tran, and Domen Novak. “A novel virtual environment for upper limb rehabilitation.” 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, 2018.

Blog/Technical Writings

I wrote almost exclusively for Towards Data Science, a technical blog aimed at data science community.