Masud Ahmed

Graduate Research Assistant
Univesity of Maryland, Baltimore County

Welcome to my personal webpage! I am an enthusiastic Ph.D. candidate at the University of Maryland, Baltimore County, specializing in Artificial Intelligence and Machine Learning. Under the guidance of Dr. Nirmalya Roy in the Department of Information Systems, I am honing my skills in the cutting-edge fields of generative modeling, domain adaptation, and various forms of learning, including continual, self-supervised, and active learning. As a member of the Mobile, Pervasive and Sensor Computing (MPSC) Lab, I thrive in collaborative settings and am passionate about exploring theoretical and application-driven research.

To download my CV click here

Education

Ph.D. in Information Systems (January 2020 - Present)

University of Maryland, Baltimore County
Supervisior: Dr. Nirmalya Roy, Professor
CGPA: 3.90/4.00
B.Sc. in Electrical and Electronic Engineering (January 2014 - April 2018)

University of Dhaka
Supervisor: Dr. Md Atiqur Rahman Ahad, Professor
CGPA: 3.18/4.00

To download my transcript click following (authorization required):
B.Sc. transcipt
Ph.D. transript (unofficial)

Reserach Areas

Theoretical

Domain Adaptation, Continual Learning, Self-Supervised Learning, Active Learning, Foundation Model, Transformer, Large Language Model, Large Vision Model

Application

Computer Vision, Natural Language Processing, Healthcare, Robotics, Wearable Device Data Analysis, Sensor Data Analysis

Programming Languages

Python, C++, C, SQL (Oracle), MATLAB, HTML, R programming, ROS (Robot Operating System)

PyTorch, HuggingFace Transformers, JAX, Tensorflow, spaCy

Dataset

CAD-EdgeTune

MPSC Multi-view Dataset

Projects

Transformer-based LIDAR Semantic Segmentation Through Vector Quantization

Explored the application of Vector Quantization (VQ) techniques to LIDAR semantic segmentation, addressing challenges in generalization and interpretability present in traditional models

Proposed a novel approach using Vector Quantized Variational Autoencoders (VQ-VAE) to encode LIDAR point cloud data into a discrete and compact codebook representation

Leveraged an autoregressive transformer model to generate high-quality semantic segmentation from the quantized representation

Employed video prompting techniques, enabling the model to also generate LIDAR point clouds, expanding its versatility for various autonomous system applications

Active Learning for Semantic Segmentation in Mobile Robotics

Develop a real-time framework for active selection of informative regions in visual data for continual learning in semantic segmentation

Entropy-driven ranking and cyclical feedback loop

Reduced data transfer overhead, improving model performance with minimal labeled data

Collect RGB dataset at UMBC campus with different lighting condition (Noon, Dawn, Dask time)

Semantic Clustering Innovation: Novel Categories Discovery (NCD)

Develop NCD based algorithm for novel data clustering based on known class semantics, overcoming pseudo-labeling limitations

Leverage data sampling and multinoulli distribution for implicit semantic clustering without extensive annotations

Align class neuron activation distributions through Monte-Carlo sampling, explore directional statistics, and conduct ablation studies to advance state-of-the-art clustering approaches

Learning the Optical & Physiological Mechanics of rPPG with Self-Supervision

In this computational biology project, proposed a self-supervised learning approach for estimating heart rate from remote photoplethysmography (rPPG) signals obtained from skin videos without the need for synchronized ground truth annotations

Developed a contrastive learning-based pretraining strategy to learn the underlying diffusion signals' frequency, phase, and temporal coherence from unlabeled video frame sequences

Distributed Collaborative Robotics and Federated Learning in Vision

Developed a framework for Federated Class-Incremental Learning (FCIL) that enables collaborative training of machine learning models across geographically distributed agents without sharing raw data

Combined virtual simulations and real-world data collected from multiple physical sites, enabling domain adaptation to learn from both simulated and real environments

Improved decision-making capabilities in real-time by enabling agents to adapt to evolving environments and data streams, reducing reliance on extensive real-world data collection

Strata and Viewpoint Invariant Encoding for Robust Video Action Recognition

Address the challenge of robust video action recognition (VAR) in diverse settings with varying viewpoints and sensors

Propose a joint optimization method leveraging contrastive and adversarial loss for learning sensors and viewpoint invariant representation from unlabeled synchronous multiview (MV) video data

Collect a large-scale time synchronous MV video dataset encompassing diverse settings, actions, viewpoints, and sensor properties.

Publications

Google Scholar profile link

ResearchGate profile link

Book

Md Atiqur Rahman Ahad, Anindya Das Antar, Masud Ahmed, "IoT Sensor-Based Activity Recognition - Human Activity Recognition," Springer Nature.

Preview of the book

Journal Paper

Anindya Das Antar, Masud Ahmed, Md Atiqur Rahman Ahad, "Recognition of human locomotion on various transportations fusing smartphone sensors," Pattern Recognition Letters, 2021.

Education

Reserach Areas

Theoretical

Application

Programming Languages

Dataset

Projects

Publications

Book

Journal Paper

Conference Paper

arXiv Preprint Paper

Work & Research Experiences

Center for Real-time Distributed Sensing and Autonomy, University of Maryland Baltimore County

MPSC Lab, Department of Information Systems, University of Maryland Baltimore County

Yagi Laboratory, Department of Intelligent Media, ISIR, Osaka University

Joykoly Publication Ltd.

Fab Lab, University of Dhaka

Sakura Science Program, Osaka Prefecture University

Additional Information

Global Competition Awards

Community Involvement

Skills

Contact