Skip to content
Deep Learning Engineer · AI Researcher

Building intelligent systems that see, reason, and learn.

I research vision transformers, multimodal learning, and large-scale model training. I turn cutting-edge papers into shipped, production-grade AI.

View ResearchRead CV
40+
Models trained
12k+
GPU hours
7
Papers
3
SOTA tasks
01 — About

Researcher. Builder. Practitioner.

I bridge cutting-edge AI research with engineering rigor — building deep learning systems that actually ship.

Profile photo
Vision Transformers · Self-Supervised Learning · Multimodal Reasoning
MD. Faysal Islam Fahad
Currently
Senior AI Researcher
Based in
Remote · Worldwide

I am a Deep Learning Engineer and AI researcher working at the intersection of computer vision, multimodal learning, and large-scale model training. My work focuses on pushing transformer architectures into resource-constrained, real-world settings — from medical imaging to autonomous perception. I enjoy turning recent papers into production systems, and shipping models that survive contact with real data.

01

Trained ViT-B/16 from scratch on a custom 5M image dataset

02

Published at top-tier vision conferences (CVPR / ICCV workshop track)

03

Led on-device CV inference on Jetson Orin (sub-30ms latency)

04

Open-source contributor with 1.5k+ GitHub stars across repos

03 — Projects

AI research, shipped.

A selection of deep learning case studies — from architecture design to deployment-grade inference pipelines.

Featured
Computer Vision

ViT for Medical Imaging

A vision transformer that detects pathology in chest X-rays at radiologist-level accuracy.

0.912
AUC (CheXpert)
0.86
F1 (Pneumonia)
1,200
GPU hours
vision-transformermedical-imagingself-supervisedclassification
04 — Skills

The deep learning toolbox.

Frameworks, models, and ops I use daily — calibrated by years of training, debugging and shipping.

Deep Learning

3 skills
  • Vision Transformers92%
  • Self-Supervised Learning88%
  • Diffusion Models85%

Computer Vision

3 skills
  • CNN Architectures93%
  • Object Detection90%
  • Image Segmentation88%

NLP & Multimodal

2 skills
  • Multimodal Models82%
  • LLM Fine-tuning80%

ML Frameworks

4 skills
  • PyTorch95%
  • Hugging Face90%
  • TensorFlow85%
  • JAX75%

Languages

3 skills
  • Python96%
  • TypeScript82%
  • CUDA70%

Tools & Ops

2 skills
  • Weights & Biases88%
  • Docker86%

Cloud

1 skill
  • AWS80%

Web

2 skills
  • FastAPI85%
  • Next.js80%
05 — Publications

Peer-reviewed contributions.

Papers, workshops and preprints — the slowest-moving but most rewarding part of the work.

2025
01
Workshop
CVPR Workshop on Efficient Deep Learning
Featured

Token-Aware Vision Transformers for Efficient Inference

MD. Faysal Islam Fahad, A. Chen, J. Smith

We present a learnable token-pruning module that adapts the computation graph of a Vision Transformer at inference.

2024
02
Workshop
MICCAI Workshop

MAE Pretraining Across Medical Modalities

MD. Faysal Islam Fahad, R. Patel

Cross-modality masked autoencoder pretraining for medical image classification.

2024
03
Preprint
arXiv preprint

Distilling YOLO for Edge Robotics

MD. Faysal Islam Fahad

A study of distillation strategies for compact object detectors targeting embedded hardware.

06 — Experience

A timeline of the work.

Roles, programs and research positions that shaped how I think about AI.

Senior AI Researcher
Frontier AI Lab
Jan 2024 — Present
Remote

Leading research on efficient multimodal models and edge deployment.

  • Architected the lab's ViT distillation pipeline
  • Mentored 3 junior researchers
PyTorchTritonCUDAWeights & Biases
Deep Learning Engineer
Vision Robotics Inc.
Mar 2022 — Dec 2023
Berlin / Remote

Shipped real-time perception models for warehouse robotics.

  • Reduced detection latency 3.5×
  • Owned the on-device CV stack on Jetson Orin
TensorRTONNXC++Python
M.Sc. in Artificial Intelligence
Tech University
Sep 2020 — Feb 2022
Munich

Master's degree with thesis on self-supervised learning for medical imaging.

  • Graduated with distinction
  • Published a workshop paper
PyTorchPandasLaTeX
07 — Achievements

Awards & recognition.

Selected highlights — the moments that pushed the work forward.

Best Paper - Workshop Track

CVPR Workshop on Efficient DL

Awarded best paper for our work on token-aware Vision Transformers.

Jun 2025

NVIDIA AI Hackathon Winner

NVIDIA

Won first place for an autonomous drone perception stack.

Mar 2024

Top 0.3% Kaggle Competitor

Kaggle

Ranked Master tier across image classification competitions.

Aug 2023
08 — Contact

Let's build something intelligent.

Whether it's a research collaboration, a hard CV problem, or a deep learning hire — I'd love to hear about it.

Send a message
Typical reply within 24h

Minimum 10 characters · Max 4,000.