Education
Degree | University | GPA | Thesis |
---|---|---|---|
MSc in Computer Engineering (Artificial Intelligence & Robotics) | Sharif University of Technology (SUT) | 18.32/20 (4.00/4.00) | Blind Image Super-Resolution using Deep Generative Neural Network Architectures |
BSc in Computer Engineering | Amirkabir University of Technology (AUT) | 19.09/20 (3.96/4.00) | Graph-based Convolutional Multivariate Time Series Forecasting Approach for Urban Traffic Forecasting |
Research Interests
- Deep Generative Models (VAEs, GANs, Diffusion Models)
- Image and Video Segmentation
- Image/Video Restoration
- 3D Vision Reconstruction (e.g., NeRF, Gaussian Splatting)
- Computer Vision
- Natural Language Processing
- Graph Neural Networks
Publications and Ongoing Work
A Comprehensive Survey on Knowledge Distillation
Preprint published on ArXiv, March 2025 (ArXiv)CLBSR: A Deep Curriculum Learning-based Blind Image Super-Resolution Network using Geometrical Prior
Published in Image and Vision Computing Journal, February 2025 (DOI)KeyVIS: Improving Weakly-supervised Video Instance Segmentation Using Keypoints Consistency
Submitted to Computer Vision and Image Understanding Journal, November 2024GSCINet: Graph-based Convolutional Multivariate Time Series Forecasting Approach for Urban Traffic Forecasting
In preparation
For a full list of my publications, visit my Google Scholar.
Teaching Assistance
Date | Course | Supervisor | University |
---|---|---|---|
Spring 2025 | Digital Image Processing | Prof. Kasaei | SUT |
Fall 2024 | Deep Learning | Prof. Beigy | SUT |
Fall 2024 | Advanced 3D Computer Vision | Prof. Kasaei | SUT |
Spring 2024 | Fundamental of 3D Computer Vision | Dr. Naderi | SUT |
Spring 2023 | Data Mining | Prof. Nazerfard | AUT |
Spring 2023 | Applied Linear Algebra | Prof. AmirMazlaghani | AUT |
Spring 2022 | Signals and Systems | Dr. TermehChi | AUT |
Fall 2022 | Applied Linear Algebra | Prof. Nazerfard | AUT |
Fall 2022 | Data Structures and Algorithms | Prof. Shirali Shahreza | AUT |
Professional Experience
Position | Organization | Duration | Description |
---|---|---|---|
Data Scientist | Bale Messenger | May 2023 - Feb. 2024 | Contributed to projects including channel classifier, recommender systems, and intelligent advertisement systems. |
Machine Learning Engineer | Asr Gooyesh Pardaz | Jul. 2022 - Sep. 2022 | Developed audiovisual speech recognition for Persian language and created Persian audiovisual datasets. |
Projects
Project | Description | Link |
---|---|---|
Deep Learning Homework | Includes dimensionality reduction, autoencoders, graph embeddings, and reinforcement learning tasks using methods like PCA, t-SNE, VAE | View on GitHub |
Deep Generative Models Homework | Implementations of VAEs, GANs, Diffusion Models (DDPMs), and Energy-Based Models (EBMs) on datasets like MNIST and CIFAR-10 | View on GitHub |
Digital Image Processing Homework | Solutions to assignments covering filtering, transformations, compression, and segmentation | View on GitHub |
For a full list of my projects, visit my GitHub profile.
Skills and Expertise
- Programming Languages: Python, Java, C/C++, MATLAB
- Frameworks & Libraries: PyTorch, PyTorch Geometric, TensorFlow, Hugging Face, JAX, OpenCV
- Tools: Git, Linux, Bash, CUDA, FFmpeg, LangChain, AWS EC2, $\LaTeX$
- Data Science: Numpy, Pandas, Scikit-learn, Matplotlib
Languages
- English
- TOEFL iBT: 101
Reading Listening Speaking Writing 28 27 23 23
- TOEFL iBT: 101
- Persian (Native)
Contact Information
- Email: amir.m.babaei.academic@gmail.com
- LinkedIn: Amir Mohammad Babaei
- Google Scholar: Amir Mohammad Babaei
I am always open to discussing potential PhD positions and collaborative research. Please feel free to reach out!