👋 Hi, I’m Abdul Sami

I am a dedicated Machine Learning researcher with a focus on generative models, including expertise in GANs, VAEs, and Diffusion Models. My current work is centered on advancing Diffusion Models to generate high-quality images, and I’m expanding my expertise in Large Language Models (LLMs) to explore the dynamic intersection of vision and language. Currently, I’m completing my Master’s in Computer Science at Soongsil University, Seoul, under the guidance of Prof. Jaeyong Choi. I am passionate about generative AI, computer vision, and building models that push the boundaries of creativity.

🎓 Education

M.S. in Computer Science, Soongsil University, Seoul, South Korea (Expected Aug 2025)
GPA: 4.16/4.5
Researcher at System Software Lab under Prof. Jaeyong Choi
Thesis: Diffusion-Driven Image Generation with Disentangled Style and Structure-Aware Fidelity
B.E. in Software Engineering, Mehran University of Engineering and Technology, Pakistan (2018--2023)
GPA: 3.73/4.0
Graduated as the top student in the class
Final Year Project: Real-Time Face Recognition Attendance System Using Computer Vision

[Back to Top]

🔬 Research Interests

Image Generation (Diffusion Models, GANs)
Multilingual Font Generation
Few-shot and Zero-shot Learning
Style Encoding and Transfer
Computer Vision & Deep Learning
Object Detection & Semantic Segmentation
Human-Centered Design Tools using AI and Computer Vision

[Back to Top]

📚 Projects

DK-Font: Diffusion-Driven Multilingual Font Generation with Phonetic Awareness and Iterative Refinement
Sami Abdul, Jaeyong Choi
Tools: PyTorch, Diffusion Models, U‑Net, VGG‑19, ResNet, CLIP

Developed a diffusion model for font synthesis across Korean, Chinese, and Latin scripts.
Used phonetic-aware encoding and iterative refinement to enhance structural accuracy and style consistency.
Achieved SOTA results in SSIM, FID, and LPIPS, surpassing prior work like Diff‑Font and MX‑Font.
Implemented few-shot capabilities to synthesize full font sets from just 3–5 reference glyphs.
Code & demos: DK-Font repository on GitHub

Unified Diffusion Model with Multi-Scale Style Infusion and Structure-Aware Losses (Ongoing)
Tools: PyTorch, Diffusion Models, Sobel Filtering, CLIP, VGG, U-Net

Developing an enhanced diffusion-based font generation framework with unified single-phase training.
Introduced Multi-Scale Style Infusion to inject style representations at encoder, bottleneck, and decoder stages.
Integrated Sobel-based structural consistency loss to enforce stroke-level preservation during generation.
Employed CLIP-based style loss for perceptual alignment between reference and generated glyphs.
Designed for high-fidelity, structure-aware font synthesis across multilingual scripts.

Real‑Time Face Recognition Attendance System
Tools: Python, OpenCV, Face Recognition, SQLite, Tkinter

Built a desktop UI that recognizes faces from video and automatically logs attendance.
Features include face registration, verification, live video feed, and database integration.

Hangul Font Classifier
Tools: PyTorch, AlexNet, torchvision

Implemented a CNN to classify 2,780 Hangul characters across font styles.
Achieved ~95% accuracy in real-time image-based prediction.

[Back to Top]

💻 Technical Skills

Deep Learning Frameworks: PyTorch, TensorFlow, Keras
Languages: Python (fluent), C++ (basic), HTML/CSS (for fun)
ML Tools: HuggingFace Diffusers, VGG Feature Extractors, OpenCV, NumPy
Model Types: Diffusion Models, GANs, Style Encoders, CNNs
Data Tools: Pandas, Matplotlib, Jupyter, Weights & Biases
Other Tools: Git, Docker, Linux, LaTeX

[Back to Top]

🧑‍🏫 Teaching & Talks

Guest Speaker – "Modern Diffusion Models for Image Generation", System Software Lab Seminar, Soongsil University (2024)
Teaching Assistant – "Deep Learning Programming", Soongsil University (Fall 2023)
Presenter – "Font Generation using AI", Internal Lab Meeting (2024)

[Back to Top]

🔗 Get in Touch

📧 Email: abdulsamimahar001@gmail.com
🌐 Portfolio: abdulsami101.github.io (Coming Soon)
📍 Based in Seoul, South Korea
💼 Open to PhD opportunities in Europe (Fall 2025)

💡 I'm open to collaborations in generative AI and creative deep learning applications. Whether you're working on a new idea, looking for a partner in research, or just curious about fonts and image generation — let’s connect!

[Back to Top]