👋 Hi, I’m Abdul Sami

Education | Research Interests | Projects & Publications | Technical Skills | Talks & Teaching | Get in Touch

I am a dedicated Machine Learning researcher with a focus on generative models, including expertise in GANs, VAEs, and Diffusion Models. My current work is centered on advancing Diffusion Models to generate high-quality images, and I’m expanding my expertise in Large Language Models (LLMs) to explore the dynamic intersection of vision and language. Currently, I’m completing my Master’s in Computer Science at Soongsil University, Seoul, under the guidance of Prof. Jaeyong Choi. I am passionate about generative AI, computer vision, and building models that push the boundaries of creativity.

🎓 Education

  • M.S. in Computer Science, Soongsil University, Seoul, South Korea (Expected Aug 2025)
    GPA: 4.16/4.5
    Researcher at System Software Lab under Prof. Jaeyong Choi
    Thesis: Diffusion-Driven Image Generation with Disentangled Style and Structure-Aware Fidelity
  • B.E. in Software Engineering, Mehran University of Engineering and Technology, Pakistan (2018--2023)
    GPA: 3.73/4.0
    Graduated as the top student in the class
    Final Year Project: Real-Time Face Recognition Attendance System Using Computer Vision
[Back to Top]

🔬 Research Interests

  • Image Generation (Diffusion Models, GANs)
  • Multilingual Font Generation
  • Few-shot and Zero-shot Learning
  • Style Encoding and Transfer
  • Computer Vision & Deep Learning
  • Object Detection & Semantic Segmentation
  • Human-Centered Design Tools using AI and Computer Vision
[Back to Top]

📚 Projects

DK-Font: Diffusion-Driven Multilingual Font Generation with Phonetic Awareness and Iterative Refinement
Sami Abdul, Jaeyong Choi
Tools: PyTorch, Diffusion Models, U‑Net, VGG‑19, ResNet, CLIP

  • Developed a diffusion model for font synthesis across Korean, Chinese, and Latin scripts.
  • Used phonetic-aware encoding and iterative refinement to enhance structural accuracy and style consistency.
  • Achieved SOTA results in SSIM, FID, and LPIPS, surpassing prior work like Diff‑Font and MX‑Font.
  • Implemented few-shot capabilities to synthesize full font sets from just 3–5 reference glyphs.
  • Code & demos: DK-Font repository on GitHub

Unified Diffusion Model with Multi-Scale Style Infusion and Structure-Aware Losses (Ongoing)
Tools: PyTorch, Diffusion Models, Sobel Filtering, CLIP, VGG, U-Net

  • Developing an enhanced diffusion-based font generation framework with unified single-phase training.
  • Introduced Multi-Scale Style Infusion to inject style representations at encoder, bottleneck, and decoder stages.
  • Integrated Sobel-based structural consistency loss to enforce stroke-level preservation during generation.
  • Employed CLIP-based style loss for perceptual alignment between reference and generated glyphs.
  • Designed for high-fidelity, structure-aware font synthesis across multilingual scripts.

Real‑Time Face Recognition Attendance System
Tools: Python, OpenCV, Face Recognition, SQLite, Tkinter

  • Built a desktop UI that recognizes faces from video and automatically logs attendance.
  • Features include face registration, verification, live video feed, and database integration.

Hangul Font Classifier
Tools: PyTorch, AlexNet, torchvision

  • Implemented a CNN to classify 2,780 Hangul characters across font styles.
  • Achieved ~95% accuracy in real-time image-based prediction.
[Back to Top]

💻 Technical Skills

  • Deep Learning Frameworks: PyTorch, TensorFlow, Keras
  • Languages: Python (fluent), C++ (basic), HTML/CSS (for fun)
  • ML Tools: HuggingFace Diffusers, VGG Feature Extractors, OpenCV, NumPy
  • Model Types: Diffusion Models, GANs, Style Encoders, CNNs
  • Data Tools: Pandas, Matplotlib, Jupyter, Weights & Biases
  • Other Tools: Git, Docker, Linux, LaTeX
[Back to Top]

🧑‍🏫 Teaching & Talks

  • Guest Speaker – "Modern Diffusion Models for Image Generation", System Software Lab Seminar, Soongsil University (2024)
  • Teaching Assistant – "Deep Learning Programming", Soongsil University (Fall 2023)
  • Presenter – "Font Generation using AI", Internal Lab Meeting (2024)
[Back to Top]

🔗 Get in Touch

  • 📧 Email: abdulsamimahar001@gmail.com
  • 🌐 Portfolio: abdulsami101.github.io (Coming Soon)
  • 📍 Based in Seoul, South Korea
  • 💼 Open to PhD opportunities in Europe (Fall 2025)

💡 I'm open to collaborations in generative AI and creative deep learning applications. Whether you're working on a new idea, looking for a partner in research, or just curious about fonts and image generation — let’s connect!

[Back to Top]