BBL 596E/696E Seminars
BLU 596E Seminars

CRN 23045/25213
CRN 24461

  1     BBL 596E  
Title Challenges and Approaches for Clustering Large-Scale Molecular Datasets Using Similarity-Aware Methods
Speaker Arshia Lamei
Date May 7th, 2025
Time 13:30 (GMT+3)
Advisor Sefer Baday
Abstract
Clustering large molecular datasets, often represented by high-dimensional fingerprints, is essential in cheminformatics but poses significant computational challenges. Standard algorithms like Agglomerative Clustering can struggle with scalability (O(N2) complexity) and memory requirements when applied to hundreds of thousands of molecules. This paper reviews these challenges based on preliminary experiments using Morgan fingerprints and Jaccard distance. Advanced generative models are subsequently explored, specifically the regularized transformer approach proposed by Tibo et al. (2024), which aligns generation negative log-likelihood (NLL) with molecular similar ity (Tanimoto). While originally designed for local chemical space exploration, the potential for such similarity-aware generation to facilitate more effective downstream clustering by providing better-structured molecular sets or similarity-informed features is discussed. This perspective is compared with traditional methods like the Butina algorithm, and future directions involving scalable computing and deep learning for tackling large-scale molecular clustering are outlined.
Location Informatics Institute
Room 412
       
  2     BBL 596E  
Title Analysis of Microscopic Data on the Behavior of Metallic Nickel Ore During Magnetic Enrichment Using SAM2 and Tensor Decomposition-Based Zero-Shot Image Segmentation
Speaker Berk Münci İnanç
Date May 7th, 2025
Time 13:30 (GMT+3)
Advisor Behçet Uğur Töreyin
Abstract
The accurate separation and classification of minerals are crucial for integrating deep learning-based computer vision approaches into mineral enrichment processes. In this work, labeled microscopic mineral images were processed using various tensor decomposition algorithms and then segmented with the SAM2 model using a zero-shot prediction. The main goal of this study is to understand how different tensor decomposition schemes influence segmentation performance and to support further research in mineral enrichment. The results shed light on the impact of these algorithms on segmentation accuracy, providing useful guidance for choosing the most effective methods in the related field.
Location Informatics Institute
Room 412
       
  3     BBL 596E  
Title Underwater Turbid River Image Restoration Using Diffusion Models
Speaker Fatima Iqbal
Date May 7th, 2025
Time 13:30 (GMT+3)
Advisor Behçet Uğur Töreyin
Abstract
Underwater images often face challenges due to turbidity caused by suspended particles, leading to hazy and distorted visuals. The lack of real-life underwater data also reduces the efficiency of trained models. This paper introduces a diffusion models-based denoising architecture to restore under water turbid images. The method first quantifies the turbidity noise by optimizing the variance parameter using a data set that replicates the diffusion process of forward noise addition. The trained U-Net architecture then iteratively reconstructs turbid images by implementing a reverse Markov diffusion chain process. In addition to visual enhancements, restored images are evaluated using perceptual evaluation measures such as entropy and NIQE. The results of this project can contribute significantly to the monitoring of marine ecosystems and the study of fish migration patterns.
Location Informatics Institute
Room 412
       
  4     BBL 596E  
Title Network Attack Classification and Incident Response Support Using a Fine-tuned Large Language Model
Speaker Haluk Çağatay Kır
Date May 7th, 2025
Time 13:30 (GMT+3)
Advisor Enver Özdemir
Abstract
The limitations of traditional, manual threat hunting in Incident Response highlight the need for more innovative cybersecurity strategies. Manually crafted rules to detect malicious behavior are not scalable and often yield low accuracy. This research proposes a shift toward integrating Large Language Models (LLMs) into Incident Response processes to enhance threat detection accuracy, lower false positives, and optimize resource allocation. By leveraging LLMs, the project seeks to reduce repetitive tasks, speeding up decision-making and overall incident response effectiveness. The methodology involves fine-tuning the Gemma-2b model on the CIC-IDS2017 dataset and creating a Retrieval Augmented Generation (RAG) agent that offers a natural language interface for log analysis. Results demonstrate that Gemma-2b acts as a conversational AI assistant, improving the efficiency of cybersecurity professionals by automating routine tasks. With an overall accuracy of 96.85%, the model shows strong predictive capabilities in identifying diverse network attacks, underscoring its potential for threat detection and classification in cybersecurity operations.
Location Informatics Institute
Room 412
       
  5     BBL 596E  
Title Weakly Supervised HER2 Equivocal WSI Classification
Speaker Mücahit Ertano
Date May 14th, 2025
Time 13:30 (GMT+3)
Advisor Abdülkerim Çapar
Abstract
HER2/neu (ERBB2) gene amplification and over-expression are critical biomarkers in breast cancer, with imunohistochemistry (IHC) used to classify HER2 expression levels. Equivocal (2+) cases present challenges, often requiring additional In Situ Hybridization (ISH) tests, which are costly and time-intensive, to guide treatment decisions. This study developed an automated approach using weakly supervised deep learning methods to classify HER2 IHC equivocal (2 +) whole slide images (WSIs) as HER2 positive or HER2-negative. Tumor samples from 113 breast cancer patients were digitized, annotated according to the intensity of HER2 staining, and analyzed using multiple instance learning. These findings demonstrate the potential of multiple instance learning to streamline HER2 testing, reduce reliance on ISH, and improve decision-making for equivocal cases.
Location Informatics Institute
Room 412
       
  6     BBL 596E  
Title A Comprehensive Survey on Multi-Modal Attention Mechanisms in Vision-Language Models
Speaker Cansu Nur Demirkıran
Date May 14th, 2025
Time 13:30 (GMT+3)
Advisor Ertuğrul Karaçuha
Abstract
Recent advancements in Vision-Language Models (VLMs) have led to significant improvements in tasks requiring joint reasoning over visual and textual modalities. A core component driving this progress is the development of sophisticated multi-modal attention mechanisms. This paper provides a comprehensive survey of multi-modal attention architectures in VLMs, including cross-attention, self-attention, co-attention, and hierarchical attention mechanisms. These mechanisms are categorized and analyzed based on their design and effectiveness in benchmark tasks such as Visual Question Answering (VQA), image captioning, and cross-modal retrieval. In addition, computational trade-offs are examined, and emerging challenges and research directions in multi-modal attention are highlighted.
Location Informatics Institute
Room 412
       
  7     BBL 596E  
Title Predicting Hit Songs with Machine Learning
Speaker Çağatay Kurt
Date May 14th, 2025
Time 13:30 (GMT+3)
Advisor Şule Öğüdücü
Abstract
Hit song prediction is a subfield of Music Information Retrieval (MIR) that is focused on analyzing hit songs that have appeared in one or more music charts around the world. These charts and their rankings are usually based on physical/digital sales and downloads, streaming/airplay count from radio and web platforms. Excluding the artist based factors, analyzing hit songs by their raw waveforms may unveil highly valuable information. This information can be used by producers and record labels in order to determine which songs are likely to be a hit prior to their release and to discover new talents. We gathered a dataset that contains hit songs from Billboard year-end charts of last 20 years, and equal amount of non-hit songs from same genres. Their high level audio features are extracted from Spotify API and this dataset is used to train five machine learning classifiers (logistic regression, support vector machine, decision tree, random forest, XGBoost) as baseline models. Some of these models were able to achieve 86% accuracy. We proposed new audio-based features as a part of ongoing work.
Location Informatics Institute
Room 412
       
  8     BBL 596E  
Title Medical Image Analysis Using Deep Learning
Speaker Yusuf Emir Meşe
Date May 14th, 2025
Time 13:30 (GMT+3)
Advisor Abdulkerim Çapar
Abstract
Deep learning has revolutionized medical image analysis by enabling automated and accurate diagnosis from medical scans such as X-rays, MRIs, and CT scans. Convolutional neural networks (CNNs) and transformer-based models have significantly improved image classification, segmentation, and anomaly detection. These techniques enhance diagnostic efficiency, reduce human error, and assist healthcare professionals in early disease detection.
Location Informatics Institute
Room 412
       
  9     BBL 596E  
Title Interpreting Noise in Pulse-Controlled Quantum Gate Operations: A Comparative Study of Modeling Approaches
Speaker Negar Dokhtmirzahasanvahid
Date May 21st, 2025
Time 13:30 (GMT+3)
Advisor Deniz Turkpence
Abstract
Accurately modeling quantum gate operations in the presence of noise is crucial for improving quantum hardware performance and error mitigation strategies. In this work, we investigate the effects of noise on single- and two-qubit gates implemented using pulse control. We compare three different approaches: unitary models, which assume ideal closed-system evolution; nonunitary models, which incorporate decoherence and dissipation; and pulse-based simulations using ModelingLib. By benchmarking these models against experimental results from noisy quantum processors, we analyze their ability to capture real-world noise sources, including decoherence, calibration errors, and crosstalk. Our findings provide insights into the strengths and limitations of each modeling approach, highlighting discrepancies between theoretical predictions and hardware behavior. This work contributes to a more accurate interpretation of noise in pulse-based quantum operations and informs the development of improved simulation techniques for near-term quantum devices.
Location Informatics Institute
Room 412
       
  10     BBL 596E  
Title MaskDINO Instance Segmentation for Chromosome Images: Enhancing Accuracy through Boosting
Speaker Ikra Nergiz
Date May 21st, 2025
Time 13:30 (GMT+3)
Advisor Behçet Uğur Töreyin
Abdulkerim Çapar
Abstract
Cytogenetic analysis plays a crucial role in the diagnosis of genetic diseases and involves the examination of chromosomes. Metaphase images are analyzed to detect any numerical or structural anomalies in the chromosomes. This process is error-prone and time-consuming due to human intervention. Errors in this process can lead to inappropriate treatments and incorrect clinical diagnoses. In recent years, deep learning techniques in computer vision have been widely used to automate chromosome analysis and improve the accuracy of chromosome segmentation. This paper demonstrates the effectiveness of MaskDINO for instance segmentation, based on Detectron2 and trained on chromosome image datasets. A boosting technique was applied using custom datasets comprising over 50,000 chromosome images. The model was trained on a specific dataset and then evaluated on a separate one. The 1,000 images with the lowest mAP50@95 scores were added to the training dataset, and the model was retrained on this extended dataset. The scores of the model trained on the extended dataset show a %X improvement compared to the initial training dataset.
Additionally, data augmentation strategies to enhance the model's performance on low-resolution and blurry images were examined.
Location Informatics Institute
Room 412
       
  11     BBL 596E  
Title Review and Implementation of The Article of Dynamic Routing Scheme for Linking Wireless Sensor Network towards Internet of Things
Speaker İsmail Fırat Sürkit
Date May 21st, 2025
Time 13:30 (GMT+3)
Advisor Ömer Melih Gül
Abstract
The rapid advancements in wireless sensor networks (WSNs) have paved the way for significant progress in the Internet of Things (IoT), enabling various large-scale applications. For these applications to function effectively, it is crucial that routing protocols ensure efficient and reliable packet delivery to their intended destinations. Although existing routing protocols are designed to fulfill basic network requirements, they often fall short in achieving a balance between energy efficiency and secure data transmission. This imbalance can result in reduced network performance and reliability, particularly in large-scale IoT systems. To address these limitations, we propose a secure and energy-efficient routing scheme specifically tailored to meet the unique challenges of IoT networks. The scheme incorporates a controller node that plays a central role in monitoring the behavior of nodes along the routing path. This includes evaluating node cooperation, energy consumption, and identifying any malicious behavior that could compromise the network. The controller node actively observes data transmission processes, detecting potential issues such as packet loss, transmission failures, congestion, or malicious activity. Upon identifying these challenges, the controller node works collaboratively with neighboring nodes to restore reliable communication by dynamically establishing alternative multipath routes. By integrating these mechanisms, the proposed scheme not only ensures energy efficiency but also enhances the security and reliability of data transmission in IoT networks, making it a robust solution for large-scale deployments.
Location Informatics Institute
Room 412
       
  12     BBL 596E  
Title Machine-learning Based Detection of Mononuclear Inflammatory Cells in the Kidney
Speaker İrem Ayşe Kayacıoğlu
Date May 21st, 2025
Time 13:30 (GMT+3)
Advisor Volkan Müjdat Tiryaki
Abstract
Banff Lesion Scores are among the most common systems for deciding whether an organ transplant was successful or not. However, the reproducibility of Banff-scoring is mediocre at best and it is time-consuming in daily practice. Human error exists and differences ensue from one pathologist to another. The aim is to reduce the workload of pathologists and increase the scoring consistency by developing an automated assessment system
Location Informatics Institute
Room 412
       
  13     BLU 596E  
Title Addressing the Leakage Problem Between Virtual Machines in Cloud Environments
Speaker Berkay Akar
Date May 28th, 2025
Time 13:30 (GMT+3)
Advisor Mehmet Tahir Sandıkkaya
Abstract
Cloud computing systems, offering increased resource sharing and availability, represent a ubiquitous technology providing businesses and individuals with flexibility and efficiency. However, the leakage problem between virtual machines (VMs) in these systems poses a significant security concern. In the cloud environment, where virtual machines run on the same physical hardware and utilize shared resources, improper configuration of one VM or a successful attack by a malicious actor gaining unauthorized access from one VM to another can lead to leakage of sensitive data or denial of service. This issue underscores the need for cloud providers and users to continually enhance security measures and improve virtual machine isolation. This study examines the leakage problem between virtual machines in cloud environments, emphasizing the need to assess existing security measures and develop new strategies to address this pressing concern.
Location Informatics Institute
Room 412
       
  14     BLU 596E  
Title Machine Learning Guided Ultra-Large Library Screening for Androgen Receptor
Speaker Doruk Can Ul
Date May 28th, 2025
Time 13:30 (GMT+3)
Advisor Sefer Baday
Abstract
The androgen receptor (AR) plays a crucial role in various physiological processes and is a key target in treating conditions such as prostate cancer. Identifying potent AR inhibitors from ultra-large chemical libraries presents a significant challenge due to the immense computational cost of traditional docking methods. This study leverages machine learning-driven deep docking to efficiently screen potential AR inhibitors. Initially, deep docking models are trained on a dataset of 1,440 ligands with known binding scores to AR. These trained models are then applied to the extensive ZINC20 database, consisting of approximately 750 million compounds. By prioritizing high-affinity candidates, this approach significantly reduces the number of compounds requiring full docking simulations, thereby improving computational efficiency while maintaining screening accuracy. The proposed methodology enables a more targeted and cost-effective drug discovery pipeline for AR inhibitors, accelerating the identification of promising therapeutic candidates.
Location Informatics Institute
Room 412
       
  15     BLU 596E  
Title Classification of Melanoma Skin Cancer Using Deep Learning-Based Analysis
Speaker Furkan Yaşar
Date May 28th, 2025
Time 13:30 (GMT+3)
Advisor Volkan Müjdat Tiryaki
Abstract
Since melanomas are among the most severe forms of skin cancer, prompt and precise diagnosis is essential to successful treatment. In this work, we provide a deep learning-based method for dermoscopic image-based melanoma classification. To automate and enhance diagnosis precision, a convolutional neural network (CNN) model is trained on a dataset comprising benign and malignant skin lesions. Key performance indicators including accuracy, sensitivity, and specificity are used to assess the model. Our findings show that deep learning methods can distinguish between benign and malignant instances with high accuracy, which may help dermatologists detect and diagnose melanoma in its early stages.
Location Informatics Institute
Room 412
       
  16     BLU 596E  
Title UAV-Driven Energy-Aware Data Collection in Robotic Wireless Sensor Networks
Speaker Beyza Duran
Date May 28th, 2025
Time 13:30 (GMT+3)
Advisor Ömer Melih Gül
Abstract
Minimizing energy consumption is essential for robotic wireless sensor networks (RWSNs) to ensure long-term operation and efficiency. Traditional static sink-based cluster protocols often increase energy costs for distant nodes, reducing overall network performance. To overcome this, UAV-assisted data collection offers an effective alternative by enabling mobile sink functionality, thus lowering transmission distances and energy usage. However, many approaches overlook the UAV’s limited battery capacity. This study proposes an energy-aware data collection strategy where a UAV selectively visits cluster head (CH) robots based on remaining battery and CH positions. Unvisited CHs transmit their data through multi-hop communication. The proposed strategy achieves zero total common cost in some scenarios and significantly outperforms existing methods in computation time, offering a practical and scalable solution for real-world RWSNs.
Location Informatics Institute
Room 412
       
  17     BBL 696E  
Title Experimental Assessment of Misalignment Effects in Terahertz Communications
Speaker Hasan Nayir
Date May 28th, 2025
Time 15:30 (GMT+3)
Advisor Ömer Melih Gül
Abstract
Terahertz (THz) frequencies play a crucial role in the advancement of next-generation wireless systems, primarily owing to their substantial available bandwidths. The inherent limitation of limited range, attributed to high attenuation in these frequencies, can be effectively addressed by implementing densely deployed heterogeneous networks, complemented by Unmanned Aerial Vehicles (UAVs) within a three-dimensional hyperspace. Yet, the success of THz communications relies on the precise alignment of beams. Inadequate handling of beam alignment can lead to diminished signal strength at the receiver, significantly affecting THz signals more than their conventional counterparts. This research underscores the paramount importance of meticulous alignment in THz communication systems. The profound impact of proper alignment is substantiated through comprehensive measurements conducted using a state-of-the-art measurement setup, facilitating accurate data collection across the 240 GHz to 300 GHz spectrum. These measurements encompass varying angles and distances within an anechoic chamber to eliminate reflections. Through a meticulous analysis of the channel frequency and impulse responses derived from these extensive measurements, this study pioneers quantifiable results, providing an assessment of the effects of beam misalignment in THz frequencies.
Location Informatics Institute
Room 412
       
  18     BBL 696E  
Title Interpreting Transformer Models: Understanding Attention Mechanisms
Speaker Ruken Missonnier
Date May 28th, 2025
Time 15:30 (GMT+3)
Advisor Sefer Baday
Abstract
This study delves into the inner workings of transformer-based models by analyzing attention patterns. It aims to provide insights into how these models capture contextual relationships and discusses methods for improving model interpretability.
Location Informatics Institute
Room 412
       
  19     BBL 696E  
Title Enabling Memory-Efficient Inference for Large Language Models with Highly Multithreaded, Distributed Architectures
Speaker Cihan Çalışır
Date May 28th, 2025
Time 15:30 (GMT+3)
Advisor Ayse Yılmazer Metin
Abstract
The increasing adoption of Large Language Models (LLMs) is hindered by their high GPU memory demands and computational constraints, particularly on resource-limited hardware. This thesis introduces a novel Ray-based distributed architecture designed to optimize low-GPU-memory inference for open-source LLMs. The proposed system integrates advanced optimization techniques, including dynamic model sharding, intelligent KV cache management, and strategies to maximize computational throughput. Specifically, the architecture employs dynamic layer partitioning across GPUs to minimize per-device GPU memory usage and incorporates methods such as Grouped-Query Attention (GQA), expert-aware KV cache routing for sparse models, and mixed-precision inference to enhance memory efficiency and computational performance. The framework is evaluated using representative open-source LLM architectures, demonstrating its ability to enable efficient inference on constrained hardware while maintaining competitive performance against state-of-the-art solutions. This research aims to provide a scalable and accessible approach to low-GPU-memory LLM inference, addressing a key challenge in broadening access to advanced language models.
Location Informatics Institute
Room 412
       
  20     BBL 696E  
Title Tensor Decomposition-Based Methods in Hyperspectral Imaging
Speaker Ülker Başar
Date May 28th, 2025
Time 15:30 (GMT+3)
Advisor Süha Tuna
Abstract
Hyperspectral imaging (HSI) captures data across numerous spectral bands, offering rich information for various applications, including remote sensing, medical diagnostics, and material classification. However, the high dimensionality and vast data volumes necessitate efficient processing and compression techniques. Tensor decomposition-based methods provide a promising solution by enabling the representation of hyperspectral data in lower-dimensional components while preserving crucial information. In particular, Enhanced Multivariance product representation (EMPR) and its optimized variants have gained significant attention due to their efficiency in lossy compression and feature extraction. EMPR-based approaches decompose HSI data into hierarchical structures, reducing redundancy and computational complexity. Moreover, recent advancements integrate optimization techniques, such as the Alternating Direction Method of Multipliers (ADMM), to enhance support vector selection, further improving classification performance. This paper reviews tensor decomposition techniques in hyperspectral imaging, highlighting their applications, advantages, and challenges. The findings demonstrate that tensor based methods, particularly EMPR, play a critical role in advancing HSI processing and improving accuracy and efficiency in hyperspectral image analysis.
Location Informatics Institute
Room 412