Zero-/Few-Shot Anomaly Classification for Transistor Using Multimodal CLIP Retrieval Augmented

dc.contributor.authorNatdhanai Praneenatthavee
dc.contributor.authorTuchsanai Ploysuwan
dc.date.accessioned2026-05-08T19:24:23Z
dc.date.issued2024-11-11
dc.description.abstractDetecting anomalies in transistors is a challenging task due to the intricate features distinguishing normal from abnormal components. This paper introduces a zero-shot and few-shot anomaly classification framework for transistors using a retrieval-augmented multimodal approach. We focus on two critical parts of the transistor: the Component BOX and the Metal Legs. Utilizing Florence2 for prompt-based bounding box generation and Segment Anything Model2 (SAM2) for segmentation, we create precise masks for each part. Embeddings generated through Contrastive Language-Image Pre-training (CLIP) are employed to classify each component effectively. For the few-shot learning scenario, we implement Retrieval-Augmented Generation (RAG) to simulate learning from both images and textual data, enhancing the anomaly classification performance. Our zero-shot model achieved an fl score of 70.5, while the few-shot model attained an improved fl score of 77.0, demonstrating the efficacy of our approach in transistor anomaly detection.
dc.identifier.doi10.1109/isai-nlp64410.2024.10799271
dc.identifier.urihttps://dspace.kmitl.ac.th/handle/123456789/19564
dc.subjectIntegrated Circuits and Semiconductor Failure Analysis
dc.subjectVLSI and Analog Circuit Testing
dc.subjectAdvancements in Semiconductor Devices and Circuit Design
dc.titleZero-/Few-Shot Anomaly Classification for Transistor Using Multimodal CLIP Retrieval Augmented
dc.typeArticle

Files

Collections