Automated Verification of English Proficiency Test Scores Using OCR for Graduation Qualification
Loading...
Date
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
An algorithm is proposed to verify English proficiency scores using OCR for graduation qualification, aiming to reduce errors and workload in the current manual verification process of officer. Score reports in image or PDF format, including TOEIC and KMITL-TEP, are processed using two deep learning-based document classifiers that identify the test type and score type prior to OCR execution. Preprocessing techniques such as noise reduction, contrast enhancement, and skew correction are applied to improve OCR accuracy. Three OCR models—Tesseract, TrOCR, and EasyOCR—are evaluated for text extraction performance. The extracted textual data are then converted into structured JSON, enabling automated rule-based comparison against graduation criteria. Evaluation performance is measured using Accuracy, F1-Score, Character Error Rate (CER), and Word Error Rate (WER). Experimental results show that the proposed system achieves 99% accuracy, demonstrating both high reliability and adaptability to institutional scoring standards. The integration of OCR significantly reduces processing time while maintaining flexibility to accommodate future changes in assessment policies.