Student Engagement Analysis using Bi-Up Sampling Feature Pyramid Network with Inter Cross Coordinate Self Attention
Naveen A. Scholar, Department of Computer Science and Engineering, GITAM School of Technology, Bengaluru Campus, India. a.naveen21@gmail.com0000-0001-6273-2520
I. Jeena JacobProfessor, Department of Computer Science and Engineering, GITAM School of Technology, Bengaluru Campus, India. ijacob@gitam.edu0000-0001-6706-1017
Ajay Kumar MandavaAssociate Professor, Department of Electrical, Electronics and Communication Engineering, GITAM School of Technology, Bengaluru Campus, India. amandava@gitam.edu0009-0001-8902-6914
Keywords: Machine Learning, Deep Learning, Facial Expression.
Abstract
Facial expressions are physical changes that reflect an individual's feelings, emotions, intentions, or social interactions. In the field of computer vision, facial expression analysis needs a higher level of knowledge. To increase student involvement in class, in recent years increased in employing technology to track and evaluate students' facial expressions. Facial expressions, as a nonverbal form of communication, can provide valuable insights into students' emotional states. This research article aims to explore the potential of class engagement monitoring using facial expressions. Hence, we proposed a Bi-upsampling Feature Pyramid Network (BiusFPN) with multiple attention map integration. We used ResNet18 as a backbone and we also introduced an inter-coordinate attention model which improves the feature extraction from local spatial extended mode with different coordinate representations. This engagement analysis model precisely detects the facial changes and yields an accurate outcome. We integrate both Channel and spatial attention mechanisms to fuse different attention maps for the final representation. The result of the classification layer will be Disengaged or partially engaged or engaged. This approach achieves 68.16% accuracy for the DAiSEE dataset and 83% for the WACV dataset. In this way, the proposed