B05 – Attention selection and recognition in scene-viewing

This project terminated in June 2021 and will not be continued in the second funding period.

Objectives

Scene viewing is an observable process of rapid decision making at the interface between perception, attention, and motor control. Knowledge about the organisation of the visual system has stimulated the development of attentional (“saliency”) models. Saliency models make predictions on regions that viewers are likely to fixate during scene exploration. Cognitive models for the control of eye movements study the dynamic interaction between perception and visuomotor control. Prior work predicts the distribution of fixation positions by a model of spatially-limited access to visual-saliency information and a leaky-memory model of re-inspection of positions. Besides macroscopic saccades, the eye performs involuntary micro-movements during saccades which are commonly categorized into micro-saccades, drift, and tremor. It is known that macroscopic eye movements are highly individual, and eye movements have been explored as a mode of biometric identification. However, the accuracy of known approaches to oculomotor biometric identification is too low and the time that is needed for an identification is too long by one to two orders of magnitude for any practical application.

Project B05 has explored the roles of data and prior knowledge in the form of generative models of eye gaze for discriminative tasks such as viewer identification, and estimation of viewer familiarity. The first principal goal of the project was to develop dynamical, hierarchical Bayesian models of attentional selection in scene viewing that account for the viewers' familiarity with and conceptually-driven anticipation of the image content as well as for viewer-specific distributional properties of fixational patterns. The second principal goal was to exploit these generative models of fixation sequences as background knowledge within discriminative models that accurately predict values of latent variables, such as levels of familiarity, based on a given fixation sequence.

Preprints

No available.

Publications

Makowski, S., Jäger, L. A., Prasse, P., & Scheffer, T. (2020). Biometric identification and presentation-attack detection using micro- and macro-movements of the eyes. International Joint Conference on Biometrics (IJCB), in press. Preprint: [1]
Prasse, P., Jäger, L. A., Makowski, S., Feuerpfeil, M., & Scheffer, T. (2020): On the Relationship between Eye Tracking Resolution and Performance of Oculomotoric Biometric Identification. Procedia Computer Science, 176, 2088-2097. doi: 10.1016/j.procs.2020.09.245
Makowski, S., Jäger, L. A., Schwetlick, L., Trukenbrod, H., Engbert, R., & Scheffer, T. (2020): Discriminative Viewer Identification using Generative Models of Eye Gaze. Procedia Computer Science, 176, 1348-1357. doi: 10.1016/j.procs.2020.09.144
Schwetlick, L., Rothkegel, L.O.M., Trukenbrod, H.A., Engbert, R. (2020). Modeling the effects of perisaccadic attention on gaze statistics during scene viewing. Communications Biology, 3, 727. doi: 10.1038/s42003-020-01429-8
Jäger, L. A., Makowski, S., Prasse, P., Liehr, S., Seidler, M., & Scheffer, T. (2019). Deep Eyedentification: Biometric Identification using Micro-Movements of the Eye. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD), 299-314. Springer, Cham. doi: 10.1007/978-3-030-46147-8_18
Trukenbrod, H. A., Barthelmé, S., Wichmann, F. A. and Engbert, R. (2019). Spatial statistics for gaze patterns in scene viewing: Effects of repeated viewing, Journal of Vision, 19(6):5, 1-19. doi: 10.1167/19.6.5
Rothkegel, L. O., Schütt, H. H., Trukenbrod, H. A., Wichmann, F. A. and Engbert, R. (2019). Searchers adjust their eye-movement dynamics to target characteristics in natural scenes. Scientific Reports, 9, article no. 1635. doi: 10.1038/s41598-018-37548-w
Makowski, S., Jäger, L., Abdelwahab, A., Landwehr, N. and Scheffer, T. (2018). A discriminative model for identifying readers and assessing text comprehension from eye movements. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD). doi: 10.1007/978-3-030-10925-7_13, arxiv preprint: https://arxiv.org/pdf/1809.08031.pdf