Prediction and Memory in Naturalistic Language Processing

Investigating how prediction and working memory influence everyday language processing

What are the moment-by-moment computations that allow us to rapidly comprehend what others say to us? Evidence has been accumulating for decades that we incrementally build up meaning representations by (1) storing and retrieving representations in working memory and (2) predicting future words and meanings. However, we still don’t have a good handle on how these computations work, how prediction and working memory relate to each other, or whether the responses we see to decontextualized, constructed laboratory sentences still happen during everyday language use.

One of my core lines of research is to study the algorithmic structure and degree of dissociability of prediction and memory effects during naturalistic sentence comprehension. To do so, I analyze large-scale naturalistic eye-tracking, self-paced reading, and fMRI data using tools from natural language processing, linguistic theory, and machine learning.

Related publications

  1. Cur Dir Psych Sci
    Similarity of computations across domains does not imply shared implementation: The case of language comprehension
    Fedorenko, Evelina, and Shain, Cory
    Current Directions in Psychological Science 2021
  2. Npsy
    fMRI reveals language-specific predictive coding during naturalistic sentence comprehension
    Neuropsychologia 2020
  3. COLING
    Coreference information guides human expectations during natural reading
    Jaffe, Evan, Shain, Cory, and Schuler, William
    In Proceedings of the 28th International Conference on Computational Linguistics 2020
  4. NAACL
    A large-scale study of the effects of word frequency and predictability in naturalistic reading
    Shain, Cory
    In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) 2019
  5. LREC
    Deep Syntactic Annotations for Broad-Coverage Psycholinguistic Modeling
    In Proceedings of the Eleventh International Conference on Language Resources and Evaluation 2018
  6. CMCL
    Coreference and focus in reading times
    Jaffe, Evan, Shain, Cory, and Schuler, William
    In Proceedings of the 8th Workshop on Cognitive Modeling and Computational Linguistics 2018
  7. CL4LC
    Memory access during incremental sentence processing causes reading time latency
    In Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity 2016