Cognitive Science and Mapping

External reference: https://openalex.org/T12805

  1. TRAVELER benchmark reveals weaker LLM temporal reasoning with vague references