Faculty Publications, Presentations, Data & Software

Faculty presenting

The faculty of the School of Information are leaders in the information, data and library sciences. Their research appears in top peer-reviewed journals, they develop world-class datasets and software, and they participate in a range of engaging presentations, both scholarly and for the general public.

Scroll down to view select faculty publications, presentations, data and software by year, or follow a link below:


Select 2024 Faculty Publications

Young, S., Catherine F. Brooks, and Pridmore, J. (2024). Societal implications of quantum technologies through a technocriticism of quantum key distribution. First Monday 3(1), online.

Gerwin JN, de Oliveira Almeida G, Boyce MW, Joseph M, Wong AH, Winslow Burleson and Evans LV (2024) HRVEST: a novel data solution for using wearable smart technology to measure physiologic stress variables during a randomized clinical trial. Frontiers in Computer Science 6:1343139.

Swetnam, T. L., Antin, P. B., Bartelme, R., Bucksch, A., Camhy, D., Greg Chism, ... & Lyons, E. (2024). CyVerse: Cyberinfrastructure for open science. PLOS Computational Biology20(2), e1011270.

Dharma KC, Venkata Ravi, Kiran Dayana, Meng-Lin Wu, Venkateswara Rao Cherukuri, Hau Hwang and Clayton T. Morrison. Towards Light Weight Object Detection System. International Workshop on Advanced Image Technology (IWAIT), 2024.

Rayburn, A. J., Punzalan, R. L., & Andrea K. Thomer. (2024). Persisting through friction: Growing a community driven knowledge infrastructure. Archival Science.


Select 2023 Faculty Publications

Tristan Naumann; Asma Ben Abacha; Steven Bethard; Kirk Roberts; and Anna Rumshisky, editors. Proceedings of the 5th Clinical Natural Language Processing Workshop. Association for Computational Linguistics. Toronto, Canada, July 2023.

Samuel Gonzalez-Lopez; and Steven Bethard. Transformer-based cynical expression detection in a corpus of Spanish YouTube reviews. In Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, pages 194–201, Toronto, Canada, July 2023. Association for Computational Linguistics.

Zeyu Zhang; and Steven Bethard. Improving Toponym Resolution with Better Candidate Generation, Transformer-based Reranking, and Two-Stage Resolution. In Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023), pages 48–60, Toronto, Canada, July 2023. Association for Computational Linguistics.

Nimet Beyza Bozdag; Tugay Bilgis; and Steven Bethard. Arizonans at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis with XLM-T. In Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 1656–1659, Toronto, Canada, July 2023. Association for Computational Linguistics.

Tugay Bilgis; Nimet Beyza Bozdag; and Steven Bethard. Gallagher at SemEval-2023 Task 5: Tackling Clickbait with Seq2Seq Models. In Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 1650–1655, Toronto, Canada, July 2023. Association for Computational Linguistics.

Jiarui Yao; Steven Bethard; Kristin Wright-Bettner; Eli Goldner; David Harris; and Guergana Savova. Textual Entailment for Temporal Dependency Graph Parsing. In Proceedings of the 5th Clinical Natural Language Processing Workshop, pages 191–199, Toronto, Canada, July 2023. Association for Computational Linguistics.

Kadir Bulut Ozler; and Steven Bethard. clulab at MEDIQA-Chat 2023: Summarization and classification of medical dialogues. In Proceedings of the 5th Clinical Natural Language Processing Workshop, pages 144–149, Toronto, Canada, July 2023. Association for Computational Linguistics.

Timothy Miller; Steven Bethard; Dmitriy Dligach; and Guergana Savova. End-to-end clinical temporal information extraction with multi-head attention. In The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, pages 313–319, Toronto, Canada, July 2023. Association for Computational Linguistics.

Lijing Wang; Yingya Li; Timothy Miller; Steven Bethard; and Guergana Savova. Two-Stage Fine-Tuning for Improved Bias and Variance for Large Pretrained Language Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15746–15761, Toronto, Canada, July 2023. Association for Computational Linguistics.

Egoitz Laparra; Alex Binford-Walsh; Kirk Emerson; Marc L. Miller; Laura López-Hoffman; Faiz Currim; and Steven Bethard. Addressing structural hurdles for metadata extraction from environmental impact statements. Journal of the Association for Information Science and Technology, n/a(n/a). June 2023.

Stephen A. Rains; Kate Kenski; Leah Dajches; Kaylin Duncan; Kun Yan; Yejin Shin; Jules L. Barbati; Steven Bethard; Kevin Coe; and Yotam Shmargad. Engagement with incivility in tweets from and directed at local elected officials. Communication and Democracy, 57(1): 143-152. 2023.

Qin, J., Sarah Bratt, Hemsley, J., Smith, A., & Liu, Q. (2023). A FAIR Data Ecosystem for Science of Science. Proceedings of the Association for Information Science and Technology, 60(1), 1107-1109.

Sarah Bratt. (2023). ‘Routine Infrastructuring’: How Social Scientists Appropriate Resources to Deposit Qualitative Data to ICPSR and Implications for FAIR and CARE. Proceedings of the Association for Information Science and Technology, 60(1), 61-72.

Sarah Bratt, Langalia, M., & Nanoti, A. (2023). North-south scientific collaborations on research datasets: a longitudinal analysis of the division of labor on genomic datasets (1992–2021). Frontiers in Big Data, 6, 1054655.

Sarah Bratt. (2023, January). Articulating Institutionalization: How US Academic Faculty Organize Work to Deposit Data and the Impacts on Long-Term Research Data Sustainability. In The 2023 ACM International Conference on Supporting Group Work (GROUP'23) Companion (pp. 82-84).

Y He, C Impey, Winslow Burleson. StellarScape: An Immersive Multimedia Performance Inspired by the Life of a Star. Leonardo, 2023.

Ruoyao Wang and Peter Jansen. 2023. Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 5555–5565, Singapore. Association for Computational Linguistics.

Wang, R., Todd, G., Yuan, E., Xiao, Z., Côté, M. A., & Peter Jansen. (2023). ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games. arXiv preprint arXiv:2305.14879.

Peter Jansen. (2023). From words to wires: Generating functioning electronic devices from natural language descriptions. arXiv preprint arXiv:2305.14874.

Wang, R., Peter Jansen, Côté, M. A., & Ammanabrolu, P. (2023). Behavior cloned transformers are neurosymbolic reasoners. arXiv preprint arXiv:2210.07382.

Peter Jansen, & Côté, M. A. (2023). TextWorldExpress: Simulating text games at one million steps per second. arXiv preprint arXiv:2208.01174.

Andrew Kemp-Wilcox, "Colonialist and Anti-Colonialist Play in Spirit Island: A Ludo-Textual Analysis." In Heritage, Memory and Identity in Postcolonial Board Games (ed. Michal Mochocki). Routledge, 2023.

Jamie A. Lee, aems emswiler, and Bianca Finley Alper, “Origin Stories and the Shaping of the Community-Based Archives,” Archival Science, vol. 23 (2023), 381-410.

Jamie A. Lee, “Archives as Spaces of Radical Hospitality,” in New Feminist Research Ethics, ed. Maryanne Dever, London: Routledge, 2023.

Jamie A. Lee, Kristen Suagee-Beauduy, and Samantha Montes,Community-Based Archives and Their Pedagogies,” in The Critical Librarianship and Pedagogy Symposium: An Anthology of Works, ACRL Press, 2023.

Zack Lischer-Katz. (2023). Methods for Exploring Indeterminate Textuality in John Cage's Practices of Bibliographic Encoding: The Case of M. Textual Cultures: Texts, Contexts, Interpretation16(2), 180-208. 

Zack Lischer-Katz. (2023). Information Borderlands in the US Southwest. Proceedings of the Association for Information Science and Technology, 60(1), 1055-1058.

Clark, J., & Zack Lischer-Katz. (2023). (In)accessibility and the technocratic library: Addressing institutional failures in library adoption of emerging technologies. First Monday, 28(1).

Alice Saebom Kwak, Cheonkam Jeong, Gaetano Vincent Forte, Derek Bambauer, Clayton T. Morrison, Mihai Surdeanu. Information Extraction from Legal Wills: How Well Does GPT-4 Do? Findings of EMNLP (EMNLP), 2023.

Adarsh Pyarelal, Eric Duong, Caleb Jones Shibu, Paulo Soares, Savannah Boyd, Payal Khosla, Valeria Pfeifer, Diheng Zhang, Eric S. Andrews, Rick Champlin, Vincent Paul Raymond, Meghavarshini Krishnaswamy, Clayton T. Morrison, Emily Butler and Kobus Barnard. The ToMCAT Dataset. Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track, 2023. (poster: https://neurips.cc/virtual/2023/poster/73548)

Dharma KC, Tito Ferra and Clayton T. Morrison. Neural Machine Translation for Recovering ASTs from Binaries. The 3rd IEEE International Conference on Software Engineering and Artificial Intelligence (SEAI), 2023.

Adarsh Pyarelal, E. Duong◦, C. J. Shibu◦, P. Soares◦, S. Boyd, P. Khosla, V. Pfeifer, D. Zhang, E. S. Andrews, R. Champlin, V. P. Raymond, M. Krishnaswamy◦, C. Morrison, E. Butler, and K. Barnard. 2023d. The ToMCAT Dataset. In Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track.

A. Qamar, Adarsh Pyarelal, and R. Huang. Dec. 2023. Who is Speaking? Speaker-Aware Multiparty Dialogue Act Classification. In Findings of the Association for Computational Linguistics: EMNLP 2023. Ed. by H. Bouamor, J. Pino, and K. Bali. Singapore: Association for Computational Linguistics, pp. 10122–10135.

M. M. M. Miah, Adarsh Pyarelal, and R. Huang. Dec. 2023. Hierarchical Fusion for Online Multimodal Dialog Act Classification. In Findings of the Association for Computational Linguistics: EMNLP 2023. Ed. by H. Bouamor, J. Pino, and K. Bali. Singapore: Association for Computational Linguistics, pp. 7532–7545.

Fan, L., Lafia, S., Wofford, M., Thomer, A., Yakel, E., & Hemphill, L. (2023). Mining Semantic Relations in Data References to Understand the Roles of Research Data in Academic Literature. 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 215–227.

King, K. B. S., Giacomini, H. C., Wehrly, K., López‐Fernández, H., Andrea K. Thomer, & Alofs, K. M. (2023). Using historical catch data to evaluate predicted changes in fish relative abundance in response to a warming climate. Ecography, e06798.

Lafia, S., Andrea Thomer, Moss, E., Bleckley, D., & Hemphill, L. (2023). How and Why Do Researchers Reference Data? A Study of Rhetorical Features and Functions of Data References in Academic Articles (1). 22(1), Article 1.

LIS Forward, Acker, A., Aden, C., Bonn, M., Coward, C., Hunt, C., Knox, E., Lankes, D., Martin, M. H., Melo, M., Ndumu, A., Palmer, C. L., Patin, B., Sturm, B., Subramaniam, M., & Andrea K. Thomer. (2023). Ensuring a Vibrant Future for LIS in iSchools, Part I. University of Washington Information School.

Plantin, J.C., & Andrea Thomer. (2023). Platforms, programmability, and precarity: The platformization of research repositories in academic libraries. New Media & Society, 14614448231176758.

Raia, N., Damerow, J., Stanley, V., Lehnert, K., O’Ryan, D., Plomp, E., ESIP Physical Sample Curation Cluster, & Andrea Thomer. (2023). 4 Steps to Publish Open Earth Science Samples. 1253529 Bytes.

Song, H., Cui, H., Vieglais, D., Mandel, D., & Andrea K. Thomer. (2023). Automated Metadata Enhancement for Physical Sample Record Aggregation in the iSamples Project. Proceedings of the Association for Information Science and Technology, 60(1), 1131–1133.

Andrea K. Thomer, & Rayburn, A. J. (2023). “A Patchwork of Data Systems”: Quilting as an Analytic Lens and Stabilizing Practice for Knowledge Infrastructures. Science, Technology, & Human Values, 016224392311755.

Andrea K. Thomer, Wofford, M. F., Lenard, M. C., Dominguez Vidana, S., & Goring, S. J. (2023). Revealing Earth science code and data-use practices using the Throughput Graph Database. In X. Ma, M. Mookerjee, L. Hsu, & D. Hills (Eds.), Recent Advancement in Geoinformatics and Data Science. Geological Society of America.

Wofford, M. F., & Andrea K. Thomer. (2023). Curating for Contrarian Communities: Data Practices of Anthropogenic Climate Change Skeptics. Proceedings of the Association for Information Science and Technology, 60(1), 442–455.


Select 2024 Faculty Presentations

Diana Daly, Pate McMichael, Kate Kenski and Keith Allred. AI & Elections Panel Discussion. AI at Arizona Town Hall Series, The University of Arizona. March 20, 2024

Jamie A. Lee. The Ambivalences of the Unfixed. Center for Archival Futures, University of Maryland. March 2024.


Select 2023 Faculty Presentations

Sarah Bratt. Making Qualitative Data Machine Readable. Society for the Social Studies of Science (4S). 2023.

Sarah Bratt, Charles Gomez, Erin Leahey, Jina Lee, Abhishek Nanoti, Mrudang Langalia. The Division of Labor on Scientific Datasets: Implications for Innovation and Equity. ICSSI 2023, 2023.

Sarah Bratt. Institutionalizing ‘Care’ in Data Curation: Research Data Management Practices for Long-Term Sustainability & Ethical AI. UArizona Sociology Colloquium, 2023.

Diana Daly, Schneider, N., and Ahmad, H. (2023) An Invitation to Shared Governance. Open Education Conference. November 7-9, 2023. Virtual.

Jamie A. Lee. Kairotic & Kin-Centric Archives: Addressing Abundances and Abandonments. Keynote Address, DigitPres 2023, Digital Library Foundation, November 2023.

Jamie A. Lee. Digital Storytelling, Climate Justice, and Coalitional Possibilities. Corbett Lecture, Global Arts + Humanities Discover Theme and the Rhetoric, Composition and Literacy Studies Program, The Ohio State University, April 2023.

Jamie A. Lee. Producing the Archival Body Workshop. The Ohio State University and their Global Arts + Humanities Discovery Theme, February 2023.

Jamie A. Lee. Producing the Archival Body and community collaborations. Rare Book School / Andrew W. Mellon Fellowship for Diversity, Inclusion and Cultural Heritage at The City College of New York and CUNY Dominican Studies Institute, February 2023.

Jamie A. Lee. Producing the Archival Body and Oral History Productions. Oral History Forum Webinar Series, sponsored by University of North Texas through IMLS funding, February 2023.

Zack Lischer-Katz. (2023). Invited panelist on the panel, “Retrospective, subjunctive, prospective: Provenance challenges across time,” R. Bettivia, Y.-Y. Cheng, and M. R. Gryk. (co-organizers). International Conference on Digital Preservation (iPRES) 2023, Champaign-Urbana, IL, Sept. 19-23.

Zack Lischer-Katz. (2023). Resisting surveillance and documentation infrastructures in the U.S. Southwestern Borderlands. Society for the Social Studies of Science (4S) Annual Meeting, Honolulu, HI, Nov. 8-11.

Zack Lischer-Katz, Braggs, R. K., & Carter, B. (2023). Volumetric video for preservation: Exploring the possibilities and challenges for immersive BIPOC storytelling. International Conference on Digital Preservation (iPRES) 2023, Champaign-Urbana, IL, Sept. 19-23.

Chen, A.T., Cole, C.L., Chassanoff, A., Ma, R., Huvila, I., Zack Lischer-Katz, & Krtalić, M. (2023). Exploring collaborative interpretive practice. Association for Information Science & Technology (ASIS&T) Annual Meeting, London, UK, Sept. 27-31. [Workshop organizer and paper/posters co-chair]

Robinson, E., Anderson, J., Buys, M., Chodacki, J., Choe, S., Davies, N., Kansa, S., Lehnert, K., Meyer, C., Praetzellis, M., Andrea Thomer, Vieglais, D., Walls, R., & Wimalaratne, S. (2023, December 14). Exploring the Hidden Afterlives of Material Samples: How the Internet of Samples (iSamples) and Transdisciplinary Collaboration Across the Material Sample Community Enables Open Science with Samples. AGU Fall Meeting, San Francisco, CA. Zenodo.

Thomer, T., Andrea Thomer, Lehnert, K., Vieglais, D., & Davies, Neil. (2023, June 1). Defragmenting physical collections and digital records through the iSamples project. 38th Annual Meeting of the Society for the Preservation of Natural History Collections, San Francisco, CA.

Damerow, J., Ramdeen, S., Stanley, V., & Andrea K. Thomer (2023, January 24). Opening doors for open samples: Developing templates for sample and specimen citation, linking, and credit. 2023 January ESIP Meeting, Virtual.

Lehnert, K., Andrea K. Thomer, Choe, S., & Vieglais, D. (2023, December 13). The System for Earth Sample Registration SESAR: Engaging the Community to Improve Service [Poster Presentation]. AGU Fall Meeting, San Francisco, CA.