Accepted Contributions

December 15-19, 2025

Jump to: Long Papers, Short Papers, Resources, Doctoral Consortium, Poster/Demos, Workshops

Long Papers

  • SciCom Wiki: A Digital Library to Support the Science Communication Knowledge Infrastructure for Videos and Podcasts
    Tim Wittenborg, Niklas Stehr, Oliver Karras, Sören Auer
  • How a Tortured Conference Becomes a Series: An Analysis of Conference Manipulations
    Wendeline Swart, Guillaume Cabanac, Ophélie Fraisier-Vannier, Gilles Hubert
  • CC30k: A Citation Contexts Dataset for Reproducibility-Oriented Sentiment Analysis
    Rochana R. Obadage, Sarah Rajtmajer, Jian Wu
  • Beyond CER and WER: How Does OCR Really Impact Information Retrieval?
    Alexandre Jaud, Ahmed Hamdi, Antoine Doucet, Adam Jatowt, Mickaël Coustaty
  • LLM-Generated Description and Reasoning: Use-case for Library Recommendations
    Arash Sal Moslehian, Eya Briki, Michalis Vlachos
  • Adaptive Progressive Fine-Tuning of VLMs for Long-Tailed Multimodal Retrieval
    Farid Alijani, Elina Late, Sanna Kumpulainen
  • 28 Resilience, Volume, and Temporal Trends Across 25 Years of the Wayback Machine
    Kritika Garg, Sawood Alam, Dietrich Ayala, Michele Weigle, Michael Nelson
  • The Indispensable Ren: A Multi-Method Study of Confucian Ethical Configurations in Trolling with Misinformation
    Xiao-Liang Shen, Qian Wen Qian, Jian Mou
  • How to Evaluate the Carbon Footprint of Digital Libraries Services: The Case of Citation Verification
    Aysegul Demir, Cyril Labbé, Qinyue Liu
  • ARTO: An Artwork Object Ontology for Descriptive and Contextual Representation
    Can Yang, Bernardo Pereira Nunes, Sergio Rodríguez Méndez, Yige Chen, Rubén Manrique, Marco Antonio Casanova
  • Scaffolding Inquiry-Oriented Web Search using LLM-based Question Generation
    Yusuke Yamamoto
  • Multi-Disciplinary Dataset Discovery from Citation-Verified Literature Contexts
    Zhiyin Tan, Changxu Duan
  • Screening Crossref for Integrity Issues: Massive and Longitudinal Mining of Publication Metadata
    Jules di Scala, Guillaume Cabanac, Ophélie Fraisier-Vannier, Ronan Tournier
  • Learning from LLM Disagreement in Retrieval Evaluation
    William Ingram, Bipasha Banerjee, Edward Fox
  • A Multi-Agent System for Complex Reasoning in Radiology Visual Question Answering
    Ziruo Yi, Jinyu Liu, Mark Albert, Ting Xiao
  • Generation, Evaluation, and Explanation of Novelists’ Styles with Single-Token Prompts
    Mosab Rezaei, Mina Rajaei Moghadam, Abdul Rahman Shaikh, Hamed Alhoori, Reva Freedman
  • CiteScreener: A Pipeline for Citation Verification in Digital Libraries with Datasets
    Qinyue Liu, Yağmur Öztürk, Tiziri Terkmani, François Portet, Cyril Labbé
  • A Self-Questioning Framework Towards Knowledge Self-Organization in Children’s Readings via Prompt Learning and Fine-tuning
    Jiacheng Yao, Guoxiu He, Xin Xu
  • Identifying Future Work Chapters in Electronic Theses and Dissertations
    Amr Aboelnaga, William Ingram, Hajra Klair, Hoda Eldardiry
  • SciKGDash: The Scientific Knowledge Graph Dashboard for Supporting Knowledge Curation
    Lena John, Sören Auer, Oliver Karras
  • Context-Based URL Classification for Open Access Datasets and Software in Scholarly Documents
    Lamia Salsabil, Rochana R. Obadage, Bipasha Banerjee, Yasasi Achinthya Abeysinghe, Sawood Alam, Michael Färber, William Ingram, Edward Fox, Jian Wu
  • ReviewGuard: Enhancing Deficient Peer Review Detection via LLM-Driven Data Augmentation
    Haoxuan Zhang, Ruochi Li, Sarthak Shrestha, Shree Harshini Mamidala, Revanth Putta, Arka Krishan Aggarwal, Ting Xiao, Junhua Ding, Haihua Chen
  • Linking Policies for the Permanent Web
    Quang Bui, Alexander Grigorian, Alexey Kuraev, Thiyazan Salman, Tu Nguyen, Mat Kelly, Sawood Alam
  • Age-Specific Fine-Tuning of Large Language Models with Sentiment-Guided Emotion Vectors for Teen Book Recommendations
    Yiu-Kai Ng

Short Papers

  • How Devices Shape Mental Effort in Digital Document Reading: An Eye-Tracking Study
    Kumushini Thennakoon, Yasasi Abeysinghe, Pasindu Thenahandi, Lawrence Obiuwevwi, Vikas Ashok, Sampath Jayarathna
  • NIED: A Corpus for Numeric Information Extraction from Dataset Descriptions
    Moriyuki Kamoto, Akihiro Tamura, Marie Katsurai
  • Detecting Framing Bias in News via Probabilistic Graphical Modeling
    Ginel Dorleon, Shirin Shujaa
  • OntExtract: PROV-O Provenance Tracking for Document Analysis Workflows
    Christopher Rauch, Hyung Wook Choi, Mat Kelly
  • K-Span Select and Multi-Dimensional Judging for Reliable Scholarly Question Answering
    Preetam Pati, Sayan De, Saurabh Tiwari, Debarshi Kumar Sanyal, Imon Mukherjee
  • What Lies Beneath: A Call for Distribution-based Visual Question & Answer Datasets
    Jill Naiman, Daniel Evans, Jooyoung Seo
  • Identifying and Classifying Software Mentions in Full Text Scholarly Documents
    David Pride, Matteo Guenci, Martin Docekal, Silvio Peroni, Petr Knoth
  • Efficient Citation Screening by Weak Classifier Ensemble
    Xiaorui Jiang, Opeoluwa Akinseloyin, Vasile Palade
  • Better by Comparison: Retrieval-Augmented Contrastive Reasoning for Automatic Prompt Optimization
    Wonduk Seo, Juhyeon Lee, Hyunjin An, Seunghyun Lee, Yi Bu
  • Pre-Crawl Prioritization and Seed Classification for Large-Scale Web Archiving
    Akshith Garapati, Sawood Alam

Resources

  • A Longitudinal Dataset of URLs Sampled From the Wayback Machine
    Kritika Garg, Sawood Alam, Dietrich Ayala, Michele Weigle, Michael Nelson
  • BookReconciler: An Open-Source Tool for Metadata Enrichment and Work-Level Clustering
    Matt Miller, Dan Sinykin, Melanie Walsh
  • Fact-Stance: A Stance-Aware Dataset of Structured Scientific Claims with LLM Annotators
    Xin Lin, Yang Zhao, Zhixiong Zhang, Yajiao Wang, Yang Li, Mengting Zhang
  • PaperCross: A Cross-Document and Multi-Modal Question Answering Benchmark for Scientific Papers
    Guangyin Zhang, Liping Gu, Yang Li, Meng Wang, Mengting Zhang
  • End of Term Web Archive Dataset: Documenting the 2024 Presidential Transition
    Mark Phillips, Kristy Phillips, Sawood Alam
  • MAT-VB: MAthematical Text-Vision Benchmark
    Behrooz Mansouri, Aidan Bell, Nicholas Largey, Abigai Pitcairn

Doctoral Consortium

  • The Ethical Duality of Human-AI Interaction: How Human and AI Ethics Shape Collaborative Rumor Combating Behavior
    Qian Wen Qian
  • Fostering Knowledge Infrastructures in Science Communication and Aerospace Engineering
    Tim Wittenborg
  • AI Agents as Knowledge Explorers: A Case Study of 398 U.S. Case Law Quotations
    Jeremiah Milbauer
  • Equipping a Virtual Reading Promoter for Digital Libraries with Retrieval-Augmented Generation
    Zhenyu Li

Posters / Demos

  • Multimodal Emotion Classification in Artwork: A Comparative Study Across Modalities
    Clayton Durepos, Abigail Pitcairn, Behrooz Mansouri
  • A Lexically-Driven Adaptive Validation Framework for Search-Based Reference Matching
    Katsuyuki Hirai, Teruhito Kanazawa, Takahiro Hayashi
  • LLM-Based Active Learning for Identifying References to Archival Repositories
    Tokinori Suzuki
  • From Speech to LaTeX: Large Language Models for Mathematical Accessibility in Digital Libraries
    Abigail Pitcairn, Clayton Durepos, Nicholas Largey, Behrooz Mansouri
  • Revisiting the Link Accessibility Problem in Scholarly Papers with PLoS ONE Papers
    Tyler Chen, Jian Wu
  • Extending the Usability of Digital Music Libraries with Analytical Interfaces: Case Study – Traditional Music Collections
    Anna Maria Matuszewska
  • REACT-EXTRACT: A Tool for Source-Grounded Automated Data Extraction in Systematic Reviews
    Sebastian Krawczyk, Paweł Jemioło, Jan Karkowski, Ilinka Ivanoska, Miroslav Mirchev, Wojciech Kusa
  • Preserving and Retrieving Hindustani Music Notation: A Symbolic Music Processing Architecture for Digital Libraries
    Chandan Misra, Attreye Chakraborty, Shreya Dutta
  • MathMex-V2: A Large Language Model Enabled Math Search Engine
    Clayton Durepos, Ian McLaughlin, Connor Lund, Anthony Sienbenmorgen, Nicholas Largey, Abigail Pitcairn, Behrooz Mansouri

Workshops

Smarter Extraction of ScholArly MEtadata using Knowledge Graphs and Language Models (SESAME)

The 2nd International Workshop on Artificial Intelligence for the Science of Science (AI4SciSci)

Beyond Search Engines: Intelligent Knowledge Discovery From Scholarly Publications

3rd International Workshop on Digital Language Archives (LangArc)