Jump to: Long Papers, Short Papers, Resources, Doctoral Consortium, Poster/Demos, Workshops
Long Papers
- SciCom Wiki: A Digital Library to Support the Science Communication Knowledge Infrastructure for Videos and Podcasts
- How a Tortured Conference Becomes a Series: An Analysis of Conference Manipulations
- CC30k: A Citation Contexts Dataset for Reproducibility-Oriented Sentiment Analysis
- Beyond CER and WER: How Does OCR Really Impact Information Retrieval?
- LLM-Generated Description and Reasoning: Use-case for Library Recommendations
- Adaptive Progressive Fine-Tuning of VLMs for Long-Tailed Multimodal Retrieval
- 28 Resilience, Volume, and Temporal Trends Across 25 Years of the Wayback Machine
- The Indispensable Ren: A Multi-Method Study of Confucian Ethical Configurations in Trolling with Misinformation
- How to Evaluate the Carbon Footprint of Digital Libraries Services: The Case of Citation Verification
- ARTO: An Artwork Object Ontology for Descriptive and Contextual Representation
- Scaffolding Inquiry-Oriented Web Search using LLM-based Question Generation
- Multi-Disciplinary Dataset Discovery from Citation-Verified Literature Contexts
- Screening Crossref for Integrity Issues: Massive and Longitudinal Mining of Publication Metadata
- Learning from LLM Disagreement in Retrieval Evaluation
- A Multi-Agent System for Complex Reasoning in Radiology Visual Question Answering
- Generation, Evaluation, and Explanation of Novelists’ Styles with Single-Token Prompts
- CiteScreener: A Pipeline for Citation Verification in Digital Libraries with Datasets
- A Self-Questioning Framework Towards Knowledge Self-Organization in Children’s Readings via Prompt Learning and Fine-tuning
- Identifying Future Work Chapters in Electronic Theses and Dissertations
- SciKGDash: The Scientific Knowledge Graph Dashboard for Supporting Knowledge Curation
- Context-Based URL Classification for Open Access Datasets and Software in Scholarly Documents
- ReviewGuard: Enhancing Deficient Peer Review Detection via LLM-Driven Data Augmentation
- Linking Policies for the Permanent Web
- Age-Specific Fine-Tuning of Large Language Models with Sentiment-Guided Emotion Vectors for Teen Book Recommendations
Short Papers
- How Devices Shape Mental Effort in Digital Document Reading: An Eye-Tracking Study
- NIED: A Corpus for Numeric Information Extraction from Dataset Descriptions
- Detecting Framing Bias in News via Probabilistic Graphical Modeling
- OntExtract: PROV-O Provenance Tracking for Document Analysis Workflows
- K-Span Select and Multi-Dimensional Judging for Reliable Scholarly Question Answering
- What Lies Beneath: A Call for Distribution-based Visual Question & Answer Datasets
- Identifying and Classifying Software Mentions in Full Text Scholarly Documents
- Efficient Citation Screening by Weak Classifier Ensemble
- Better by Comparison: Retrieval-Augmented Contrastive Reasoning for Automatic Prompt Optimization
- Pre-Crawl Prioritization and Seed Classification for Large-Scale Web Archiving
Resources
- A Longitudinal Dataset of URLs Sampled From the Wayback Machine
- BookReconciler: An Open-Source Tool for Metadata Enrichment and Work-Level Clustering
- Fact-Stance: A Stance-Aware Dataset of Structured Scientific Claims with LLM Annotators
- PaperCross: A Cross-Document and Multi-Modal Question Answering Benchmark for Scientific Papers
- End of Term Web Archive Dataset: Documenting the 2024 Presidential Transition
- MAT-VB: MAthematical Text-Vision Benchmark
Doctoral Consortium
- The Ethical Duality of Human-AI Interaction: How Human and AI Ethics Shape Collaborative Rumor Combating Behavior
- Fostering Knowledge Infrastructures in Science Communication and Aerospace Engineering
- AI Agents as Knowledge Explorers: A Case Study of 398 U.S. Case Law Quotations
- Equipping a Virtual Reading Promoter for Digital Libraries with Retrieval-Augmented Generation
Posters / Demos
- Multimodal Emotion Classification in Artwork: A Comparative Study Across Modalities
- A Lexically-Driven Adaptive Validation Framework for Search-Based Reference Matching
- LLM-Based Active Learning for Identifying References to Archival Repositories
- From Speech to LaTeX: Large Language Models for Mathematical Accessibility in Digital Libraries
- Revisiting the Link Accessibility Problem in Scholarly Papers with PLoS ONE Papers
- Extending the Usability of Digital Music Libraries with Analytical Interfaces: Case Study – Traditional Music Collections
- REACT-EXTRACT: A Tool for Source-Grounded Automated Data Extraction in Systematic Reviews
- Preserving and Retrieving Hindustani Music Notation: A Symbolic Music Processing Architecture for Digital Libraries
- MathMex-V2: A Large Language Model Enabled Math Search Engine
Workshops
Smarter Extraction of ScholArly MEtadata using Knowledge Graphs and Language Models (SESAME)
- Organizers: Muhammad Asif Suryani, Brigitte Mathiak, Florian Reitz, Florian Jäckel and Ansgar Scherp
- Web: https://sesame-workshop.github.io/SESAME/
The 2nd International Workshop on Artificial Intelligence for the Science of Science (AI4SciSci)
- Organizers: Jian Wu, Sarah Rajtmajer, Yian Yin, Yi He and Staša Milojević
- Web: https://ai4scisci.github.io/2025/
Beyond Search Engines: Intelligent Knowledge Discovery From Scholarly Publications
- Organizers: Gautam Kishore Shahi and Oliver Hummel
- Web: http://besides-workshop.github.io/
3rd International Workshop on Digital Language Archives (LangArc)
- Organizers: Oksana L. Zavalina, Shobhana L. Chelliah and Mary Burke
- Web: https://sites.google.com/view/langarc-2025




