Context-Based URL Classification for Open Access Datasets and Software in Scholarly Documents

Published in The 2025 Web Archiving and Digital Libraries Workshop (WADL@HT 2025)., 2025

Rochana R. Obadage, Lamia Salsabil, Sawood Alam, William A. Ingram, Bipasha Banerjee, Edward A. Fox, Jian Wu. Toward Robust URL Extraction for Open Science: A Study of arXiv File Formats and Temporal Trends. In the 2025 Web Archiving and Digital Libraries (WADL@HT 2025) Workshop, co-located with the 36th ACM Conference on Hypertext and Social Media (HT 2025). Chicago, IL, United States.
Download Paper