Efficient Context-Limited Telescope Bibliography Classification for the WASP-2025 Shared Task Using SciBERT

Madhusudhan Naidu

Efficient Context-Limited Telescope Bibliography Classification for the WASP-2025 Shared Task Using SciBERT

Abstract

The creation of telescope bibliographies is a crucial part of assessing the scientific impact of observatories and ensuring reproducibility in astronomy. This task involves identifying, categorizing, and linking scientific publications that reference or use specific telescopes. However, this process remains largely manual and resource intensive. In this work, we present an efficient SciBERT-based approach for automatic classification of scientific papers into four categories — science, instrumentation, mention, and not telescope. Despite strict context-length constraints (maximum 512 tokens) and limited compute resources, our approach achieved a macro F1 score of 0.89, ranking at the top of the WASP-2025 leaderboard. We analyze the effect of truncation and show that even with half the samples exceeding the token limit, SciBERT’s domain alignment enables robust classification. We discuss trade-offs between truncation, chunking, and long-context models, providing insights into the efficiency frontier for scientific text curation.

Anthology ID:: 2025.wasp-main.21
Volume:: Proceedings of the Third Workshop for Artificial Intelligence for Scientific Publications
Month:: December
Year:: 2025
Address:: Mumbai, India and virtual
Editors:: Alberto Accomazzi, Tirthankar Ghosal, Felix Grezes, Kelly Lockhart
Venues:: WASP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 192–194
Language:
URL:: https://aclanthology.org/2025.wasp-main.21/
DOI:
Bibkey:
Cite (ACL):: Madhusudhan Naidu. 2025. Efficient Context-Limited Telescope Bibliography Classification for the WASP-2025 Shared Task Using SciBERT. In Proceedings of the Third Workshop for Artificial Intelligence for Scientific Publications, pages 192–194, Mumbai, India and virtual. Association for Computational Linguistics.
Cite (Informal):: Efficient Context-Limited Telescope Bibliography Classification for the WASP-2025 Shared Task Using SciBERT (Naidu, WASP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.wasp-main.21.pdf

PDF Cite Search Fix data