Exploring Scientific Hypothesis Generation with Mamba

Miaosen Chai, Emily Herron, Erick Cervantes, Tirthankar Ghosal


Abstract
Generating scientifically grounded hypotheses is a challenging frontier task for generative AI models in science. The difficulty arises from the inherent subjectivity of the task and the extensive knowledge of prior work required to assess the validity of a generated hypothesis. Large Language Models (LLMs), trained on vast datasets from diverse sources, have shown a strong ability to utilize the knowledge embedded in their training data. Recent research has explored using transformer-based models for scientific hypothesis generation, leveraging their advanced capabilities. However, these models often require a significant number of parameters to manage Long sequences, which can be a limitation. State Space Models, such as Mamba, offer an alternative by effectively handling very Long sequences with fewer parameters than transformers. In this work, we investigate the use of Mamba for scientific hypothesis generation. Our preliminary findings indicate that Mamba achieves similar performance w.r.t. transformer-based models of similar sizes for a higher-order complex task like hypothesis generation. We have made our code available here: https://github.com/fglx-c/Exploring-Scientific-Hypothesis-Generation-with-Mamba
Anthology ID:
2024.nlp4science-1.17
Volume:
Proceedings of the 1st Workshop on NLP for Science (NLP4Science)
Month:
November
Year:
2024
Address:
Miami, FL, USA
Editors:
Lotem Peled-Cohen, Nitay Calderon, Shir Lissak, Roi Reichart
Venue:
NLP4Science
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
197–207
Language:
URL:
https://aclanthology.org/2024.nlp4science-1.17
DOI:
Bibkey:
Cite (ACL):
Miaosen Chai, Emily Herron, Erick Cervantes, and Tirthankar Ghosal. 2024. Exploring Scientific Hypothesis Generation with Mamba. In Proceedings of the 1st Workshop on NLP for Science (NLP4Science), pages 197–207, Miami, FL, USA. Association for Computational Linguistics.
Cite (Informal):
Exploring Scientific Hypothesis Generation with Mamba (Chai et al., NLP4Science 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.nlp4science-1.17.pdf