Slender-Mamba: Fully Quantized Mamba in 1.58 Bits From Head to Toe Zhenxuan Yu author Takeshi Kojima author Yutaka Matsuo author Yusuke Iwasawa author 2025-01 text Proceedings of the 31st International Conference on Computational Linguistics Owen Rambow editor Leo Wanner editor Marianna Apidianaki editor Hend Al-Khalifa editor Barbara Di Eugenio editor Steven Schockaert editor Association for Computational Linguistics Abu Dhabi, UAE conference publication yu-etal-2025-slender https://aclanthology.org/2025.coling-main.316/ 2025-01 4715 4724