Ken Gu


2021

pdf bib
A Package for Learning on Tabular and Text Data with Transformers
Ken Gu | Akshay Budhkar
Proceedings of the Third Workshop on Multimodal Artificial Intelligence

Recent progress in natural language processing has led to Transformer architectures becoming the predominant model used for natural language tasks. However, in many real- world datasets, additional modalities are included which the Transformer does not directly leverage. We present Multimodal- Toolkit, an open-source Python package to incorporate text and tabular (categorical and numerical) data with Transformers for downstream applications. Our toolkit integrates well with Hugging Face’s existing API such as tokenization and the model hub which allows easy download of different pre-trained models.