ACL Anthology
News
(current)
FAQ
(current)
Corrections
(current)
Submissions
(current)
GitHub
Brett H.
Meyer
2024
pdf
bib
Intermediate Layer Distillation with the Reused Teacher Classifier: A Study on the Importance of the Classifier of Attention-based Models
Hang Zhang
|
Seyyed Hasan Mozafari
|
James J. Clark
|
Brett H. Meyer
|
Warren J. Gross
Findings of the Association for Computational Linguistics: EMNLP 2024
Search
Co-authors
James J. Clark
1
Warren J. Gross
1
Seyyed Hasan Mozafari
1
Hang Zhang
1
Venues
findings
1
Fix author