Mohammad Bahrani
Also published as: M. Bahrani
2024
Persian Abstract Meaning Representation: Annotation Guidelines and Gold Standard Dataset
Reza Takhshid
|
Tara Azin
|
Razieh Shojaei
|
Mohammad Bahrani
Proceedings of the 2024 UMR Parsing Workshop
This paper introduces the Persian Abstract Meaning Representation (AMR) guidelines, a detailed guide for annotating Persian sentences with AMR, focusing on the necessary adaptations to fit Persian’s unique syntactic structures. We discuss the development process of a Persian AMR gold standard dataset consisting of 1562 sentences created following the guidelines. By examining the language specifications and nuances that distinguish AMR annotations of a low-resource language like Persian, we shed light on the challenges and limitations of developing a universal meaning representation framework. The guidelines and the dataset introduced in this study highlight such challenges, aiming to advance the field.
2023
Developing an Annotated Persian Dataset from COVID-19 News for Enhanced Fake News Detection
Foroogh Zahed
|
Seyedeh Fatemeh Ebrahimi
|
Mohammad Bahrani
|
Alireza Mansouri
Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation
2006
Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems
M. Bahrani
|
H. Sameti
|
N. Hafezi
|
H. Movasagh
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
In this paper building statistical language models for Persian language using a corpus and incorporating them in Persian continuous speech recognition (CSR) system are described. We used Persian Text Corpus for building the language models. First we preprocessed the texts of corpus by correcting the different orthography of words. Also, the number of POS tags was decreased by clustering POS tags manually. Then we extracted word based monogram and POS-based bigram and trigram language models from the corpus. We also present the procedure of incorporating language models in a Persian CSR system. By using the language models 27.4% reduction in word error rate was achieved in the best case.
Search
Fix data
Co-authors
- Tara Azin 1
- Seyedeh Fatemeh Ebrahimi 1
- Nazila Hafezi 1
- Alireza Mansouri 1
- Hamed Movasagh 1
- show all...