<?xml version="1.0" encoding="UTF-8" ?>
<volume id="W17">
  <paper id="7700" href="https://doi.org/10.26615/978-954-452-038-0_">
    <title>Proceedings of the 1st Workshop on Natural Language Processing and Information Retrieval associated with RANLP 2017</title>
    <editor><first>University of Wolverhampton</first><last>Mireille Makary</last></editor>
    <editor><first>University of Wolverhampton</first><last>Michael Oakes</last></editor>
    <month>September</month>
    <year>2017</year>
    <address>Varna, Bulgaria</address>
    <publisher>INCOMA Inc.</publisher>
    <doi>10.26615/978-954-452-038-0_</doi>
    <url>https://doi.org/10.26615/978-954-452-038-0_</url>
    <bibtype>book</bibtype>
    <bibkey>NLPIR:2017</bibkey>
  </paper>

  <paper id="7701" href="https://doi.org/10.26615/978-954-452-038-0_001">
    <title>Deception Detection for the Russian Language: Lexical and Syntactic Parameters</title>
    <author><first>Dina</first><last>Pisarevskaya</last></author>
    <author><first>Tatiana</first><last>Litvinova</last></author>
    <author><first>Olga</first><last>Litvinova</last></author>
    <booktitle>Proceedings of the 1st Workshop on Natural Language Processing and Information Retrieval associated with RANLP 2017</booktitle>
    <month>September</month>
    <year>2017</year>
    <address>Varna, Bulgaria</address>
    <publisher>INCOMA Inc.</publisher>
    <pages>1&#8211;10</pages>
    <doi>10.26615/978-954-452-038-0_001</doi>
    <url>https://doi.org/10.26615/978-954-452-038-0_001</url>
    <abstract>The field of automated deception detection in written texts is methodologically
	challenging. Different linguistic levels (lexics, syntax and semantics) are
	basically used for different  types of English texts to reveal if they are
	truthful or deceptive. Such parameters as POS tags and POS tags n-grams,
	punctuation marks, sentiment polarity of words, psycholinguistic features,
	fragments of syntaсtic structures are taken into consideration. The importance
	of different types of parameters was not compared for the Russian language
	before and should be investigated before moving to complex models and higher
	levels of linguistic processing. On the example of the Russian Deception Bank
	Corpus we estimate the impact of three groups of features (POS features
	including bigrams, sentiment and psycholinguistic features, syntax  and
	readability features) on the successful deception detection and find out that
	POS features can be used for binary text classification, but the results should
	be double-checked and, if possible, improved.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>pisarevskaya-litvinova-litvinova:2017:NLPIR</bibkey>
  </paper>

  <paper id="7702" href="https://doi.org/10.26615/978-954-452-038-0_002">
    <title>oIQa: An Opinion Influence Oriented Question Answering Framework with Applications to Marketing Domain</title>
    <author><first>Dumitru-Clementin</first><last>Cercel</last></author>
    <author><first>Cristian</first><last>Onose</last></author>
    <author><first>Stefan</first><last>Trausan-Matu</last></author>
    <author><first>Florin</first><last>Pop</last></author>
    <booktitle>Proceedings of the 1st Workshop on Natural Language Processing and Information Retrieval associated with RANLP 2017</booktitle>
    <month>September</month>
    <year>2017</year>
    <address>Varna, Bulgaria</address>
    <publisher>INCOMA Inc.</publisher>
    <pages>11&#8211;18</pages>
    <doi>10.26615/978-954-452-038-0_002</doi>
    <url>https://doi.org/10.26615/978-954-452-038-0_002</url>
    <abstract>Understanding questions and answers in QA system is a major challenge in the
	domain of natural language processing. In this paper, we present a question
	answering system that influences the human opinions in a conversation. The
	opinion words are quantified by using a lexicon-based method. We apply Latent
	Semantic Analysis and the cosine similarity measure between candidate answers
	and each question to infer the answer of the chatbot.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>cercel-EtAl:2017:NLPIR</bibkey>
  </paper>

  <paper id="7703" href="https://doi.org/10.26615/978-954-452-038-0_003">
    <title>Automatic Summarization of Online Debates</title>
    <author><first>Nattapong</first><last>Sanchan</last></author>
    <author><first>Ahmet</first><last>Aker</last></author>
    <author><first>Kalina</first><last>Bontcheva</last></author>
    <booktitle>Proceedings of the 1st Workshop on Natural Language Processing and Information Retrieval associated with RANLP 2017</booktitle>
    <month>September</month>
    <year>2017</year>
    <address>Varna, Bulgaria</address>
    <publisher>INCOMA Inc.</publisher>
    <pages>19&#8211;27</pages>
    <doi>10.26615/978-954-452-038-0_003</doi>
    <url>https://doi.org/10.26615/978-954-452-038-0_003</url>
    <abstract>Debate summarization is one of the novel and challenging research areas in
	automatic text summarization which has been largely unexplored. In this paper,
	we develop a debate summarization pipeline to summarize key topics which are
	discussed or argued in the two opposing sides of online debates. We view that
	the generation of debate summaries can be achieved by clustering, cluster
	labeling, and visualization. In our work, we investigate two different
	clustering approaches for the generation of the summaries. In the first
	approach, we generate the summaries by applying purely term-based clustering
	and cluster labeling. The second approach makes use of X-means for clustering
	and Mutual Information for labeling the clusters. Both approaches are driven by
	ontologies.  We visualize the results using bar charts. We think that our
	results are a smooth entry for users aiming to receive the first impression
	about what is discussed within a debate topic containing waste number of
	argumentations.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>sanchan-aker-bontcheva:2017:NLPIR</bibkey>
  </paper>

  <paper id="7704" href="https://doi.org/10.26615/978-954-452-038-0_004">
    <title>A Game with a Purpose for Automatic Detection of Children's Speech Disabilities using Limited Speech Resources</title>
    <author><first>Reem</first><last>Salem</last></author>
    <author><first>Mohamed</first><last>Elmahdy</last></author>
    <author><first>Slim</first><last>Abdennadher</last></author>
    <author><first>Injy</first><last>Hamed</last></author>
    <booktitle>Proceedings of the 1st Workshop on Natural Language Processing and Information Retrieval associated with RANLP 2017</booktitle>
    <month>September</month>
    <year>2017</year>
    <address>Varna, Bulgaria</address>
    <publisher>INCOMA Inc.</publisher>
    <pages>28&#8211;34</pages>
    <doi>10.26615/978-954-452-038-0_004</doi>
    <url>https://doi.org/10.26615/978-954-452-038-0_004</url>
    <abstract>Speech therapists and researchers are becoming more concerned with the use of
	computer-based systems in the therapy of speech disorders. In this paper, we
	propose a computer-based game with a purpose (GWAP) for speech therapy of
	Egyptian speaking children suffering from Dyslalia. Our aim is to detect if a
	certain phoneme is pronounced correctly. An Egyptian Arabic speech corpus has
	been collected. A baseline acoustic model was trained using the Egyptian
	corpus. In order to benefit from existing large amounts of Modern Standard
	Arabic (MSA) resources, MSA acoustic models were adapted with the collected
	Egyptian corpus. An independent testing set that covers common speech disorders
	has been collected for Egyptian speakers. Results show that adapted acoustic
	models give better recognition accuracy which could be relied on in the game
	and that children show more interest in playing the game than in visiting the
	therapist. A noticeable progress in children Dyslalia appeared with the
	proposed system.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>salem-EtAl:2017:NLPIR</bibkey>
  </paper>

</volume>

