<?xml version="1.0" encoding="UTF-8" ?>
<volume id="W18">
  <paper id="5800">
    <title>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</title>
    <editor>Sandra Kuebler</editor>
    <editor>Garrett Nicolai</editor>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <url>http://www.aclweb.org/anthology/W18-58</url>
    <bibtype>book</bibtype>
    <bibkey>SIGMORPHON:2018</bibkey>
  </paper>

  <paper id="5801">
    <title>Efficient Computation of Implicational Universals in Constraint-Based Phonology Through the Hyperplane Separation Theorem</title>
    <author><first>Giorgio</first><last>Magri</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>1&#8211;10</pages>
    <url>http://www.aclweb.org/anthology/W18-5801</url>
    <abstract>This paper focuses on the most basic implicational universals in phonological theory, called T-orders after Anttila and Andrus (2006). It develops necessary and sufficient constraint characterizations of T-orders within Harmonic Grammar and Optimality Theory. These conditions rest on the rich convex geometry underlying these frameworks. They are phonologically intuitive and have significant algorithmic implications.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>magri:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5802">
    <title>Lexical Networks in !Xung</title>
    <author><first>Syed-Amad</first><last>Hussain</last></author>
    <author><first>Micha</first><last>Elsner</last></author>
    <author><first>Amanda</first><last>Miller</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>11&#8211;20</pages>
    <url>http://www.aclweb.org/anthology/W18-5802</url>
    <abstract>We investigate the lexical network properties of the large phoneme inventory Southern African language Mangetti Dune !Xung as it compares to English and other commonly-studied languages. Lexical networks are graphs in which nodes (words) are linked to their minimal pairs; global properties of these networks are believed to mediate lexical access in the minds of speakers. We show that the network properties of !Xung are within the range found in previously-studied languages. By simulating data ("pseudolexicons") with varying levels of phonotactic structure, we find that the lexical network properties of !Xung diverge from previously-studied languages when fewer phonotactic constraints are retained. We conclude that lexical network properties are representative of an underlying cognitive structure which is necessary for efficient word retrieval and that the phonotactics of !Xung may be shaped by a selective pressure which preserves network properties within this cognitively useful range.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>hussain-elsner-miller:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5803">
    <title>Acoustic Word Disambiguation with Phonogical Features in Danish ASR</title>
    <author><first>Andreas Søeborg</first><last>Kirkedal</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>21&#8211;31</pages>
    <url>http://www.aclweb.org/anthology/W18-5803</url>
    <abstract>Phonological features can indicate word class and we can use word class information to disambiguate both homophones and homographs in automatic speech recognition (ASR). </abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>kirkedal:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5804">
    <title>Adaptor Grammars for the Linguist: Word Segmentation Experiments for Very Low-Resource Languages</title>
    <author><first>Pierre</first><last>Godard</last></author>
    <author><first>Laurent</first><last>Besacier</last></author>
    <author><first>François</first><last>Yvon</last></author>
    <author><first>Martine</first><last>Adda-Decker</last></author>
    <author><first>Gilles</first><last>Adda</last></author>
    <author><first>H&#233;lène</first><last>Maynard</last></author>
    <author><first>Annie</first><last>Rialland</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>32&#8211;42</pages>
    <url>http://www.aclweb.org/anthology/W18-5804</url>
    <abstract>Computational Language Documentation attempts to make the most recent research in speech and language technologies available to linguists working on language preservation and documentation. In this paper, we pursue two main goals along these lines. The first is to improve upon a strong baseline for the unsupervised word discovery task on two very low-resource Bantu languages, taking advantage of the expertise of linguists on these particular languages. The second consists in exploring the Adaptor Grammar framework as a decision and prediction tool for linguists studying a new language. We experiment 162 grammar configurations for each language and show that using Adaptor Grammars for word segmentation enables us to test hypotheses about a language. Specializing a generic grammar with language specific knowledge leads to great improvements for the word discovery task, ultimately achieving a leap of about 30% token F-score from the results of a strong baseline.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>godard-EtAl:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5805">
    <title>String Transduction with Target Language Models and Insertion Handling</title>
    <author><first>Garrett</first><last>Nicolai</last></author>
    <author><first>Saeed</first><last>Najafi</last></author>
    <author><first>Grzegorz</first><last>Kondrak</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>43&#8211;53</pages>
    <url>http://www.aclweb.org/anthology/W18-5805</url>
    <abstract>Many character-level tasks can be framed as</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>nicolai-najafi-kondrak:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5806">
    <title>Complementary Strategies for Low Resourced Morphological Modeling</title>
    <author><first>Alexander</first><last>Erdmann</last></author>
    <author><first>Nizar</first><last>Habash</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>54&#8211;65</pages>
    <url>http://www.aclweb.org/anthology/W18-5806</url>
    <abstract>Morphologically rich languages are challenging for natural language processing tasks due to data sparsity. This can be addressed either by introducing out-of-context morphological knowledge, or by developing machine learning architectures that specifically target data sparsity and/or morphological information. We find these approaches to complement each other in a morphological paradigm modeling task in Modern Standard Arabic, which, in addition to being morphologically complex, features ubiquitous ambiguity, exacerbating sparsity with noise. Given a small number of out-of-context rules describing closed class morphology, we combine them with word embeddings leveraging subword strings and noise reduction techniques. The combination outperforms both approaches individually by about 20% absolute. While morphological resources already exist for Modern Standard Arabic, our results inform how comparable resources might be constructed for non-standard dialects or any morphologically rich, low resourced language, given scarcity of time and</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>erdmann-habash:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5807">
    <title>Modeling Reduplication with 2-way Finite-State Transducers</title>
    <author><first>Hossep</first><last>Dolatian</last></author>
    <author><first>Jeffrey</first><last>Heinz</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>66&#8211;77</pages>
    <url>http://www.aclweb.org/anthology/W18-5807</url>
    <abstract>This article describes a novel approach to the computational modeling of reduplication. Reduplication is a well-studied linguistic phenomenon. However, it is often treated as a stumbling block within finite-state treatments of morphology. Most finite-state implementations of computational morphology cannot adequately capture the productivity of unbounded copying in reduplication, nor can they adequately capture bounded copying. We show that an understudied type of finite-state machines, two-way finite-state transducers (2-way FSTs), captures virtually all reduplicative processes, including total reduplication. 2-way FSTs can model reduplicative typology in a way which is convenient, easy to design and debug in practice, and linguistically-motivated. By virtue of being finite-state, 2-way FSTs are likewise incorporable into existing finite-state systems and programs. A small but representative typology of reduplicative processes is described in this article, alongside their corresponding 2-way FST models.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>dolatian-heinz:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5808">
    <title>Automatically Tailoring Unsupervised Morphological Segmentation to the Language</title>
    <author><first>Ramy</first><last>Eskander</last></author>
    <author><first>Owen</first><last>Rambow</last></author>
    <author><first>Smaranda</first><last>Muresan</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>78&#8211;83</pages>
    <url>http://www.aclweb.org/anthology/W18-5808</url>
    <abstract>Morphological segmentation is beneficial for several natural language processing tasks dealing with large vocabularies. Unsupervised methods for morphological segmentation are essential for handling a diverse set of languages, including low-resource languages. Eskander et al. (2016) introduced a Language Independent Morphological Segmenter (LIMS) using Adaptor Grammars (AG) based on the best-on-average performing AG configuration. However, while LIMS worked best on average and outperforms other state-of-the-art unsupervised morphological segmentation approaches, it did not provide the optimal AG configuration for five out of the six languages. We propose two language-independent classifiers that enable the selection of the optimal or nearly-optimal configuration for the morphological segmentation of unseen languages.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>eskander-rambow-muresan:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5809">
    <title>A Comparison of Entity Matching Methods between English and Japanese Katakana</title>
    <author><first>Michiharu</first><last>Yamashita</last></author>
    <author><first>Hideki</first><last>Awashima</last></author>
    <author><first>Hidekazu</first><last>Oiwa</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>84&#8211;92</pages>
    <url>http://www.aclweb.org/anthology/W18-5809</url>
    <abstract>Japanese Katakana is one component of the Japanese writing system and is used to express English terms, loanwords, and onomatopoeia in Japanese characters based on the phonemes. The main purpose of this research is to find the best entity matching methods between English and Katakana. We built two research questions to clarify which types of entity matching systems works better than others. The first question is what transliteration should be used for conversion. We need to transliterate English or Katakana terms into the same form in order to compute the string similarity. We consider five conversions that transliterate English to Katakana directly, Katakana to English directly, English to Katakana via phoneme, Katakana to English via phoneme, and both English and Katakana to phoneme. The second question is what should be used for the similarity measure at entity matching. To investigate the problem, we choose six methods, which are Overlap Coefficient, Cosine, Jaccard, Jaro-Winkler, Levenshtein, and the similarity of the phoneme probability predicted by RNN. Our results show that 1) matching using phonemes and conversion of Katakana to English works better than other methods, and 2) the similarity of phonemes outperforms other methods while other similarity score is changed depending on data and models.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>yamashita-awashima-oiwa:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5810">
    <title>Seq2Seq Models with Dropout can Learn Generalizable Reduplication</title>
    <author><first>Brandon</first><last>Prickett</last></author>
    <author><first>Aaron</first><last>Traylor</last></author>
    <author><first>Joe</first><last>Pater</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>93&#8211;100</pages>
    <url>http://www.aclweb.org/anthology/W18-5810</url>
    <abstract>Natural language reduplication can pose a challenge to neural models of language, and has been argued to require variables (Marcus et al., 1999). Sequence-to-sequence neural networks have been shown to perform well at a number of other morphological tasks (Cotterell et al., 2016), and produce results that highly correlate with human behavior (Kirov, 2017; Kirov &#38; Cotterell, 2018) but do not include any explicit variables in their architecture. We find that they can learn a reduplicative pattern that generalizes to novel segments if they are trained with dropout (Srivastava et al., 2014). We argue that this matches the scope of generalization observed in human reduplication.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>prickett-traylor-pater:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5811">
    <title>A Characterwise Windowed Approach to Hebrew Morphological Segmentation</title>
    <author><first>Amir</first><last>Zeldes</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>101&#8211;110</pages>
    <url>http://www.aclweb.org/anthology/W18-5811</url>
    <abstract>This paper presents a novel approach to the segmentation of orthographic word forms in contemporary Hebrew, focusing purely on splitting without carrying out morphological analysis or disambiguation. Casting the analysis task as character-wise binary classification and using adjacent character and word-based lexicon-lookup features, this approach achieves over 98% accuracy on the benchmark SPMRL shared task data for Hebrew, and 97% accuracy on a new out of domain Wikipedia dataset, an improvement of ~4% and 5% over previous state of the art performance.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>zeldes:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5812">
    <title>Phonetic Vector Representations for Sound Sequence Alignment</title>
    <author><first>Pavel</first><last>Sofroniev</last></author>
    <author><first>Çağrı</first><last>Çöltekin</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>111&#8211;116</pages>
    <url>http://www.aclweb.org/anthology/W18-5812</url>
    <abstract>This study explores a number of data-driven vector representations of the</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>sofroniev-ltekin:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5813">
    <title>Sounds Wilde. Phonetically Extended Embeddings for Author-Stylized Poetry Generation</title>
    <author><first>Aleksey</first><last>Tikhonov</last></author>
    <author><first>Ivan</first><last>Yamshchikov</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>117&#8211;124</pages>
    <url>http://www.aclweb.org/anthology/W18-5813</url>
    <abstract>This paper addresses author-stylized text generation. Using a version of a language model with extended phonetic and semantic embeddings for poetry generation we show that phonetics has comparable contribution to the overall model performance as the information on the target author. Phonetic information is shown to be important for English and Russian language. Humans tend to attribute machine generated texts to the target author.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>tikhonov-yamshchikov:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5814">
    <title>On Hapax Legomena and Morphological Productivity</title>
    <author><first>Janet</first><last>Pierrehumbert</last></author>
    <author><first>Ramon</first><last>Granell</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>125&#8211;130</pages>
    <url>http://www.aclweb.org/anthology/W18-5814</url>
    <abstract>Quantifying and predicting morphological productivity is a long-standing challenge in corpus linguistics and psycholinguistics. The same challenge reappears in natural language processing in the context of handling words that were not seen in the training set (out-of-vocabulary, or OOV, words). Prior research showed that a good indicator of the productivity of a morpheme is the number of words involving it that occur exactly once (the hapax legomena). A technical connection was adduced between this result and Good-Turing smoothing, which assigns probability mass to unseen events on the basis of the simplifying assumption that word frequencies are stationary. In a large-scale study of 133 affixes in Wikipedia, we develop evidence that success in fact depends on tapping the frequency range in which the assumptions of Good-Turing are violated.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>pierrehumbert-granell:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5815">
    <title>A Morphological Analyzer for Shipibo-Konibo</title>
    <author><first>Ronald</first><last>Cardenas</last></author>
    <author><first>Daniel</first><last>Zeman</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>131&#8211;139</pages>
    <url>http://www.aclweb.org/anthology/W18-5815</url>
    <abstract>We present a morphological analyzer for Shipibo-Konibo, a low-resourced native language spoken in the Amazonian region of Peru.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>cardenas-zeman:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5816">
    <title>An Arabic Morphological Analyzer and Generator with Copious Features</title>
    <author><first>Dima</first><last>Taji</last></author>
    <author><first>Salam</first><last>Khalifa</last></author>
    <author><first>Ossama</first><last>Obeid</last></author>
    <author><first>Fadhl</first><last>Eryani</last></author>
    <author><first>Nizar</first><last>Habash</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>140&#8211;150</pages>
    <url>http://www.aclweb.org/anthology/W18-5816</url>
    <abstract>We introduce CALIMA-Star, a very rich Arabic morphological analyzer and generator that provides functional and form-based morphological features as well as built-in tokenization, phonological representation, lexical rationality and much more. This tool includes a fast engine that can be easily integrated into other systems, as well as an easy-to-use API and a web interface. CALIMA-Star also supports morphological reinflection. We evaluate CALIMA-Star against four commonly used analyzers for Arabic in terms of speed and morphological content.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>taji-EtAl:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5817">
    <title>Sanskrit n-Retroflexion is Input-Output Tier-Based Strictly Local</title>
    <author><first>Thomas</first><last>Graf</last></author>
    <author><first>Connor</first><last>Mayer</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>151&#8211;160</pages>
    <url>http://www.aclweb.org/anthology/W18-5817</url>
    <abstract>Sanskrit /n/-retroflexion (nati) is one of the most complex segmental processes in phonology. While it is still star-free, it does not fit in any of the subregular classes that are commonly entertained in the literature. We show that when construed as a phonotactic dependency, the process fits into a class we call input-output tier-based strictly local (IO-TSL), a natural extension of the familiar class TSL. IO-TSL increases the power of TSL's tier projection function by making it an input-output strictly local transduction. Assuming that /n/-retroflexion represents the upper bound on the complexity of segmental phonology, this shows that all of segmental phonology can be captured by combining the intuitive notion of tiers with the independently motivated machinery of strictly local mappings.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>graf-mayer:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5818">
    <title>Phonological Features for Morphological Inflection</title>
    <author><first>Adam</first><last>Wiemerslage</last></author>
    <author><first>Miikka</first><last>Silfverberg</last></author>
    <author><first>Mans</first><last>Hulden</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>161&#8211;166</pages>
    <url>http://www.aclweb.org/anthology/W18-5818</url>
    <abstract>Modeling morphological inflection is an important task in Natural Language Processing. In contrast to earlier work that has largely used orthographic representations, we experiment with this task in a phonetic character space, representing inputs as either IPA segments or bundles of phonological distinctive features. We show that both of these inputs, somewhat counterintuitively, achieve similar accuracies on morphological inflection, slightly lower than orthographic models. We conclude that providing detailed phonological representations is largely redundant when compared to IPA segments, and that articulatory distinctions relevant for word inflection are already latently present in the distributional properties of many graphemic writing systems.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>wiemerslage-silfverberg-hulden:2018:SIGMORPHON</bibkey>
  </paper>

  <paper id="5819">
    <title>Extracting Morphophonology from Small Corpora</title>
    <author><first>Marina</first><last>Ermolaeva</last></author>
    <booktitle>TOBEFILLED-Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology</booktitle>
    <month>October</month>
    <year>2018</year>
    <address>Brussels, Belgium</address>
    <publisher>Association for Computational Linguistics</publisher>
    <pages>167&#8211;175</pages>
    <url>http://www.aclweb.org/anthology/W18-5819</url>
    <abstract>Probabilistic approaches have proven themselves well in learning phonological structure. In contrast, theoretical linguistics usually works with deterministic generalizations. The goal of this paper is to explore possible interactions between information-theoretic methods and deterministic linguistic knowledge and to examine some ways in which both can be used in tandem to extract phonological and morphophonological patterns from a small annotated dataset. Local and nonlocal processes in Mishar Tatar (Turkic/Kipchak) are examined as a case study.</abstract>
    <bibtype>inproceedings</bibtype>
    <bibkey>ermolaeva:2018:SIGMORPHON</bibkey>
  </paper>

</volume>

