Annotation of Clinical Narratives in Bulgarian language

Ivajlo Radev; Kiril Simov; Galia Angelova; Svetla Boytcheva

doi:10.26615/978-954-452-044-1_011

Annotation of Clinical Narratives in Bulgarian language

Ivajlo Radev, Kiril Simov, Galia Angelova, Svetla Boytcheva

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use ... for bold, ... for italic, ... for underline, <sc>...</sc> for small-caps, <tt>...<tt> for typewriter text, <url>...</url> for URLs, <a href=...> for hyperlinks, and <par/> for paragraph breaks.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the external publication .)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

In this paper we describe annotation process of clinical texts with morphosyntactic and semantic information. The corpus contains 1,300 discharge letters in Bulgarian language for patients with Endocrinology and Metabolic disorders. The annotated corpus will be used as a Gold standard for information extraction evaluation of test corpus of 6,200 discharge letters. The annotation is performed within Clark system — an XML Based System For Corpora Development. It provides mechanism for semi-automatic annotation first running a pipeline for Bulgarian morphosyntactic annotation and a cascaded regular grammar for semantic annotation is run, then rules for cleaning of frequent errors are applied. At the end the result is manually checked. At the end we hope also to be able to adapted the morphosyntactic tagger to the domain of clinical narratives as well.

Anthology ID:: W17-8011
Volume:: Proceedings of the Biomedical NLP Workshop associated with RANLP 2017
Month:: September
Year:: 2017
Address:: Varna, Bulgaria
Editors:: Svetla Boytcheva, Kevin Bretonnel Cohen, Guergana Savova, Galia Angelova
Venue:: RANLP
SIG:
Publisher:: INCOMA Ltd.
Note:
Pages:: 81–87
Language:
External URL:: https://doi.org/10.26615/978-954-452-044-1_011
DOI:: 10.26615/978-954-452-044-1_011
Bibkey:
Cite (ACL):: Ivajlo Radev, Kiril Simov, Galia Angelova, and Svetla Boytcheva. 2017. Annotation of Clinical Narratives in Bulgarian language. In Proceedings of the Biomedical NLP Workshop associated with RANLP 2017, pages 81–87, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):: Annotation of Clinical Narratives in Bulgarian language (Radev et al., RANLP 2017)
Copy Citation:

External Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{radev-etal-2017-annotation,
    title = "Annotation of Clinical Narratives in {B}ulgarian language",
    author = "Radev, Ivajlo  and
      Simov, Kiril  and
      Angelova, Galia  and
      Boytcheva, Svetla",
    editor = "Boytcheva, Svetla  and
      Cohen, Kevin Bretonnel  and
      Savova, Guergana  and
      Angelova, Galia",
    booktitle = "Proceedings of the Biomedical {NLP} Workshop associated with {RANLP} 2017",
    month = sep,
    year = "2017",
    address = "Varna, Bulgaria",
    publisher = "INCOMA Ltd.",
    url = "https://aclanthology.org/W17-8011/",
    doi = "10.26615/978-954-452-044-1_011",
    pages = "81--87",
    abstract = "In this paper we describe annotation process of clinical texts with morphosyntactic and semantic information. The corpus contains 1,300 discharge letters in Bulgarian language for patients with Endocrinology and Metabolic disorders. The annotated corpus will be used as a Gold standard for information extraction evaluation of test corpus of 6,200 discharge letters. The annotation is performed within Clark system {---} an XML Based System For Corpora Development. It provides mechanism for semi-automatic annotation first running a pipeline for Bulgarian morphosyntactic annotation and a cascaded regular grammar for semantic annotation is run, then rules for cleaning of frequent errors are applied. At the end the result is manually checked. At the end we hope also to be able to adapted the morphosyntactic tagger to the domain of clinical narratives as well."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="radev-etal-2017-annotation">
    <titleInfo>
        <title>Annotation of Clinical Narratives in Bulgarian language</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Ivajlo</namePart>
        <namePart type="family">Radev</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Kiril</namePart>
        <namePart type="family">Simov</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Galia</namePart>
        <namePart type="family">Angelova</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Svetla</namePart>
        <namePart type="family">Boytcheva</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2017-09</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Biomedical NLP Workshop associated with RANLP 2017</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Svetla</namePart>
            <namePart type="family">Boytcheva</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Kevin</namePart>
            <namePart type="given">Bretonnel</namePart>
            <namePart type="family">Cohen</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Guergana</namePart>
            <namePart type="family">Savova</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Galia</namePart>
            <namePart type="family">Angelova</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>INCOMA Ltd.</publisher>
            <place>
                <placeTerm type="text">Varna, Bulgaria</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>In this paper we describe annotation process of clinical texts with morphosyntactic and semantic information. The corpus contains 1,300 discharge letters in Bulgarian language for patients with Endocrinology and Metabolic disorders. The annotated corpus will be used as a Gold standard for information extraction evaluation of test corpus of 6,200 discharge letters. The annotation is performed within Clark system — an XML Based System For Corpora Development. It provides mechanism for semi-automatic annotation first running a pipeline for Bulgarian morphosyntactic annotation and a cascaded regular grammar for semantic annotation is run, then rules for cleaning of frequent errors are applied. At the end the result is manually checked. At the end we hope also to be able to adapted the morphosyntactic tagger to the domain of clinical narratives as well.</abstract>
    <identifier type="citekey">radev-etal-2017-annotation</identifier>
    <identifier type="doi">10.26615/978-954-452-044-1_011</identifier>
    <location>
        <url>https://aclanthology.org/W17-8011/</url>
    </location>
    <part>
        <date>2017-09</date>
        <extent unit="page">
            <start>81</start>
            <end>87</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Annotation of Clinical Narratives in Bulgarian language
%A Radev, Ivajlo
%A Simov, Kiril
%A Angelova, Galia
%A Boytcheva, Svetla
%Y Boytcheva, Svetla
%Y Cohen, Kevin Bretonnel
%Y Savova, Guergana
%Y Angelova, Galia
%S Proceedings of the Biomedical NLP Workshop associated with RANLP 2017
%D 2017
%8 September
%I INCOMA Ltd.
%C Varna, Bulgaria
%F radev-etal-2017-annotation
%X In this paper we describe annotation process of clinical texts with morphosyntactic and semantic information. The corpus contains 1,300 discharge letters in Bulgarian language for patients with Endocrinology and Metabolic disorders. The annotated corpus will be used as a Gold standard for information extraction evaluation of test corpus of 6,200 discharge letters. The annotation is performed within Clark system — an XML Based System For Corpora Development. It provides mechanism for semi-automatic annotation first running a pipeline for Bulgarian morphosyntactic annotation and a cascaded regular grammar for semantic annotation is run, then rules for cleaning of frequent errors are applied. At the end the result is manually checked. At the end we hope also to be able to adapted the morphosyntactic tagger to the domain of clinical narratives as well.
%R 10.26615/978-954-452-044-1_011
%U https://aclanthology.org/W17-8011/
%U https://doi.org/10.26615/978-954-452-044-1_011
%P 81-87

Download as File

Markdown (Informal)

[Annotation of Clinical Narratives in Bulgarian language](https://aclanthology.org/W17-8011/) (Radev et al., RANLP 2017)

Annotation of Clinical Narratives in Bulgarian language (Radev et al., RANLP 2017)

ACL

Ivajlo Radev, Kiril Simov, Galia Angelova, and Svetla Boytcheva. 2017. Annotation of Clinical Narratives in Bulgarian language. In Proceedings of the Biomedical NLP Workshop associated with RANLP 2017, pages 81–87, Varna, Bulgaria. INCOMA Ltd..