Evaluating the Reliability of Human–AI Collaborative Scoring of Written Arguments Using Rational Force Model

Noriko Takahashi; Abraham Onuorah; Alina Reznitskaya; Evgeny Chukharev; Ariel Sykes; Michele Flammia; Joe Oyler

Evaluating the Reliability of Human–AI Collaborative Scoring of Written Arguments Using Rational Force Model

Noriko Takahashi, Abraham Onuorah, Alina Reznitskaya, Evgeny Chukharev, Ariel Sykes, Michele Flammia, Joe Oyler

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

This study aims to improve the reliability of a new AI collaborative scoring system used to assess the quality of students’ written arguments. The system draws on the Rational Force Model and focuses on classifying the functional relation of each proposition in terms of support, opposition, acceptability, and relevance.

Anthology ID:: 2025.aimecon-wip.16
Volume:: Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress
Month:: October
Year:: 2025
Address:: Wyndham Grand Pittsburgh, Downtown, Pittsburgh, Pennsylvania, United States
Editors:: Joshua Wilson, Christopher Ormerod, Magdalen Beiting Parrish
Venue:: AIME-Con
SIG:
Publisher:: National Council on Measurement in Education (NCME)
Note:
Pages:: 135–140
Language:
URL:: https://aclanthology.org/2025.aimecon-wip.16/
DOI:
Bibkey:
Cite (ACL):: Noriko Takahashi, Abraham Onuorah, Alina Reznitskaya, Evgeny Chukharev, Ariel Sykes, Michele Flammia, and Joe Oyler. 2025. Evaluating the Reliability of Human–AI Collaborative Scoring of Written Arguments Using Rational Force Model. In Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress, pages 135–140, Wyndham Grand Pittsburgh, Downtown, Pittsburgh, Pennsylvania, United States. National Council on Measurement in Education (NCME).
Cite (Informal):: Evaluating the Reliability of Human–AI Collaborative Scoring of Written Arguments Using Rational Force Model (Takahashi et al., AIME-Con 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.aimecon-wip.16.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{takahashi-etal-2025-evaluating,
    title = "Evaluating the Reliability of Human{--}{AI} Collaborative Scoring of Written Arguments Using Rational Force Model",
    author = "Takahashi, Noriko  and
      Onuorah, Abraham  and
      Reznitskaya, Alina  and
      Chukharev, Evgeny  and
      Sykes, Ariel  and
      Flammia, Michele  and
      Oyler, Joe",
    editor = "Wilson, Joshua  and
      Ormerod, Christopher  and
      Beiting Parrish, Magdalen",
    booktitle = "Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress",
    month = oct,
    year = "2025",
    address = "Wyndham Grand Pittsburgh, Downtown, Pittsburgh, Pennsylvania, United States",
    publisher = "National Council on Measurement in Education (NCME)",
    url = "https://aclanthology.org/2025.aimecon-wip.16/",
    pages = "135--140",
    ISBN = "979-8-218-84229-1",
    abstract = "This study aims to improve the reliability of a new AI collaborative scoring system used to assess the quality of students' written arguments. The system draws on the Rational Force Model and focuses on classifying the functional relation of each proposition in terms of support, opposition, acceptability, and relevance."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="takahashi-etal-2025-evaluating">
    <titleInfo>
        <title>Evaluating the Reliability of Human–AI Collaborative Scoring of Written Arguments Using Rational Force Model</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Noriko</namePart>
        <namePart type="family">Takahashi</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Abraham</namePart>
        <namePart type="family">Onuorah</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Alina</namePart>
        <namePart type="family">Reznitskaya</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Evgeny</namePart>
        <namePart type="family">Chukharev</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Ariel</namePart>
        <namePart type="family">Sykes</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Michele</namePart>
        <namePart type="family">Flammia</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Joe</namePart>
        <namePart type="family">Oyler</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2025-10</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Joshua</namePart>
            <namePart type="family">Wilson</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Christopher</namePart>
            <namePart type="family">Ormerod</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Magdalen</namePart>
            <namePart type="family">Beiting Parrish</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>National Council on Measurement in Education (NCME)</publisher>
            <place>
                <placeTerm type="text">Wyndham Grand Pittsburgh, Downtown, Pittsburgh, Pennsylvania, United States</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
        <identifier type="isbn">979-8-218-84229-1</identifier>
    </relatedItem>
    <abstract>This study aims to improve the reliability of a new AI collaborative scoring system used to assess the quality of students’ written arguments. The system draws on the Rational Force Model and focuses on classifying the functional relation of each proposition in terms of support, opposition, acceptability, and relevance.</abstract>
    <identifier type="citekey">takahashi-etal-2025-evaluating</identifier>
    <location>
        <url>https://aclanthology.org/2025.aimecon-wip.16/</url>
    </location>
    <part>
        <date>2025-10</date>
        <extent unit="page">
            <start>135</start>
            <end>140</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Evaluating the Reliability of Human–AI Collaborative Scoring of Written Arguments Using Rational Force Model
%A Takahashi, Noriko
%A Onuorah, Abraham
%A Reznitskaya, Alina
%A Chukharev, Evgeny
%A Sykes, Ariel
%A Flammia, Michele
%A Oyler, Joe
%Y Wilson, Joshua
%Y Ormerod, Christopher
%Y Beiting Parrish, Magdalen
%S Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress
%D 2025
%8 October
%I National Council on Measurement in Education (NCME)
%C Wyndham Grand Pittsburgh, Downtown, Pittsburgh, Pennsylvania, United States
%@ 979-8-218-84229-1
%F takahashi-etal-2025-evaluating
%X This study aims to improve the reliability of a new AI collaborative scoring system used to assess the quality of students’ written arguments. The system draws on the Rational Force Model and focuses on classifying the functional relation of each proposition in terms of support, opposition, acceptability, and relevance.
%U https://aclanthology.org/2025.aimecon-wip.16/
%P 135-140

Download as File

Markdown (Informal)

[Evaluating the Reliability of Human–AI Collaborative Scoring of Written Arguments Using Rational Force Model](https://aclanthology.org/2025.aimecon-wip.16/) (Takahashi et al., AIME-Con 2025)

Evaluating the Reliability of Human–AI Collaborative Scoring of Written Arguments Using Rational Force Model (Takahashi et al., AIME-Con 2025)

ACL

Noriko Takahashi, Abraham Onuorah, Alina Reznitskaya, Evgeny Chukharev, Ariel Sykes, Michele Flammia, and Joe Oyler. 2025. Evaluating the Reliability of Human–AI Collaborative Scoring of Written Arguments Using Rational Force Model. In Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress, pages 135–140, Wyndham Grand Pittsburgh, Downtown, Pittsburgh, Pennsylvania, United States. National Council on Measurement in Education (NCME).