Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/46328
Title: | Improving AI Text Classification: A Cascaded Approach | Authors: | THYS, Jarne VANACKEN, Davy ROVELO RUIZ, Gustavo |
Issue Date: | 2025 | Source: | 3rd Workshop on Engineering Interactive Systems Embedding AI Technologies, Trier, Germany, 2025, June 24 | Status: | In press | Abstract: | LLMs have rapidly evolved into versatile ''foundation models'', repurposed - despite persistent gaps in reliability - for a variety of tasks, such as legal document summarization, medical question answering, and text classification. In this paper, we propose an approach to engineer better text classification solutions for educational grading. We address this challenge with a solution that couples (i) a transformer cascade for rubric-level prediction with (ii) a transparent, traffic-light feedback interface powered by a Mixture-of-Agents LLM system. We compared our approach to a standard LLM and a single transformer architecture using the ASAG dataset. Results show that our approach increases recall for incorrect answers by more than 50% and precision on fully correct answers by 20% compared to a single transformer. Finally, we describe a prototype implementing our approach in an end-to-end, minimally intrusive solution for semi-automatic grading, which allows the teaching staff to review and revise the feedback generated by a Mixture-of-Agents LLM system based on the grade classification. | Keywords: | Cascade Models;AI-Augmented Workflows;Automated Grading Systems;AI Text Classification | Document URI: | http://hdl.handle.net/1942/46328 | Category: | C2 | Type: | Conference Material |
Appears in Collections: | Research publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
EICS_workshop_2025_AI_assisted_grading.pdf Restricted Access | Conference material | 1.17 MB | Adobe PDF | View/Open Request a copy |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.