A Neural Network model for the Evaluation of Text Complexity in Italian Language: a Representation Point of View

Risultato della ricerca: Article

1 Citazione (Scopus)

Abstract

The goal of a text simplification system (TS) is to create a new text suited to the characteristics of a reader, with the final goal of making it more understandable.The building of an Automatic Text Simplification System (ATS) cannot be separated from a correct evaluation of the text complexity. In fact the ATS must be capable of understanding if a text should be simplified for the target reader or not. In a previous work we have presented a model capable of classifying Italian sentences based on their complexity level. Our model is a Long Short Term Memory (LSTM) Neural Network capable of learning the features of easy-to-read and complex-to-read sentences autonomously from a annotated corpus created specifically for text simplification. In this paper we further investigate on the role of the text representation, i.e. how different ways of representing the input text can affect the accuracy of the proposed system. In detail, we will use our Neural Network model for evaluating the sentence complexity using different kind of representations such as GloVe, Word2vec, FastTex and a new one based on a representation learning scheme.
Lingua originaleEnglish
pagine (da-a)464-470
Numero di pagine7
RivistaProcedia Computer Science
Volume145
Stato di pubblicazionePublished - 2018

All Science Journal Classification (ASJC) codes

  • Computer Science(all)

Cita questo

@article{b3af3c92466645368eb10075f71fdaee,
title = "A Neural Network model for the Evaluation of Text Complexity in Italian Language: a Representation Point of View",
abstract = "The goal of a text simplification system (TS) is to create a new text suited to the characteristics of a reader, with the final goal of making it more understandable.The building of an Automatic Text Simplification System (ATS) cannot be separated from a correct evaluation of the text complexity. In fact the ATS must be capable of understanding if a text should be simplified for the target reader or not. In a previous work we have presented a model capable of classifying Italian sentences based on their complexity level. Our model is a Long Short Term Memory (LSTM) Neural Network capable of learning the features of easy-to-read and complex-to-read sentences autonomously from a annotated corpus created specifically for text simplification. In this paper we further investigate on the role of the text representation, i.e. how different ways of representing the input text can affect the accuracy of the proposed system. In detail, we will use our Neural Network model for evaluating the sentence complexity using different kind of representations such as GloVe, Word2vec, FastTex and a new one based on a representation learning scheme.",
author = "{Lo Bosco}, Giosue' and Giovanni Pilato and Daniele Schicchi",
year = "2018",
language = "English",
volume = "145",
pages = "464--470",
journal = "Procedia Computer Science",
issn = "1877-0509",
publisher = "Elsevier BV",

}

TY - JOUR

T1 - A Neural Network model for the Evaluation of Text Complexity in Italian Language: a Representation Point of View

AU - Lo Bosco, Giosue'

AU - Pilato, Giovanni

AU - Schicchi, Daniele

PY - 2018

Y1 - 2018

N2 - The goal of a text simplification system (TS) is to create a new text suited to the characteristics of a reader, with the final goal of making it more understandable.The building of an Automatic Text Simplification System (ATS) cannot be separated from a correct evaluation of the text complexity. In fact the ATS must be capable of understanding if a text should be simplified for the target reader or not. In a previous work we have presented a model capable of classifying Italian sentences based on their complexity level. Our model is a Long Short Term Memory (LSTM) Neural Network capable of learning the features of easy-to-read and complex-to-read sentences autonomously from a annotated corpus created specifically for text simplification. In this paper we further investigate on the role of the text representation, i.e. how different ways of representing the input text can affect the accuracy of the proposed system. In detail, we will use our Neural Network model for evaluating the sentence complexity using different kind of representations such as GloVe, Word2vec, FastTex and a new one based on a representation learning scheme.

AB - The goal of a text simplification system (TS) is to create a new text suited to the characteristics of a reader, with the final goal of making it more understandable.The building of an Automatic Text Simplification System (ATS) cannot be separated from a correct evaluation of the text complexity. In fact the ATS must be capable of understanding if a text should be simplified for the target reader or not. In a previous work we have presented a model capable of classifying Italian sentences based on their complexity level. Our model is a Long Short Term Memory (LSTM) Neural Network capable of learning the features of easy-to-read and complex-to-read sentences autonomously from a annotated corpus created specifically for text simplification. In this paper we further investigate on the role of the text representation, i.e. how different ways of representing the input text can affect the accuracy of the proposed system. In detail, we will use our Neural Network model for evaluating the sentence complexity using different kind of representations such as GloVe, Word2vec, FastTex and a new one based on a representation learning scheme.

UR - http://hdl.handle.net/10447/328126

M3 - Article

VL - 145

SP - 464

EP - 470

JO - Procedia Computer Science

JF - Procedia Computer Science

SN - 1877-0509

ER -