Masking and BERT-based Models for Stereotype Identification

Sánchez-Junquera, Javier; Rosso, Paolo; Montes y Gómez, Manuel; Chulvi, Berta

Masking and BERT-based Models for Stereotype Identification

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/117481

Información del item - Informació de l'item - Item information
Título:	Masking and BERT-based Models for Stereotype Identification
Título alternativo:	Modelos Basados en Enmascaramiento y en BERT para la Identificación de Estereotipos
Autor/es:	Sánchez-Junquera, Javier \| Rosso, Paolo \| Montes y Gómez, Manuel \| Chulvi, Berta
Palabras clave:	Social bias \| Immigrant stereotypes \| BETO \| Masking technique \| Sesgo social \| Estereotipos hacia inmigrantes \| Técnica de enmascaramiento
Área/s de conocimiento:	Lenguajes y Sistemas Informáticos
Fecha de publicación:	sep-2021
Editor:	Sociedad Española para el Procesamiento del Lenguaje Natural
Cita bibliográfica:	Procesamiento del Lenguaje Natural. 2021, 67: 83-94. https://doi.org/10.26342/2021-67-7
Resumen:	Stereotypes about immigrants are a type of social bias increasingly present in the human interaction in social networks and political speeches. This challenging task is being studied by computational linguistics because of the rise of hate messages, offensive language, and discrimination that many people receive. In this work, we propose to identify stereotypes about immigrants using two different explainable approaches: a deep learning model based on Transformers; and a text masking technique that has been recognized by its capabilities to deliver good and human-understandable results. Finally, we show the suitability of the two models for the task and offer some examples of their advantages in terms of explainability. \| Los estereotipos sobre inmigrantes son un tipo de sesgo social cada vez más presente en la interacción humana en redes sociales y en los discursos políticos. Esta desafiante tarea está siendo estudiada por la lingüística computacional debido al aumento de los mensajes de odio, el lenguaje ofensivo, y la discriminación que reciben muchas personas. En este trabajo, nos proponemos identificar estereotipos sobre inmigrantes utilizando dos enfoques diametralmente opuestos prestando atención a la explicabilidad de los mismos: un modelo de aprendizaje profundo basado en Transformers; y una técnica de enmascaramiento de texto que ha sido reconocida por su capacidad para ofrecer buenos resultados a la vez que comprensibles para los humanos. Finalmente, mostramos la idoneidad de los dos modelos para la tarea, y ofrecemos algunos ejemplos de sus ventajas en términos de explicabilidad.
Patrocinador/es:	The work of the authors from the Universitat Politècnica of València was funded by the Spanish Ministry of Science and Innovation under the research project MISMIS-FAKEnHATE on MISinformation and MIScommunication in social media: FAKE news and HATE speech (PGC2018-096212-B-C31). Experiments were carried out on the GPU cluster at PRHLT thanks to the PROMETEO/2019/121 (DeepPattern) research project funded by the Generalitat Valenciana.
URI:	http://hdl.handle.net/10045/117481
ISSN:	1135-5948
DOI:	10.26342/2021-67-7
Idioma:	eng
Tipo:	info:eu-repo/semantics/article
Derechos:	© Sociedad Española para el Procesamiento del Lenguaje Natural
Revisión científica:	si
Versión del editor:	https://doi.org/10.26342/2021-67-7
Aparece en las colecciones:	Procesamiento del Lenguaje Natural - Nº 67 (2021)

Archivos en este ítem:

Archivos en este ítem:
Archivo	Descripción	Tamaño	Formato
PLN_67_07.pdf		864,05 kB	Adobe PDF	Abrir Vista previa Cerrar vista previa

Ver citas en Google Académico

Muestra el registro completo