Improving Subjectivity Detection for Spanish Texts using Subjectivity Word Sense Disambiguation based on Knowledge
Marco Sobrevilla Cabezudo$^{1}$, Nora La Serna Palomino$^{1}$, Rolando Alberto Maguiña Perez$^{1}$
$^{1}$Universidad Nacional Mayor de San Marcos. Lima Peru
email: msobrevillac@gmail.com, nlasernap@unmsm.edu.pe, rmaguinap@unmsm.edu.pe
Schedule:Mon 19th@16:15, Room: A

In this paper, we present a Sentence-level Subjectivity Detection method for Spanish using Subjectivity Word Sense Disambiguation (SWSD) based on Knowledge. We use a classic method of Word Sense Disambiguation, using the Spanish WordNet included in Mutilingual Central Repository 3.0 and the WordNet-Pr as Knowledge base. Because of the alignment between the WordNet and the SentiWordNet, we use this latter as semantic resource annotated with polarity values to determine when a word expresses subjectivity and objectivity, defining subjectivity levels using a fuzzy clustering algorithm previously. Due to the few resources focused on Sentiment Analysis for Spanish, the Semcor corpus was used for analyzing the attributes to be used. Finally, a Rule-based classifier was created to detect subjective sentences. This method was executed over a Spanish corpus, created in this work. The results show that our approach contributes positively to Subjectivity Detection task, despite of using resources created for English.

BibTex

@InProceedings{CLEI-2015:144408,
	author 		= {Marco Sobrevilla Cabezudo and Nora La Serna Palomino and Rolando Alberto MaguiƱa Perez},
	title 		= {Improving Subjectivity Detection for Spanish Texts using Subjectivity Word Sense Disambiguation based on Knowledge},
	booktitle 	= {2015 XLI Latin American Computing Conference (CLEI)},
	pages 		= {269--275},
	year 		= {2015},
	editor 		= {Hector Cancela and Alex Cuadros-Vargas and Ernesto Cuadros-Vargas},
	address 	= {Arequipa-Peru},
	month 		= {October},
	organization 	= {CLEI},
	publisher 	= {CLEI},
	url 		= {http://clei.org/clei2015/144408},
	isbn 		= {978-1-4673-9143-6},
	}


Generated by Ernesto Cuadros-Vargas , Sociedad Peruana de Computación-Peru, Universidad Católica San Pablo, Arequipa-Perú