Systems and Means of Informatics
2015, Volume 25, Issue 1, pp 20-33
COMBINING CORPUS AND THE SAURUS INFORMATION FOR EXTRACTING SENTIMENT WORDS
- N. V. Loukachevitch
- I. I. Chetviorkin
Abstract
The paper describes a combined approach to extraction of a domain-
specific sentiment lexicon. At first, an initial version of a domain-specific lexicon
is obtained by application of a supervised model. At the second stage, the ordered
list of sentiment words is refined using the thesaurus information. This combined
model is applied to several domains and at last, the domain-specific sentiment
lexicons are united to create an improved version of the Russian sentiment lexicon
in the generalized domain of products.
[+] References (16)
- Mihalcea, R., C. Banea, and J. Wiebe. 2007. Learning multilingual subjective
language via cross-lingual projections. 45th Annual Meeting of the Association of
Computational Linguistics ACL-2007 Proceedings. Prague, Czech Republic. 976-983.
- Steinberger, J., P. Lenkova, M. Ebrahim, M. Ehrmann, A. Hurriyetogly, M. Kabadjov, R. Steinberger,H. Tanev, V. Zavarella, and S. Vazquez. 2011. Creating sentiment
dictionaries via triangulation. 2nd Workshop on Computational Approaches to Subjec-
tivity and Sentiment Analysis, ACL-HLT-2011 Proceedings. Oregon. 28-36.
- Chetviorkin, I., and N. Loukachevitch. 2012. Extraction of Russian sentiment lexicon
for product meta-domain. COLING-2012 Proceedings. Mumbai. 593-610.
- Choi, Y., and C. Cardie. 2009. Adapting a polarity lexicon using integer linear
programming for domain-specific sentiment classification. Conference on Empirical
Methods in Natural Language Processing EMNLP-2009 Proceedings. Edinburgh.
2:590-598.
- Hatzivassiloglou, V., and K. McKeown. 1997. Predicting the semantic orientation of
adjectives. 35th Annual Meeting of the Association for Computational Linguistics,
ACL-1 997 Proceedings. 174-181.
- Velikovich, L., S. Blair-Goldensohn, K. Hannan, and R. McDonald. 2010. The
viability of web-derived polarity lexicons. NAACL-2010 Proceedings. Los Angeles,
CA. 777-785.
- Volkova, S., Th. Wilson, and D. Yarowsky. 2013. Exploring sentiment in social
media: Bootstrapping subjectivity clues from multilingual twitter streams. 51st Annual
Meeting of the Association of Computational Linguistics, ACL-13 Proceedings. Sofia,
Bulgaria. 505-510.
- Feng, S., J. S. Kang, P. Kuznetsova, and Y. Choi. 2013. Connotation Lexicon: A dash
of sentiment beneath the surface meaning. 51th Annual Meeting of the Association for
Computational Linguistics ACL-2013 Proceedings. Sofia, Bulgaria. 1774-1784.
- Lau, R., C. Lai, P. Bruza, and K.-F. Wong. 2011. Leveraging web 2.0 data for
scalable semi-supervised learning of domain-specific sentiment lexicons. 20th ACM
Conference (International) on Information and Knowledge Management CIKM-2011
Proceedings. New York, N.Y.: ACM. 2457-2460.
- Esuli, A., and F. Sebastiani. 2006. SentiWordnet:A publicly available lexical resource
for opinion mining. LREC-2006 Proceedings. Genoa. 417-422.
- Fellbaum, Ch. 1998. WordNet: An Electronic Lexical Database. Cambridge, MA:
MIT Press.
- Rao,D., andD. Ravichandran. 2009. Semi-supervised polarity lexicon induction. 12th
Conference of the European Chapter of the ACL, EACL-2009 Proceedings. Athens.
675-682.
- Zhu, X., and Z. Ghahramani. 2002. Learning from labeled and unlabeled data with
label propagation. Pittsburg: Carnegie Mellon University. Technical Report CMU-
CALD-02-107.
- Loukachevitch, N., and B. Dobrov. 2014, RuThes Linguistic Ontology vs. Russian
Wordnets. Global Wordnet Conference GWC-2014 Proceedings. Tartu, Estonia. 154-
162.
- Ahmad, K., Gillam L., and L. Tostevin. 1999. University of Surrey participation in
Trec8:Weirdness indexing for logical documents extrapolation and retrieval. 8th Text
Retrieval Conference TREC-8 Proceedings. Gaithersburg,MD: NIST. 717-724.
- Callan, J.P., W.B. Croft, and S.M. Harding. 1992. The INQUERY retrieval
system. 3rd Conference (International) on Database and Expert Systems Applications
DEXA-92 Proceedings. New York: Springer Verlag. 78-93.
[+] About this article
Title
COMBINING CORPUS AND THE SAURUS INFORMATION FOR EXTRACTING SENTIMENT WORDS
Journal
Systems and Means of Informatics
Volume 25, Issue 1, pp 20-33
Cover Date
2013-11-30
DOI
10.14357/08696527150102
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
sentiment analysis; domain adaptation; natural language processing; thesaurus
Authors
N. V. Loukachevitch and I. I. Chetviorkin
Author Affiliations
Research Computing Center, M.V. Lomonosov Moscow State University, 4 Leninskie Gory, Moscow 119991, Russian Federation
|