Sense inventory alignment using lexical substitutions and crowdsourcing / Ustalov D., Igushkin S. // Proceedings of the International FRUCT Conference on Intelligence, Social Media and Web, ISMW FRUCT 2016. - 2016. - V. , l. .

ISSN:
нет данных
Type:
Conference Paper
Abstract:
Sense inventory induction is a topical problem of deriving a set of synsets representing concepts nsing various automatic or human-assisted methods. There might be, and actually are, mistakes in such synsets. Here we are focused on the problem of eliminating potentially duplicate synsets having exactly two words in common as the broader lntersectlon is known to be successfully addressed by beoristics. We exploit the pbenomena of lexical substitotions and microtaskbased crowdsourcing for aligning the synsets to the individual word senses. We also present an open source mobile application implementing oor approach. Our experiments on the Russian language show that the approach scales well aad dramatically reduces the number of duplicate synsets in the inventory. © 2016 FRUCT.
Author keywords:
Index keywords:
Social networking (online); Mobile applications; Open sources; Russian languages; Sense inventories; Synsets; Word sense; Crowdsourcing
DOI:
10.1109/FRUCT.2016.7584771
Смотреть в Scopus:
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84994460760&doi=10.1109%2fFRUCT.2016.7584771&partnerID=40&md5=1e700aa5e433151957d44b3bd8d51ec2
Соавторы в МНС:
Другие поля
Поле Значение
Art. No. 7584771
Link https://www.scopus.com/inward/record.uri?eid=2-s2.0-84994460760&doi=10.1109%2fFRUCT.2016.7584771&partnerID=40&md5=1e700aa5e433151957d44b3bd8d51ec2
Affiliations Ural Federal University, Yekaterinburg, Russian Federation; ITMO University, Saint Petersburg, Russian Federation
References Allan, K., (2009) Concise Encyclopedia of Semantics, , Oxford, UK: Elsevier Science; Fellbaum, C., Large-scale lexicography in the digital age (2014) International Journal of Lexicography, 27 (4), pp. 378-395. , Sep; Kiselev, Y., Ustalov, D., Porshnev, S., Eliminating fuzzy duplicates in crowdsourced lexical resources (2016) Proceedings of the 8th Global WordNet Conference. GWC 2016, pp. 161-167. , Jan; Pantel, P., Lin, D., Discovering word senses from text (2002) Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 613-619. , Jul; Bicmann, C., Chinese whispers: An efficient graph clustering algorithm and its application to natural language processing problems (2006) Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing, pp. 73-80. , Jun; Biemann, C., Creating a system for lexical substitutions from scratch using crowdsourcing (2012) Language Resources and Evaluation, 47 (1), pp. 97-122. , Mar; Krizhanovsky, A.A., Smirnov, A.V., An approach to automated construction of a general-purpose lexical ontology based on Wiktionary (2013) Journal of Computer and Systems Sciences International, 52 (2), pp. 215-225. , Mar; Jurgens, D., Navigli, R., It's all fun and games until someone annotates: Video games with a purpose for linguistic annotation (2014) Transactions of the Association for Computational Linguistics, 2, pp. 449-464. , Oct; Amazon Mechanical Turk-Welcome, , https://wwwjnturk.coin/iiiturk/welcome; Yan, T., MarzUli, M., Holmes, R., Ganesan, D., Corner, M., Mcrowd: A platform for mobile crowdsourcing (2009) Proceedings of the 7th ACM Conference on Embedded Networked Sensor Systems, pp. 347-348. , Nov; Narula, P., Guthcim, P., Rolnitzky, D., Kulkarni, A., Hartmann, B., Mobileworks: A mobile crowdsourcing platform for workers at the bottom of the pyramid (2011) Human Computation: Papers from the 2011 AAA! Workshop, pp. 121-123. , Aug; Gupta, A., Thies, W., Cutrell, E., Balakrishnan, R., Mclerk: Enabling mobile crowdsoureing in developing regions (2012) Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 1843-1852. , May; Ustalov, D., Teleboyarin-mechanized labor for telegram (2015) Proceedings of the AINL-ISMW FRUCT, pp. 195-197. , Nov; Wang, Y., Jia, X., Jin, Q., Ma, J., Mobile crowdsoureing: Framework, challenges, and solutions (2016) Concurrency and Computation: Practice and Experience, , Feb; Komarov, S., Reinecke, K., Gajos, K.Z., Crowdsoureing performance evaluations of user interfaces (2013) Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 206-216. , Apr; Rothwell, S., Carter, S., Elshcnawy, A., Braga, D., Job complexity and user attention in crowdsoureing micro tasks (2015) Proceedings of the Crowdsoureing Breakthroughs for Language Technology Applications Y/orkshop, pp. 20-25. , Nov; Zyskowski, K., Morris, M.R., Bigham, J.P., Gray, M.L., Kane, S.K., Accessible crowdwork7: Understanding the value in and challenge of microtask employment for people with disabilities (2015) Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work A Social Computing, pp. 1682-1693. , Mar; Sablgren, M., The distributional hypothesis (2008) Italian Journal of Linguistics, 20 (1), pp. 33-53. , Oct; Snow, R., O'Connor, B., Jurafsky, D., Ng, A.Y., Cheap and fast-but is it good?: Evaluating non-expert annotations for natural language tasks (2008) Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 254-263. , Oct; (2015) Gartner Says Worldwide Srnartphone Sales Grew 9.7 Percent in Fourth Quarter, , http://www.gartner.com/newsroom/id/3215217; Ustalov, D.A., A crowdsoureing engine for mechanized labor (2015) Proceedings of the Institute for System Programming, 27 (3), pp. 351-364. , Jul; Russianwordnet/yam-android: Android Client for Yet Another RussNet, , https://github.coni/russianwordnet/yarn-aiidroid; Turbo Text A Convenient Copywriting Market, , http://www.taTbotcxt.ru/; YandeiLTbloka, , https://toloka.yandcx.com/; Braslavski, P., Ustalov, D., MuHrin, M., Kiselev, Y., YARN: Spinning-in-progress (2016) Proceedings of the 8th Global WordNet Conference, GWC 2016, pp. 58-65. , Jan; Kiselev, Y.A., Porshnev, S.V., Mukhin, M.Y., Current status of Russian electronic thesauri: Quality, completeness and availability (2015) Pro-grammnaya Lngeneria, 6, pp. 34-40. , Jun; YARN-Android Apps on Google Play, , https:/lay.googlc,coTrVstorc/apps/rlcto; Ustalov, D., Kiselev, Y., Add-remove-qiufirin: Crowdsoureing synset cleansing (2015) Application of Information and Communication Technologies (AICT 2015 9th International Conference on, pp. 143-147. , Oct; Ustalov, D., Crowdsoureing synset relations with genus-spedes-match (2015) Proceedings of the AINL-ISMW FRUCT, pp. 118-124. , Nov; Faralli, S., Panchenko, A., Biemann, C., Ponzetto, S.P., Linked disambiguated distributional semantic setworks Proc. ISWC, 2016. , in press
Editors Tyutina T.Balandin S.
Publisher Institute of Electrical and Electronics Engineers Inc.
Conference name 2016 International FRUCT Conference on Intelligence, Social Media and Web, ISMW FRUCT 2016
Conference date 28 August 2016 through 4 September 2016
Conference code 124277
ISBN 9789526839769
Language of Original Document English
Abbreviated Source Title Proc. Int. FRUCT Conf. Intel., Soc. Media Web, ISMW FRUCT
Source Scopus