Crowdsourcing synset relations with Genus-Species-Match / Ustalov D. // Proceedings of Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference, AINL-ISMW FRUCT 2015. - 2016. - V. , l. . - P. 118-124.

ISSN:
нет данных
Type:
Conference Paper
Abstract:
Enabling a domain-specific lexical resource is useful for improving the performance of a natural language processing system. However, such resources may be represented in the form of glossaries-terms provided with their sense definitions. Despite the problem of integrating such domain-specific glossaries into more sophisticated general purpose resources like thesuari being highly topical, it is complicated by ambiguity of the individual terms. This paper presents Genus-Species-Match, a crowdsourcing workflow for matching noisy pairs of synsets representing hyponymic/hypernymic relations. The system demonstrates F1 score of 80% on an experiment conducted on an online labor marketplace using the EMERCOM glossary and the Yet Another RussNet sense inventory. © 2015 FRUCT.
Author keywords:
Index keywords:
Artificial intelligence; Computational linguistics; Crowdsourcing; Glossaries; Information analysis; Information retrieval; Social networking (online); Websites; World Wide Web; Domain specific; F1 sc
DOI:
10.1109/AINL-ISMW-FRUCT.2015.7
Смотреть в Scopus:
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84969557111&doi=10.1109%2fAINL-ISMW-FRUCT.2015.7382980&partnerID=40&md5=575294fc198baa566440c6b6565a0bcd
Соавторы в МНС:
Другие поля
Поле Значение
Art. No. 7382980
Link https://www.scopus.com/inward/record.uri?eid=2-s2.0-84969557111&doi=10.1109%2fAINL-ISMW-FRUCT.2015.7382980&partnerID=40&md5=575294fc198baa566440c6b6565a0bcd
Affiliations IMM UB RAS, Yekaterinburg, Russian Federation; Ural Federal University, Yekaterinburg, Russian Federation
References Gurevych, I., Kim, J., (2013) The People's Web Meets NLP, , Springer Berlin Heidelberg: Berlin; Glossary of EMERCOM, , http://www.mchs.gov.ru/dop/tenns, Web; Braslavski, P., Ustalov, D., Mukhin, M., A spinning wheel for YARN: User interface for a crowdsourced thesaurus (2014) Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 101-104. , Apr; Astrakhantsev, N.A., Turdakov, D.Y., Automatic construction and enrichment of informal ontologies: A survey (2013) Programming and Computer Software, 39 (1), pp. 34-42. , Feb; Hearst, M.A., Automatic acquisition of hyponyms from large text corpora (1992) Proceedings of the 14th Conference on Computational Linguistics, 2, pp. 539-545. , Aug; Yang, D., Powers, D.M., Automatic thesaurus construction (2008) Proceedings of the Thirty-first awsfra/asz'An Conference on Computer Science, 74, pp. 147-156. , Jan; Giuliano, C., Gliozzo, A.M., Gangemi, A., Tymoshenko, K., Acquiring thesauri from wikis by exploiting domain models and lexical substitution (2010) The Semantic Web: Research and Applications, 6089, pp. 121-135. , May; Hagiwara, M., Ogawa, Y., Toyama, K., PLSI utilization for automatic thesaurus construction (2005) Natural Language Processing - IJCNLP 2005, 8876, pp. 334-345. , Oct; Sarasua, C., Simperl, E., Noy, N.F., CrowdMap: Crowdsourcing ontology alignment with microtasks (2012) The Semantic Web-ISWC 2012, 7649, pp. 525-541. , Nov; Bernstein, M.S., Little, G., Miller, R.C., Hartmann, B., Ackerman, M.S., Karger, D.R., Crowell, D., Panovich, K., Soylent: A word processor with a crowd inside (2010) Proceedings of the 23Nd Annual ACM Symposium on User Interface Software and Technology, pp. 313-322. , Oct; Wang, J., Kraska, T., Franklin, M.J., Feng, J., CrowdER: Crowdsourcing entity resolution (2012) Proc. VLDB Endow, 5 (11), pp. 1483-1494. , Jul; Biemann, C., Creating a system for lexical substitutions from scratch using crowdsourcing (2012) Language Resources and Evaluation, 47 (1), pp. 97-122. , Mar; Ustalov, D., Kiselev, Y., Add-remove-confirm: Crowdsourcing synset cleansing (2015) 2015 IEEE 9th International Conference on Application of Information and Communication Technologies (AICT), pp. 143-147. , Oct; Dawid, A.P., Skene, A.M., Maximum likelihood estimation of observer error-rates using the em algorithm (1979) Journal of the Royal Statistical Society, 28 (1), pp. 20-28; Sheng, V.S., Provost, F., Ipeirotis, P.G., Get another label? Improving data quality and data mining using multiple, noisy labelers (2008) Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 614-622. , Jun; Whitehill, J., Ruvolo, P., Wu, T., Bergsma, J., Movellan, J., Whose vote should count more: Optimal integration of labels from labelers of unknown expertise (2009) Advances in Neural Information Processing Systems, 22, pp. 2035-2043. , Dec; Demartini, G., Difallah, D.E., Cudre-Mauroux, P., Large-scale linked data integration using probabilistic reasoning and crowdsourcing (2013) The VLDB Journal, 22 (5), pp. 665-687. , Jul; Karger, D.R., Oh, S., Shah, D., Budget-optimal task allocation for reliable crowdsourcing systems (2014) Operations Research, 62 (1), pp. 1-24. , Feb; Sheshadri, A., Lease, M., SQUARE: A benchmark for research on computing crowd consensus (2013) First AAAI Conference on Human Computation and Crowdsourcing, pp. 156-164. , Nov; Snow, R., O'Connor, B., Jurafsky, D., Ng, A.Y., Cheap and Fast-but is it good?: Evaluating non-expert annotations for natural language tasks (2008) Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 254-263. , Oct; Welinder, P., Perona, P., Online crowdsourcing: Rating annotators and obtaining cost-effective labels (2010) 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 25-32. , Jun; Fan, J., Li, G., Ooi, B.C., Tan, K., Feng, J., ICrowd: An adaptive crowdsourcing framework (2015) Proceedings of the 2015 ACM SIGMOD International on Management Data, pp. 1015-1030. , Jun; Kiselev, Y., Ustalov, D., Porshnev, S., Eliminating fuzzy duplicates in crowdsourced lexical resources Proc. of the Eight Global Wordnet Conference, , in press; Sharov, S., Nivre, J., The proper place of men and machines in language technology: Processing Russian without any linguistic knowledge (2011) Computational Linguistics and Intellectual Technologies: Papers from the Annual Conference "dialogue", 10 (17), pp. 657-670. , May; Docker Hub, , https://hub.docker.com/r/nlpub/maltparser/, Web; Krizhanovsky, A.A., Smirnov, A.V., An approach to automated construction of a general-purpose lexical ontology based on Wiktionary (2013) Journal of Computer and Systems Sciences International, 52 (2), pp. 215-225. , Mar; Amazon Mechanical Turk-Welcome, , https://www.mturk.com/mturk/welcome, Web; CrowdFlower | People-powered Data Enrichment Platform, , https://www.crowdflower.com, Web; Microworkers-work & Earn or Offer A Micro Job, , https://microworkers.com/, Web; Yandex.Toloka, , https://toloka.yandex.com/, Web; TurboText, A Convenient Copywriting Market, , http://www.turbotext.ru/, Web; Ustalov, D.A., A crowdsourcing engine for mechanized labor (2015) Proceedings of the Institute for System Programming, 27 (3), pp. 351-364. , Jul; Lyashevskaya, O.N., Sharov, S.A., (2009) The Frequency Dictionary of Modern Russian Language, , Moscow: Azbukovnik; Mechanical Tsar, , http://mtsar.nlpub.org/, Web; Powers, D.M.W., Evaluation: From precision, recall and F-measure to ROC, informedness, markedness & correlation (2011) Journal of Machine Learning Technologies, 2 (1), pp. 37-63. , Feb; Tran-Thanh, L., Huynh, T.D., Rosenfeld, A., Ramchurn, S.D., Jennings, N.R., Crowd sourcing complex workflows under budget constraints (2015) Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, pp. 1298-1304. , Jan; Panchenko, A., Loukachevitch, N.V., Ustalov, D., Paperno, D., Meyer, C.M., Konstantinova, N., RUSSE: The first workshop on Russian semantic similarity (2015) Computational Linguistics and Intellectual Technologies: Papers from the Annual Conference "dialogue", 2, pp. 89-105. , May
Correspondence Address Ustalov, D.; IMM UB RASRussian Federation; email: dau@imm.uran.ru
Publisher Institute of Electrical and Electronics Engineers Inc.
Conference name Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference, AINL-ISMW FRUCT 2015
Conference date 9 November 2015 through 14 November 2015
Conference code 119090
ISBN 9789526839707
Language of Original Document English
Abbreviated Source Title Proc. Artif. Intell. Nat. Lang. Inf. Extr., Soc. Media Web Search FRUCT Conf., AINL-ISMW FRUCT 2015
Source Scopus