Add-Remove-Confirm: Crowdsourcing synset cleansing / Ustalov D., Kiselev Y. // 9th International Conference on Application of Information and Communication Technologies, AICT 2015 - Proceedings. - 2015. - V. , l. . - P. 143-147.

ISSN:
нет данных
Type:
Conference Paper
Abstract:
Thesaurus is a crucial resource for many natural language processing and artificial intelligence problems, which require common sense reasoning. It is becoming highly topical to put special effort to ensure the high quality of synsets when a thesaurus is created collaboratively by non-expert annotators. This paper proposes Add-Remove-Confirm, a novel workflow for crowdsourcing synset cleansing. The present workflow has been empirically evaluated using a Russian thesaurus created through crowdsourcing showing that it does improve the synset quality as according to the expert assessment with high level of agreement. © 2015 IEEE.
Author keywords:
collaborative lexicography; crowdsourcing; data cleansing; lexical resource; mechanized labor; natural language processing; thesaurus
Index keywords:
Artificial intelligence; Computational linguistics; Crowdsourcing; Thesauri; collaborative lexicography; Commonsense reasoning; Data cleansing; Expert assessment; High quality; Lexical resources; NAtu
DOI:
10.1109/ICAICT.2015.7338534
Смотреть в Scopus:
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84960945144&doi=10.1109%2fICAICT.2015.7338534&partnerID=40&md5=a22caa55be3fab96c7e99074ab205891
Соавторы в МНС:
Другие поля
Поле Значение
Art. No. 7338534
Link https://www.scopus.com/inward/record.uri?eid=2-s2.0-84960945144&doi=10.1109%2fICAICT.2015.7338534&partnerID=40&md5=a22caa55be3fab96c7e99074ab205891
Affiliations N.N. Krasovskii Institute of Mathematics and Mechanics, Ural Branch of the Russian Academy of Sciences, 16 Sofia Kovalevskaya st, Yekaterinburg, Russian Federation; Ural Federal University, 19 Mira st., Yekaterinburg, Russian Federation
Author Keywords collaborative lexicography; crowdsourcing; data cleansing; lexical resource; mechanized labor; natural language processing; thesaurus
References Kiselev, Y., Porshnev, S., Mukhin, M., Current status of Russian electronic thesauri: Quality, completeness and availability (2015) Programmnaya Ingeneria, (6), pp. 34-40. , in Russian; Meyer, C.M., Gurevych, I., Wiktionary: A new rival for expert-built lexicons exploring the possibilities of collaborative lexicography (2012) Electronic Lexicography, pp. 259-291. , S. Granger and M. Paquot, Eds. Oxford: Oxford University Press; Kittur, A., Nickerson, J.V., Bernstein, M., Gerber, E., Shaw, A., Zimmerman, J., Lease, M., Horton, J., The future of crowd work (2013) Proceedings of the 2013 Conference on Computer Supported Cooperative Work, pp. 1301-1318. , ser. CSCW '13. New York, NY, USA: ACM; Bernstein, M.S., Little, G., Miller, R.C., Hartmann, B., Ackerman, M.S., Karger, D.R., Crowell, D., Panovich, K., Soylent: A word processor with a crowd inside (2010) Proceedings of the 23Nd Annual ACM Symposium on User Interface Software and Technology, pp. 313-322. , ser. UIST '10. New York, NY, USA: ACM; Noronha, J., Hysen, E., Zhang, H., Gajos, K.Z., PlateMate: Crowdsourcing nutritional analysis from food photographs (2011) Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology, pp. 1-12. , ser. UIST '11. New York, NY, USA: ACM; Kittur, A., Smus, B., Khamkar, S., Kraut, R.E., CrowdForge: Crowdsourcing complex work (2011) Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology, pp. 43-52. , ser. UIST '11. New York, NY, USA: ACM; Wang, J., Kraska, T., Franklin, M.J., Feng, J., CrowdER: Crowdsourcing entity resolution (2012) Proc. VLDB Endow, 5 (11), pp. 1483-1494; Biemann, C., Creating a system for lexical substitutions from scratch using crowdsourcing (2013) Language Resources and Evaluation, 47 (1), pp. 97-122; Tong, Y., Cao, C., Zhang, C., Li, Y., Chen, L., CrowdCleaner: Data cleaning for multi-version data on the web via crowdsourcing (2014) 2014 IEEE 30th International Conference on Data Engineering (ICDE), pp. 1182-1185; Ustalov, D., A crowdsourcing engine for mechanized labor (2015) Preliminary Proceedings of the 9th Spring/Summer Young Researchers' Colloquium on Software Engineering (SYRCoSE 2015), pp. 52-55. , May 28-30, Samara, Russia; Braslavski, P., Ustalov, D., Mukhin, M., A spinning wheel for YARN user interface for a crowdsourced thesaurus (2014) Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics. Gothenburg, Sweden: Association for Computational Linguistics, pp. 101-104; Kiselev, Y., Krizhanovsky, A., Braslavski, P., Russian lexicographic landscape: A tale of 12 dictionaries (2015) Computational Linguistics and Intellectual Technologies: Papers from the Annual Conference Dialogue, 1, pp. 254-271. , Moscow: RGGU; Fleiss, J.L., Levin, B., Paik, M.C., (2003) Statistical Methods for Rates and Proportions, , 3rd ed. John Wiley &Sons; Powers, D.M.W., The problem with kappa (2012) Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp. 345-355. , ser. EACL '12. Stroudsburg, PA, USA: Association for Computational Linguistics; Panchenko, A., Loukachevitch, N.V., Ustalov, D., Paperno, D., Meyer, C.M., Konstantinova, N., RUSSE: The first workshop on Russian semantic similarity (2015) Computational Linguistics and Intellectual Technologies: Papers from the Annual Conference Dialogue, 2, pp. 89-105. , Moscow, Russia: RGGU
Publisher Institute of Electrical and Electronics Engineers Inc.
Conference name 9th International Conference on Application of Information and Communication Technologies, AICT 2015
Conference date 14 October 2015 through 16 October 2015
Conference code 118382
ISBN 9781467368551
Language of Original Document English
Abbreviated Source Title 9th Int. Conf. Appl. Inf. Commun. Technol., AICT - Proc.
Source Scopus