Methodology and evaluation of the Galician WordNet expansion with the WN-Toolkit
- Xavier Gómez Guinovart
- Antoni Oliver
ISSN: 1135-5948
Year of publication: 2014
Issue: 53
Pages: 43-50
Type: Article
More publications in: Procesamiento del lenguaje natural
Abstract
En este artículo se presenta la metodología utilizada en la expansión del WordNet del gallego mediante el WN-Toolkit, así como una evaluación detallada de los resultados obtenidos. El conjunto de herramientas incluido en el WN-Toolkit permite la creación o expansión de wordnets siguiendo la estrategia de expansión. En los experimentos presentados en este artículo se han utilizado estrategias basadas en diccionarios y en corpus paralelos. La evaluación de los resultados se ha realizado de manera tanto automática como manual, permitiendo así la comparación de los valores de precisión obtenidos. La evaluación manual también detalla la fuente de los errores, lo que ha sido de utilidad tanto para mejorar el propio WN-Toolkit, como para corregir los errores del WordNet de referencia para el gallego.
Bibliographic References
- Aliabadi, Purya, Mohamed Sina Ahmadi, and Kyumars Sheykh Salavati, Shahin adn Esmaili. 2014. Towards building kurdnet, the kurdish wordnet. In Proceedings of the 7th Global WordNetConference, Tartu, Estonia.
- Alvez, Javier, Jordi Atserias, Jordi Carrera, Salvador Climent, Antoni Oliver, and German Rigau. 2008. Consistent annotation of eurowordnet with the top concept ontology. In Proceedings of the 4th Global WordNet Conference, Szeged, Hungary.
- Atserias, Jordi, Salvador Climent, Xavier Farreres, German Rigau, and Horacio Rodriguez. 1997. Combining multiple methods for the automatic construction of multi-lingual WordNets. In Recent Advances in Natural Language Processing II. Selected papers from RANLP, volume 97, pages 327-338.
- Bentivogli, Luisa, Pamela Forner, Bernardo Magnini, and Emanuele Pianta. 2004. Revising wordnet domains hierarchy: Semantics, coverage, and balancing. In Proceedings of COLING Workshop on Multilingual Linguistic Resources, pages 101-108, Ginebra. Benítez, Laura, Sergi Cervell, Gerard Escudero, Mònica López, German Rigau, and Mariona Taulé. 1998. Methods and Tools for Building the Catalan WordNet. In In Proceedings of the ELRA Workshop on Language Resources for European Minority Languages.
- Bond, Francis and Paik Kyonghee. 2012. A survey of wordnets and their licenses. In Proceedings of the 6th International Global WordNet Conference, pages 64-71, Matsue, Japan.
- Fellbaum, Christiane. 1998. WordNet: An electronic lexical database. The MIT press.
- Gómez Guinovart, Xavier, Xosé María Gómez Clemente, Andrea González Pereira, and Verónica Taboada Lorenzo. 2011. Galnet: WordNet 3.0 do galego. Linguamática, 3(1):61-67.
- Gómez Guinovart, Xavier, Xosé María Gómez Clemente, Andrea González Pereira, and Verónica Taboada Lorenzo. 2013. Sinonimia e rexistros na construcicon do WordNet do galego. Estudos de lingüística galega, 5:27-42.
- Gómez Guinovart, Xavier and Alberto Sim~oes. 2013. Retreading dictionaries for the 21st century. In José Paulo Leal, Ricardo Rocha, and Alberto Sim~oes, editors, 2nd Symposium on Languages, Applications and Technologies, pages 115-126, Saarbrücken. Dagstuhl Publishing.
- González Agirre, Antoni and German Rigau. 2013. Construcción de una base de conocimiento léxico multilingüe de amplia cobertura: Multilingual central repository. Linguamática, 5(1):13-28.
- Izquierdo, Rubén, Armando Suarez, and German Rigau. 2007. Exploring the automatic selection of basic level concepts. In Proceedings of the International Conference on Recent Advances on Natural Language Processing (RANLP'07), Borovetz, Bulgaria.
- Miháltz, M., C. Hatvani, J. Kuti, G. Szarvas, J. Csirik, G. Prószéky, and T. Váradi. 2008. Methods and results of the Hungarian wordnet project. In Proceedings of the Fourth Global WordNet Conference. GWC, pages 387-405, Szeged, Hungary.
- Navigli, Roberto and Simone Paolo Ponzetto. 2012. BabelNet: The automatic construction, evaluation and application of a widecoverage multilingual semantic network. Artificial Intelligence, 193:217-250.
- Och, Franz Josef and Hermann Ney. 2003. A systematic comparison of various statistical alignment models. Computational Linguistics, 29(1):19-51.
- Oliver, A. and S. Climent. 2012. Building wordnets by machine translation of sense tagged corpora. In Proceedings of the Global WordNet Conference, Matsue, Japan.
- Oliver, Antoni. 2012. WN-Toolkit: un toolkit per a la creació de wordnets a partir de diccionaris bilingües. Linguamática, 4(2):93-101.
- Oliver, Antoni. 2014. Wn-toolkit: Automatic generation of wordnets following the expand model. In Proceedings of the 7th Global WordNetConference, Tartu, Estonia.
- Oliver, Antoni and Salvador Climent. 2014. Automatic creation of wordnets from parallel corpora. In Nicoletta Calzolari (Conference Chair), Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, may. European Language Resources Association (ELRA).
- Padró, L., S. Reese, E. Agirre, and A. Soroa. 2010. Semantic services in freeling 2.1: Wordnet and UKB. In Proceedings of the 5th International Conference of the Global WordNet Association (GWC-2010).
- Pease, Adam, Ian Niles, and John Li. 2002. The suggested upper merged ontology: A large ontology for the semantic web and its applications. In Working Notes of the AAAI-2002 Workshop on Ontologies and the Semantic Web, Edmonton.
- Pianta, E., L. Bentivogli, and C. Girardi. 2002. MultiWordNet. developing an aligned multilingual database. In 1st International WordNet Conference, pages 293-302, Mysore, India.
- Pradet, Quentin, Gaël de Chalendar, and Jaume Baguenier Desormeaux. 2014. Wonef, an improved, expanded and evaluated automatic french translation of wordnet. In Proceedings of the 7th Global WordNetConference, Tartu, Estonia.
- Putra, D. D, A. Arfan, and R. Manurung. 2008. Building an Indonesian WordNet. In Proceedings of the 2nd International MALINDO Workshop.
- Raffaeli, Ida, Bekavac Bozo, Zeljko Agic, and Marko Tadic. 2014. Building croatian wordnet. In Proceedings of the 4th Global WordNet Conference, Szeged, Hungary.
- Sagot, Benoît and Darja Fiser. 2008. Building a free French wordnet from multilingual resources. In Proceedings of OntoLex.
- Vossen, Piek. 1998. EuroWordNet: a multilingual database with lexical semantic networks. Springer.