Hostname: page-component-cd9895bd7-gxg78 Total loading time: 0 Render date: 2024-12-27T14:19:13.871Z Has data issue: false hasContentIssue false

FinnFN 1.0: The Finnish frame semantic database

Published online by Cambridge University Press:  14 August 2017

Krister Lindén
Affiliation:
Nykykielten laitos, PL 24, 00014 Helsingin yliopisto, Unioninkatu 40, Finland. krister.linden@helsinki.fi, heidi.haltia@gmail.com, juha.luukkonen@iki.fi, antti.o.laine@helsinki.fi, hege.roivainen@helsinki.fi, niina.vaisanen@helsinki.fi
Heidi Haltia
Affiliation:
Nykykielten laitos, PL 24, 00014 Helsingin yliopisto, Unioninkatu 40, Finland. krister.linden@helsinki.fi, heidi.haltia@gmail.com, juha.luukkonen@iki.fi, antti.o.laine@helsinki.fi, hege.roivainen@helsinki.fi, niina.vaisanen@helsinki.fi
Juha Luukkonen
Affiliation:
Nykykielten laitos, PL 24, 00014 Helsingin yliopisto, Unioninkatu 40, Finland. krister.linden@helsinki.fi, heidi.haltia@gmail.com, juha.luukkonen@iki.fi, antti.o.laine@helsinki.fi, hege.roivainen@helsinki.fi, niina.vaisanen@helsinki.fi
Antti O. Laine
Affiliation:
Nykykielten laitos, PL 24, 00014 Helsingin yliopisto, Unioninkatu 40, Finland. krister.linden@helsinki.fi, heidi.haltia@gmail.com, juha.luukkonen@iki.fi, antti.o.laine@helsinki.fi, hege.roivainen@helsinki.fi, niina.vaisanen@helsinki.fi
Henri Roivainen
Affiliation:
Nykykielten laitos, PL 24, 00014 Helsingin yliopisto, Unioninkatu 40, Finland. krister.linden@helsinki.fi, heidi.haltia@gmail.com, juha.luukkonen@iki.fi, antti.o.laine@helsinki.fi, hege.roivainen@helsinki.fi, niina.vaisanen@helsinki.fi
Niina Väisänen
Affiliation:
Nykykielten laitos, PL 24, 00014 Helsingin yliopisto, Unioninkatu 40, Finland. krister.linden@helsinki.fi, heidi.haltia@gmail.com, juha.luukkonen@iki.fi, antti.o.laine@helsinki.fi, hege.roivainen@helsinki.fi, niina.vaisanen@helsinki.fi
Get access

Abstract

The article describes the process of creating a Finnish language FrameNet or FinnFN, based on the original English language FrameNet hosted at the International Computer Science Institute in Berkeley, California. We outline the goals and results relating to the FinnFN project and especially to the creation of the FinnFrame corpus. The main aim of the project was to test the universal applicability of frame semantics by annotating real Finnish using the same frames and annotation conventions as in the original Berkeley FrameNet project. From Finnish newspaper corpora, 40,721 sentences were automatically retrieved and manually annotated as example sentences evoking certain frames. This became the FinnFrame corpus. Applying the Berkeley FrameNet annotation conventions to the Finnish language required some modifications due to Finnish morphology, and a convention for annotating individual morphemes within words was introduced for phenomena such as compounding, comparatives and case endings. Various questions about cultural salience across the two languages arose during the project, but problematic situations occurred only in a few examples, which we also discuss in the article. The article shows that, barring a few minor instances, the universality hypothesis of frames is largely confirmed for languages as different as Finnish and English.

Type
Research Article
Copyright
Copyright © Nordic Association of Linguistics 2017 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

REFERENCES

Baker, Collin F., Fillmore, Charles J. & Lowe, John B.. 1998. The Berkeley FrameNet project. In Association for Computational Linguistics (ed.), Proceedings of the 17th International Conference on Computational Linguistics (Coling 1998), 86–90.Google Scholar
Bauer, Daniel, Fürstenau, Hagen & Rambow, Owen. 2012. The dependency-parsed FrameNet corpus. Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey, 38613867.Google Scholar
Bertoldi, Anderson & de Oliveira Chishman, Rove Luiza. 2011. The limits of using FrameNet Frames to build a legal ontology. CEUR Workshop Proceedings 776, 207–212.Google Scholar
Bertoldi, Anderson & de Oliveira Chishman, Rove Luiza. 2012. Developing a framebased lexicon for the Brazilian legal language: The case of the Criminal_Process Frame. In Palmirani, Monica, Pagallo, Ugo, Casanovas, Pompeu & Sartor, Giovanni (eds.), AI Approaches to the Complexity of Legal Systems: Models and Ethical Challenges for Legal Systems, Legal Language and Legal Ontologies, Argumentation and Software Agents (Lecture Notes in Computer Science 7639), 256270.Google Scholar
Bick, Eckhard. 2011. A FrameNet for Danish. Proceedings of NODALIDA 2011. Riga, Latvia (NEALT Proceedings Series 11), 3441. Tartu: Tartu University Library.Google Scholar
Borin, Lars, Dannélls, Dana, Forsberg, Markus, Gronostaj, Maria Toporowska & Kokkinakis, Dimitrios. 2010. Swedish FrameNet++. Swedish Language Technology Conference 2010. https://svn.spraakdata.gu.se/sb/fnplusplus/pub/sltc2010.pdf (accessed 23 November 2015).Google Scholar
Borin, Lars, Forsberg, Markus & Roxendal, Johan. 2012. Korp – the corpus infrastructure of Språkbanken. Proceedings of Eighth International Conference on Language Resources (LREC 2012) , Istanbul, Turkey, 474478.Google Scholar
Burchardt, Aljoscha, Erk, Katrin, Frank, Anette, Kowalski, Andrea, Padó, Sebastian & Pinkal, Manfred. 2006. The SALSA corpus: A German corpus resource for lexical semantics. Proceedings of Fifth International Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy, 969974.Google Scholar
Candito, Mandito, Amsili, Pascal, Barque, Lucie, Benamara, Farah, de Chalendar, Gaël, Djemaa, Marianne, Haas, Pauline, Huyghe, Richard, Mathieu, Yvette Yannick, Muller, Philippe, Sagot, Benoît & Vieu, Laure. 2014. Developing a French FrameNet: Methodology and first results. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014), Reykjavik, Iceland, 13721379. http://www.linguist.jussieu.fr/~amsili/papers/asfalda_lrec2014.final.pdf (accessed 23 September 2015).Google Scholar
Dannélls, Dana. 2010. Applying semantic frame theory to automate natural languagetemplate generation from ontology statements. Proceedings of the 6th International Natural Language Generation Conference 2010, 179–183.Google Scholar
Duran, Magali Sanches & Aluísio, Sandra Maria. 2011. Propbank-Br: A Brazilian Portuguese corpus annotated with semantic role labels. 8th Brazilian Symposium in Information and Human Language Technology, 164–168.Google Scholar
Fillmore, Charles J. 1976. Frame semantics and the nature of language. Origins and Evolution of Language and Speech (Annals of the New York Academy of Sciences 280) , 2032.Google Scholar
Fillmore, Charles J. 1982. Frame semantics. In Linguistics Society of Korea (ed.), Linguistics in the Morning Calm, 111137. Seoul: Hanshin Publishing Company.Google Scholar
Fillmore, Charles J. & Baker, Collin F.. 2010. A frames approach to semantic analysis. In Heine, Bernd & Narrog, Heiko (ed.), The Oxford Handbook of Linguistic Analysis, 313339. New York: Oxford University Press.Google Scholar
Fillmore, Charles J., Johnson, Christopher R. & Petruck, Miriam R. L.. 2003. Background to FrameNet. International Journal of Lexicography 16 (3), 235250.Google Scholar
Fontenelle, Thierry (ed.). 2003. International Journal of Lexicography 16 (3). [Special issue on FrameNet] Google Scholar
Friberg Heppin, Karin & Friberg, Håkan. 2012. Using FrameNet in communicative language teaching. Proceedings of the XV EURALEX International Congress in Oslo, Norway, 640–647.Google Scholar
Friberg Heppin, Karin & Petruck, Miriam R. L.. 2014. Encoding of compounds in Swedish FrameNet. Proceedings of the 10th Workshop on Multiword Expressions (MWE 2014), 67–71.Google Scholar
Friberg Heppin, Karin & Gronostaj, Maria Toporowska. 2014. Exploiting FrameNet for Swedish mismatch? Constructions and Frames 6 (1), 5272.Google Scholar
Hahm, Younggun, Kim, Youngsik, Won, Yousung, Woo, Jongsung, Seo, Jiwoo, Kim, Jiseong, Park, Seongbae, Hwang, Dosam & Key-Sun-Choi. 2014. Toward matching the relation instantiation from DBpedia ontology to Wikipedia text: Fusing FrameNet to Korean. Proceedings of the 10th International Conference on Semantic Systems, 1319. Association for Computing Machinery (ACM).Google Scholar
Hamunen, Markus. 2012. Koloratiivirakenne, liike ja tapa [The colorative construction and manner]. In Herlin, Ilona & Kotilainen, Lari (eds.), Verbit ja konstruktiot [Verbs and constructions], 104140. Helsinki: Suomalaisen Kirjallisuuden Seura.Google Scholar
Hayoun, Avi & Elhadad, Michael. 2015. The Hebrew FrameNet project. Israeli Seminar on Computational Linguistics. Ra'anana: The Open University of Israel. http://www.openu.ac.il/iscol2015/downloads/ISCOL2015_submission_30_c_18.pdf (accessed 4 June 2017).Google Scholar
Jarva, Vesa & Kytölä, Samu. 2007. The Finnish colorative construction and expressivity. SKY Journal of Linguistics 20, 235272.Google Scholar
Lenci, Alessandro, Johnson, Martina & Lapesa, Gabriella. 2010. Building an Italian FrameNet through Semi-automatic Corpus Analysis. Proceedings of the International Conference on Language Resources and Evaluation (LREC 2010), Valletta, Malta. 1219.Google Scholar
Lindén, Krister, Haltia, Heidi, Laine, Antti, Luukkonen, Juha, Piitulainen, Jussi & Väisänen, Niina. To appear. TransFrame – Translating frames in the FinnFrameNet Project. Language Resources and Evaluation.Google Scholar
Lönneker-Rodman, Birte. 2007a. Beyond syntactic valence: FrameNet markup of example sentences in a Slovenian–German online dictionary. Proceedings of Fourth International Seminar on Computer Treatment of Slavic and East European Languages (Slovko 2007), Bratislava, Slovakia, 152164.Google Scholar
Lönneker-Rodman, Birte. 2007b. Multilinguality and FrameNet. Technical Report TR-07-001, ICSI, Berkeley, CA.Google Scholar
Meurs, Marie-Jean, Duvert, Frédéric, Béchet, Frédéric, Lefèvre, Fabrice & de Mori, Renato. 2008. Semantic frame annotation on the French MEDIA corpus. Proceedings of the Sixth Language Resources and Evaluation Conference (LREC 2008), Marrakech, Marocco, 10141018.Google Scholar
Ohara, Kyoko. 2012. Semantic annotations in Japanese FrameNet: Comparing frames in Japanese and English. Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey, 15591562.Google Scholar
Ohara, Kyoko., Fujii, Seiko, Ohori, Toshio, Suzuki, Ryoko, Saito, Hiroaki & Ishizaki, Shun. 2004. The Japanese FrameNet project: An introduction. Proceedings of the Workshop on Building Lexical Resources from Semantically Annotated Corpora, 9–12.Google Scholar
Palmer, Alexis & Sporleder, Caroline. 2010. Evaluating FrameNet-style semantic parsing: The role of coverage gaps in FrameNet. Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010): Poster Volume, 928–936.Google Scholar
Petruck, Miriam R. L. 2013. Introduction. In Fried, Mirjam & Nikiforidou, Kiki (eds.), Advances in Frame Semantics (Benjamins Current Topics 58), 112. Amsterdam: John Benjamins.Google Scholar
Ruppenhofer, Josef, Ellsworth, Michael, Petruck, Miriam R. L., Johnson, Christopher R. & Scheffczyk, Jan. 2010. FrameNet II: Extended theory and practice. https://framenet2.icsi.berkeley.edu/docs/r1.5/book.pdf (accessed 19 October 2015).Google Scholar
Subirats, Carlos & Petruck, Miriam R. L.. 2003. Surprise: Spanish FrameNet. Proceedings of CIL 17.Google Scholar
Tonelli, Sara & Pianta, Emanuele. 2008. Frame information transfer from English to Italian. Proceedings of the Sixth Language Resources and Evaluation Conference (LREC 2008), Marrakech, Marocco, 22522256.Google Scholar
You, Liping & Liu, Kaiying. 2005. Building Chinese Framenet database. Proceedings of the Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE 2005), 301–306.Google Scholar
Zawisławska, Magdalena, Derwojedowa, Magdalena & Linde-Usiekniewicz, Jadwiga. 2008. A FrameNet for Polish. Converging Evidence: Proceedings to the Third International Conference of the German Cognitive Linguistics Association (GCLA’08), 116–117.Google Scholar