Possible but not probable :A quantitative analysis of valency behaviour of Czech nouns in the Prague Dependency Treebank

Open access

Abstract

In order to optimize corpus searches for valency lexicon production, we analyse the relative frequencies of different combinations of valency complementations of Czech deverbal nouns in the Prague Dependency Treebank, considering differences between productively and non-productively derived nouns and their semantic class. We also classify combinations of forms of participants according to their frequency.

[1] Bejček, E. et al. (2013). Prague Dependency Treebank 3.0. Data/software, Univerzita Karlova v Praze, MFF, ÚF AL, Prague, Czech Republic.

[2] Benko, V. (2014). Aranea: Yet Another Family of (Comparable) Web Corpora. In Sojka, P. et al., editors, TSD 2014. LNAI 8655, pages 247–256, Springer International Publishing.

[3] Bick, E. (2009). DeepDict – A Graphical Corpus-based Dictionary of Word Relations. In Bick, E., Hagen, K., Müürisep, K., and Trosterud, T., editors, Proceedings of the NODALIDA 2009 workshop Constraint Grammar and robust parsing. NEALT Proceedings Series, Vol. 8, i-ii, Northern European Association for Language Technology (NEALT), Tartu University.

[4] Collins COBUILD Advanced Learner’s Dictionary (2014).

[5] Čermáková, A. (2009). Valence českých substantiv. Lidové noviny, Praha.

[6] Hajič, J., Panevová, J., Urešová, Z., Bémová, A., Kolářová, V., and Pajas, P. (2003). PDT-VALLEX: Creating a Largecoverage Valency Lexicon for Treebank Annotation. In Proceedings of The Second Workshop on Treebanks and Linguistic Theories, pages 57–68, Växjö University Press, Sweden.

[7] Herbst, T., Heath, D., Roe, I. F., and Götz, D. (2004). A valency dictionary of English: a corpus-based analysis of the complementation patterns of English verbs, nouns, and adjectives. Walter de Gruyter, Berlin, Germany.

[8] Kettnerová, V., Lopatková, M., and Hrstková, K. (2008). Semantic Classes in Czech Valency Lexicon: Verbs of Communication and Verbs of Exchange. In Lecture Notes in Computer Science, Vol. 5246, Proceedings of the 11th International Conference, TSD 2008, pages 109–116, Springer, Berlin – Heidelberg, Germany.

[9] Kilgarriff, A. and Tugwell, D. (2001). WORD SKETCH: Extraction and display of significant collocations for lexicography. In Proc Collocations workshop, pages 32–38, ACL, Toulouse, France.

[10] Kilgarriff, A., Husák, M., McAdam, K., Rundell, M., and Rychlý, P. (2008). GDEX: Automatically finding good dictionary examples in a corpus. In Proceedings of the 13th EURALEX International Congress, pages 425–432, Spain.

[11] Klímová, J., Kolářová, V., and Vernerová, A. (2016). Towards a Corpus-based Valency Lexicon of Czech Nouns. In Kernerman, I. et al., editors, Globalex 2016, Lexicographic Resources for Human Language Technology, pages 1–7. Accessible at: http://ailab.ijs.si/globalex/files/2016/06/LREC2016Workshop-GLOBALEX_Proceedings-v2.pdf.

[12] Kolářová, V. (2014). Special valency behavior of Czech deverbal nouns. In Spevak, O., editor, Noun Valency, pages 19–60, John Benjamins, Amsterdam, Netherlands.

[13] Lexikální databáze Pralex: Accessible at: http://lexiko.ujc.cas.cz/index.php?-page=23.

[14] Longman Dictionary of Contemporary English. (2014). 6th Edition. Pearson.

[15] Lopatková, M., Kettnerová, V., Bejček, E., Vernerová, A., and Žabokrtský, Z. (2015). VALLEX 3.0 – Valenční slovník českých sloves. Charles University in Prague. Accessible at: http://ufal.mff.cuni.cz/vallex/3.0/.

[16] Lopatková, M., Kettnerová, V., Bejček, E., Vernerová, A., and Žabokrtský, Z. (2016). Valenční slovník českých sloves VALLEX. Karolinum, Praha.

[17] Mikulová, M. et al. (2006). Annotation on the tectogrammatical level in the Prague Dependency Treebank. Annotation manual. Technical Report TR-2006-30, ÚF AL MFF UK, Praha.

[18] Panevová, J. (2002). K valenci substantiv (s ohledem na jejich derivaci). In Zbornik matice srpske za slavistiku, vol. 61, pages 29–36.

[19] Panevová, J. (2014). Contribution of Valency to the Analysis of Language. In Spevak, O., editor, Noun Valency, pages 1–18, John Benjamins Publishing Company, Amsterdam, Netherlands.

[20] Příruční slovník jazyka českého. (1935–1957). Státní nakladatelství – Školní nakladatelství – Státní pedagogické nakladatelství, Praha.

[21] Sgall, P., Hajičová, E., and Panevová, J. (1986). The Meaning of the Sentence in its Semantic and Pragmatic Aspects. Reidel, Dordrecht.

[22] Stevenson A., editor (2010). Oxford Dictionary of English. Oxford University Press.

[23] Štěpánek, J. and Pajas, P. (2010). Querying Diverse Treebanks in a Uniform Way. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC’10), pages 1828–1835, European Language Resources Association (ELRA), Valletta, Malta.

[24] Urešová, Z. (2012). Building the PDT-VALLEX valency lexicon. In Proceedings of the fifth Corpus Linguistics Conference, pages 1–18, University of Liverpool, Liverpool, UK.

Journal of Linguistics/Jazykovedný casopis

The Journal of Ludovít Štúr Institute of Linguistics, SAV

Journal Information


CiteScore 2017: 0.03

SCImago Journal Rank (SJR) 2017: 0.101
Source Normalized Impact per Paper (SNIP) 2017: 0.203

Metrics

All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 143 142 11
PDF Downloads 69 68 1