Profiling a set of personality traits of text author: what our words reveal about us

Open access


Authorship profiling, i.e. revealing information about an unknown author by analyzing their text, is a task of growing importance. One of the most urgent problems of authorship profiling (AP) is selecting text parameters which may correlate to an author’s personality. Most researchers’ selection of these is not underpinned by any theory. This article proposes an approach to AP which applies neuroscience data. The aim of the study is to assess the probability of self-destructive behaviour of an individual via formal parameters of their texts. Here we have used the “Personality Corpus”, which consists of Russian-language texts. A set of correlations between scores on the Freiburg Personality Inventory scales that are known to be indicative of self-destructive behaviour (“Spontaneous Aggressiveness”, “Depressiveness”, “Emotional Lability”, and “Composedness”) and text variables (average sentence length, lexical diversity etc.) has been calculated. Further, a mathematical model which predicts the probability of self-destructive behaviour has been obtained.

Angst, J. and P. Clayton. 1986. Premorbid Personality of Depressive, Bipolar, and Schizophrenic Patients with Special Reference to Suicidal Issues. Comprehensive Psychiatry 27(6). 511‒532.

Argamon, S. et al. 2009. Automatically profiling the author of an anonymous text. Communications of the ACM 52(2). 119‒123.

Baddeley, J. L., Daniel, G. R. and J. W. Pennebaker. 2011. How Henry Hellyer’s Use of Language Foretold His Suicide. Crisis 32 (5). 288‒292.

Bloom, L. R. et al. 1994. Hemispheric Responsibility and Discourse Production: Contrasting Patients with Unilateral Left and Right Hemisphere Damage. In L. R. Bloom, L. K. Obler, S. D. Santi and J. S. Ehrlich (eds.). Discourse Analysis and Applications: Studies in Adult Clinical Populations, 91-94. Lawrence Erlbaum Associates Publishers.

Chung, C. K. and J. W. Pennebaker. 2009. The psychological functions of function words. In K. Fiedler (ed.), Social communication, 343-359. New York: Psychology Press.

Demjen, Z. 2015. Sylvia Plath and the Language of Affective States: Written Discourse and the Experience of Depression. Bloomsbury.

Fernбndez-Cabana, M. et al. 2013. Suicidal Traits in Marilyn Monroe’s Fragments: An LIWC Analysis. Crisis: The Journal of Crisis Intervention and Suicide Prevention 34(2). 124‒130.

Fotekova, T. A. and T. V. Akhutina. 2002. Diagnostika rechevikh narushenii shkol’nikov s ispol’zovaniem neiropsikhologicheskikh metodov [Detecting Speech Impediments in School Children Using Neuropsychological Methods]. Moscow: ARKTI.

Handelman, L. D. and D. Lester. 2007. The Content of Suicide Notes from Attempters and Completers. Crisis 28, 102‒104.

Joiner, T. E., Brown, J. S. and L. R. Jr. Wingate. 2005. The Psychology and Neurobiology of Suicidal Behaviour. Annu Rev Psychol 56. 287‒314.

Jones, N. and C. Bennell. 2007. The Development and Validation of Statistical Prediction Rules for Discriminating Between Genuine and Simulated Suicide Notes. Archives of Suicide Research: Official Journal of the International Academy for Suicide Research 11(2). 219.

Koppel, M., Argamon, S. and A. Shimoni. 2003. Automatically Categorizing Written Texts by Author Gender. Lit and Ling Computing 17(4). 401‒412.

Lester, D. 2014. The “I” of the Storm: Understanding the Suicidal Mind. De Gruyter Open Ltd.

Lightman, E. J. et al. 2007. Using Computational Text Analysis Tools to Compare the Lyrics of Suicidal and non-suicidal Songwriters. In D. S. McNamara & G. Trafton (eds.), Proceedings of the 29th Annual Cognitive Science Society, 1217-1222. Hillsdale, NJ: Erlbaum.

Litvinova, T. A. 2014. Profiling the Author of a Written Text in Russian. Journal of Language and Literature 5(4). 210‒216.

Litvinova, T. A., Seredin, P. V. and O. A. Litvinova. 2015. Using Part-of-Speech Sequences Frequencies in a Text to Predict Author Personality: a Corpus Study. Indian Journal of Science and Technology 8(9). 93‒97.

Long, D. L., et al. 2012. The Organization of Discourse in the Brain: Results from the Item-Priming-in-Recognition Paradigm. In M. Faust (ed.), The Handbook of the Neuropsychology of Language, 77‒99. Wiley-Blackwell.

Marciсczuk, M., Zaњko-Zieliсska, M. and M. Piasecki. 2011. Structure Annotation in the Polish Corpus of Suicide Notes. In I. Habernal and V. Matoušek (ed.), Text, Speech and Dialogue. 14th International Conference, TSD 2011, Pilsen, Czech Republic, September 1-5, 2011. Proceedings, 419‒426. Springer Berlin Heidelberg.

Nini, A. 2014. Authorship Profiling in a Forensic Context. PhD thesis. Aston University.

Noecker Jr, J. W., Ryan, M. and P. Juola. 2013. Psychological Profiling Through Textual Analysis. Lit Linguist Computing 28(3). 382‒387.

Oborneva, I. V. 2005. Avtomatizatsiia otsenki kachestva vostriiatiya vospriiatiya teksta [Automatisation of the Assessment of Perception of a Text]. Vestnik Moskovskogo gorodskogo pedagogicheskogo universiteta [Herald Journal of Moscow State Pedagogical University] 2(5). 86‒92.

Pennebaker, J. W. 2011. The Secret Life of Pronouns: What Our Words Say About Us. New York: Bloomsbury Publishing.

Pennebaker, J. W., Mehl, M. R. and K. Niederhoffer. 2003. Psychological Aspects of Natural Language Use: Our Words, Our Selves. Annual Review of Psychology 54. 547‒577.

Pennebaker, J. W. and L. D. Stone. 2004. What Was She Trying To Say? A Linguistic Analysis of Katie’s Diaries. In D. Lester (ed.), Katie’s Diary: Unlocking the Mystery of a Suicide, 55‒80. New York: Brunner-Routledge.

Pestian, J. et al. 2010. Suicide Note Classification Using Natural Language Processing: A Content Analysis. Biomed Inform Insights 3. 19‒28.

Pilyagina, G. Ya. 2003. Mekhanismi suitsidogeneza i otsenka suitsidal’nogo riska pri razlichnikh formah autoagressivnogo povedeniya [Mechanisms of Suicidogenesis and Assessments of Suicidal Risks in Different Forms of Self-destructive Behaviour]. Arhiv psihiatriї [Psychiatry Archives] 9(4). 18‒26.

Rangel, F. et al. 2014. Overview of the 2nd Author Profiling Task at PAN 2014. In L. Cappellato, N. Ferro, M. Halvey and W. Kraaij (eds.), CLEF 2014 Labs and Workshops, Notebook Papers., vol. 1180 898‒827.

Rangel, F. et al. 2015. Overview of the 3rd Author Profiling Task at PAN 2015. In CEUR Workshop Proceedings. [Online] Available from: [Accessed: 19.12.2016]

Rozanov, V. A. 2004. Neirobiologicheskie osnovi suitsidal’nogo povedeniya [Neurobiuological Foundations of Suicidal Behaviour]. Vestnik biologicheskoj psihiatrii [Herald Journal of Biological Psychiatry] 6. [Online] Available from: [Accessed: 19.12.2016]

Rude, S., Gortner, E. M. and J. Pennebaker. 2004. Language use of depressed and depression-vulnerable college students. Cognition and Emotion 18(8). 1121-1133.

Sakharniy, L. V. 1994. Chelovek i tekst: dve grammatiki teksta [Man and Text: Two Grammars of a Text]. Chelovek – tekst – kul’tura [Man – Text – Culture]. Yekaterinburg. 17‒20.

Schler, J. et al. 2006. Effects of Age and Gender on Blogging. In Proc. of AAAI Spring Symposium on Computational Approaches for Analyzing Weblogs, 199-205. AAAI.

Sedov, K. F. 2007. Neiropsikholingvistika [Neurolinguistics]. Moscow: Labirint.

Stirman, S. W. and J. W. Pennebaker. 2001. Word Use in the Poetry of Suicidal and Non-Suicidal Poets. Psychosom Med 63(4). 517‒522.

Tausczik, Y. R. and J. W. Pennebaker. 2010. The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods. Journal of Language And Social Psychology 29(1). 24‒54.

Yegorov, A. Yu. 1999. Koordinatsiya dejatel’nosti polusharii mozga cheloveka pri osushestvlenii kognitivnikh funktsii [Coordination of the Activities of the Right Hemisphere of the Human Brain]: abstract of thesis for PhD in Medicine. Saint Petersburg.

Yegorov, A. Yu. and O. V. Ivanov. 2007. Osobennosti individual’nykh profilei funktsional’noi assimetrii u lits sovershivshikh suitsidal’nuiu popytku [Features of Individual Profiles of Functional AssymetryAsymmetry in Individuals Committed a Suicidal Attempt]. Social and Clinical Psychiatry 2. 20‒24.

Research in Language

The Journal of University of Lodz

Journal Information

CiteScore 2016: 0.27

SCImago Journal Rank (SJR) 2016: 0.271
Source Normalized Impact per Paper (SNIP) 2016: 0.453


All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 74 74 55
PDF Downloads 10 10 7