Personality and Word Use: Study on Czech Language and the Big Five

. 2022 Oct ; 51 (5) : 1165-1196. [epub] 20220517

Jazyk angličtina Země Spojené státy americké Médium print-electronic

Typ dokumentu časopisecké články

Perzistentní odkaz   https://www.medvik.cz/link/pmid35579837

Grantová podpora
16-19087S Grantová Agentura České Republiky
2020-28-11 j. william fulbright commision czech republic

Odkazy

PubMed 35579837
DOI 10.1007/s10936-022-09892-6
PII: 10.1007/s10936-022-09892-6
Knihovny.cz E-zdroje

The study is a follow-up to three published anglophone researches examining the relation between the use of linguistic categories and personality characteristics as outlined in the Big Five model, with the purpose of replicating these and elaborating for the Czech language. The comparative research study in Czech focuses on analysis of both grammatical and semantic variables in six types of text (written and oral), produced by N = 200 participants. Within the study, there were six confirmed relations, however, these appear only in certain types of text. The results show not only an essential role of the text register, but they also allow us to evaluate the universality of findings of studies in English in comparison with other, especially Slavic, languages.

Zobrazit více v PubMed

Avolio, B. J., & Gardner, W. L. (2005). Authentic leadership development: Getting to the root of positive forms of leadership. The Leadership Quarterly, 16(3), 315–338. https://doi.org/10.1016/j.leaqua.2005.03.001 DOI

Azucar, D., Marengo, D., & Settanni, M. (2018). Predicting the Big 5 personality traits from digital footprints on social media: A meta-analysis. Personality and Individual Differences, 124, 150–159. https://doi.org/10.1016/j.paid.2017.12.018 DOI

Basnight-Brown, D. M., & Altarriba, J. (2018). The influence of emotion and culture on language representation and processing. In Advances in culturally-aware intelligent systems and in cross-cultural psychological studies (pp. 415–432). Springer. https://doi.org/10.1007/978-3-319-67024-9_19

Berry, D. S., Pennebaker, J. W., Mueller, J. S., & Hiller, W. S. (1997). Linguistic bases of social perception. Personality and Social Psychology Bulletin, 23(5), 526–537. https://doi.org/10.1177/0146167297235008 DOI

Biber, D. (1991). Variation across speech and writing. Cambridge University Press.

Biber, D. (1993). Using register-diversified corpora for general language studies. Computational Linguistics, 19(2), 219–241.

Biber, D. (1995). Dimensions of register variation: A cross-linguistic comparison. Cambridge University Press. https://doi.org/10.1017/CBO9780511519871 DOI

Bjekić, J., Lazarević, L., Erić, M., Stojimirović, E., & Đokić, T. (2012). Development of Serbian dictionary for automatic text analysis (LIWCser). Psiholoska Istrazivanja, 15(1), 85–110. https://doi.org/10.5937/PsIstra1201085B DOI

Brewer, M. B., & Gardner, W. (1996). Who is this “We”? Levels of collective identity and self-representations. Journal of Personality and Social Psychology, 71(1), 83. https://doi.org/10.1037/0022-3514.71.1.83 DOI

Chen, S. X., & Bond, M. H. (2010). Two languages, two personalities? Examining language effects on the expression of personality in a bilingual context. Personality and Social Psychology Bulletin, 36(11), 1514–1528. https://doi.org/10.1177/0146167210385360 PubMed DOI

Chen, J., Qiu, L., & Ho, M.-H.R. (2020). A meta-analysis of linguistic markers of extraversion: Positive emotion and social process words. Journal of Research in Personality, 89, 104035. https://doi.org/10.1016/j.jrp.2020.104035 DOI

Cobb-Clark, D. A., & Schurer, S. (2012). The stability of big-five personality traits. Economics Letters, 115(1), 11–15. https://doi.org/10.1016/j.econlet.2011.11.015 DOI

Cvrček, V., Komrsková, Z., & Lukeš, D. (2018a). Rozsah registrové variability textů [Scope of register variability of texts]. In D. Kučera, J. M. Havigerová, J. Haviger, V. Cvrček, Z. Komrsková, D. Lukeš, T. Jelínek, T. Urbánek, J. Franková (Eds.), Výzkum CPACT: Komputační psycholingvistická analýza českého textu [CPACT Research: Computational psycholinguistic analysis of Czech text] (pp. 153–172). České Budějovice: PF JU.

Cvrček, V., Komrsková, Z., Lukeš, D., Poukarová, P., Řehořková, A., & Zasina, A. (2018b). From extra- to intratextual characteristics: Charting the space of variation in Czech through MDA. Corpus Linguistics and Linguistic Theory. https://doi.org/10.1515/cllt-2018-0020 DOI

Cvrček, V., Laubeová, Z., Lukeš, D., Poukarová, P., Řehořková, A., & Zasina, A. J. (2020). Author and register as sources of variation: A corpus-based study using elicited texts. International Journal of Corpus Linguistics, 25(4), 461–488. https://doi.org/10.1075/ijcl.19020.cvr DOI

Czech Statistical Office – Český statistický úřad (2015). Věk a vzdělání populace. Retrieved from https://www.czso.cz

ČNK (Czech National Corpus) (2018). The Koditex Corpus. Retrieved from http://wiki.korpus.cz/doku.php/en:cnk:koditex

Demjén, Z. (2014). Drowning in negativism, self-hate, doubt, madness: Linguistic insights into Sylvia Plath’s experience of depression. Communication and Medicine, 11(1), 41–54. https://doi.org/10.1558/cam.v11i1.18478 PubMed DOI

Dino, A., Reysen, S., & Branscombe, N. R. (2009). Online interactions between group members who differ in status. Journal of Language and Social Psychology, 28(1), 85–93. https://doi.org/10.1177/0261927X08325916 DOI

Dudău, D. P., & Sava, F. A. (2021). Performing multilingual analysis with Linguistic Inquiry and Word Count 2015 (LIWC2015). An equivalence study of four languages. Frontiers in Psychology. https://doi.org/10.3389/fpsyg.2021.570568 PubMed DOI PMC

Fleková, L., Lampos, V., & Cox, I. J. (2018). Changes in psycholinguistic attributes of social media users before, during, and after self-reported influenza symptoms. In Proceedings of the 2018 EMNLP workshop SMM4H: The 3rd social media mining for health applications workshop & shared task (pp. 17–21). https://doi.org/10.18653/v1/W18-5905

Freud, S. (1901). Psychopathology of everyday life. Basic Books.

Garimella, A., Mihalcea, R., & Pennebaker, J. (2016). Identifying cross-cultural differences in word usage. In Proceedings of COLING 2016, the 26th international conference on computational linguistics: Technical Papers (pp. 674–683). https://www.aclweb.org/anthology/C16-1065

Gill, A. J., & Oberlander, J. (2019). Taking care of the linguistic features of extraversion. In Proceedings of the twenty-fourth annual conference of the cognitive science society (pp. 363–368). https://doi.org/10.4324/9781315782379-99

Goldberg, L. R. (1992). The development of markers for the big-five factor structure. Psychological Assessment, 4, 26–42. https://doi.org/10.1037/1040-3590.4.1.26 DOI

Goldberg, L. R., Johnson, J. A., Eber, H. W., Hogan, R., Ashton, M. C., Cloninger, C. R., & Gough, H. G. (2006). The international personality item pool and the future of public-domain personality measures. Journal of Research in Personality, 40(1), 84–96. https://doi.org/10.1016/j.jrp.2005.08.007 DOI

Hajič, J. (2001). Disambiguation of rich inflection. Karolinum.

Hamilton, R. V. (1957). A psycholinguistic analysis of some interpretive processes of three basic personality types. Journal of Social Psychology, 46, 153–177. https://doi.org/10.1080/00224545.1957.9714317 DOI

Hanel, P. H., & Vione, K. C. (2016). Do student samples provide an accurate estimate of the general public? PLoS ONE, 11(12), e0168354. https://doi.org/10.1371/journal.pone.0168354 PubMed DOI PMC

Harley, T. A. (2013). The psychology of language: From data to theory. Psychology Press. https://doi.org/10.4324/9781315859019 DOI

Harris, M. A., Brett, C. E., Johnson, W., & Deary, I. J. (2016). Personality stability from age 14 to age 77 years. Psychology of Aging, 31(8), 862–874. https://doi.org/10.1037/pag0000133 DOI

Holtgraves, T. (2011). Text messaging, personality, and the social context. Journal of Research in Personality, 45(1), 92–99. https://doi.org/10.1016/j.jrp.2010.11.015 DOI

Hornová, L. (2003). Referenční slovník gramatických termínů. Univerzita Palackého v Olomouci.

Hřebíčková, M., Jelínek, M., Blatný, M., Brom, C., Burešová, I., Graf, S., Mejzlíková, T., Vazsonyi, A. T., & Zábrodská, K. (2016). Big Five Inventory: Základní psychometrické charakteristiky české verze BFI-44 A BFI-10. Československá Psychologie, 60(6), 567.

Irawan, S. S. (2018). Power of social class and its impact on language use. International Journal of Multicultural and Multireligious Understanding, 5(6), 166–171. https://doi.org/10.18415/ijmmu.v5i6.550 DOI

Ireland, M. E., & Mehl, M. R. (2014). Natural language use as a marker. In T. M. Holtgraves (Ed.), The Oxford handbook of language and social psychology (pp. 201–237). Oxford University Press.

Irvine, J. T. (1985). Status and style in language. Annual Review of Anthropology, 14, 557–581. DOI

Jackson, J. C., Watts, J., Henry, T. R., List, J.-M., Forkel, R., Mucha, P. J., Greenhill, S. J., Gray, R. D., & Lindquist, K. A. (2019). Emotion semantics show both cultural variation and universal structure. Science, 366(6472), 1517–1522. https://doi.org/10.1126/science.aaw8160 PubMed DOI

Jelínek, T. (2018). Současná východiska komputační lingvistiky a její aplikace. In D. Kučera, J. M. Havigerová, J. Haviger, V. Cvrček, T. Jelínek, Z. Komrsková, D. Lukeš, T. Urbánek, & J. Franková (Eds.), Výzkum CPACT: Komputační psycholingvistická analýza českého textu [CPACT Research: Computational psycholinguistic analysis of Czech text]. České Budějovice: Vydavatelství Pedagogické fakulty Jihočeské univerzity v Českých Budějovicích.

John, O. P., Donahue, E. M., & Kentle, R. L. (1991). The Big Five Inventory – Versions 4a and 4b (Technical Report). Institute for Personality and Social Research, University of California. https://doi.org/10.1037/t07550-000 DOI

John, O. P., Naumann, L. P., & Soto, C. J. (2008). Paradigm shift to the integrative big five trait taxonomy. Handbook of Personality: Theory and Research, 3(2), 114–158.

Kacewicz, E., Pennebaker, J. W., Davis, M., Jeon, M., & Graesser, A. C. (2014). Pronoun use reflects standings in social hierarchies. Journal of Language and Social Psychology, 33(2), 125–143. https://doi.org/10.1177/0261927X13502654 DOI

Kartelj, A., Filipović, V., & Milutinović, V. (2012, May). Novel approaches to automated personality classification: Ideas and their potentials. In MIPRO, 2012 Proceedings of the 35th international convention (pp. 1017–1022). IEEE.

Kim, U., Park, Y.-S., & Park, D. (2000). The challenge of cross-cultural psychology: The role of the indigenous psychologies. Journal of Cross-Cultural Psychology, 31(1), 63–75. https://doi.org/10.1177/0022022100031001006 DOI

Kučera, D. (2017). Computational psycholinguistic analysis of Czech text and the CPACT research. In ISC SGEM (Eds.), 4th international multidisciplinary scientific conference on social sciences and Arts SGEM 2017: Science & society conference proceedings (Vol. 2, pp. 77–84). Albena, Bulgaria: ISC SGEM. https://doi.org/10.5593/sgemsocial2017/32/S11.010

Kučera, D. (2018). Výzkumný soubor a metody výzkumu CPACT [Research sample and methods in the CPACT research]. In D. Kučera, J. M. Havigerová, J. Haviger, V. Cvrček, Z. Komrsková, D. Lukeš, T. Jelínek, T. Urbánek, J. Franková (Eds.), Výzkum CPACT: Komputační psycholingvistická analýza českého textu [CPACT Research: Computational psycholinguistic analysis of Czech text] (pp. 59–86). České Budějovice: PF JU.

Kučera, D., & Havigerová, J. M. (2015). Interpersonální charakteristiky komunikátora z pohledu kvantitativní psycholingvistické analýzy (předběžné sdělení z výzkumné studie) [Interpersonal Characteristics of a Communicator from the Perspective of Quantitative Psycholinguistic Analysis Research Study Preliminary Report]. In M. Bozogáňová et al. (Eds.), Sociálne procesy a osobnosť 2014: Človek a spoločnosť: Zborník príspevkov zo 17. ročníka medzinárodnej konferencie (pp. 267–274). Košice: Spoločenskovedný ústav SAV.

Lee, C. H., Kim, K., Seo, Y. S., & Chung, C. K. (2007). The relations between personality and language use. Journal of General Psychology, 134(4), 405–413. https://doi.org/10.3200/GENP.134.4.405-414 PubMed DOI

Linkov, V., & Šmerk, P. (2009). Rozdíly mezi pravdivou a lživou online textovou komunikací (Differences between true and deceptive online text communication). Sociální studia/Social Studies, 6(2). https://doi.org/10.5817/SOC2009-2-73

Litvinova, O., Seredin, P., Litvinova, T., & Lyell, J. (2017). Deception detection in Russian texts. In Proceedings of the student research workshop at the 15th conference of the European Chapter of the association for computational linguistics (pp. 43–52). https://doi.org/10.18653/v1/E17-4005

Lukeš, D. (2018). Text parameters comparison for the CPACT research [Unpublished manuscript], Czech National Corpus, Praha.

Majumder, N., Poria, S., Gelbukh, A., & Cambria, E. (2017). Deep learning-based document modeling for personality detection from text. IEEE Intelligent Systems, 32(2), 74–79. https://doi.org/10.1109/MIS.2017.23 DOI

Meehl, P. E. (1986). Trait language and behaviorese. In T. Thompson & M. Zeiler (Eds.), Analysis and integration of behavioral units (pp. 315–334). Lawrence Erlbaum.

Mehl, M. R., & Pennebaker, J. W. (2003). The sounds of social life: A psychometric analysis of students’ daily social environments and natural conversations. Journal of Personality and Social Psychology, 84(4), 857. https://doi.org/10.1037/0022-3514.84.4.857 PubMed DOI

Newman, M. L., Groom, C. J., Handelman, L. D., & Pennebaker, J. W. (2008). Gender differences in language use: An analysis of 14,000 text samples. Discourse Processes, 45(3), 211–236. https://doi.org/10.1080/01638530802073712 DOI

Panicheva, P., Ledovaya, Y., & Bogolyubova, O. (2016, November). Lexical, morphological and semantic correlates of the dark triad personality traits in Russian Facebook texts. In Artificial intelligence and natural language conference (AINL), IEEE (pp. 1–8). IEEE.

Park, G., Schwartz, H. A., Eichstaedt, J. C., Kern, M. L., Kosinski, M., Stillwell, D. J., & Seligman, M. E. (2015). Automatic personality assessment through social media language. Journal of Personality and Social Psychology, 108(6), 934–952. https://doi.org/10.1037/pspp0000020 PubMed DOI

Parkvall, M. (2007). Världens 100 största språk 2007 [The world‘s 100 largest languages in 2007]. In Nationalencyklopedin, 32.

Pennebaker, J. W., Boyd, R. L., Jordan, K., & Blackburn, K. (2015). The development and psychometric properties of LIWC2015. University of Texas at Austin.

Pennebaker, J. W., Chung, C. K., Ireland, M., Gonzales, A. L., & Booth, R. J. R. J. (2007). The development and psychometric properties of LIWC2007. The University of Texas at Austin, USA & The University of Auckland, New Zealand.

Pennebaker, J. W., & Graybeal, A. (2001). Patterns of natural language use: Disclosure, personality, and social integration. Current Directions in Psychological Science, 10(3), 90–93. https://doi.org/10.1111/1467-8721.00123 DOI

Pennebaker, J. W., & King, L. A. (1999). Linguistic styles: Language use as an individual difference. Journal of Personality and Social Psychology, 77(6), 1296–1312. https://doi.org/10.1037/0022-3514.77.6.1296 PubMed DOI

Pennebaker, J. W., & Lay, T. C. (2002). Language use and personality during crises: Analyses of Mayor Rudolph Giuliani’s press conferences. Journal of Research in Personality, 36(3), 271–282. https://doi.org/10.1006/jrpe.2002.2349 DOI

Pennebaker, J. W., Mehl, M. R., & Niederhoffer, K. G. (2003). Psychological aspects of natural language use: Our words, our selves. Annual Review of Psychology, 54(1), 547–577. https://doi.org/10.1146/annurev.psych.54.101601.145041 PubMed DOI

Petkevič, V. (2006). Reliable morphological disambiguation of Czech: Rule-based approach is necessary. In M. Šimková (Ed.), Insight into the Slovak and Czech corpus linguistics (pp. 26–44). Veda.

Qui, L., Lu, J., Ramsay, J., Yang, S., Qu, W., & Zhu, T. (2017). Personality expression in Chinese language use. International Journal of Psychology, 52(6), 463–472. https://doi.org/10.1002/ijop.12259 DOI

Ramírez-Esparza, N., Chung, C. K., Kacewicz, E., & Pennebaker, J. W. (2008). The psychology of word use in depression forums in English and in Spanish: Testing two text analytic approaches. In Proceedings of the 2008 International Conference on Weblogs and Social Media (pp. 102–108). Association for the Advancement of Artificial Intelligence (AAAI).

Roberts, G. (2013). Perspectives on language as a source of social markers. Language and Linguistics Compass, 7(12), 12052. https://doi.org/10.1111/lnc3.12052 DOI

Sánchez-Rada, J. F., & Iglesias, C. A. (2019). Social context in sentiment analysis: Formal definition, overview of current trends and framework for comparison. Information Fusion, 52, 344–356. https://doi.org/10.1016/j.inffus.2019.05.003 DOI

Sboev, A., Litvinova, T., Gudovskikh, D., Rybka, R., & Moloshnikov, I. (2016). Machine learning models of text categorization by author gender using topic-independent features. Procedia Computer Science, 101, 135–142. https://doi.org/10.1016/j.procs.2016.11.017 DOI

Schwartz, H. A., Eichstaedt, J. C., Kern, M. L., Dziurzynski, L., Ramones, S. M., Agrawal, M., & Ungar, L. H. (2013). Personality, gender, and age in the language of social media: The open-vocabulary approach. PLoS ONE, 8(9), e73791. https://doi.org/10.1371/journal.pone.0073791 PubMed DOI PMC

Seidlhofer, B. (2013). Understanding English as a lingua franca—Oxford applied linguistics. Oxford University Press. https://doi.org/10.1002/9781405198431.wbeal0243 DOI

Shoda, Y., Mischel, W., & Wright, J. C. (1994). Intraindividual stability in the organization and patterning of behavior: Incorporating psychological situations into the idiographic analysis of personality. Journal of Personality and Social Psychology, 67(4), 674. https://doi.org/10.1037/0022-3514.67.4.674 PubMed DOI

Sikos, J., David, P., Habash, N., & Faraj, R. (2014). Authorship analysis of Inspire Magazine through stylometric and psychological features. In 2014 IEEE Joint Intelligence and Security Informatics Conference. https://doi.org/10.1109/jisic.2014.15

Skoumalová, H. (2011). Porovnání úspěšnosti tagování korpusu. In V. Petkevič, A. Rosen, Korpusová lingvistika Praha (Eds.), 3 Gramatika a značkování korpusů (pp.199–207). Praha, Nakladatelství Lidové noviny.

Specht, J., Egloff, B., & Schmukle, S. C. (2011). Stability and change of personality across the life course: The impact of age and major life events on mean-level and rank-order stability of the Big Five. Journal of Personality and Social Psychology, 101(4), 862. https://doi.org/10.1037/a0024950 PubMed DOI

Tausczik, Y., & Pennebaker, J. (2010). The psychological meaning of words: LIWC and computerized text analysis methods. Journal of Language and Social Psychology, 29, 24–54. https://doi.org/10.1177/0261927X09351676 DOI

Thompson, B., Roberts, S. G., & Lupyan, G. (2020). Cultural influences on word meanings revealed through large-scale semantic alignment. Nature Human Behaviour, 4(10), 1029–1038. https://doi.org/10.1038/s41562-020-0924-8 PubMed DOI

Vyse, S. (2004). Stability over time: Is behavior analysis a trait psychology? The Behavior Analyst, 27(1), 43–53. https://doi.org/10.1007/BF03392091 PubMed DOI PMC

W3Techs (2015). Usage of content languages for websites. W3Techs.com. Retrieved 24 March 2015 from http://w3techs.com

Weisbuch, M., Slepian, M. L., Clarke, A., Ambady, N., & Veenstra-VanderWeele, J. (2010). Behavioral stability across time and situations: Nonverbal versus verbal consistency. Journal of Nonverbal Behavior, 34(1), 43–56. https://doi.org/10.1007/s10919-009-0079-9 PubMed DOI PMC

Yarkoni, T. (2010). Personality in 100,000 words: A large-scale analysis of personality and word use among bloggers. Journal of Research in Personality, 44(3), 363–373. https://doi.org/10.1016/j.jrp.2010.04.001 PubMed DOI PMC

Zasina, A. J., Lukeš, D., Komrsková, Z., Poukarová, P., & Řehořková, A. (2018). Koditex: korpus diverzifikovaných textů. Ústav Českého národního korpusu FF UK.

Nejnovějších 20 citací...

Zobrazit více v
Medvik | PubMed

Beyond English: Considering Language and Culture in Psychological Text Analysis

. 2022 ; 13 () : 819543. [epub] 20220304

Najít záznam

Citační ukazatele

Nahrávání dat ...

Možnosti archivace

Nahrávání dat ...