Determining the personality of Internet users by analyzing their digital footprints in the light of selected research by Michał Kosiński
digital traces, digital footprints, measuring user personality, Five Factor Model, privacy, Michał KosińskiAbstract
According to data from Internet Live Stats, in April 2021, about 92,000 queries were made, every second, in the Google search engine. Each activity performed by users of digital devices is indexed as so-called digital footprints [also called digital traces], thanks to which it is possible, using appropriate technologies and methods, to precisely define personality traits, political views and sexual orientations of these users.
The article was inspired by the works of PhD Michał Kosiński and describes the issues related to analyzing digital traces of Internet users (mainly social media).
The main aim of the article is to present the research by PhD Michał Kosiński and to draw the attention of the IT community to the issues related to analyzing the digital traces of Internet users. This work is of a popularizing nature and does not constitute a comprehensive description of the achievements of the indicated scientist. The article does not present new information or own research – however, it is intended to encourage recipients to analyze the literature on the subject of analyzing digital traces and privacy in the digital age.
The work uses the method of database content analysis – to collect and analyze the literature on the subject. The focus was on the resources made available by PhD Kosinski through his private website – Additionally, datamining techniques were used to summarize the archival content published on the Cambridge Analytica website.
The chronological range of the searched writing materials was narrowed from 2011 to the first quarter of 2021, focusing mainly on the years 2013–2021, i.e. from the year of publishing the article Private traits and attributes are predictable from digital records of human behavior until the year of publication of the text Facial recognition technology can expose political orientation from naturalistic facial images.
The article presents selected works by PhD Michał Kosiński, which constitute "milestones" in research on determining the personality of Internet users within the publication of the indicated author. Consequently, the article does not present the works that described part of the research or contributed to taking larger, further measurements.
1 Second – Internet Live Stats. ([2021]). Internet Live Stats [online]. Pobrane 2 maja 2021, z:
Google Scholar
Bachrach, Yoram, Kosinski, Michal, Graepel, Thore, Kohli, Pushmeet, & Stillwell, David J. (2012). Personality and patterns of Facebook usage. W: Proceedings of the 4th Annual ACM Web Science Conference, WebSci’12 (s. 24–32) [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Cambridge Analytica. (2021). W: Wayback Machine [online]. Pobrane 2 maja 2021 r., z:
Google Scholar
Cambridge Analytica. (2021). W: Wikipedia. The Free Encyclopedia [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Kasprzak, Wojciech A. (2015). Ślady cyfrowe. Studium prawno-kryminalistyczne. Warszawa: Difin.
Google Scholar
Kosinski, Michal (2021). Facial recognition technology can expose political orientation from naturalistic facial images. Scientific Reports, 11(1), Article number: 100 [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Kosinski, Michal, Bachrach, Yoram, Kohli, Puchmeet, Stillwell, David J., & Graepel, Thore (2014). Manifestations of user personality in website choice and behavior on online social networks. Machine Learning, 95, 357–380 [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Kosinski, Michal, Stillwell, David J., & Graepel, Thore (2013). Private traits and attributes are predictable from digital records of human behavior. Proceedings of the National Academy of Sciences of the United States of America, 110(15), 5802–5805 [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Kosinski, Michal, Wang, Yilun, Lakkaraju, Himabindu, & Leskovec, Jure (2016). Mining big data to extract patterns and predict real-life outcomes. Psychological Methods, 21(4), 493–506 [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Lambiotte, Renaud, & Kosinski, Michal (2014). Tracking the digital footprints of personality. Proceedings of the IEEE, 102(12), 1934–1939 [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Markovikj, Dejan, Gievska, Sonja, Kosinski, Michal & Stillwell, David J. (2013). Mining Facebook data for predictive personality modeling. Proceedings of the International AAAI Conference on Web and Social Media, 7(1), 23–26. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Matz, Sandra C., Kosinski, Michal, Nave, Gideon, & Stillwell, David J. (2017). Psychological targeting as an effective approach to digital mass persuasion. Proceedings of the National Academy of Sciences of the United States of America, 114(48), 12714–12719 [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
McCrae, Robert R., & John, Oliver P. (1992). An Introduction to the Five‐Factor Model and Its Applications. Journal of Personality, 60(2), 175–215 [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Michal Kosinski (2021, maj 2). W: Google Scholar [online]. Pobrane 2 maja 2021 r., z:
Google Scholar
Pamięć podręczna (2021). W: Wikipedia. Wolna encyklopedia [online]. Pobrane 30 czerwca 2021 r., z:ęć_podręczna
Google Scholar
Quercia, Daniele, Kosinski, Michal, Stillwell, David J., & Crowcroft, Jon (2011). Our twitter profiles, our selves: Predicting personality with twitter. W: Proceedings – 2011 IEEE International Conference on Privacy, Security, Risk and Trust and IEEE International Conference on Social Computing, PASSAT/SocialCom 2011 (s. 180–185) [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Quercia, Daniele, Lambiotte, Renaud, Stillwell, David J., Kosinski, Michal, & Crowcroft, Jon (2012). The personality of popular facebook users. W: Proceedings of the ACM Conference on Computer Supported Cooperative Work, CSCW’12 (s. 955–964) [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Stillwell, David J. (2021). myPersonality database [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Stillwell, David J. & Kosinski, Michal (2012). myPersonality project : Example of successful utilization of online social networks for large-scale social research. W: Proceedings of the ACM Workshop on Mobile Systems for Computational Social Science (MobiSys) [online]. The Psychometric Centre, University of Cambridge. Pobrane 30 czerwca 2021 r. z:
Google Scholar
Wang, Yilun, & Kosinski, Michal (2018). Deep neural networks are more accurate than humans at detecting sexual orientation from facial images. Journal of Personality and Social Psychology, 114(2), 246–257 [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Wasilewski, Janusz (2013). Zarys definicyjny cyberprzestrzeni. Przegląd Bezpieczeństwa Wewnętrznego, 9(5), 225–234 [online]. Agencja Bezpieczeństwa Wewnętrznego. Pobrane 30 czerwca 2021 r., z:
Google Scholar
Youyou, Wu, Kosinski, Michal, & Stillwell, David J. (2015). Computer-based personality judgments are more accurate than those made by humans. Proceedings of the National Academy of Sciences of the United States of America, 112(4) 1036–1040 [online]. Pobrane 30 czerwca 2021 r., z:
Google Scholar
How to Cite
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.