Read also: Writing a Master's Thesis in Language Technology
The way we write texts give a lot of information about the background personalities of the authors: their age, gender, native language (if writing in a foreign language), if they're human or bots, and possibly their actual identity. This type of information can be used to, e.g., give fair indications of user profiles, to deduce if a text (or a part of it) has been plagiarised, or to uncover social media software misuse. The thesis could thus focus on tasks such as author profiling (what can we say about the author, e.g., their gender, age, if they're a human or a bot), author identification (did a specific author write this text?) and/or plagiarism detection (did somebody else than the author claiming the text actually write all or part of the text?)