Research

In my current research project my goal is to identify personality from text documents using linguistics methodologies to find latent features, particularly in Spanish Language.

personality-identification Personality identification from texts
:scroll: Aplicación web para identificar personalidad, género y edad de usuarios en Twitter
:scroll: TxPI-u: A resource for Personality Identification of undergraduates
:scroll: Shared task at ICPR 2018

Other projects my students and I are working on are:

data-depression Texts from depressed users: representation features and its possible detection. We analyze English texts of people with and without depression from Reddit in order to describe the language and behavior of people with depression in social media.
:scroll: Towards modelling depressed blogers: We use a graph-based representation to find depressed bloggers in a early detection fashion.
:scroll: Finding signs of anorexia and depression in social media users: In this paper (in Spanish) we explore some traditional texts representations to identify mental disorders.
:information_source: Information of depression in English texts: Web page (in Spanish) that present the result of this analysis.

influence-social-media Influence in social media. Is it possible to identify influence of a user or posts in social media by analyzing the texts and engagements of other participants in the social media platform?
:scroll: User influence in Twitter: We identify the influence of a Twitter account by means of style and behavior attributes of that account.
:scroll: Predicting impact of Facebook posts
:open_file_folder: Corpus Reacción: The data set of almost 14000 posts in Spanish for impact predictions.

source-code-plagiarism Source code plagiarism identification. We represent each source code with features such as the writer style (in the context of code writing and in comments in natural language) as well as structural attributes and the lexicon use in the name of variables.
:scroll: On the importance of lexicon, structure and style for identifying source code plagiarism
:scroll: High level features for detecting source code plagiarism across programming languages
:scroll: Retrieving and classifying instances of source code plagiarism

:point_up: Si eres un estudiante interesado en realizar tu proyecto terminal ponte en contacto conmigo. (If you are an interested student, contact me.)

Collaborators

  • Esaú Villatoro Tello, Héctor Jiménez Salazar, Christian Sánchez Sánchez, Carlos Rodriguez Lucatero, Wulfrano Luna Ramírez, Alba Nuñez Reyes, Margarita Espinosa Meneses, Dina Rochman Beer. DCCD/UAM Cuajimalpa
  • Manuel Montes y Gómez, Luis Villaseñor Pineda, Hugo J. Escalante, Fernando Sánchez Vega. LabTL/INAOE
  • Thamar Solorio. RiTUAL/UH
  • Verónica Reyes Meza. Centro Tlaxcala de Biología de la Conducta/UATx
  • Leticia C. Cagnina, Marcelo L. Errecalde. LIDIC/UNSL
  • Ivan Meza. IIMAS/UNAM