Publication: Boosting gender identification using author preference
cris.virtual.department | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
cris.virtual.orcid | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
cris.virtualsource.department | fd57787d-d723-465c-8fe7-a3f165ddfdf0 | |
cris.virtualsource.orcid | fd57787d-d723-465c-8fe7-a3f165ddfdf0 | |
dc.contributor.affiliation | Ted University; Middle East Technical University; Turk Hava Kurumu University; Turkish Aeronautical Association | |
dc.contributor.author | Kucukyilmaz, Tayfun; Deniz, Ayca; Kiziloz, Hakan Ezgi | |
dc.date.accessioned | 2024-06-25T11:45:07Z | |
dc.date.available | 2024-06-25T11:45:07Z | |
dc.date.issued | 2020 | |
dc.description.abstract | Predicting the gender of a text document's author, also known as gender identification, is a well-studied authorship categorization task in the literature. A common theme in gender identification studies is that gender is considered a binary task. However, digital communications provide users with the ability to select virtual genders leveraging physical anonymity. In this study, the additional duality on gender due to author preferences is examined along with the biological gender. Formally, the objective of this paper is to investigate whether the gender preference of an author contains any additional linguistic information. Furthermore, we explore whether this information can be exploited to improve the author characterization task. In particular, the self-assigned gender, i.e., virtual gender, of the users in text-based real-time online messaging services, along with the biological sex, is evaluated quantitatively via comparing/assessing the gender prediction performance under various settings. Experiment results show that by integrating the virtual gender into the binary classification problem of predicting an author's gender, it is possible to further improve the prediction performance by 2.6%, up to 85.4%. (c) 2020 Elsevier B.V. All rights reserved. | |
dc.description.doi | 10.1016/j.patrec.2020.10.002 | |
dc.description.endpage | 251 | |
dc.description.pages | 7 | |
dc.description.researchareas | Computer Science | |
dc.description.startpage | 245 | |
dc.description.uri | http://dx.doi.org/10.1016/j.patrec.2020.10.002 | |
dc.description.volume | 140 | |
dc.description.woscategory | Computer Science, Artificial Intelligence | |
dc.identifier.issn | 0167-8655 | |
dc.identifier.uri | https://acikarsiv.thk.edu.tr/handle/123456789/1233 | |
dc.language.iso | English | |
dc.publisher | ELSEVIER | |
dc.relation.journal | PATTERN RECOGNITION LETTERS | |
dc.subject | Gender identification; Text classification; Authorship attribution; Machine learning; Gender-swapping; Virtual gender | |
dc.subject | COMPUTER; DISCOURSE; FEATURES; DIALECT | |
dc.title | Boosting gender identification using author preference | |
dc.type | Article | |
dspace.entity.type | Publication |